WO2017134798A1 - Dispositif de communication vocale - Google Patents

Dispositif de communication vocale Download PDF

Info

Publication number
WO2017134798A1
WO2017134798A1 PCT/JP2016/053375 JP2016053375W WO2017134798A1 WO 2017134798 A1 WO2017134798 A1 WO 2017134798A1 JP 2016053375 W JP2016053375 W JP 2016053375W WO 2017134798 A1 WO2017134798 A1 WO 2017134798A1
Authority
WO
WIPO (PCT)
Prior art keywords
learning
elevator
signal
parameter
communication device
Prior art date
Application number
PCT/JP2016/053375
Other languages
English (en)
Japanese (ja)
Inventor
茂明 鈴木
訓 古田
智治 粟野
Original Assignee
三菱電機株式会社
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 三菱電機株式会社 filed Critical 三菱電機株式会社
Priority to JP2016544644A priority Critical patent/JP6011751B1/ja
Priority to PCT/JP2016/053375 priority patent/WO2017134798A1/fr
Priority to CN201680079879.0A priority patent/CN108495803B/zh
Publication of WO2017134798A1 publication Critical patent/WO2017134798A1/fr

Links

Images

Classifications

    • BPERFORMING OPERATIONS; TRANSPORTING
    • B66HOISTING; LIFTING; HAULING
    • B66BELEVATORS; ESCALATORS OR MOVING WALKWAYS
    • B66B3/00Applications of devices for indicating or signalling operating conditions of elevators
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M9/00Arrangements for interconnection not involving centralised switching
    • H04M9/08Two-way loud-speaking telephone systems with means for conditioning the signal, e.g. for suppressing echoes for one or both directions of traffic
    • BPERFORMING OPERATIONS; TRANSPORTING
    • B66HOISTING; LIFTING; HAULING
    • B66BELEVATORS; ESCALATORS OR MOVING WALKWAYS
    • B66B5/00Applications of checking, fault-correcting, or safety devices in elevators
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/60Substation equipment, e.g. for use by subscribers including speech amplifiers
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M9/00Arrangements for interconnection not involving centralised switching
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R3/00Circuits for transducers, loudspeakers or microphones
    • H04R3/02Circuits for transducers, loudspeakers or microphones for preventing acoustic reaction, i.e. acoustic oscillatory feedback

Definitions

  • the present invention relates to a voice call device used when making a voice call.
  • An echo canceller for preventing howling and echo is used in a voice call device capable of hands-free calling such as a speakerphone phone.
  • the conditions for echo generation depend on the environment in which the voice communication device is set. Generally, a room surrounded by a narrow and hard wall has a large reverberation, so a large echo is generated. Sometimes small.
  • the echo canceller includes an adaptive filter that sequentially learns the characteristics (e.g., impulse response) of the echo path, and the adaptive filter sequentially learns the characteristics of the echo path that varies depending on the environment and cancels the echo.
  • this sequential learning becomes difficult or takes a long time to learn when the installation environment is greatly different, etc. Therefore, a technique is disclosed in which initial learning is performed and a pre-learned parameter is given before a call. (For example, Patent Document 1).
  • a voice communication device used for an emergency call of an elevator is a device that performs a hands-free call using a microphone and a speaker of an interphone installed in the elevator, and an echo canceller is required. Since the echo environment varies greatly depending on the variation in the microphone and speaker characteristics and the size of the elevator, the introduction of the initial learning shown in the prior art is effective. However, since an elevator passenger who encounters a situation such as confinement in an elevator is used for a voice call device for an emergency call of an elevator, the user may specify the start of initial learning in a confined situation. There are difficulties. Therefore, if the conventional technique is used as it is, there is a problem that it cannot be said that it is suitable as the timing of the initial learning.
  • the present invention has been made to solve the above problems, and an object of the present invention is to provide a voice communication device that can automatically perform initial learning in an appropriate time zone without giving a sense of incongruity to a user.
  • a voice call device is a voice call device for an elevator, a parameter holding unit for holding a parameter depending on a signal echo path, and a signal for processing a voice signal at the time of a voice call using the parameter A processing unit, a learning opportunity determination unit that determines whether or not the parameter can be learned prior to the voice call according to an environment in an elevator in which the device is installed, and the learning opportunity determination unit And a learning signal generator for generating a learning signal for learning the parameter when it is determined that learning is possible.
  • FIG. 1 is a configuration diagram of an elevator emergency call system in which a voice communication device according to Embodiment 1 is used.
  • the internal structure of the voice call device 4 according to the first embodiment. 2 is a diagram showing a hardware configuration of a voice call device 4 according to Embodiment 1.
  • FIG. 4 is a flowchart showing the operation of the voice call device 4 according to the first embodiment. 6 is a flow for determining an initial learning time of the learning opportunity determination unit 41 according to the first embodiment.
  • FIG. 3 is a diagram showing an internal configuration of a signal processing unit 45 according to the first embodiment.
  • FIG. 10 is a flowchart showing the operation of the voice call device 4 according to the fifth embodiment.
  • FIG. 10 is an internal configuration diagram of a signal processing unit 45 according to a sixth embodiment.
  • FIG. 10 is an internal configuration diagram of a signal processing unit 45 according to a sixth embodiment.
  • FIG. 1 is a block diagram of an elevator emergency call system in which a voice communication device according to the present invention is used.
  • 1 is an elevator
  • 2 is an intercom installed in the elevator
  • 3 is an elevator operation control unit that controls the operation of the elevator
  • 4 is a voice communication device
  • 5 is a communication network
  • 6 is a destination of the elevator 1.
  • the monitoring center 7 is a telephone terminal in the monitoring center 6.
  • This system is used, for example, when a call is made by calling an operator of the monitoring center 6 when the elevator 1 fails and a passenger is confined in the elevator 1.
  • the interphone 2 is used for voice input / output of a person confined in the elevator 1, and the voice communication device 4 performs interconversion between the analog voice of the interphone 2 and the digital voice transmitted on the communication network 5, in the elevator 1. Suppresses echoes generated in.
  • the communication network 5 transmits voice data between the voice communication device 4 and the monitoring center 6.
  • the telephone terminal 7 is used for a telephone call by an operator in the monitoring center 6.
  • the elevator operation control unit 3 is installed in the elevator 1 and controls the operation of the elevator 1, that is, controls the raising / lowering and opening / closing of the door, and is not directly related to the voice call, but in this embodiment, the voice call device 4 provides information for determining the time of initial learning.
  • FIG. 2 shows the internal configuration of the voice communication device 4 together with the interphone 2 and the elevator operation control unit 3 connected to the device.
  • 41 is a learning trigger determination unit that determines the trigger of initial learning
  • 42 is a call control unit that controls the start / end of a call
  • 43 is a learning signal generation unit that generates a learning signal
  • 44 is from a communication line.
  • 45 is a signal processing unit having an echo canceller function for suppressing an echo that is a signal obtained by sneaking the sound output from the speaker 21 of the interphone 2 into the microphone of the interphone 2.
  • 46 is a D / A converter that converts a digital signal into an analog signal
  • 47 is an A / D (Analog / Digital) converter that converts an analog signal into a digital signal
  • 48 holds parameters obtained by initial learning.
  • Parameter holding unit 49 is a communication line interface for transmitting / receiving data to / from the communication line
  • 21 is an internal phone 2 internal switch.
  • 22 intercom 2 internal microphones (hereinafter, referred to as microphone), 23 is very call button to be used in very call initiation by intercom 2.
  • FIG. 3 is a diagram showing a hardware configuration of the voice call device 4 according to Embodiment 1 of the present invention.
  • the learning opportunity determination unit 41, the call control unit 42, the learning signal generation unit 43, the switch 44, the signal processing unit 45, and the parameter holding unit 48 are realized by a processor 401 that executes a program stored in the memory 402. This is merely an example, and a hardware configuration using a dedicated processing circuit other than this may be used.
  • the D / A converter 46 and the A / D converter 47 are realized by an A / D, D / A conversion LSI (Large Scale Integration) 403.
  • the processor 401 may be a system LSI integrated with an A / D or D / A conversion LSI 403 or the like.
  • the switch open / close detection LSI 404 electrically converts the button pressing state of the emergency call button 23 so that it can be referred to from a specific port of the processor 401.
  • This is an example, and for example, a system LSI in which the processor 401 is integrated with the switch open / close detection LSI 404 may be used.
  • the elevator operation control unit 3 outputs various information output to the learning opportunity determination unit 41 on a network frame, and the network interface A405 receives the information.
  • the network interface B 406 transmits / receives the input / output data on the communication line side carried in the network frame. That is, the network interface B 406 corresponds to the communication line interface 49 in FIG. This is only an example, and for example, one network interface may receive information from the elevator operation control unit 3 and transmit / receive input / output data on the communication line side.
  • the processing realized by the processor 401 may be realized by a plurality of processors.
  • signal processing such as the signal processing unit 45 and the learning signal generation unit 43 is performed by a digital signal processing processor. 407 is realized.
  • FIG. 5 is a flowchart showing the operation of the voice call device 4.
  • the operation of the voice call device 4 will be described with reference to FIG.
  • the call control unit 42 monitors whether or not the emergency call button 23 has been pressed (ST1), and when a passenger confined in the elevator 1 presses the emergency call button 23, the call control unit 42 is triggered by this. Control signals are transmitted to and received from the monitoring center 6 via the communication line interface 49, and the monitoring center 6 is called to establish communication (ST2).
  • This communication establishment is performed by, for example, SIP (Session Initiation Protocol) defined by RFC (Request for Comments) 3261 of IETF (Internet Engineering Task Force).
  • the call control unit 42 outputs to the learning opportunity determination unit 41 and the signal processing unit 45 whether the communication state, that is, the center call, the call, or any idle state.
  • the signal processing unit 45 inputs the parameters stored in the parameter holding unit 48 (ST3), and a call is started.
  • the switch 44 selects an input signal from the communication line interface 49 and outputs it to the signal processing unit 45, so that the input signal from the communication line interface 49 passes through the signal processing unit 45 to the D / A converter 46. (ST4). Then, it is converted into an analog signal by the D / A converter 46 and output from the speaker 21 in the intercom 2.
  • the voice of the person in the elevator 1 from the microphone 22 in the interphone 2 is converted into a digital signal by the A / D converter 47, and then the echo is eliminated by the signal processing unit 45, via the communication line interface 49. It is output to the communication line (ST5). This operation is continued until the call is finished (ST6).
  • the switch 44 selects as described above when the determination result of whether or not the initial learning performed by the learning opportunity determination unit 41 is possible is not possible.
  • the operation for determining whether or not the initial learning performed by the learning opportunity determination unit 41 is possible will be described later.
  • the learning trigger determination unit 41 determines that initial learning is possible in an environment in which the call control unit 42 determines that the emergency call button 23 has not been pressed (ST1)
  • the learning opportunity determination unit 41 determines that the initial learning is possible (initial learning time) (ST7)
  • the learning process is started.
  • the learning signal generator 43 outputs white noise as a signal for initial parameter learning (ST8). Since the switch 44 selects the input from the learning signal generation unit 43 instead of the communication line interface 49 and outputs it to the signal processing unit 45, the output signal of the learning signal generation unit 43 passes through the signal processing unit 45 to D / The data is output to the A converter 46 (ST9).
  • parameter initial learning signal may be a signal other than white noise as long as the signal processing unit 45 can cancel the echo.
  • the learning opportunity determination unit 41 outputs a determination result as to whether or not initial learning is possible to the switch 44, the signal processing unit 45, and the learning signal generation unit 43.
  • the determination shown in ST7 in the flow of FIG. 5, that is, the operation for determining whether or not the learning opportunity determination unit 41 can perform initial learning will be described.
  • the elevator operation control unit 3 controls the raising / lowering of the elevator 1 and the opening / closing control of the door. Therefore, the elevator operation / stop information indicating whether the elevator 1 is stopped or operating (ascending or descending), It has door opening / closing information indicating whether it is open or closed, and button pressing information indicating whether a destination button or door opening / closing button in the elevator 1 has been pressed, and these information is output to the learning opportunity determination unit 41.
  • the initial learning of the signal processing unit 45 is desirably performed in a state where the elevator 1 is stopped and the door is closed, there is no passenger in the elevator 1 and noise is low.
  • the learning opportunity determination unit 41 receives each information from the elevator operation control unit 3, information from the call control unit 42, and an output signal of the A / D converter 47. Use.
  • the power is turned on at the time of installation of the communication device, but since it is in a state before operation, there is a possibility that it is in a state different from the echo environment at the time of the call, for example, the elevator door is open. There is.
  • this embodiment when initial learning is performed in a state where the door is closed, there are no passengers in the elevator 1 and noise is low, the environment is close to the echo environment at the time of a call. Initial learning can be performed.
  • FIG. 6 is an example of a determination flow for determining whether or not initial learning is possible in the learning opportunity determination unit 41.
  • the process starts from a determination state that the initial learning time is not reached (ST13).
  • the elevator 1 is stopped by the elevator operation / stop information from the elevator operation control unit 3 (ST14), the door is closed by the door opening / closing information from the elevator operation control unit 3 (ST15), and More than a predetermined time has passed since the button in the elevator 1 was last pressed from the button pressing information from the elevator operation control unit 3 (ST16), and the information for the call is determined by the information from the call control unit 42.
  • condition (ST16) that a predetermined time or more has passed since the button in the elevator 1 was last pressed from the button pressing information from the elevator operation control unit 3 is the condition that there is no passenger in the elevator 1. Equivalent to. Therefore, it is also possible to use a configuration for identifying that there is no passenger in the elevator 1 from other conditions. Further, the determination flow shown in FIG. 6 is an example, and other conditions can be used as long as the environment in which the voice communication device 4 is placed is recognized and whether or not initial learning is possible according to the environment is determined. It is also possible to use a decision flow based on settings.
  • 451 is an adaptive filter that estimates an impulse response of an echo path and generates a pseudo echo signal
  • 452 is a subtractor that subtracts the pseudo echo from a transmission signal input from the interphone 2 side
  • 453 is a signal on the transmission side.
  • 454 is a residual echo suppression unit that further suppresses the residual echo after echo suppression by the subtracter 452.
  • the adaptive filter 451 reads the parameter held in the parameter holding unit 48 at the start of the call.
  • the parameter input from the parameter holding unit 48 is an estimated echo response value of the echo path.
  • the adaptive filter 451 starts operation, generates a pseudo echo signal through the adaptive filter 451 from the signal on the reception side (input signal on the communication line side), and outputs the pseudo echo signal.
  • the subtracter 452 transmits the pseudo echo signal to the transmission signal. It is subtracted from the (interphone 2 side input signal) and output to the adaptive operation availability determination unit 453 and the residual echo suppression unit 454 and to the adaptive filter 451.
  • the adaptive filter 451 performs impulse response estimation of the echo path using the signal from the subtractor 452.
  • a LMS (Least Mean Squared) algorithm is used for the impulse response estimation of the echo path.
  • whether or not to perform the impulse response estimation operation of the echo path depends on the determination of the adaptive operation availability determination unit 453.
  • the adaptive operation availability determination unit 453 determines whether the echo path estimation operation in the adaptive filter 451 is possible, and outputs the determination result to the adaptive filter 451.
  • a state in which there is a call signal on the receiving side, there is no call signal on the transmitting side, and there is only an echo signal in which the signal on the receiving side wraps around in the elevator 1, is the most desirable state for the echo path estimation operation.
  • the adaptive operation availability determination unit 453 compares the power of the receiving side signal (communication line side input signal) and the transmitting side signal after subtraction of the pseudo echo by the subtractor 452, and compares the power of the receiving side signal. If it is greater than a certain threshold value than the signal power of the talk side, it is determined that an echo path estimation operation is possible.
  • the output signal of the subtracter 452 is a signal after echo cancellation obtained by subtracting the pseudo echo signal generated by the adaptive filter 451 from the transmission signal, but generally an echo component remains. In order to suppress this, the residual echo suppression unit 454 gives a loss to the output signal of the subtractor 452.
  • the adaptive operation availability determination unit 453 and the operation of the residual echo suppression unit 454 described above are examples, and for example, the adaptive operation availability determination unit 453 uses a determination method in which the transmission side signal before pseudo echo subtraction is used together. Alternatively, the residual echo suppression unit 454 may be omitted.
  • the adaptive filter 451 initializes the echo path impulse response estimated value at the timing of the initial learning start (all “0”), and then performs the same processing as during a call until the learning time is completed. That is, a pseudo echo signal is generated and output from the signal on the receiving side (input signal on the communication line side) through the adaptive filter 451, and the impulse response of the echo path is estimated using the signal from the subtracter 452.
  • the adaptive operation availability determination unit 453 performs the same operation as during a call.
  • the residual echo suppression unit 454 performs the same operation as during a call, but since the call is not in progress, its output is not transmitted to the communication line and is not used.
  • the learning opportunity determination unit 41 recognizes the surrounding environment and determines whether or not initial learning is possible according to the environment. Thus, it is possible to obtain the voice call device 4 that can automatically perform initial learning in an appropriate time zone without giving the user a sense of incongruity.
  • initial learning can be performed in an environment close to the environment where the voice call device 4 is actually used, and the voice call device 4 is actually used. It can be applied to the environment.
  • the voice call device 4 is a voice call device for an elevator, and includes a parameter holding unit 48 that holds parameters depending on a signal echo path, and a voice call using the parameters.
  • a signal processing unit 45 that processes voice signals and a learning opportunity determination unit that determines whether or not the parameter can be learned prior to a voice call according to the environment in the elevator 1 in which the device is installed 41 and a learning signal generation unit 43 that generates a learning signal for learning the parameter when it is determined that the learning opportunity determination unit 41 can perform learning.
  • the voice call device 4 is characterized in that the learning opportunity determination unit 41 determines that the parameter can be learned when there is no person in the elevator 1. With this configuration, the voice communication device 4 can perform initial learning at a timing when there is no person in the elevator 1.
  • the learning opportunity determination unit 41 is in a state where the elevator 1 is stopped and the door of the elevator 1 is closed, and a button in the elevator 1 is pressed.
  • a predetermined time elapses, it is determined that the environment in the elevator 1 is an environment in which the parameter can be learned, and the timing for learning the parameter is determined.
  • the parameters held by the parameter holding unit 48 are learned from the time when the learning signal is transmitted from the voice call device 4 to the time when the voice call device 4 receives the learning signal.
  • FIG. 1 it is determined whether or not it is time for the initial learning to be possible based on the flow shown in FIG. Specifically, the condition that there is no passenger in the elevator 1 is that a predetermined time or more has passed since the button in the elevator 1 was last pressed based on the button pressing information from the elevator operation control unit 3 (ST16). ) To determine the initial learning time. On the other hand, in this Embodiment, the structure which determines with the other means that there is no passenger in the elevator 1 is shown.
  • FIG. 8 shows an initial learning time determination flow. In this flow, instead of the condition that a predetermined time has passed since the button in the elevator 1 was last pressed, the condition that the weight in the elevator 1 is equal to or less than a certain value (ST21) is used for the determination. Otherwise, the flow is the same as in FIG. If the weight in the elevator 1 is not more than a certain value, it can be said that there is no passenger.
  • the learning opportunity determination unit 41 determines that there is no passenger by determining that the weight in the elevator 1 is equal to or less than the predetermined value, and learning. Determine if it is possible time.
  • the learning opportunity determination unit 41 is in a state where the elevator 1 is stopped, the door of the elevator 1 is closed, and the weight in the elevator 1 is a predetermined value.
  • the environment in the elevator 1 is an environment in which the parameter can be learned, and a time zone in which the parameter is learned is determined.
  • Embodiment 3 FIG. In this Embodiment, the structure which determines the conditions that there is no passenger in the elevator 1 with a camera image is shown. Some elevators can perform not only voice communication but also image communication at the same time.
  • FIG. 9 shows a flow in which the learning opportunity determination unit 41 determines the initial learning time.
  • the condition that the camera image is unmanned (ST22) is used for the determination.
  • the other flow is the same as in FIG.
  • the learning opportunity determination unit 41 determines that there is no passenger by determining that the camera image is unmanned, and is initial learning possible? Determine whether or not.
  • the learning opportunity determination unit 41 is in a state where the elevator 1 is stopped, the door of the elevator 1 is closed, and the camera installed in the elevator 1 When the image is unattended, it is determined that the environment in the elevator 1 is an environment in which the parameter can be learned. With this configuration, it is possible to automatically perform initial learning in an appropriate time zone without causing the user to feel uncomfortable.
  • the parameter held in the parameter holding unit 48 by the signal processing unit 45 is the impulse response estimated value of the echo path.
  • the present embodiment a configuration is shown in which not all of the impulse response estimated values are retained, but only the impulse response estimated values within a predetermined delay time are retained.
  • n impulse response estimation values provided in the adaptive filter 451
  • H0, H1 Only the impulse responses H0, H1,... Hm ⁇ 1 (m ⁇ n) are stored.
  • the signal processing unit 45 inputs only m impulse responses from the parameter holding unit 48 at the start of a call, the remaining n ⁇ m impulse responses Hm, Hm + 1,. Starts operation with a zero value.
  • the impulse response expresses the echo path characteristic with a response signal at the time of impulse input, and the impulse response with a short delay time is due to the direct sound that the sound output from the speaker 21 of the interphone 2 is directly transmitted to the microphone 22 of the interphone 2.
  • the impulse response with a long delay time corresponds to an echo due to a reflected sound transmitted from the speaker 21 of the interphone 2 to the microphone 22 of the interphone 2 after the sound output from the speaker 21 of the interphone 2 is reflected by the wall or door of the elevator 1.
  • the reflected sound changes depending on the number and positions of passengers in the elevator 1, but the direct sound does not change. Therefore, at the start of the call, even if the call is started in a state where all the impulse response estimated values Hm, Hm + 1,...
  • Hn ⁇ 1 having a long delay time are all 0, all the impulse response estimated values H0, H1, ...
  • H / W Hardware
  • the impulse response estimated value within a predetermined delay time is held in the parameter holding unit 48, so that the H / W scale can be reduced.
  • the parameters held by the parameter holding unit 48 are learned from the time when the learning signal is transmitted from the voice call device 4 to the time when the voice call device 4 receives the learning signal. This is the value of a part of the impulse response with a short delay time among the impulse response values of the path through which the signal passes.
  • Embodiment 5 FIG. In Embodiments 1 to 4, it is determined that there is no elevator occupant and that initial learning of parameters can be performed while a call is not being made with the center. On the other hand, this embodiment shows a mode in which initial learning is performed immediately before a call is made due to an elevator confinement.
  • a learning signal such as white noise needs to be output from the speaker, but there is a problem that it is difficult to allow an operation in which an harsh learning signal is heard to a user who encounters an emergency in an elevator. Therefore, in the present embodiment, a configuration using a learning signal that does not feel harsh to a user who encounters an emergency situation is shown.
  • FIG. 10 is a flowchart showing the operation of the voice call device 4.
  • the call control unit 42 monitors whether or not the emergency call button 23 is pressed (ST1), and when a passenger confined in the elevator presses the emergency call button 23, the call control unit 42 communicates with this as a trigger.
  • a control signal is transmitted / received to / from the monitoring center 6 via the line interface 49, and a monitoring center call is started (ST23).
  • the learning opportunity determination unit 41 determines the initial learning time (ST24), and starts the initial learning.
  • the learning signal generation unit 43 When the initial learning starts, the learning signal generation unit 43 outputs a chirp signal as a parameter initial learning signal (ST25). The chirp signal will be described later. Since the switch 44 selects the input from the learning signal generation unit 43 instead of the communication line interface 49 and outputs it to the signal processing unit 45, the output signal of the learning signal generation unit 43 passes through the signal processing unit 45 to D / The data is output to the A converter 46 (ST9). Then, it is converted into an analog signal by the D / A converter 46 and output from the speaker 21 in the intercom 2. The voice of the person in the elevator from the microphone 22 in the interphone 2 is converted into a digital signal by the A / D converter 47, and then the echo is eliminated by the signal processing unit 45 (ST10).
  • the learning opportunity determination unit 41 determines that a predetermined time, for example, 10 seconds elapses from the start of initial learning, as the initial learning time (ST20), and continues the learning operation (ST25, ST9, ST10) during that time.
  • a predetermined time for example, 10 seconds elapses from the start of initial learning, as the initial learning time (ST20), and continues the learning operation (ST25, ST9, ST10) during that time.
  • the initial learning time ends, the parameters learned by the signal processing unit 45 along with the echo cancellation operation are stored in the parameter holding unit 48 (ST12).
  • the initial learning operation and the call control unit 42 call to the monitoring center 6 are performed in parallel, and when communication with the monitoring center 6 is established and the call is completed (ST26), the signal processing unit 45 is a parameter holding unit.
  • the parameters stored in 48 are input (ST3), and the call starts.
  • the switch 44 selects an input signal from the communication line interface 49 and outputs it to the signal processing unit 45, so that the input signal from the communication line interface 49 passes through the signal processing unit 45 to the D / A converter 46. (ST4). Then, it is converted into an analog signal by the D / A converter 46 and output from the speaker 21 in the intercom 2.
  • the voice of the person in the elevator from the microphone 22 in the interphone 2 is converted into a digital signal by the A / D converter 47, and then the echo is eliminated by the signal processing unit 45 and communicated via the communication line interface 49. It is output to the line (ST5). This operation is continued until the call is finished (ST6).
  • a ringing tone used in general telephone communication is a signal obtained by amplitude-modulating a 400 Hz tone signal at 16 Hz, but such a narrow frequency band signal is not suitable as a learning signal. Therefore, a chirp signal is intermittently output and used as a ringing tone.
  • a chirp signal is a signal whose frequency increases or decreases with time.
  • the chirp signal at a certain time t (0 ⁇ t ⁇ T) is CH (t) and this signal increases from the frequency F0 to F1 during the time 0 to T, it is expressed as follows: Can do.
  • A is the maximum amplitude of the chirp signal.
  • the chirp signal is not harsh like white noise because it is a sound similar to a chirp of insects and birds. Moreover, since it is naturally felt as a ringing tone by using an intermittent tone, it does not give a feeling of strangeness to the elevator passenger. Furthermore, since it has a wide frequency band, it is suitable for learning of the adaptive filter 451.
  • the learning opportunity determination unit 41 can initially learn whichever is longer between a fixed time from the start of the center call and the time from the start of the call to the establishment of the call. to decide.
  • the learning signal generator 43 generates a ringing tone having a wide frequency band as a learning signal.
  • the center call is started when there is a person in the elevator 1. That is, in the voice call device 4 according to the fifth embodiment, when there is a person in the elevator 1, the learning signal generation unit uses a chirp signal as the learning signal. With this configuration, even when the initial learning is performed in an environment where the user exists immediately before a call is made due to an elevator being confined, the initial learning can be performed while maintaining an environment that is not harsh to the user. As a result, it is possible to realize a favorable call environment without giving a sense of incongruity to the user.
  • the impulse response estimated value of the echo path is a parameter learned by initial learning, whereas in this embodiment, parameters other than the impulse response estimated value of the echo path are learned by initial learning. Indicates the configuration as a parameter.
  • FIG. 11 is an internal configuration diagram of the signal processing unit 45 according to the present embodiment.
  • reference numeral 455 denotes a gain / loss calculation unit that calculates a gain or loss applied to signals before and after the subtractor 452, and reference numerals 456 and 457 denote. It is a gain / loss insertion section that gives a gain or loss to a signal.
  • the operations of the adaptive operation availability determination unit 453 and the residual echo suppression unit 454 are the same as those of the signal processing unit 45 shown in FIG.
  • the operation of the adaptive filter 451 is different from the signal processing unit 45 shown in FIG. 7 only in the operation during the initial learning that the learning opportunity determination unit 41 determines that the initial learning is possible.
  • the adaptive filter 451 initializes the impulse response estimated value of the echo path after a certain time elapses from the learning start timing, for example, 1 second later (all “0”), and then performs the same processing as during a call until the learning time is completed. Do. When the initial learning time is completed, the estimated impulse response value of the echo path estimated during the initial learning time is output to the parameter holding unit 48 and stored at the completion timing.
  • the gain / loss calculation unit 455 performs gain / loss learning from the timing when the learning opportunity determination unit 41 determines that initial learning is possible until the adaptive filter 451 starts learning. During this time, the gain / loss calculation unit 455 calculates the signal level L1 output to the interphone 2 and the signal level L2 input from the interphone 2, and at the timing when the adaptive filter 451 starts learning, the gain / loss insertion unit ⁇ ⁇ L2 / L1 is given as a gain / loss value to be given to 456, L1 / ( ⁇ ⁇ L2) which is the reciprocal number is given to the gain / loss insertion unit 457, and a gain / loss value for the gain / loss insertion unit 456 is given.
  • .Alpha..times.L2 / L1 and L1 / (. Alpha..times.L2), which is the reciprocal and gain / loss value for the gain / loss insertion unit 457, are output to the parameter holding unit 48 and stored.
  • is a safety coefficient, and a fixed value of 1 or less is used.
  • the gain / loss calculation unit 455 knows the timing of the call start from the information from the call control unit 42, and the gain / loss value and gain for the gain / loss insertion unit 456 from the parameter holding unit 48 at the call start timing.
  • the gain / loss value for the / loss insertion unit 457 is extracted and set in the gain / loss insertion unit 456 and the gain / loss insertion unit 457, respectively.
  • the gain / loss insertion unit 456 and the gain / loss insertion unit 457 perform the same operation during a call and during initial learning, and multiply the input signal by the gain / loss value set by the gain / loss calculation unit 455. Output.
  • the gain / loss insertion as described above has the following effects when the adaptive filter 451 is realized by fixed-point arithmetic.
  • the impulse response of the echo path has a large value when the gain of the echo path is large, and conversely has a small value when the gain of the echo path is small. Therefore, if the gain / loss insertion as described above is not performed, if the echo path impulse response estimated values H0, H1,..., Hn-1 in the adaptive filter 451 are fixed-point numbers, the numerical value overflows.
  • the decimal point position is determined in advance according to the maximum echo path gain assumed for prevention. If this method is adopted, when the gain of the echo path is small, the higher-order bits of the fixed-point numbers H0, H1,.
  • the estimation accuracy of the echo path impulse response is lowered.
  • the gain / loss insertion as described above is performed, the gain of the echo path viewed from the adaptive filter 451 is always the same by the gain / loss inserted by the gain / loss insertion unit 456, and H0, H1,. , Hn ⁇ 1 can be prevented from decreasing. Further, the gain / loss insertion unit 457 multiplies the reciprocal of the gain / loss inserted by the gain / loss insertion unit 456, and thus does not affect other processes.
  • the gain / loss calculation unit 455 obtains the gain / loss value corresponding to the gain of the echo path, and obtains the gain / loss value obtained from the input and output signals of the adaptive filter 451 and its inverse. Multiply With this configuration, it is possible to obtain a voice communication device that can maintain high estimation accuracy of the echo path impulse response by the adaptive filter 451.
  • the parameters held by the parameter holding unit 48 are the learning signals from the time when the learning signal is transmitted from the voice call device 4 to the time when the voice call device 4 receives it. It is the gain of the path which passes. With this configuration, it is possible to obtain a voice communication device that can maintain high estimation accuracy of the echo path impulse response by the adaptive filter 451.
  • initial learning means learning that is performed prior to a call. Therefore, the initial learning may be performed only once, or the initial learning may be performed several times.

Landscapes

  • Engineering & Computer Science (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Otolaryngology (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Indicating And Signalling Devices For Elevators (AREA)
  • Interconnected Communication Systems, Intercoms, And Interphones (AREA)
  • Telephone Function (AREA)

Abstract

La présente invention concerne un dispositif de communication vocale, lequel dispositif est un dispositif de communication vocale pour un ascenseur, et lequel dispositif de communication vocale est caractérisé en ce qu'il comprend : une unité de conservation de paramètre pour conserver un paramètre dépendant de la trajectoire d'un écho d'un signal ; un processeur de signal pour traiter, à l'aide du paramètre, un signal vocal pendant une communication vocale ; une unité de détermination d'opportunité d'apprentissage pour déterminer, en fonction de l'environnement dans un ascenseur dans lequel le dispositif hôte est installé, s'il est ou non possible d'effectuer un apprentissage du paramètre avant la communication vocale ; et un générateur de signaux d'apprentissage pour générer, quand il est déterminé par l'unité de détermination d'opportunité d'apprentissage qu'il est possible d'effectuer un apprentissage, un signal d'apprentissage pour effectuer l'apprentissage du paramètre. La configuration rend possible d'effectuer automatiquement un apprentissage initial selon une temporisation appropriée sans donner à l'utilisateur un sentiment d'inconfort.
PCT/JP2016/053375 2016-02-04 2016-02-04 Dispositif de communication vocale WO2017134798A1 (fr)

Priority Applications (3)

Application Number Priority Date Filing Date Title
JP2016544644A JP6011751B1 (ja) 2016-02-04 2016-02-04 音声通話装置
PCT/JP2016/053375 WO2017134798A1 (fr) 2016-02-04 2016-02-04 Dispositif de communication vocale
CN201680079879.0A CN108495803B (zh) 2016-02-04 2016-02-04 语音通话装置

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/JP2016/053375 WO2017134798A1 (fr) 2016-02-04 2016-02-04 Dispositif de communication vocale

Publications (1)

Publication Number Publication Date
WO2017134798A1 true WO2017134798A1 (fr) 2017-08-10

Family

ID=57140234

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/JP2016/053375 WO2017134798A1 (fr) 2016-02-04 2016-02-04 Dispositif de communication vocale

Country Status (3)

Country Link
JP (1) JP6011751B1 (fr)
CN (1) CN108495803B (fr)
WO (1) WO2017134798A1 (fr)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114258129A (zh) * 2021-12-31 2022-03-29 展讯通信(上海)有限公司 终端注册方法及装置、存储介质、终端设备

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH08237175A (ja) * 1995-02-24 1996-09-13 Casio Comput Co Ltd エコーキャンセラ装置、該エコーキャンセラ装置のフィルタ係数設定方法および該エコーキャンセラ装置を備える音声通信装置
JP2008056442A (ja) * 2006-08-31 2008-03-13 Toshiba Elevator Co Ltd エレベータの通話点検システム
JP2010168155A (ja) * 2009-01-21 2010-08-05 Mitsubishi Electric Corp エレベータの非常用通話装置
JP5034607B2 (ja) * 2006-11-02 2012-09-26 株式会社日立製作所 音響エコーキャンセラシステム
JP5784539B2 (ja) * 2012-04-10 2015-09-24 株式会社日立ビルシステム エレベータ非常通話装置の点検装置

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1284309C (zh) * 1994-05-07 2006-11-08 株式会社Ntt都科摩 回波消除器学习方法
JP5705782B2 (ja) * 2012-05-09 2015-04-22 株式会社日立ビルシステム エレベータ用非常通話点検システム
CN105007364B (zh) * 2015-05-26 2017-04-05 腾讯科技(深圳)有限公司 语音通话方法及语音通话装置

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH08237175A (ja) * 1995-02-24 1996-09-13 Casio Comput Co Ltd エコーキャンセラ装置、該エコーキャンセラ装置のフィルタ係数設定方法および該エコーキャンセラ装置を備える音声通信装置
JP2008056442A (ja) * 2006-08-31 2008-03-13 Toshiba Elevator Co Ltd エレベータの通話点検システム
JP5034607B2 (ja) * 2006-11-02 2012-09-26 株式会社日立製作所 音響エコーキャンセラシステム
JP2010168155A (ja) * 2009-01-21 2010-08-05 Mitsubishi Electric Corp エレベータの非常用通話装置
JP5784539B2 (ja) * 2012-04-10 2015-09-24 株式会社日立ビルシステム エレベータ非常通話装置の点検装置

Also Published As

Publication number Publication date
CN108495803B (zh) 2019-08-06
JP6011751B1 (ja) 2016-10-19
CN108495803A (zh) 2018-09-04
JPWO2017134798A1 (ja) 2018-02-08

Similar Documents

Publication Publication Date Title
JP4569618B2 (ja) エコーキャンセラ及び通話音声処理装置
CN110838300B (zh) 回声消除的处理方法及处理***
JP3727258B2 (ja) エコー抑制処理システム
US8160239B2 (en) Echo canceller and speech processing apparatus
JP4568439B2 (ja) エコー抑圧装置
WO2013166761A1 (fr) Procédé et dispositif d'élimination d'écho
JP3009647B2 (ja) 音響反響制御システム、音響反響制御システムの同時通話検出器及び音響反響制御システムの同時通話制御方法
JP3607625B2 (ja) 多チャネル反響抑圧方法、その装置、そのプログラム及びその記録媒体
JP2003324370A (ja) エコーキャンセラ
JP2009094802A (ja) 通信装置
KR100547765B1 (ko) 동시통화구간 검출에 따른 음향 반향 제거 방법 및 장치
JP4105681B2 (ja) エコーサプレス方法、エコーサプレッサ、エコーサプレッサプログラム、通信路上の損失量制御方法、通信路上の損失量制御装置、通信路上の損失量制御プログラム、記録媒体
JP6011751B1 (ja) 音声通話装置
JP3756839B2 (ja) 反響低減方法、反響低減装置、反響低減プログラム
JP4600423B2 (ja) エコーキャンセラ
JP4544040B2 (ja) エコーキャンセル装置およびそれを用いた電話機、並びにエコーキャンセル方法
EP1584178A1 (fr) Dispositif et procede destines a la suppression d'echo, en particulier dans des telephones
WO2019169272A1 (fr) Détecteur d'intervention amélioré
JP4696776B2 (ja) 音声処理装置及びマイク装置
JP4857652B2 (ja) ノイズキャンセラ及びマイク装置
JP4650163B2 (ja) 拡声通話装置
JPH09116613A (ja) 反響消去装置
KR102172608B1 (ko) 동시통화에 강인한 심층학습 기반 음향반향 제거 장치 및 그 방법
JP5963077B2 (ja) 通話装置
JP4877083B2 (ja) 残留エコー抑圧制御装置、方法及びプログラム

Legal Events

Date Code Title Description
ENP Entry into the national phase

Ref document number: 2016544644

Country of ref document: JP

Kind code of ref document: A

121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 16889284

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 16889284

Country of ref document: EP

Kind code of ref document: A1