WO2011001578A1 - Appareil de communication - Google Patents

Appareil de communication Download PDF

Info

Publication number
WO2011001578A1
WO2011001578A1 PCT/JP2010/002786 JP2010002786W WO2011001578A1 WO 2011001578 A1 WO2011001578 A1 WO 2011001578A1 JP 2010002786 W JP2010002786 W JP 2010002786W WO 2011001578 A1 WO2011001578 A1 WO 2011001578A1
Authority
WO
WIPO (PCT)
Prior art keywords
audio signal
band
level
noise
degree
Prior art date
Application number
PCT/JP2010/002786
Other languages
English (en)
Japanese (ja)
Inventor
竹田博昭
Original Assignee
パナソニック株式会社
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by パナソニック株式会社 filed Critical パナソニック株式会社
Publication of WO2011001578A1 publication Critical patent/WO2011001578A1/fr

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/038Speech enhancement, e.g. noise reduction or echo cancellation using band spreading techniques

Definitions

  • the present invention relates to a communication device, and in particular, receives a voice signal with a narrow frequency band (narrow band voice signal) in voice communication, expands the frequency band of the received narrow band voice signal, and widens the voice signal (broadband).
  • the present invention relates to a communication apparatus and a band expansion method for performing band expansion to be converted into a voice signal.
  • a voice signal whose frequency band is limited to about 300 to 3400 Hz which is called a narrowband voice signal, is transmitted on the telephone line. For this reason, voice quality is deteriorated.
  • band extension a narrowband audio signal as an input is analyzed to generate a signal in a frequency band that does not originally exist.
  • the band expansion often uses the characteristics of an audio signal such as pitch and formant.
  • noise generated around a speaker is often superimposed as noise or line noise is often superimposed.
  • Patent Document 2 since noise removal processing is performed on an audio signal whose frequency band is extended, malfunction of the band expansion processing cannot be prevented.
  • An object of the present invention is to improve a voice quality by converting a narrowband audio signal to a wideband audio signal, and to perform a band extension on a narrowband audio signal containing noise, due to a malfunction of the band extension process.
  • a communication apparatus capable of suppressing noise generation is provided.
  • the communication apparatus includes an expansion unit that performs band expansion to expand a frequency band of a narrowband audio signal and convert it to a wideband audio signal, a noise level of the narrowband audio signal, and an audio level of the narrowband audio signal And a reference table in which a relationship between the band extension effect degree of the band extension is set, and a determination unit that determines the degree of the band extension effect with reference to the reference table based on the noise level and the audio level And.
  • a narrowband audio signal is converted into a wideband audio signal to improve audio quality, and what kind of band expansion effect can be achieved when performing band extension on a narrowband audio signal containing noise. It can be set by using a reference table. For example, it is possible to simply avoid noise such as performing an operation to avoid malfunction of broadband processing due to noise or performing broadband processing even if noise generation is allowed.
  • the bandwidth extension effect can be controlled more effectively than the bandwidth extension effect is controlled in proportion to the level.
  • the figure which shows the degree of the band expansion effect which concerns on Embodiment 1 of this invention The block diagram which shows the structure of the communication apparatus which concerns on Embodiment 2 of this invention.
  • the figure which shows the degree of the band expansion effect which concerns on Embodiment 2 of this invention The figure which shows the degree of the band expansion effect which concerns on Embodiment 2 of this invention.
  • FIG. 1 shows the configuration of communication apparatus 100 according to the present embodiment.
  • the communication unit 102 establishes a call connection with the communication apparatus of the communication partner via the public network 101, and transmits / receives a digital narrowband audio signal to / from the communication partner.
  • the received narrowband audio signal is input to the noise level estimation unit 103 and the band extension unit 105.
  • the noise level estimation unit 103 estimates the level of noise included in the digital narrowband audio signal (the noise level of the narrowband audio signal), and outputs the estimation result to the band extension effect determination unit 104.
  • the noise level may be estimated using the technique disclosed in Japanese Patent No. 3244252, for example.
  • the band expansion effect determination unit 104 determines the degree (strength) of the band expansion effect when the band expansion unit 105 expands the narrowband audio signal to the wideband audio signal according to the noise level estimated by the noise level estimation unit 103. Then, the bandwidth extension unit 105 is instructed to determine the degree of the bandwidth extension effect. A method for determining the degree of the bandwidth expansion effect will be described later.
  • the band extension unit 105 adjusts the degree (strength) of the band extension effect in accordance with an instruction from the band extension effect determination unit 104, and uses the technique described in Patent Document 1, for example, to adjust the frequency band of the digital narrowband audio signal. Extends the bandwidth to be converted into a digital wideband audio signal. For example, when band extension is performed using the technique described in Patent Document 1, the adjustment of the degree of the band extension effect is performed by increasing or decreasing the addition ratio of the frequency component having a higher frequency than the original sound generated based on the original sound to the original sound. Can be performed.
  • the DAC / AMP unit 106 analogizes and amplifies the digital broadband audio signal output from the band expansion unit 105 to obtain an analog broadband audio signal.
  • the output device unit 107 converts an analog broadband audio signal into sound and outputs the sound in the air.
  • the output device unit 107 includes a receiver, a speaker, a headphone, and the like.
  • the degree of the band expansion effect is determined according to the noise level of the digital narrowband audio signal. Specifically, the band expansion effect determination unit 104 determines the degree of the band expansion effect as shown in FIG.
  • the band extension effect determination unit 104 strengthens the band extension effect in order to improve the voice quality.
  • the band expansion effect determination unit 104 sets the band expansion effect to medium in order to improve voice quality.
  • the band expansion effect determination unit 104 weakens the band expansion effect in order to suppress the occurrence of noise due to the malfunction of the band expansion process in the band expansion unit 105.
  • the band expansion effect determination unit 104 determines the degree (strength) of the band expansion effect according to the noise level estimated by the noise level estimation unit 103. Then, according to the determined degree (strength), the band extension unit 105 performs band extension while adjusting the degree of the band extension effect.
  • the degree of the band expansion effect that emphasizes the voice quality is used.
  • the optimum degree of band expansion effect is determined in accordance with the degree of band expansion effect that can suppress the occurrence of noise due to the malfunction of and the noise level.
  • the narrowband audio signal is converted into the wideband audio signal to improve the audio quality, and the band extension is performed even when the band extension is performed on the narrowband audio signal including noise. Generation of noise due to processing malfunction can be suppressed.
  • FIG. 3 shows the configuration of communication apparatus 200 according to the present embodiment.
  • the same components as those in FIG. 1 (Embodiment 1) are denoted by the same reference numerals, and description thereof is omitted.
  • the narrowband audio signal received by the communication unit 102 is input to the level ratio estimation unit 201 and the band extension unit 105.
  • the level ratio estimation unit 201 estimates the noise level of the narrowband audio signal and also estimates the level of the audio signal included in the digital narrowband audio signal (the audio level of the narrowband audio signal). Then, the level ratio estimation unit 201 estimates the ratio (level ratio) between the audio level and the noise level, and outputs the estimation result to the band extension effect determination unit 202.
  • the noise level may be estimated using the technology disclosed in, for example, Japanese Patent No. 3244252 as described above.
  • the estimation of the voice level may be performed by a method such as subtracting the noise level from the level of the entire digital narrowband voice signal.
  • the band expansion effect determination unit 202 determines the degree (strength) of the band expansion effect when the band expansion unit 105 expands the narrowband audio signal to the wideband audio signal according to the level ratio estimated by the level ratio estimation unit 201. Then, the bandwidth extension unit 105 is instructed to determine the degree of the bandwidth extension effect.
  • the degree of the band expansion effect is determined according to the ratio (level ratio) between the audio level and the noise level of the digital narrowband audio signal. Specifically, the band expansion effect determination unit 202 determines the degree of the band expansion effect as shown in FIG.
  • the band extension effect determination unit 202 sets the band extension effect to a medium level in order to improve voice quality.
  • the band expansion effect determination unit 202 performs the band expansion effect in order to suppress the occurrence of noise due to the malfunction of the band expansion process in the band expansion unit 105 when the noise level is high and the sound level is medium. Weaken.
  • the band expansion effect determination unit 202 weakens the band expansion effect in order to suppress the occurrence of noise due to the malfunction of the band expansion process in the band expansion unit 105 if the noise level is large and the sound level is small.
  • the band extension effect determination unit 202 strengthens the band extension effect in order to improve the voice quality if the noise level is medium and the voice level is high.
  • the band expansion effect determination unit 202 sets the band expansion effect to medium in order to improve the voice quality if the noise level is medium and the sound level is medium.
  • the band expansion effect determination unit 202 weakens the band expansion effect in order to suppress the occurrence of noise due to the band expansion processing malfunction in the band expansion unit 105. To do.
  • the band extension effect determination unit 202 strengthens the band extension effect in order to improve voice quality if the noise level is low and the voice level is high.
  • the band extension effect determination unit 202 strengthens the band extension effect in order to improve the voice quality if the noise level is low and the voice level is medium.
  • the band extension effect determination unit 202 makes the band extension effect moderate in order to improve voice quality.
  • the band extension effect determination unit 202 determines the degree (strength) of the band extension effect according to the level ratio estimated by the level ratio estimation unit 201. Then, according to the determined degree (strength), the band extension unit 105 performs band extension while adjusting the degree of the band extension effect.
  • the present invention can also realize an operation in which the degree of the band expansion effect is not proportional to the ratio between the audio level and the noise level by replacing the contents of the reference table for determining the degree of the band expansion effect as shown in FIG. .
  • the band expansion process may malfunction, so it is desirable not to perform the band expansion process from the viewpoint of voice quality, but dare to make it easier to hear the voice
  • band expansion processing is performed while allowing noise to be generated is also conceivable.
  • the degree of bandwidth expansion effect may be determined using a reference table as shown in FIG. A case where the sound level is low will be described with reference to FIG.
  • the band extension effect determination unit 202 sets the band extension effect to a medium level in order to improve voice quality.
  • the band expansion effect determination unit 202 weakens the band expansion effect in order to suppress the occurrence of noise due to the band expansion processing malfunction in the band expansion unit 105. To do.
  • the band extension effect determination unit 202 weakens the band extension effect in order to suppress the occurrence of noise due to the malfunction of the band extension process in the band extension unit 105 when the noise level is high and the sound level is low. It should be, however, that the voice is very difficult to hear when the voice is low and the noise level is high. In this case, the band expansion effect is strengthened for the purpose of allowing generation of noise and making it easy to hear the voice.
  • the noise level in a narrowband audio signal is sufficiently small with respect to the audio level
  • the noise level in the narrowband audio signal is reduced to the degree of the band expansion effect that emphasizes the audio quality.
  • the optimal bandwidth expansion effect is determined according to the level of the bandwidth expansion effect that can suppress the occurrence of noise due to malfunction of the bandwidth expansion process and the ratio between the audio level and the noise level. .
  • the degree of the bandwidth expansion effect using the reference table as in the embodiment of the present invention, specifically, by using a reference table that is not proportional to the ratio between the audio level and the noise level. For example, when the audio level is low and the noise level is high, the band expansion effect should be weakened, but it is not proportional to the ratio between the audio level and the noise level.
  • the degree of the bandwidth expansion effect can be determined, and a bandwidth expansion effect that makes it easier to hear the voice can be provided.
  • the narrowband audio signal is converted into the wideband audio signal to improve the audio quality, and the band extension is performed even when the band extension is performed on the narrowband audio signal including noise. It is possible to control the excellent degree of bandwidth expansion effect, which can suppress the generation of noise due to a malfunction of the process, and can set the emphasis on ease of listening even if the generation of noise is allowed.
  • the present invention can be used for all communication devices capable of narrowband voice communication.
  • the present invention can be used for a wired telephone connected to a fixed communication network and a mobile phone connected to a mobile communication network.

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Quality & Reliability (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Noise Elimination (AREA)
  • Telephone Function (AREA)

Abstract

L'invention porte sur un appareil de communication qui augmente une qualité audio par conversion d'un signal audio à bande étroite en un signal audio à large bande, et peut minimaliser une génération de bruit due à des erreurs dans le processus d'étalement de bande, même lors de la réalisation d'un étalement de bande sur un signal audio à bande étroite qui contient du bruit. Dans l'appareil de communication fourni, une unité d'estimation de niveau de bruit (103) estime le niveau d'un bruit dans un signal audio à bande étroite ; une unité de décision d'effet d'étalement de bande (104) décide, conformément au niveau de bruit estimé par l'unité d'estimation de niveau de bruit (103), le degré (intensité) de l'effet d'étalement de bande à utiliser lorsqu'une unité d'étalement de bande (105) étale le signal audio à bande étroite en un signal audio à large bande ; et l'unité d'étalement de bande (105) effectue un étalement de bande tout en ajustant le degré (intensité) de l'effet d'étalement de bande conformément à des instructions provenant de l'unité de décision d'effet d'étalement de bande (104).
PCT/JP2010/002786 2009-06-29 2010-04-16 Appareil de communication WO2011001578A1 (fr)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
JP2009-154056 2009-06-29
JP2009154056 2009-06-29

Publications (1)

Publication Number Publication Date
WO2011001578A1 true WO2011001578A1 (fr) 2011-01-06

Family

ID=43410671

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/JP2010/002786 WO2011001578A1 (fr) 2009-06-29 2010-04-16 Appareil de communication

Country Status (1)

Country Link
WO (1) WO2011001578A1 (fr)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2015215528A (ja) * 2014-05-13 2015-12-03 日本電信電話株式会社 音声強調装置、音声強調方法及びプログラム

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2002536679A (ja) * 1999-01-27 2002-10-29 コーディング テクノロジーズ スウェーデン アクチボラゲット 情報源符号化システムの性能向上方法と装置
JP2004514179A (ja) * 2000-11-14 2004-05-13 コーディング テクノロジーズ アクチボラゲット 適応ろ波による高周波復元符号化方法の知覚性能の強化方法
WO2004104987A1 (fr) * 2003-05-20 2004-12-02 Matsushita Electric Industrial Co., Ltd. Procede et dispositif permettant d'elargir la plage de frequences d'un signal audio
JP2008197247A (ja) * 2007-02-09 2008-08-28 Yamaha Corp 音声処理装置
JP2010020251A (ja) * 2008-07-14 2010-01-28 Ntt Docomo Inc 音声符号化装置及び方法、音声復号化装置及び方法、並びに、音声帯域拡張装置及び方法

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2002536679A (ja) * 1999-01-27 2002-10-29 コーディング テクノロジーズ スウェーデン アクチボラゲット 情報源符号化システムの性能向上方法と装置
JP2004514179A (ja) * 2000-11-14 2004-05-13 コーディング テクノロジーズ アクチボラゲット 適応ろ波による高周波復元符号化方法の知覚性能の強化方法
WO2004104987A1 (fr) * 2003-05-20 2004-12-02 Matsushita Electric Industrial Co., Ltd. Procede et dispositif permettant d'elargir la plage de frequences d'un signal audio
JP2008197247A (ja) * 2007-02-09 2008-08-28 Yamaha Corp 音声処理装置
JP2010020251A (ja) * 2008-07-14 2010-01-28 Ntt Docomo Inc 音声符号化装置及び方法、音声復号化装置及び方法、並びに、音声帯域拡張装置及び方法

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2015215528A (ja) * 2014-05-13 2015-12-03 日本電信電話株式会社 音声強調装置、音声強調方法及びプログラム

Similar Documents

Publication Publication Date Title
KR101482488B1 (ko) 개선된 오디오를 위한 통합된 심리음향 베이스 강화 (pbe)
US9653091B2 (en) Echo suppression device and echo suppression method
US9984705B2 (en) Non-intrusive quality measurements for use in enhancing audio quality
WO2013084810A1 (fr) Dispositif de capture de son de type à fixation au conduit auriculaire, dispositif de traitement de signal et procédé de capture de son
US20060142999A1 (en) Band correcting apparatus
KR20140019023A (ko) 전자 디바이스 상에서의 마스킹 신호 생성
JP2012533967A (ja) メディアストリーム内のデジタルオーディオサンプルに対する適応ゲイン制御
JP2005354683A (ja) 分散式サウンド向上技術
WO2020248769A1 (fr) Amplificateur de puissance audio, circuit de commande de gain de celui-ci et son procédé de commande
KR101898911B1 (ko) 인이어 마이크와 아웃이어 마이크 수음특성을 이용한 소음 제거 이어셋 및 소음 제거 방법
US20190073992A1 (en) Signal processing device, signal processing method and computer program
JP4843691B2 (ja) 信号特性変化装置
JP2002368839A (ja) 電話機及び該電話機の音声信号周波数補正方法
JP2011081033A (ja) 信号処理装置、及び携帯端末装置
JP4460256B2 (ja) 雑音低減処理方法、この方法を実施する装置、プログラム、記録媒体
JP2008148179A (ja) 音声信号処理装置および自動利得制御装置における雑音抑圧処理方法
WO2011001578A1 (fr) Appareil de communication
JP2010237288A (ja) 帯域拡張装置、方法及びプログラム、並びに、電話端末
US9961441B2 (en) Near-end listening intelligibility enhancement
JP2012195813A (ja) 電話機、制御方法、及びプログラム
JP2003092537A (ja) ワイヤレスマイクロホンシステム
JP2010204564A (ja) 通信装置
JP5535428B2 (ja) 音声信号の出力方法、スピーカシステム、携帯機器及びコンピュータプログラム
TWI753672B (zh) 應用在無線耳機中的通話品質改善裝置及方法
US20230058583A1 (en) Transmission error robust adpcm compressor with enhanced response

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 10793756

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 10793756

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: JP