EP1081685A2 - Rauschverminderungsverfahren in einem Sprachsignal mit einem einzigen Mikrophon - Google Patents

Rauschverminderungsverfahren in einem Sprachsignal mit einem einzigen Mikrophon Download PDF

Info

Publication number: EP1081685A2
Authority: EP; European Patent Office
Prior art keywords: noise; speech; data; blocks; block
Prior art date: 1999-09-01
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.): Withdrawn

Application number

EP00118147A

Other languages

English (en)

French (fr)

Other versions

EP1081685A3 (de

Inventor

Russell H. Lambert

Karina L. Edmonds

Shi-Ping Hsu

Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)

Northrop Grumman Corp

Original Assignee

TRW Inc

Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)

1999-09-01

Filing date

2000-08-29

Publication date

2001-03-07

2000-08-29 Application filed by TRW Inc filed Critical TRW Inc

2001-03-07 Publication of EP1081685A2 publication Critical patent/EP1081685A2/de

2002-04-24 Publication of EP1081685A3 publication Critical patent/EP1081685A3/de

Status Withdrawn legal-status Critical Current

Links

238000000034 method Methods 0.000 title claims abstract description 24
230000009467 reduction Effects 0.000 title claims abstract description 24
238000001228 spectrum Methods 0.000 claims abstract description 30
238000012545 processing Methods 0.000 claims abstract description 16
238000004891 communication Methods 0.000 claims abstract description 10
230000001131 transforming effect Effects 0.000 claims description 9
230000000694 effects Effects 0.000 claims description 8
238000001914 filtration Methods 0.000 claims description 4
230000003595 spectral effect Effects 0.000 description 5
230000006835 compression Effects 0.000 description 4
238000007906 compression Methods 0.000 description 4
238000010586 diagram Methods 0.000 description 4
238000004378 air conditioning Methods 0.000 description 3
238000005311 autocorrelation function Methods 0.000 description 3
238000001514 detection method Methods 0.000 description 3
238000012360 testing method Methods 0.000 description 3
238000013459 approach Methods 0.000 description 2
230000005540 biological transmission Effects 0.000 description 2
230000015556 catabolic process Effects 0.000 description 2
238000006243 chemical reaction Methods 0.000 description 2
238000006731 degradation reaction Methods 0.000 description 2
238000005516 engineering process Methods 0.000 description 2
239000000203 mixture Substances 0.000 description 2
238000012544 monitoring process Methods 0.000 description 2
230000000737 periodic effect Effects 0.000 description 2
230000008569 process Effects 0.000 description 2
230000004044 response Effects 0.000 description 2
230000005534 acoustic noise Effects 0.000 description 1
230000001413 cellular effect Effects 0.000 description 1
230000000295 complement effect Effects 0.000 description 1
230000003111 delayed effect Effects 0.000 description 1
238000012217 deletion Methods 0.000 description 1
230000037430 deletion Effects 0.000 description 1
238000002592 echocardiography Methods 0.000 description 1
238000003780 insertion Methods 0.000 description 1
230000037431 insertion Effects 0.000 description 1
238000012986 modification Methods 0.000 description 1
230000004048 modification Effects 0.000 description 1
238000005457 optimization Methods 0.000 description 1
238000005070 sampling Methods 0.000 description 1
238000006467 substitution reaction Methods 0.000 description 1
238000010998 test method Methods 0.000 description 1
230000009466 transformation Effects 0.000 description 1
230000000007 visual effect Effects 0.000 description 1

Images

Classifications

- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering

Definitions

This invention relates generally to techniques for reliable conversion of speech data from acoustic signals to electrical signals in an acoustically noisy and reverberant environment.
ASR automatic speech recognition
background noise from both inside and outside an automobile renders in-vehicle communication both difficult and stressful.
Reverberation within the automobile combines with high noise levels to greatly degrade the speech signal received by a microphone in the automobile.
the microphone receives not only the original speech signal but also distorted and delayed duplicates of the speech signal, generated by multiple echoes from walls, windows and objects in the automobile interior. These duplicate signals in general arrive at the microphone over different paths.
multipath is often applied to the environment.
the quality of the speech signal is extremely degraded in such an environment, and the accuracy of any associated ASR systems is also degraded, perhaps to the point where they no longer operate.
recognition accuracy of an ASR system as high as 96% in a quiet environment could drop to well below 50% in a moving automobile.
speech compression Another related technology affected by noise and reverberation is speech compression, which digitally encodes speech signals to achieve reductions in communication bandwidth and for other reasons. In the presence of noise, speech compression becomes increasingly difficult and unreliable.
the active noise reduction approaches cancel acoustic noise signals by generating an opposite signal, sometimes referred to as "anti-noise,” through one or more transducers near the noise source, to cancel the unwanted noise signal.
This technique often creates noise at some other location in the vicinity of the speaker, and is not a practical solution for canceling multiple unknown noise sources, especially in the presence of multipath effects.
the present invention resides in a system and method for reducing noise in speech signals obtained from a single microphone in a noisy environment.
the present invention is a general noise reduction framework that allows multiple parameters to be adjusted optimally for any given application, noise environment or automatic speech recognition (ASR) system.
ASR automatic speech recognition
the system of the invention comprises a fast Fourier transform (FFT) circuit for transforming blocks of input microphone data to a frequency domain representation; a bandpass filter to remove selected frequency bands in which noise is known to be present; a speech detector for sensing the presence of speech signals in microphone data; a noise spectrum estimator updated only for data blocks in which no speech signals are detected; a spectrum subtraction circuit, for subtracting the estimated noise spectrum from microphone signals containing noise and speech signal components; and a speech emphasis circuit, for emphasizing speech signal components with respect to any residual noise after operation of the spectrum subtraction circuit, to provide a noise-reduced speech signal in the frequency domain.
FFT fast Fourier transform
the system may further comprise means for reconstructing time-domain data from the noise-reduced speech signal in the frequency domain, including an inverse fast Fourier transform circuit for transforming blocks of data from the frequency domain back into the time domain, whereby the noise-reduced speech signals are more intelligible in voice communication systems.
the system may further comprise an automatic speech recognition (ASR) system connected to receive the noise-reduced speech signals in the frequency domain, whereby the ASR system operates more reliably to generate selected control signals.
ASR automatic speech recognition
the speech emphasis circuit raises signals in the frequency domain by a power N, where N is a positive quantity greater than one.
the input signals are presented to the noise reduction system in blocks of "A” samples each, and data blocks of size “2A” samples each are presented to the FFT circuit.
the system further comprises means for combining input signal blocks of "A” samples in pairs to form data blocks.
the means for combining input signal blocks uses each input signal block twice, such that a currently input signal block is placed in a second half of a current data block and is then placed in a first half of a next data block.
the system may further comprise means for applying a triangular weighting window to each data block; and the means for reconstructing time-domain data includes means for combining the first half of each reconstructed data block with the second half of a reconstructed data block saved from processing the previous data block, time-domain samples with a uniform envelope are reconstructed and unwanted artifacts of block processing are minimized.
the system further comprises a noise monitor to provide an indication of when use of noise reduction would be desirable; and means for selecting the noise-reduced signal when noise level detected in the noise monitor is detected as relatively high, and for selecting the original speech with noise signal when the detected noise level is relatively low.
the invention may also be defined in terms of a method for reducing noise in signals received by a single microphone in a noise environment.
the method comprises the steps of transforming blocks of input data from a single microphone from a time-domain representation to a frequency-domain representation; filtering out selected frequency bands to minimize the effect known noise sources; detecting the presence of speech in each block of data signals; estimating noise by updating a noise spectrum estimate when no speech is detected; subtracting the noise spectrum estimate from the input speech and noise signals; and emphasizing speech signal components with respect to noise signal components, by raising the result of the subtracting step to the Nth power, where N is a positive quantity greater than one, to provide frequency-domain speech signal data with a reduced noise content.
the method may also include the step of reconstructing time-domain data from the noise-reduced speech signal in the frequency domain, including transforming blocks of data from the frequency domain back into the time domain, whereby the noise-reduced speech signals are more intelligible in voice communication systems.
the method includes the step of transmitting the noise-reduced speech signals in the frequency domain to an automatic speech recognition (ASR) system, whereby the ASR system operates more reliably to generate selected control signals.
ASR automatic speech recognition
the method step of emphasizing speech signal components includes raising signals in the frequency domain by a power N, where N is a positive quantity greater than one.
the method further includes the steps of presenting input signals to the noise reduction system in blocks of "A" samples each; presenting data blocks of size "2A” samples to the FFT circuit; combining input signal blocks of "A" samples in pairs to form data blocks, the combining step including using each input signal block twice, such that a currently input signal block is placed in a second half of a current data block and is then placed in a first half of a next data block; applying a triangular weighting window to each data block; and in the reconstructing step, combining the first half of each reconstructed data block with the second half of a reconstructed data block saved from processing the previous data block. Time-domain samples with a uniform envelope are reconstructed and unwanted artifacts of block processing are minimized with use of this method.
the method may further comprise the steps of continually monitoring the noise level with a noise monitor, to provide an indication of when use of noise reduction would be desirable; selecting the noise-reduced signal when the noise level detected by the noise monitor is detected as relatively high; and selecting the original speech and noise signal when the detected noise level is relatively low.
the present invention is concerned with a technique for significantly reducing the effects of noise in the detection of speech in a noisy and reverberant environment, such as the interior of a moving automobile.
the quality of speech transmission from mobile telephones in automobiles has long been known to be poor much of the time.
Noise from within and outside the vehicle result in a relatively low signal-to-noise ratio and reverberation of sounds within the vehicle further degrades the speech signals.
Available technologies for automatic speech recognition (ASR) and speech compression are at best degraded, and may not operate at all in the environment of the automobile.
a noisy speech signal is converted to digital samples and is input a block of samples at a time for processing in a fast Fourier transform (FFT) circuit, as indicated in block 10.
FFT fast Fourier transform
the signal is first bandpass filtered, as also indicated in block 10.
the magnitude spectrum is computed, as indicated in block 12, as the absolute value of the FFT function.
each block of data still in the frequency domain, is analyzed to detect the presence or absence of speech, as indicated in block 14.
An essential aspect of the invention is to reduce noise by spectral subtraction of noise spectrum estimate. Ideally, this estimate should be based on data obtained when speech is absent.
the noise spectrum estimate is not updated, but if speech is absent the noise estimate is updated.
the noise spectrum estimate is subtracted from the noisy speech signal spectrum, still in the frequency domain. Then, as indicated in block 20, speech is further emphasized over any residual noise by raising the speech signal (obtained after spectral subtraction of the noise) to the n th power, where n is optimized to provide the most desirable result. Finally, as indicated in block 22, the blocks of data in the frequency domain are subjected to inverse transformation by an inverse FFT circuit, which outputs a "cleaned" speech signal in the time domain.
FIG. 2 The functions depicted in FIG. 1 are depicted in more detail in FIG. 2.
the general parameter set referred to in FIG. 2 is defined in the following table: Parameter Name Description Range Units
a Block size (FFT size is 2A) Real positive integer (usually a power of 2) Samples B Input low cut-off point 0-parameter C Frequency (Hz) C Input high cut-off point Parameter B-sample rate/2 Frequency (Hz) D Spectral compression factor Real positive (greater than 1) Unitless E Speech location lower limit 0-parameter F Frequency (Hz) F Speech location upper limit Parameter E- sample rate/2 Frequency (Hz) G Running average energy update parameter Real positive (between 0 and 1) Unitless H Speech detect threshold parameter Real positive Unitless I Running average noise spectrum update parameter Real positive (between 0 and 1) Unitless J Speech enhancement parameter Real positive (greater than 1) Unitless
a Block size (FFT size is 2A) Real positive integer (usually a power of 2) Samples B Input low cut-off point
the functions shown in FIG. 2 may be implemented in any desired hardware or software configuration.
the noise cancellation system was implemented as software with code in a Microsoft Visual C++ compiler running on a personal computer in real time.
Input speech signals are sampled and input in blocks of A samples each.
Computation blocks for FFT processing are formed to contain 2A data samples each.
the FFT point size is 2A.
A may be 128 samples and 2A, 256 samples.
Rectangle 40 in FIG. 2 indicates the input of blocks of data.
Rectangle 42 indicates that each data computation block of 2A samples is formed from the stream of A-sized blocks in overlapping fashion. More specifically, if the incoming stream of A-sized blocks are designated as block (a), block (b), block (c), block (d) and so forth, then the first data computation block is formed from blocks (a) and (b) together, the next data computation block is formed from blocks (b) and (c) together, the next from blocks (c) and (d) together, and so forth.
the reason for overlapping the blocks in this way is to minimize sound artifacts that can be introduced by serially processing the blocks of data.
each data computation block is subjected to "windowing" by a triangular weighting function having the profile of an isosceles triangle centered on the data computation block.
a triangular weighting function having the profile of an isosceles triangle centered on the data computation block.
a maximum weight is applied to a sample or samples at the center of the data computation block, and progressively less weight is applied to samples towards the leading and trailing edges of the block.
these triangular windows also overlap.
the signals are later converted to the frequency domain and back to the time domain, the contributions from each adjacent pair of overlapping data computation blocks combine to produce a set of samples having a relatively uniform amplitude envelope.
each successive data block is formed and windowed, it is introduced to FFT processing, as indicated in rectangle 46, and then subjected to bandpass filtering between limits defined by parameters B and C, as indicated in rectangle 48.
This filtering step eliminates noise at very low and very high frequencies, such as below 300 Hz and above 3,850 Hz.
a magnitude spectrum S is computed and placed in a compressed domain using parameter D.
S compressed S 1/D .
the speech energy of the current data block is computed by summing the energy in the frequency range given by parameters E and F, such as 400 to 800 Hz, where speech is most likely to be dominant.
decision block 56 the current speech energy is compared with H times the average speech energy E avg , which provides a continually adapting speech detection threshold. If the current speech energy is greater that H* E avg , then the noise spectrum is not updated, as indicated by path 58.
the speech spectrum is then computed as the difference between the current spectrum and the noise spectrum estimate, as indicated in rectangle 62.
speech enhancement step 64 in which the speech spectrum, together with any residual noise component, is raised to the power J, where J is selected to be greater than one. Raising the signal to a power greater than one further distinguishes speech components from noise components.
the speech signals are to be transmitted to a human user of the system, they must next be transformed back to the time domain.
Reconstruction of the time domain waveform is also performed on a block by block basis.
An inverse FFT operation is performed on each data block, as indicated in rectangle 66.
the triangularly windowed data samples that result must be added together in a manner that will produce a uniform data envelope for the reconstructed waveform.
the first half of a reconstructed data block is added to the second half of the previously converted block of data, as indicated in block 68. Because these two half-blocks were originally subject to triangular windowing, they now combine in a complementary way to produce a uniform signal envelope.
the second half of the current block is saved for the next block iteration, as indicated in rectangle 70.
the combined A samples from the current and previous blocks are output, as indicated in rectangle 72.
a standard "star search” technique may be used, varying one parameter of the method described above while holding all others fixed. Ideally, this should be repeated for each type of speech and for different noise conditions.
One of the most critical parameters is the speech emphasis term, J. This was varied from 1.5 to 2.5 while testing the recognition accuracy for each setting of J. The optimum parameter value indicated was for use of the invention in the presence of freeway road and vehicle noise and for spoken connected digits data.
random noise indicated by graph 80
graph 80 has a distinctive 'spike' in its autocorrelation function 82
a sine wave has a periodic auto-correlation function.
a segment of speech 84 has strong components that are periodic sine waves. Therefore, the speech correlates strongly over several milliseconds, as indicated at 86.
the noise 80 correlates strongly only at the zero delay point, as indicated by the spike in its autocorrelation function 82. In the correlation domain, the spike due to noise can be easily zeroed out and this is the basis of the spectral subtraction approach used in the present invention.
the system of the invention has been tested under practical conditions in a moving vehicle, on a freeway with the windows closed and air-conditioning on, and also with the windows partly open.
Two types of microphones were considered, omni-directional and unidirectional. Not unexpectedly, the unidirectional microphone led to significantly better recognition accuracy for all background noise levels. The highest recognition accuracy obtained was 86% from freeway driving with the windows up and air conditioning on using connected digits speech data.
the in-vehicle data were initially collected using a digital recorder and the microphone placement was selected to maximize signal-to-noise ratio (SNR). For both the omni-directional and the unidirectional microphone the position that yields the greatest signal was just above the driver's visor (i.e., directly in front of the source). All the tests were conducted using the passenger as the point source for speech. Since the car cabin is symmetric, the results for the driver's side are expected to be equivalent to those obtained from the passenger side.
the speech recorded on the digital recorder in the automobile was sampled at 44.1 kHz and subsequently down-sampled to 8 kHz. In order to ensure the integrity of the audio files after down sampling, the files were tested with an automatic speech recognition (ASR) system. No degradation in ASR performance was observed for a file recorded at 44.1 kHz and down-sampled to 8 kHz.
ASR automatic speech recognition
a software package designed by Lemout and Hauspie ASR1500 was utilized for testing since it allowed for connected digits and has a relatively short response time.
the vocabulary tested consisted of eleven digits; 1-9, zero and oh. Connected digits were selected in order to account for the co-articulation factors in recognition process. In the test procedure, each digit is pronounced approximately fifteen times during a dialogue of a random series of connected digits.
the recognition accuracy for the digits is significantly improved after the removal of the background noise.
recognition rates improved from 47% to 86% for a unidirectional microphone, and from 16% to 78% for an omni-directional microphone.
recognition rates improved from 46% to 83% for a unidirectional microphone, and from less than 10% to 39% for the omni-directional microphone.
background noise level monitoring system 90 may be incorporated into the standard noise cancellation system of the invention, which would then operate only when a specified level of background noise is present. This would eliminate speech degradation from the processing when there is no background noise.
the decision need not be a "hard” (on or off) one. Rather the modified system would appropriately blend the processed and unprocessed speech in a continuously varying manner such that the effect of turning on the processing in high noise conditions would not be noticeable to the system user.
the monitored noise level is compared against an upper threshold, as indicated in decision block 92, and if the noise exceeds the threshold, the system selects processed (noise-reduced) speech as indicated in rectangle 94.
the monitored noise level is currently below the upper threshold, it is compared with a tower threshold, as indicated in decision block 96. If the noise is below the lower threshold, the original unprocessed speech is selected, as indicated in rectangle 98. If the monitored noise is between the upper and lower thresholds, the system selects a blend of inputs from the original speech and noise-reduced speech signals, as indicated in rectangle 100.
the noise reduction system is incorporated into an automatic speech recognition (ASR) system 104 (FIG. 5).
ASR automatic speech recognition
the noise reduction system is the same as the one illustrated in FIG. 1, but without the final inverse FFT process. This will eliminate some of the speech artifacts that are created when transforming back to the time domain waveform. Where the application calls for voice control of the ASR system only, there is no need to reconstruct the time domain waveform.
the inverse FFT function is eliminated from the noise cancellation system and the output of the noise cancellation system is coupled directly to frequency domain inputs 106 of the ASR system 104, which generates appropriate output control signals 108 in response to detection of input speech commands.
the present invention represents a significant advance in noise reduction for a single-microphone installed in noisy environment, such as a moving automobile.
the invention provides a "cleaned” or noise-reduced speech signal that is more intelligible to the human ear and improves reliability of ASR systems.
the system of the invention produces either time-domain output for transmission over voice communication systems, or frequency-domain output for direct connection to an ASR system.

Landscapes

Engineering & Computer Science (AREA)
Computational Linguistics (AREA)
Quality & Reliability (AREA)
Signal Processing (AREA)
Health & Medical Sciences (AREA)
Audiology, Speech & Language Pathology (AREA)
Human Computer Interaction (AREA)
Physics & Mathematics (AREA)
Acoustics & Sound (AREA)
Multimedia (AREA)
Circuit For Audible Band Transducer (AREA)
Fittings On The Vehicle Exterior For Carrying Loads, And Devices For Holding Or Mounting Articles (AREA)

EP00118147A 1999-09-01 2000-08-29 Rauschverminderungsverfahren in einem Sprachsignal mit einem einzigen Mikrophon Withdrawn EP1081685A3 (de)

Applications Claiming Priority (2)

Application Number	Priority Date	Filing Date	Title
US388266		1989-08-01
US38826699A	1999-09-01	1999-09-01

Publications (2)

Publication Number	Publication Date
EP1081685A2 true EP1081685A2 (de)	2001-03-07
EP1081685A3 EP1081685A3 (de)	2002-04-24

Family

ID=23533388

Family Applications (1)

Application Number	Title	Priority Date	Filing Date
EP00118147A Withdrawn EP1081685A3 (de)	1999-09-01	2000-08-29	Rauschverminderungsverfahren in einem Sprachsignal mit einem einzigen Mikrophon

Country Status (2)

Country	Link
EP (1)	EP1081685A3 (de)
JP (1)	JP2001092491A (de)

Cited By (20)

* Cited by examiner, † Cited by third party
Publication number	Priority date	Publication date	Assignee	Title
KR100421013B1 (ko) *	2001-08-10	2004-03-04	삼성전자주식회사	음성 향상 시스템 및 방법
GB2437559A (en) *	2006-04-26	2007-10-31	Zarlink Semiconductor Inc	System for reducing background noise in a speech signal by use of a fast Fourier transform
CN101320566B (zh) *	2008-06-30	2010-10-20	中国人民解放军第四军医大学	基于多带谱减法的非空气传导语音增强方法
CN102930870A (zh) *	2012-09-27	2013-02-13	福州大学	利用抗噪幂归一化倒谱系数的鸟类声音识别方法
US8538749B2 (en)	2008-07-18	2013-09-17	Qualcomm Incorporated	Systems, methods, apparatus, and computer program products for enhanced intelligibility
US8615393B2 (en)	2006-11-15	2013-12-24	Microsoft Corporation	Noise suppressor for speech recognition
US8831936B2 (en)	2008-05-29	2014-09-09	Qualcomm Incorporated	Systems, methods, apparatus, and computer program products for speech signal processing using spectral contrast enhancement
US9053697B2 (en)	2010-06-01	2015-06-09	Qualcomm Incorporated	Systems, methods, devices, apparatus, and computer program products for audio equalization
JP2015169915A (ja) *	2014-03-10	2015-09-28	公立大学法人広島市立大学	アクティブノイズ制御装置およびアクティブノイズ制御方法
CN104978955A (zh) *	2014-04-14	2015-10-14	美的集团股份有限公司	语音控制方法和***
US9202456B2 (en)	2009-04-23	2015-12-01	Qualcomm Incorporated	Systems, methods, apparatus, and computer-readable media for automatic control of active noise cancellation
WO2016094418A1 (en) *	2014-12-09	2016-06-16	Knowles Electronics, Llc	Dynamic local asr vocabulary
US9536540B2 (en)	2013-07-19	2017-01-03	Knowles Electronics, Llc	Speech signal separation and synthesis based on auditory scene analysis and speech modeling
US9820042B1 (en)	2016-05-02	2017-11-14	Knowles Electronics, Llc	Stereo separation and directional suppression with omni-directional microphones
US9838784B2 (en)	2009-12-02	2017-12-05	Knowles Electronics, Llc	Directional audio capture
US9978388B2 (en)	2014-09-12	2018-05-22	Knowles Electronics, Llc	Systems and methods for restoration of speech components
WO2018140020A1 (en) *	2017-01-26	2018-08-02	Nuance Communications, Inc.	Methods and apparatus for asr with embedded noise reduction
US10045140B2 (en)	2015-01-07	2018-08-07	Knowles Electronics, Llc	Utilizing digital microphones for low power keyword detection and noise suppression
CN111724805A (zh) *	2020-06-29	2020-09-29	北京百度网讯科技有限公司	用于处理信息的方法和装置
CN114650484A (zh) *	2022-05-23	2022-06-21	东莞市云仕电子有限公司	具有自动降噪功能的无线耳机及其使用方法

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number	Priority date	Publication date	Assignee	Title
FI19992453A (fi) *	1999-11-15	2001-05-16	Nokia Mobile Phones Ltd	Kohinanvaimennus
JP7231181B2 (ja) *	2018-07-17	2023-03-01	国立研究開発法人情報通信研究機構	耐雑音音声認識装置及び方法、並びにコンピュータプログラム
JPWO2023100374A1 (de) *	2021-12-03	2023-06-08

Citations (3)

* Cited by examiner, † Cited by third party
Publication number	Priority date	Publication date	Assignee	Title
US5012519A (en) *	1987-12-25	1991-04-30	The Dsp Group, Inc.	Noise reduction system
EP0637012A2 (de) *	1990-01-18	1995-02-01	Matsushita Electric Industrial Co., Ltd.	Vorrichtung zur Rauschreduzierung
US5742927A (en) *	1993-02-12	1998-04-21	British Telecommunications Public Limited Company	Noise reduction apparatus using spectral subtraction or scaling and signal attenuation between formant regions

2000
- 2000-08-29 EP EP00118147A patent/EP1081685A3/de not_active Withdrawn
- 2000-09-01 JP JP2000265121A patent/JP2001092491A/ja active Pending

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number	Priority date	Publication date	Assignee	Title
US5012519A (en) *	1987-12-25	1991-04-30	The Dsp Group, Inc.	Noise reduction system
EP0637012A2 (de) *	1990-01-18	1995-02-01	Matsushita Electric Industrial Co., Ltd.	Vorrichtung zur Rauschreduzierung
US6038532A (en) *	1990-01-18	2000-03-14	Matsushita Electric Industrial Co., Ltd.	Signal processing device for cancelling noise in a signal
US5742927A (en) *	1993-02-12	1998-04-21	British Telecommunications Public Limited Company	Noise reduction apparatus using spectral subtraction or scaling and signal attenuation between formant regions

Cited By (27)

* Cited by examiner, † Cited by third party
Publication number	Priority date	Publication date	Assignee	Title
KR100421013B1 (ko) *	2001-08-10	2004-03-04	삼성전자주식회사	음성 향상 시스템 및 방법
GB2437559A (en) *	2006-04-26	2007-10-31	Zarlink Semiconductor Inc	System for reducing background noise in a speech signal by use of a fast Fourier transform
GB2437559B (en) *	2006-04-26	2010-12-22	Zarlink Semiconductor Inc	Low complexity noise reduction method
US8615393B2 (en)	2006-11-15	2013-12-24	Microsoft Corporation	Noise suppressor for speech recognition
US8831936B2 (en)	2008-05-29	2014-09-09	Qualcomm Incorporated	Systems, methods, apparatus, and computer program products for speech signal processing using spectral contrast enhancement
CN101320566B (zh) *	2008-06-30	2010-10-20	中国人民解放军第四军医大学	基于多带谱减法的非空气传导语音增强方法
US8538749B2 (en)	2008-07-18	2013-09-17	Qualcomm Incorporated	Systems, methods, apparatus, and computer program products for enhanced intelligibility
US9202456B2 (en)	2009-04-23	2015-12-01	Qualcomm Incorporated	Systems, methods, apparatus, and computer-readable media for automatic control of active noise cancellation
US9838784B2 (en)	2009-12-02	2017-12-05	Knowles Electronics, Llc	Directional audio capture
US9053697B2 (en)	2010-06-01	2015-06-09	Qualcomm Incorporated	Systems, methods, devices, apparatus, and computer program products for audio equalization
CN102930870A (zh) *	2012-09-27	2013-02-13	福州大学	利用抗噪幂归一化倒谱系数的鸟类声音识别方法
CN102930870B (zh) *	2012-09-27	2014-04-09	福州大学	利用抗噪幂归一化倒谱系数的鸟类声音识别方法
US9536540B2 (en)	2013-07-19	2017-01-03	Knowles Electronics, Llc	Speech signal separation and synthesis based on auditory scene analysis and speech modeling
JP2015169915A (ja) *	2014-03-10	2015-09-28	公立大学法人広島市立大学	アクティブノイズ制御装置およびアクティブノイズ制御方法
CN104978955A (zh) *	2014-04-14	2015-10-14	美的集团股份有限公司	语音控制方法和***
US9978388B2 (en)	2014-09-12	2018-05-22	Knowles Electronics, Llc	Systems and methods for restoration of speech components
WO2016094418A1 (en) *	2014-12-09	2016-06-16	Knowles Electronics, Llc	Dynamic local asr vocabulary
US10045140B2 (en)	2015-01-07	2018-08-07	Knowles Electronics, Llc	Utilizing digital microphones for low power keyword detection and noise suppression
US9820042B1 (en)	2016-05-02	2017-11-14	Knowles Electronics, Llc	Stereo separation and directional suppression with omni-directional microphones
WO2018140020A1 (en) *	2017-01-26	2018-08-02	Nuance Communications, Inc.	Methods and apparatus for asr with embedded noise reduction
CN110268471A (zh) *	2017-01-26	2019-09-20	诺昂世通讯公司	具有嵌入式降噪的asr的方法和设备
EP3574499A4 (de) *	2017-01-26	2020-09-09	Nuance Communications, Inc.	Verfahren und vorrichtung für asr mit eingebetteter rauschminderung
US11308946B2 (en)	2017-01-26	2022-04-19	Cerence Operating Company	Methods and apparatus for ASR with embedded noise reduction
CN110268471B (zh) *	2017-01-26	2023-05-02	赛伦斯运营公司	具有嵌入式降噪的asr的方法和设备
CN111724805A (zh) *	2020-06-29	2020-09-29	北京百度网讯科技有限公司	用于处理信息的方法和装置
CN114650484A (zh) *	2022-05-23	2022-06-21	东莞市云仕电子有限公司	具有自动降噪功能的无线耳机及其使用方法
CN114650484B (zh) *	2022-05-23	2022-09-06	东莞市云仕电子有限公司	具有自动降噪功能的无线耳机及其使用方法

Also Published As

Publication number	Publication date
EP1081685A3 (de)	2002-04-24
JP2001092491A (ja)	2001-04-06

Legal Events

Date	Code	Title	Description
2001-01-19	PUAI	Public reference made under article 153(3) epc to a published international application that has entered the european phase	Free format text: ORIGINAL CODE: 0009012
2001-03-07	AK	Designated contracting states	Kind code of ref document: A2 Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE
2001-03-07	AX	Request for extension of the european patent	Free format text: AL;LT;LV;MK;RO;SI
2002-03-08	PUAL	Search report despatched	Free format text: ORIGINAL CODE: 0009013
2002-04-24	AK	Designated contracting states	Kind code of ref document: A3 Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE
2002-04-24	AX	Request for extension of the european patent	Free format text: AL;LT;LV;MK;RO;SI
2002-09-11	17P	Request for examination filed	Effective date: 20020712
2003-01-15	AKX	Designation fees paid	Free format text: AT BE CH CY DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE
2003-11-19	RAP1	Party data changed (applicant data changed or rights of an application transferred)	Owner name: NORTHROP GRUMMAN CORPORATION
2003-12-03	RAP1	Party data changed (applicant data changed or rights of an application transferred)	Owner name: NORTHROP GRUMMAN CORPORATION
2005-07-22	STAA	Information on the status of an ep patent application or granted ep patent	Free format text: STATUS: THE APPLICATION IS DEEMED TO BE WITHDRAWN
2005-09-07	18D	Application deemed to be withdrawn	Effective date: 20050301

Publication	Publication Date	Title
EP1081685A2 (de)	2001-03-07	Rauschverminderungsverfahren in einem Sprachsignal mit einem einzigen Mikrophon
EP1739657B1 (de)	2013-01-09	Sprachsignalverbesserung
US8010355B2 (en)	2011-08-30	Low complexity noise reduction method
US6487257B1 (en)	2002-11-26	Signal noise reduction by time-domain spectral subtraction using fixed filters
US8249861B2 (en)	2012-08-21	High frequency compression integration
EP1080465B1 (de)	2003-01-22	Rauschunterdrückung mittels spektraler subtraktion unter verwendung von linearem faltungsprodukt und kausaler filterung
KR100851716B1 (ko)	2008-08-11	바크 대역 위너 필터링 및 변형된 도블링거 잡음 추정에기반한 잡음 억제
EP2244254B1 (de)	2019-06-12	Gegen hohe Anregungsgeräusche unempfindliches System zum Ausgleich von Umgebungsgeräuschen
EP1855456B1 (de)	2009-10-14	Echoverringerung für zeitvariante Systeme
Yang	1993	Frequency domain noise suppression approaches in mobile telephone systems
EP2416315B1 (de)	2015-05-20	Rauschunterdrückungseinrichtung
US5878389A (en)	1999-03-02	Method and system for generating an estimated clean speech signal from a noisy speech signal
US20060222184A1 (en)	2006-10-05	Multi-channel adaptive speech signal processing system with noise reduction
US6510224B1 (en)	2003-01-21	Enhancement of near-end voice signals in an echo suppression system
KR20070085729A (ko)	2007-08-27	바크 밴드 위너 필터 및 선형 감쇠를 이용한 노이즈 감소및 컴포트 노이즈 이득 제어
US20140244245A1 (en)	2014-08-28	Method for soundproofing an audio signal by an algorithm with a variable spectral gain and a dynamically modulatable hardness
KR100470523B1 (ko)	2005-03-08	마이크로폰 신호로부터 스피커 간섭을 제거하기 위한 필터 시스템
US7917359B2 (en)	2011-03-29	Noise suppressor for removing irregular noise
Itoh et al.	1997	Environmental noise reduction based on speech/non-speech identification for hearing aids
EP2490218B1 (de)	2019-09-25	Verfahren zur Interferenzunterdrückung
US6507623B1 (en)	2003-01-14	Signal noise reduction by time-domain spectral subtraction
US20060184361A1 (en)	2006-08-17	Method and apparatus for reducing an interference noise signal fraction in a microphone signal
Esch et al.	2010	Combined reduction of time varying harmonic and stationary noise using frequency warping
US11227622B2 (en)	2022-01-18	Speech communication system and method for improving speech intelligibility
JP2003517761A (ja)	2003-05-27	通信システムにおける音響バックグラウンドノイズを抑制するための方法と装置