CN111968659B - Microphone array voice enhancement method based on optimized IMCRA - Google Patents
Microphone array voice enhancement method based on optimized IMCRA Download PDFInfo
- Publication number
- CN111968659B CN111968659B CN202010719382.9A CN202010719382A CN111968659B CN 111968659 B CN111968659 B CN 111968659B CN 202010719382 A CN202010719382 A CN 202010719382A CN 111968659 B CN111968659 B CN 111968659B
- Authority
- CN
- China
- Prior art keywords
- signal
- noise
- microphone array
- imcra
- optimized
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000000034 method Methods 0.000 title claims abstract description 49
- 230000003595 spectral effect Effects 0.000 claims abstract description 20
- 230000000903 blocking effect Effects 0.000 claims abstract description 7
- 239000011159 matrix material Substances 0.000 claims abstract description 7
- 238000001228 spectrum Methods 0.000 claims description 8
- 230000004044 response Effects 0.000 claims description 4
- 239000000654 additive Substances 0.000 claims description 3
- 230000000996 additive effect Effects 0.000 claims description 3
- 238000009499 grossing Methods 0.000 claims description 3
- 230000009466 transformation Effects 0.000 claims description 3
- 238000001914 filtration Methods 0.000 description 4
- 238000004088 simulation Methods 0.000 description 4
- 230000003044 adaptive effect Effects 0.000 description 3
- 230000000694 effects Effects 0.000 description 3
- 230000009467 reduction Effects 0.000 description 2
- 241000282414 Homo sapiens Species 0.000 description 1
- 238000010521 absorption reaction Methods 0.000 description 1
- 239000008186 active pharmaceutical agent Substances 0.000 description 1
- 238000003491 array Methods 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 230000002452 interceptive effect Effects 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 230000001629 suppression Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L21/0216—Noise filtering characterised by the method used for estimating noise
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L21/0216—Noise filtering characterised by the method used for estimating noise
- G10L2021/02161—Number of inputs available containing the signal or the noise to be suppressed
- G10L2021/02166—Microphone arrays; Beamforming
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02D—CLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
- Y02D30/00—Reducing energy consumption in communication networks
- Y02D30/70—Reducing energy consumption in communication networks in wireless communication networks
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Quality & Reliability (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Circuit For Audible Band Transducer (AREA)
Abstract
A microphone array speech enhancement method based on optimized IMCRA, comprising: respectively calculating signals collected by each microphone to form a microphone array; calculating output signals of the microphone array; calculating an average noise signal in the blocking matrix output signal; noise power spectral density estimated using an optimized IMCRA method; the output signal of the microphone array and the noise power spectral density are input to an MMSE-LSA estimator to obtain a final speech enhancement signal. The invention not only improves the precision of noise estimation, but also reduces the calculation amount of voice enhancement; the spatial characteristics of microphone array voice enhancement are utilized, and noise residues of traditional microphone array voice enhancement are further removed; more accurate noise power spectral density can be estimated, thereby enabling the MMSE-LSA estimator to output enhanced speech signals of higher quality.
Description
Technical Field
The invention relates to a voice enhancement method. And more particularly to a microphone array speech enhancement method based on optimized IMCRA.
Background
The voice is used as the most basic communication means of human beings, and is the most convenient and direct information interaction tool between people. With the rapid development of science and technology, voice is also a main tool for people to communicate with machines. But in daily life, the speech signal is often disturbed by noise. Therefore, how to reduce noise, or how to improve speech quality, especially speech intelligibility, is a hot spot for many scholars to study. The goal of speech enhancement is to suppress noise as much as possible. In recent years, in order to reduce noise, many speech enhancement methods have been proposed.
The number of microphones can be classified into single-channel speech enhancement and microphone array speech enhancement. Among them, the single-channel speech enhancement algorithm is the earliest one. Classical single-channel speech enhancement algorithms include spectral subtraction, wiener filtering, and kalman filtering. However, during the processing, the spectral subtraction may generate musical noise, the wiener filtering method may have poor performance in a non-stationary environment, and the kalman filtering method may cause damage to the voice.
The microphone array speech enhancement method has more advantages than the conventional single channel speech enhancement method. The method not only utilizes time domain information among samples, but also utilizes space information among channels, thereby improving the voice enhancement performance. Currently, there are many well-established microphone array speech enhancement algorithms. Such as fixed beam forming algorithms, adaptive beam forming algorithms, and generalized sidelobe canceling methods. Fixed beam forming algorithms are easy to implement but require more microphones to effectively enhance speech. Adaptive beamforming algorithms were developed based on fixed beamforming. The key change in comparison to fixed beam forming is the weighting coefficients. The weighting coefficients of adaptive beamforming are no longer fixed values but vary with the input. The flexibility of the algorithm is improved, and the algorithm can be applied to more acoustic environments. The generalized sidelobe cancellation method can eliminate strong correlation noise, but has poor suppression capability on weak correlation noise and incoherent noise, and is more convenient and flexible to calculate.
Estimating the power spectral density of noise is an important step in speech enhancement. Noise environments can be classified into stationary noise environments and non-stationary noise environments. In a stationary noise environment, the noise distribution is uniform and varies slowly. It only uses noise segments of the noisy speech signal to estimate the noise spectrum, which are typically identified by speech activity detection methods. In practical application, background noise is often non-stationary, so that it is of practical significance to study noise estimation algorithms in non-stationary noise environments. In a non-stationary noise environment, the noise is constantly changing. Common estimation methods are minimum statistics, minimum control recursive average and modified minimum control recursive average.
For single channel speech enhancement algorithms, the noise power spectrum estimator generally exploits the time spectral characteristics of the signal. In a non-stationary noise environment its estimation accuracy will decrease. Because these methods assume that noise is only updated when speech is not present and not updated when speech is present. In practice, noise is always present. This will result in non-stationary noise being not accurately estimated when the noise changes very rapidly.
However, microphone arrays can overcome the limitations of single channel methods by spatially separating the signals, which is relatively easy to extract speech or noise components. Therefore, the microphone array voice enhancement is combined with the noise estimation algorithm, so that the accuracy of noise estimation can be remarkably improved, and the calculated amount is reduced.
Disclosure of Invention
The invention aims to solve the technical problem of providing a microphone array voice enhancement method based on optimized IMCRA, which can improve the precision of noise estimation and voice enhancement performance.
The technical scheme adopted by the invention is as follows: a microphone array voice enhancement method based on an optimized IMCRA comprises the following steps:
1) Respectively calculating the signal x collected by each microphone n (t) forming a microphone array;
2) Calculating the output signal y of the microphone array a (t);
3) Calculating average noise signal B in blocking matrix output signals AV (t);
4) Noise power spectral density estimated using optimized IMCRA method
5) Output signal y of microphone array a (t) and noise Power Spectrum DensityAnd inputting the speech enhancement signal to an MMSE-LSA estimator to obtain a final speech enhancement signal.
Step 1) comprises:
there are J signals s in the set space j (t), J e 1,2,3,..j, n+1 microphones, the nth microphone receiving a signal x at time t n (t):
in the formula ,hn,j Is the channel impulse response from the jth sound source to the nth microphone, v n (t) represents an additive noise,represents convolution, where N represents the microphones in the array, N e 1,2, N; at signal s j In (t), j=1 represents a desired speech signal, j=2, 3, & gt, and J represents an interference signal.
Step 2) comprises:
signal X received at time t for microphone array N (t) calculating the output signal y of the array using beamforming a (t):
In which the input signal of the array is X N (t),X N (t)=[x 1 (t),x 2 (t),...,x N (t)]The weight of beam forming is w a (t),w a (t)=[w a,1 (t),w a,2 (t),K,w a,N (t)]。
The step 3) comprises the following steps:
signal X to be received at time t based on microphone array N (t) obtaining a noise signal by blocking matrix B as follows:
then the noise signal is averaged by the following formula to obtain an average noise signal B AV (t):
Where N represents the total number of microphones.
Step 4) is to output the signal y of the microphone array a (t) average noise Signal B AV (t) obtaining a corresponding frequency domain signal y after Fourier transformation a (k, l) and B AV (k, l) estimating the power spectral density of the noise by using the optimized IMCRA method to the frequency domain signalWhere k represents frequency.
The optimized IMCRA method is that an average noise signal B is added into an updating formula of noise power spectrum density in the IMCRA method AV (t) the specific formula is as follows:
wherein ,is the estimated noise power spectral density at frequency k and frame i +.>Is the estimated noise power spectral density of the 1 st+1 frame; alpha d Is a smoothing factor; p (k, l) is the probability of signal presence.
According to the microphone array voice enhancement method based on the optimized IMCRA, the noise estimation algorithm is combined with the microphone array voice enhancement, so that the accuracy of noise estimation is improved, and the calculated amount of voice enhancement is reduced. The method combines microphone array voice enhancement and single-channel voice enhancement, not only utilizes the airspace characteristic of microphone array voice enhancement, but also further removes the noise residue of the traditional microphone array voice enhancement. The invention firstly uses a wave beam forming method to process the noise-containing signals received by the microphone array, so that the signal-to-noise ratio of the noise-containing signals is improved. The optimized IMCRA method can estimate more accurate noise power spectral density, so that the MMSE-LSA estimator can output enhanced voice signals with higher quality.
Drawings
Fig. 1 is a block diagram of a microphone array speech enhancement method based on an optimized IMCRA of the present invention.
Detailed Description
A microphone array speech enhancement method based on an optimized IMCRA of the present invention is described in detail below with reference to the examples and figures.
As shown in fig. 1, the microphone array voice enhancement method based on the optimized IMCRA of the present invention includes the following steps:
1) Respectively calculating the signal x collected by each microphone n (t) forming a microphone array; comprising the following steps:
there are J signals s in the set space j (t), J e 1,2,3,..j, n+1 microphones, the nth microphone receiving a signal x at time t n (t):
in the formula ,hn,j Is the channel impulse response from the jth sound source to the nth microphone, v n (t) represents an additive noise,represents convolution, where N represents the microphones in the array, N e 1,2, N; at signal s j In (t), j=1 represents a desired speech signal, j=2, 3, & gt, and J represents an interference signal.
2) Calculating the output signal y of the microphone array a (t); comprising the following steps:
signal X received at time t for microphone array N (t) calculating the output signal y of the array using beamforming a (t):
In which the input signal of the array is X N (t),X N (t)=[x 1 (t),x 2 (t),…,x N (t)]The weight of beam forming is w a (t),w a (t)=[w a,1 (t),w a,2 (t),K,w a,N (t)]。
3) Calculating average noise signal B in blocking matrix output signals AV (t); comprising the following steps:
signal X to be received at time t based on microphone array N (t) obtaining a noise signal by blocking matrix B as follows:
then the noise signal is averaged by the following formula to obtain an average noise signal B AV (t):
Where N represents the total number of microphones.
4) Noise power spectral density estimated using optimized IMCRA method
In particular the output signal y of the microphone array a (t) average noise Signal B AV (t) obtaining a corresponding frequency domain signal y after Fourier transformation a (k, l) and B AV (k, l) estimating the power spectral density of the noise by using the optimized IMCRA method to the frequency domain signalWhere k represents frequency.
The optimized IMCRA method of the invention is to add a flat to an update formula of noise power spectrum density in the IMCRA methodNoise equalizing signal B AV (t) the specific formula is as follows:
wherein ,is the estimated noise power spectral density at frequency k and frame i +.>Is the estimated noise power spectral density of the 1 st+1 frame; alpha p Is a smoothing factor; p (k, l) is the probability of signal presence.
5) Output signal y of microphone array a (t) and noise Power Spectrum DensityAnd inputting the speech enhancement signal to an MMSE-LSA estimator to obtain a final speech enhancement signal.
The effect of a microphone array speech enhancement method based on an optimized IMCRA of the present invention is illustrated by comparing with existing methods in the same simulation environment.
The simulation environment is built on an open source tool McRoomsim, which can generate a reverberation room impulse response of a user-defined rectangular room. The room properties are set to sound-deadening chambers with a sound absorption coefficient of 1, which means that there is no reverberation or other noise in the environment. The microphone array was a uniform circular microphone array (UCA) with a gap of 18 cm, n=7. The UCA is centered at the origin (0, 0) of the coordinate system with its center being the positive X-axis. The target signal is placed at 15 meters from the central axis. An interference source was established in the simulation, located at (7.5,7.5,7.5).
One male and one female are randomly selected from the TIMIT database 20. 5 noise signals were extracted from the noise-92 database 21, simulating different noise environments, namely pink noise, F16 (aircraft) noise, white noise, volwa (car) noise and M109 noise, respectively. The selected 2 clean speech and 5 noise signals are resampled at 16kHz and mixed with SNR at-10 dB, -5dB, 0dB, 5dB and 10dB to generate the target signal. The target signal and the interfering signal propagate to the microphone array in the simulated environment.
The microphone array voice enhancement method based on the optimized IMCRA, the fixed beam forming (DS) method and the voice enhancement method based on the MVDR method are used for carrying out comparison experiments, and the experimental results adopt a segmented signal-to-noise ratio (segSNR) and short-time objective intelligibility (STOI) to represent voice quality. The experimental results are shown in tables 1 to 5. Simulation experiment results show that the microphone array voice enhancement method based on the optimized IMCRA can improve voice enhancement performance, and the STOI and segSNR are respectively improved by 12% and 88%.
TABLE 1 average STOI score of original and enhanced signals (female)
TABLE 2 average STOI score of original and enhanced signals (Male)
TABLE 3 segSNR score of original and enhanced signals (female)
TABLE 4 segSNR score of original and enhanced signals (Male)
Table 5.M-sex expression of OIMCRA
From tables 1 to 4, it can be seen that the microphone array speech enhancement method based on the optimized IMCRA of the present invention can effectively improve the STOI and segSNR scores of the noise speech under various noise conditions. Experimental results show that the method of the invention improves STOI and segSNR by 12% and 88%, respectively.
Research shows that the male voice has more middle and low frequency components and the female voice has more middle and high frequency components. From table 5 it can be seen that the method of the invention has a noise reduction performance for women that is better than for men for different types of noise, and a noise reduction effect for women that is 20% better than for men.
Claims (1)
1. A microphone array speech enhancement method based on optimized IMCRA, comprising the steps of:
1) Respectively calculating the signal x collected by each microphone n (t) forming a microphone array; comprising the following steps:
there are J signals s in the set space j (t), J e 1,2,3,..j, n+1 microphones, the nth microphone receiving a signal x at time t n (t):
in the formula ,hn,j Is the channel impulse response from the jth sound source to the nth microphone, v n (t) represents an additive noise,represents convolution, where N represents the microphones in the array, N e 1,2, N; at signal s j In (t), j=1 represents a desired speech signal, j=2, 3, & gt, and J represents an interference signalA number;
2) Calculating the output signal y of the microphone array a (t); comprising the following steps:
signal X received at time t for microphone array N (t) calculating the output signal y of the array using beamforming a (t):
In which the input signal of the array is X N (t),X N (t)=[x 1 (t),x 2 (t),...,x N (t)]The weight of beam forming is w a (t),w a (t)=[w a,1 (t),w a,2 (t),K,w a,N (t)];
3) Calculating average noise signal B in blocking matrix output signals AV (t); comprising the following steps:
signal X to be received at time t based on microphone array N (t) obtaining a noise signal by blocking matrix B as follows:
then the noise signal is averaged by the following formula to obtain an average noise signal B AV (t):
Wherein N represents the total number of microphones;
4) Noise power spectral density estimated using optimized IMCRA methodIs to output signal y of microphone array a (t) average noise Signal B AV (t) obtaining a corresponding frequency domain signal y after Fourier transformation a (k, l) and B AV (k, l) estimating the power spectral density of the noise by using the optimized IMCRA method on the frequency domain signal>Wherein k represents a frequency;
the optimized IMCRA method is that an average noise signal B is added into an updating formula of noise power spectrum density in the IMCRA method AV (t) the specific formula is as follows:
wherein ,is the estimated noise power spectral density at frequency k and frame i +.>Is the estimated noise power spectral density of the 1 st+1 frame; alpha p Is a smoothing factor; p (k, l) is the probability of signal presence;
5) Output signal y of microphone array a (t) and noise Power Spectrum DensityAnd inputting the speech enhancement signal to an MMSE-LSA estimator to obtain a final speech enhancement signal. />
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010719382.9A CN111968659B (en) | 2020-07-23 | 2020-07-23 | Microphone array voice enhancement method based on optimized IMCRA |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010719382.9A CN111968659B (en) | 2020-07-23 | 2020-07-23 | Microphone array voice enhancement method based on optimized IMCRA |
Publications (2)
Publication Number | Publication Date |
---|---|
CN111968659A CN111968659A (en) | 2020-11-20 |
CN111968659B true CN111968659B (en) | 2023-10-31 |
Family
ID=73362804
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202010719382.9A Active CN111968659B (en) | 2020-07-23 | 2020-07-23 | Microphone array voice enhancement method based on optimized IMCRA |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN111968659B (en) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113329288B (en) * | 2021-04-29 | 2022-07-19 | 开放智能技术(南京)有限公司 | Bluetooth headset noise reduction method based on notch technology |
Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103456310A (en) * | 2013-08-28 | 2013-12-18 | 大连理工大学 | Transient noise suppression method based on spectrum estimation |
CN103559887A (en) * | 2013-11-04 | 2014-02-05 | 深港产学研基地 | Background noise estimation method used for speech enhancement system |
CN107301869A (en) * | 2017-08-17 | 2017-10-27 | 珠海全志科技股份有限公司 | Microphone array sound pick-up method, processor and its storage medium |
CN108735225A (en) * | 2018-04-28 | 2018-11-02 | 南京邮电大学 | It is a kind of based on human ear masking effect and Bayesian Estimation improvement spectrum subtract method |
CN108922554A (en) * | 2018-06-04 | 2018-11-30 | 南京信息工程大学 | The constant Wave beam forming voice enhancement algorithm of LCMV frequency based on logarithm Power estimation |
CN109658949A (en) * | 2018-12-29 | 2019-04-19 | 重庆邮电大学 | A kind of sound enhancement method based on deep neural network |
CN111128215A (en) * | 2019-12-24 | 2020-05-08 | 声耕智能科技(西安)研究院有限公司 | Single-channel real-time noise reduction method and system |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US10573301B2 (en) * | 2018-05-18 | 2020-02-25 | Intel Corporation | Neural network based time-frequency mask estimation and beamforming for speech pre-processing |
-
2020
- 2020-07-23 CN CN202010719382.9A patent/CN111968659B/en active Active
Patent Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103456310A (en) * | 2013-08-28 | 2013-12-18 | 大连理工大学 | Transient noise suppression method based on spectrum estimation |
CN103559887A (en) * | 2013-11-04 | 2014-02-05 | 深港产学研基地 | Background noise estimation method used for speech enhancement system |
CN107301869A (en) * | 2017-08-17 | 2017-10-27 | 珠海全志科技股份有限公司 | Microphone array sound pick-up method, processor and its storage medium |
CN108735225A (en) * | 2018-04-28 | 2018-11-02 | 南京邮电大学 | It is a kind of based on human ear masking effect and Bayesian Estimation improvement spectrum subtract method |
CN108922554A (en) * | 2018-06-04 | 2018-11-30 | 南京信息工程大学 | The constant Wave beam forming voice enhancement algorithm of LCMV frequency based on logarithm Power estimation |
CN109658949A (en) * | 2018-12-29 | 2019-04-19 | 重庆邮电大学 | A kind of sound enhancement method based on deep neural network |
CN111128215A (en) * | 2019-12-24 | 2020-05-08 | 声耕智能科技(西安)研究院有限公司 | Single-channel real-time noise reduction method and system |
Non-Patent Citations (2)
Title |
---|
Speech Enhancement Based on IMCRA Incorporating noise classification algorithm;Ji-Hyun Song 等;《The Transactions of the Korean Institute of Electrical Engineers》;第第61卷卷(第第12期期);第1920-1925页 * |
基于噪声估计的麦克风阵列语音增强算法研究;李秋颖;《中国优秀硕士论文全文数据库 电子与通信工程辑》(第第02期期);全文 * |
Also Published As
Publication number | Publication date |
---|---|
CN111968659A (en) | 2020-11-20 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN106782590B (en) | Microphone array beam forming method based on reverberation environment | |
CN108172231B (en) | Dereverberation method and system based on Kalman filtering | |
CN106875938B (en) | Improved nonlinear self-adaptive voice endpoint detection method | |
CN105280193B (en) | Priori signal-to-noise ratio estimation method based on MMSE error criterion | |
Niwa et al. | Post-filter design for speech enhancement in various noisy environments | |
Sehr et al. | Towards a better understanding of the effect of reverberation on speech recognition performance | |
CN115424627A (en) | Voice enhancement hybrid processing method based on convolution cycle network and WPE algorithm | |
CN111968659B (en) | Microphone array voice enhancement method based on optimized IMCRA | |
CN103929704B (en) | The method and system that a kind of adaptive acoustic feedback based on transform domain is eliminated | |
Dmour et al. | A new framework for underdetermined speech extraction using mixture of beamformers | |
Kinoshita et al. | Multi-step linear prediction based speech dereverberation in noisy reverberant environment. | |
Heitkaemper et al. | Smoothing along frequency in online neural network supported acoustic beamforming | |
Miyazaki et al. | Theoretical analysis of parametric blind spatial subtraction array and its application to speech recognition performance prediction | |
Priyanka et al. | GSC adaptive beamforming using fast NLMS algorithm for speech enhancement | |
Pfeifenberger et al. | Blind source extraction based on a direction-dependent a-priori SNR. | |
Shen et al. | A modified cross power-spectrum phase method based on microphone array for acoustic source localization | |
Kawase et al. | Automatic parameter switching of noise reduction for speech recognition | |
KR100875264B1 (en) | Post-processing method for blind signal separation | |
Chen et al. | Research on Novel Normal Fuzzy Kalman Filter for Speech Enhancement in Noisy Environment | |
Wang et al. | Multi-channel speech enhancement based on the MVDR beamformer and postfilter | |
Hu et al. | A robust adaptive speech enhancement system for vehicular applications | |
Mowlaee et al. | Target speaker separation in a multisource environment using speaker-dependent postfilter and noise estimation | |
JP2020141160A (en) | Sound information processing device and programs | |
CN116320947B (en) | Frequency domain double-channel voice enhancement method applied to hearing aid | |
Chan et al. | A decision-directed adaptive gain equalizer for assistive hearing instruments |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |