CN111968659B - Microphone array voice enhancement method based on optimized IMCRA - Google Patents

Microphone array voice enhancement method based on optimized IMCRA Download PDF

Info

Publication number
CN111968659B
CN111968659B CN202010719382.9A CN202010719382A CN111968659B CN 111968659 B CN111968659 B CN 111968659B CN 202010719382 A CN202010719382 A CN 202010719382A CN 111968659 B CN111968659 B CN 111968659B
Authority
CN
China
Prior art keywords
signal
noise
microphone array
imcra
optimized
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202010719382.9A
Other languages
Chinese (zh)
Other versions
CN111968659A (en
Inventor
李秋颖
张涛
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Tianjin University
Original Assignee
Tianjin University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tianjin University filed Critical Tianjin University
Priority to CN202010719382.9A priority Critical patent/CN111968659B/en
Publication of CN111968659A publication Critical patent/CN111968659A/en
Application granted granted Critical
Publication of CN111968659B publication Critical patent/CN111968659B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L21/0216Noise filtering characterised by the method used for estimating noise
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L21/0216Noise filtering characterised by the method used for estimating noise
    • G10L2021/02161Number of inputs available containing the signal or the noise to be suppressed
    • G10L2021/02166Microphone arrays; Beamforming
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D30/00Reducing energy consumption in communication networks
    • Y02D30/70Reducing energy consumption in communication networks in wireless communication networks

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Quality & Reliability (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Circuit For Audible Band Transducer (AREA)

Abstract

A microphone array speech enhancement method based on optimized IMCRA, comprising: respectively calculating signals collected by each microphone to form a microphone array; calculating output signals of the microphone array; calculating an average noise signal in the blocking matrix output signal; noise power spectral density estimated using an optimized IMCRA method; the output signal of the microphone array and the noise power spectral density are input to an MMSE-LSA estimator to obtain a final speech enhancement signal. The invention not only improves the precision of noise estimation, but also reduces the calculation amount of voice enhancement; the spatial characteristics of microphone array voice enhancement are utilized, and noise residues of traditional microphone array voice enhancement are further removed; more accurate noise power spectral density can be estimated, thereby enabling the MMSE-LSA estimator to output enhanced speech signals of higher quality.

Description

Microphone array voice enhancement method based on optimized IMCRA
Technical Field
The invention relates to a voice enhancement method. And more particularly to a microphone array speech enhancement method based on optimized IMCRA.
Background
The voice is used as the most basic communication means of human beings, and is the most convenient and direct information interaction tool between people. With the rapid development of science and technology, voice is also a main tool for people to communicate with machines. But in daily life, the speech signal is often disturbed by noise. Therefore, how to reduce noise, or how to improve speech quality, especially speech intelligibility, is a hot spot for many scholars to study. The goal of speech enhancement is to suppress noise as much as possible. In recent years, in order to reduce noise, many speech enhancement methods have been proposed.
The number of microphones can be classified into single-channel speech enhancement and microphone array speech enhancement. Among them, the single-channel speech enhancement algorithm is the earliest one. Classical single-channel speech enhancement algorithms include spectral subtraction, wiener filtering, and kalman filtering. However, during the processing, the spectral subtraction may generate musical noise, the wiener filtering method may have poor performance in a non-stationary environment, and the kalman filtering method may cause damage to the voice.
The microphone array speech enhancement method has more advantages than the conventional single channel speech enhancement method. The method not only utilizes time domain information among samples, but also utilizes space information among channels, thereby improving the voice enhancement performance. Currently, there are many well-established microphone array speech enhancement algorithms. Such as fixed beam forming algorithms, adaptive beam forming algorithms, and generalized sidelobe canceling methods. Fixed beam forming algorithms are easy to implement but require more microphones to effectively enhance speech. Adaptive beamforming algorithms were developed based on fixed beamforming. The key change in comparison to fixed beam forming is the weighting coefficients. The weighting coefficients of adaptive beamforming are no longer fixed values but vary with the input. The flexibility of the algorithm is improved, and the algorithm can be applied to more acoustic environments. The generalized sidelobe cancellation method can eliminate strong correlation noise, but has poor suppression capability on weak correlation noise and incoherent noise, and is more convenient and flexible to calculate.
Estimating the power spectral density of noise is an important step in speech enhancement. Noise environments can be classified into stationary noise environments and non-stationary noise environments. In a stationary noise environment, the noise distribution is uniform and varies slowly. It only uses noise segments of the noisy speech signal to estimate the noise spectrum, which are typically identified by speech activity detection methods. In practical application, background noise is often non-stationary, so that it is of practical significance to study noise estimation algorithms in non-stationary noise environments. In a non-stationary noise environment, the noise is constantly changing. Common estimation methods are minimum statistics, minimum control recursive average and modified minimum control recursive average.
For single channel speech enhancement algorithms, the noise power spectrum estimator generally exploits the time spectral characteristics of the signal. In a non-stationary noise environment its estimation accuracy will decrease. Because these methods assume that noise is only updated when speech is not present and not updated when speech is present. In practice, noise is always present. This will result in non-stationary noise being not accurately estimated when the noise changes very rapidly.
However, microphone arrays can overcome the limitations of single channel methods by spatially separating the signals, which is relatively easy to extract speech or noise components. Therefore, the microphone array voice enhancement is combined with the noise estimation algorithm, so that the accuracy of noise estimation can be remarkably improved, and the calculated amount is reduced.
Disclosure of Invention
The invention aims to solve the technical problem of providing a microphone array voice enhancement method based on optimized IMCRA, which can improve the precision of noise estimation and voice enhancement performance.
The technical scheme adopted by the invention is as follows: a microphone array voice enhancement method based on an optimized IMCRA comprises the following steps:
1) Respectively calculating the signal x collected by each microphone n (t) forming a microphone array;
2) Calculating the output signal y of the microphone array a (t);
3) Calculating average noise signal B in blocking matrix output signals AV (t);
4) Noise power spectral density estimated using optimized IMCRA method
5) Output signal y of microphone array a (t) and noise Power Spectrum DensityAnd inputting the speech enhancement signal to an MMSE-LSA estimator to obtain a final speech enhancement signal.
Step 1) comprises:
there are J signals s in the set space j (t), J e 1,2,3,..j, n+1 microphones, the nth microphone receiving a signal x at time t n (t):
in the formula ,hn,j Is the channel impulse response from the jth sound source to the nth microphone, v n (t) represents an additive noise,represents convolution, where N represents the microphones in the array, N e 1,2, N; at signal s j In (t), j=1 represents a desired speech signal, j=2, 3, & gt, and J represents an interference signal.
Step 2) comprises:
signal X received at time t for microphone array N (t) calculating the output signal y of the array using beamforming a (t):
In which the input signal of the array is X N (t),X N (t)=[x 1 (t),x 2 (t),...,x N (t)]The weight of beam forming is w a (t),w a (t)=[w a,1 (t),w a,2 (t),K,w a,N (t)]。
The step 3) comprises the following steps:
signal X to be received at time t based on microphone array N (t) obtaining a noise signal by blocking matrix B as follows:
then the noise signal is averaged by the following formula to obtain an average noise signal B AV (t):
Where N represents the total number of microphones.
Step 4) is to output the signal y of the microphone array a (t) average noise Signal B AV (t) obtaining a corresponding frequency domain signal y after Fourier transformation a (k, l) and B AV (k, l) estimating the power spectral density of the noise by using the optimized IMCRA method to the frequency domain signalWhere k represents frequency.
The optimized IMCRA method is that an average noise signal B is added into an updating formula of noise power spectrum density in the IMCRA method AV (t) the specific formula is as follows:
wherein ,is the estimated noise power spectral density at frequency k and frame i +.>Is the estimated noise power spectral density of the 1 st+1 frame; alpha d Is a smoothing factor; p (k, l) is the probability of signal presence.
According to the microphone array voice enhancement method based on the optimized IMCRA, the noise estimation algorithm is combined with the microphone array voice enhancement, so that the accuracy of noise estimation is improved, and the calculated amount of voice enhancement is reduced. The method combines microphone array voice enhancement and single-channel voice enhancement, not only utilizes the airspace characteristic of microphone array voice enhancement, but also further removes the noise residue of the traditional microphone array voice enhancement. The invention firstly uses a wave beam forming method to process the noise-containing signals received by the microphone array, so that the signal-to-noise ratio of the noise-containing signals is improved. The optimized IMCRA method can estimate more accurate noise power spectral density, so that the MMSE-LSA estimator can output enhanced voice signals with higher quality.
Drawings
Fig. 1 is a block diagram of a microphone array speech enhancement method based on an optimized IMCRA of the present invention.
Detailed Description
A microphone array speech enhancement method based on an optimized IMCRA of the present invention is described in detail below with reference to the examples and figures.
As shown in fig. 1, the microphone array voice enhancement method based on the optimized IMCRA of the present invention includes the following steps:
1) Respectively calculating the signal x collected by each microphone n (t) forming a microphone array; comprising the following steps:
there are J signals s in the set space j (t), J e 1,2,3,..j, n+1 microphones, the nth microphone receiving a signal x at time t n (t):
in the formula ,hn,j Is the channel impulse response from the jth sound source to the nth microphone, v n (t) represents an additive noise,represents convolution, where N represents the microphones in the array, N e 1,2, N; at signal s j In (t), j=1 represents a desired speech signal, j=2, 3, & gt, and J represents an interference signal.
2) Calculating the output signal y of the microphone array a (t); comprising the following steps:
signal X received at time t for microphone array N (t) calculating the output signal y of the array using beamforming a (t):
In which the input signal of the array is X N (t),X N (t)=[x 1 (t),x 2 (t),…,x N (t)]The weight of beam forming is w a (t),w a (t)=[w a,1 (t),w a,2 (t),K,w a,N (t)]。
3) Calculating average noise signal B in blocking matrix output signals AV (t); comprising the following steps:
signal X to be received at time t based on microphone array N (t) obtaining a noise signal by blocking matrix B as follows:
then the noise signal is averaged by the following formula to obtain an average noise signal B AV (t):
Where N represents the total number of microphones.
4) Noise power spectral density estimated using optimized IMCRA method
In particular the output signal y of the microphone array a (t) average noise Signal B AV (t) obtaining a corresponding frequency domain signal y after Fourier transformation a (k, l) and B AV (k, l) estimating the power spectral density of the noise by using the optimized IMCRA method to the frequency domain signalWhere k represents frequency.
The optimized IMCRA method of the invention is to add a flat to an update formula of noise power spectrum density in the IMCRA methodNoise equalizing signal B AV (t) the specific formula is as follows:
wherein ,is the estimated noise power spectral density at frequency k and frame i +.>Is the estimated noise power spectral density of the 1 st+1 frame; alpha p Is a smoothing factor; p (k, l) is the probability of signal presence.
5) Output signal y of microphone array a (t) and noise Power Spectrum DensityAnd inputting the speech enhancement signal to an MMSE-LSA estimator to obtain a final speech enhancement signal.
The effect of a microphone array speech enhancement method based on an optimized IMCRA of the present invention is illustrated by comparing with existing methods in the same simulation environment.
The simulation environment is built on an open source tool McRoomsim, which can generate a reverberation room impulse response of a user-defined rectangular room. The room properties are set to sound-deadening chambers with a sound absorption coefficient of 1, which means that there is no reverberation or other noise in the environment. The microphone array was a uniform circular microphone array (UCA) with a gap of 18 cm, n=7. The UCA is centered at the origin (0, 0) of the coordinate system with its center being the positive X-axis. The target signal is placed at 15 meters from the central axis. An interference source was established in the simulation, located at (7.5,7.5,7.5).
One male and one female are randomly selected from the TIMIT database 20. 5 noise signals were extracted from the noise-92 database 21, simulating different noise environments, namely pink noise, F16 (aircraft) noise, white noise, volwa (car) noise and M109 noise, respectively. The selected 2 clean speech and 5 noise signals are resampled at 16kHz and mixed with SNR at-10 dB, -5dB, 0dB, 5dB and 10dB to generate the target signal. The target signal and the interfering signal propagate to the microphone array in the simulated environment.
The microphone array voice enhancement method based on the optimized IMCRA, the fixed beam forming (DS) method and the voice enhancement method based on the MVDR method are used for carrying out comparison experiments, and the experimental results adopt a segmented signal-to-noise ratio (segSNR) and short-time objective intelligibility (STOI) to represent voice quality. The experimental results are shown in tables 1 to 5. Simulation experiment results show that the microphone array voice enhancement method based on the optimized IMCRA can improve voice enhancement performance, and the STOI and segSNR are respectively improved by 12% and 88%.
TABLE 1 average STOI score of original and enhanced signals (female)
TABLE 2 average STOI score of original and enhanced signals (Male)
TABLE 3 segSNR score of original and enhanced signals (female)
TABLE 4 segSNR score of original and enhanced signals (Male)
Table 5.M-sex expression of OIMCRA
From tables 1 to 4, it can be seen that the microphone array speech enhancement method based on the optimized IMCRA of the present invention can effectively improve the STOI and segSNR scores of the noise speech under various noise conditions. Experimental results show that the method of the invention improves STOI and segSNR by 12% and 88%, respectively.
Research shows that the male voice has more middle and low frequency components and the female voice has more middle and high frequency components. From table 5 it can be seen that the method of the invention has a noise reduction performance for women that is better than for men for different types of noise, and a noise reduction effect for women that is 20% better than for men.

Claims (1)

1. A microphone array speech enhancement method based on optimized IMCRA, comprising the steps of:
1) Respectively calculating the signal x collected by each microphone n (t) forming a microphone array; comprising the following steps:
there are J signals s in the set space j (t), J e 1,2,3,..j, n+1 microphones, the nth microphone receiving a signal x at time t n (t):
in the formula ,hn,j Is the channel impulse response from the jth sound source to the nth microphone, v n (t) represents an additive noise,represents convolution, where N represents the microphones in the array, N e 1,2, N; at signal s j In (t), j=1 represents a desired speech signal, j=2, 3, & gt, and J represents an interference signalA number;
2) Calculating the output signal y of the microphone array a (t); comprising the following steps:
signal X received at time t for microphone array N (t) calculating the output signal y of the array using beamforming a (t):
In which the input signal of the array is X N (t),X N (t)=[x 1 (t),x 2 (t),...,x N (t)]The weight of beam forming is w a (t),w a (t)=[w a,1 (t),w a,2 (t),K,w a,N (t)];
3) Calculating average noise signal B in blocking matrix output signals AV (t); comprising the following steps:
signal X to be received at time t based on microphone array N (t) obtaining a noise signal by blocking matrix B as follows:
then the noise signal is averaged by the following formula to obtain an average noise signal B AV (t):
Wherein N represents the total number of microphones;
4) Noise power spectral density estimated using optimized IMCRA methodIs to output signal y of microphone array a (t) average noise Signal B AV (t) obtaining a corresponding frequency domain signal y after Fourier transformation a (k, l) and B AV (k, l) estimating the power spectral density of the noise by using the optimized IMCRA method on the frequency domain signal>Wherein k represents a frequency;
the optimized IMCRA method is that an average noise signal B is added into an updating formula of noise power spectrum density in the IMCRA method AV (t) the specific formula is as follows:
wherein ,is the estimated noise power spectral density at frequency k and frame i +.>Is the estimated noise power spectral density of the 1 st+1 frame; alpha p Is a smoothing factor; p (k, l) is the probability of signal presence;
5) Output signal y of microphone array a (t) and noise Power Spectrum DensityAnd inputting the speech enhancement signal to an MMSE-LSA estimator to obtain a final speech enhancement signal. />
CN202010719382.9A 2020-07-23 2020-07-23 Microphone array voice enhancement method based on optimized IMCRA Active CN111968659B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010719382.9A CN111968659B (en) 2020-07-23 2020-07-23 Microphone array voice enhancement method based on optimized IMCRA

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010719382.9A CN111968659B (en) 2020-07-23 2020-07-23 Microphone array voice enhancement method based on optimized IMCRA

Publications (2)

Publication Number Publication Date
CN111968659A CN111968659A (en) 2020-11-20
CN111968659B true CN111968659B (en) 2023-10-31

Family

ID=73362804

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010719382.9A Active CN111968659B (en) 2020-07-23 2020-07-23 Microphone array voice enhancement method based on optimized IMCRA

Country Status (1)

Country Link
CN (1) CN111968659B (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113329288B (en) * 2021-04-29 2022-07-19 开放智能技术(南京)有限公司 Bluetooth headset noise reduction method based on notch technology

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103456310A (en) * 2013-08-28 2013-12-18 大连理工大学 Transient noise suppression method based on spectrum estimation
CN103559887A (en) * 2013-11-04 2014-02-05 深港产学研基地 Background noise estimation method used for speech enhancement system
CN107301869A (en) * 2017-08-17 2017-10-27 珠海全志科技股份有限公司 Microphone array sound pick-up method, processor and its storage medium
CN108735225A (en) * 2018-04-28 2018-11-02 南京邮电大学 It is a kind of based on human ear masking effect and Bayesian Estimation improvement spectrum subtract method
CN108922554A (en) * 2018-06-04 2018-11-30 南京信息工程大学 The constant Wave beam forming voice enhancement algorithm of LCMV frequency based on logarithm Power estimation
CN109658949A (en) * 2018-12-29 2019-04-19 重庆邮电大学 A kind of sound enhancement method based on deep neural network
CN111128215A (en) * 2019-12-24 2020-05-08 声耕智能科技(西安)研究院有限公司 Single-channel real-time noise reduction method and system

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10573301B2 (en) * 2018-05-18 2020-02-25 Intel Corporation Neural network based time-frequency mask estimation and beamforming for speech pre-processing

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103456310A (en) * 2013-08-28 2013-12-18 大连理工大学 Transient noise suppression method based on spectrum estimation
CN103559887A (en) * 2013-11-04 2014-02-05 深港产学研基地 Background noise estimation method used for speech enhancement system
CN107301869A (en) * 2017-08-17 2017-10-27 珠海全志科技股份有限公司 Microphone array sound pick-up method, processor and its storage medium
CN108735225A (en) * 2018-04-28 2018-11-02 南京邮电大学 It is a kind of based on human ear masking effect and Bayesian Estimation improvement spectrum subtract method
CN108922554A (en) * 2018-06-04 2018-11-30 南京信息工程大学 The constant Wave beam forming voice enhancement algorithm of LCMV frequency based on logarithm Power estimation
CN109658949A (en) * 2018-12-29 2019-04-19 重庆邮电大学 A kind of sound enhancement method based on deep neural network
CN111128215A (en) * 2019-12-24 2020-05-08 声耕智能科技(西安)研究院有限公司 Single-channel real-time noise reduction method and system

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
Speech Enhancement Based on IMCRA Incorporating noise classification algorithm;Ji-Hyun Song 等;《The Transactions of the Korean Institute of Electrical Engineers》;第第61卷卷(第第12期期);第1920-1925页 *
基于噪声估计的麦克风阵列语音增强算法研究;李秋颖;《中国优秀硕士论文全文数据库 电子与通信工程辑》(第第02期期);全文 *

Also Published As

Publication number Publication date
CN111968659A (en) 2020-11-20

Similar Documents

Publication Publication Date Title
CN106782590B (en) Microphone array beam forming method based on reverberation environment
CN108172231B (en) Dereverberation method and system based on Kalman filtering
CN106875938B (en) Improved nonlinear self-adaptive voice endpoint detection method
CN105280193B (en) Priori signal-to-noise ratio estimation method based on MMSE error criterion
Niwa et al. Post-filter design for speech enhancement in various noisy environments
Sehr et al. Towards a better understanding of the effect of reverberation on speech recognition performance
CN115424627A (en) Voice enhancement hybrid processing method based on convolution cycle network and WPE algorithm
CN111968659B (en) Microphone array voice enhancement method based on optimized IMCRA
CN103929704B (en) The method and system that a kind of adaptive acoustic feedback based on transform domain is eliminated
Dmour et al. A new framework for underdetermined speech extraction using mixture of beamformers
Kinoshita et al. Multi-step linear prediction based speech dereverberation in noisy reverberant environment.
Heitkaemper et al. Smoothing along frequency in online neural network supported acoustic beamforming
Miyazaki et al. Theoretical analysis of parametric blind spatial subtraction array and its application to speech recognition performance prediction
Priyanka et al. GSC adaptive beamforming using fast NLMS algorithm for speech enhancement
Pfeifenberger et al. Blind source extraction based on a direction-dependent a-priori SNR.
Shen et al. A modified cross power-spectrum phase method based on microphone array for acoustic source localization
Kawase et al. Automatic parameter switching of noise reduction for speech recognition
KR100875264B1 (en) Post-processing method for blind signal separation
Chen et al. Research on Novel Normal Fuzzy Kalman Filter for Speech Enhancement in Noisy Environment
Wang et al. Multi-channel speech enhancement based on the MVDR beamformer and postfilter
Hu et al. A robust adaptive speech enhancement system for vehicular applications
Mowlaee et al. Target speaker separation in a multisource environment using speaker-dependent postfilter and noise estimation
JP2020141160A (en) Sound information processing device and programs
CN116320947B (en) Frequency domain double-channel voice enhancement method applied to hearing aid
Chan et al. A decision-directed adaptive gain equalizer for assistive hearing instruments

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant