CN102117618B - Method, device and system for eliminating music noise - Google Patents

Method, device and system for eliminating music noise Download PDF

Info

Publication number
CN102117618B
CN102117618B CN200910215212A CN200910215212A CN102117618B CN 102117618 B CN102117618 B CN 102117618B CN 200910215212 A CN200910215212 A CN 200910215212A CN 200910215212 A CN200910215212 A CN 200910215212A CN 102117618 B CN102117618 B CN 102117618B
Authority
CN
China
Prior art keywords
noise
speech signal
noisy speech
subband
signal
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN200910215212A
Other languages
Chinese (zh)
Other versions
CN102117618A (en
Inventor
程荣
张崇岩
韦春妍
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Huawei Technologies Co Ltd
Original Assignee
Huawei Technologies Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Huawei Technologies Co Ltd filed Critical Huawei Technologies Co Ltd
Priority to CN200910215212A priority Critical patent/CN102117618B/en
Publication of CN102117618A publication Critical patent/CN102117618A/en
Application granted granted Critical
Publication of CN102117618B publication Critical patent/CN102117618B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Abstract

The invention discloses a method for eliminating music noise, comprising the following steps of: calculating the signal to noise ratio of a noise-containing speech signal at a low frequency band; modifying the amplitude spectrum estimation parameter of the noise-containing speech signal when the signal to noise ratio meets a preset threshold; and carrying out noise suppression on the noise-containing speech signal by utilizing the modified amplitude spectrum estimation parameter. The embodiment of the invention further provides a device and system for eliminating music noise. Before the noise suppression is carried out, the noise-containing speech signal is detected, the noise-containing speech signal capable of generating music noise is detected, and the amplitude spectrum estimation parameter of the noise-containing speech signal is modified, thus the precision of the detection of the music noise is improved and the music noise is effectively eliminated.

Description

A kind of method, Apparatus and system of eliminating the music noise
Technical field
The embodiment of the invention relates to the voice communication technical field, relates in particular to a kind of method, Apparatus and system of eliminating the music noise.
Background technology
There are unusual fluctuations at random in the amplitude of nonstationary noise, and the noise suppression algorithm of estimating based on amplitude spectrum capable of using carries out squelch.In the process of noise being carried out the amplitude spectrum estimation; If the noise component of certain frequency is bigger; Then can cause estimated result inaccurate, a part of noise is remained, on frequency spectrum, present spike at random; Then form the residual noise of rhythmical fluctuating, similar music acoustically, so be called " music noise ".
Existing a kind of noise reduction techniques based on music noise post processing and filtering, the post-processing technology of this techniques make use squelch is eliminated the music noise that produces in the nonstationary noise process of inhibition.This technological implementation is to carry out the detection of music noise after to the nonstationary noise noise reduction, then the critical band of detected generation music noise is carried out aftertreatment, to eliminate the music noise.
The inventor finds that there are the following problems at least in the prior art in realizing process of the present invention:
Nonstationary noise through after the noise reduction process can lose original statistical property, makes to cause the remaining music noise characteristic and the feature similarity of voice signal music walkaway precision low, and then can't eliminate the music noise well.
Summary of the invention
Embodiments of the invention provide a kind of method, Apparatus and system of eliminating the music noise, thereby effectively eliminate the music noise.
The objective of the invention is to realize through following technical scheme:
A kind of method of eliminating the music noise comprises:
Calculate the signal to noise ratio (S/N ratio) of noisy speech signal in low-frequency band;
When said signal to noise ratio (S/N ratio) satisfies setting threshold, the amplitude spectrum estimated parameter of said noisy speech signal is revised;
Utilize revised amplitude spectrum estimated parameter said noisy speech signal to be carried out the squelch of estimating based on amplitude spectrum.
A kind of device of eliminating the music noise comprises:
The snr computation module is used to calculate the signal to noise ratio (S/N ratio) of noisy speech signal in low-frequency band;
The parameter correcting module is used for when signal to noise ratio (S/N ratio) that said computing module calculates satisfies setting threshold, the amplitude spectrum estimated parameter of said noisy speech signal being revised;
The parameter sending module is used for the amplitude spectrum estimated parameter after the said parameter correcting module processing is sent to the noise suppressor of estimating based on amplitude spectrum.
A kind of system of eliminating the music noise comprises music noise elimination apparatus and noise suppressor:
Said music noise elimination apparatus be used to calculate noisy speech signal in the signal to noise ratio (S/N ratio) of low-frequency band when said signal to noise ratio (S/N ratio) satisfies setting threshold; Amplitude spectrum estimated parameter to said noisy speech signal is revised, and revised amplitude spectrum estimated parameter is sent to said noise suppressor;
The revised amplitude spectrum estimated parameter that said noise suppressor is used to utilize said noise suppressor to send carries out the squelch based on the amplitude spectrum estimation to said noisy speech signal.
Technical scheme by the embodiment of the invention described above provides can be found out; In the embodiment of the invention; Because before carrying out squelch, noisy speech signal is detected, detect the noisy speech signal that possibly produce the music noise and it is carried out the correction of amplitude spectrum estimated parameter; Thereby improved the precision of music walkaway, and then effectively eliminated the music noise.
Description of drawings
In order to be illustrated more clearly in the embodiment of the invention or technical scheme of the prior art; To do one to the accompanying drawing of required use in embodiment or the description of the Prior Art below introduces simply; Obviously, the accompanying drawing in describing below only is some embodiments of the present invention, for those of ordinary skills; Under the prerequisite of not paying creative work property, can also obtain other accompanying drawing according to these accompanying drawings.
The processing procedure synoptic diagram that Fig. 1 provides for the embodiment of the invention;
The processing procedure synoptic diagram that Fig. 2 provides for the specific embodiment of the invention;
The apparatus structure synoptic diagram that Fig. 3 provides for the embodiment of the invention;
The system architecture synoptic diagram that Fig. 4 provides for the embodiment of the invention.
Embodiment
To combine the accompanying drawing in the embodiment of the invention below, the technical scheme in the embodiment of the invention is carried out clear, intactly description, obviously, described embodiment only is the present invention's part embodiment, rather than whole embodiment.Based on the embodiment among the present invention, those of ordinary skills are not making the every other embodiment that is obtained under the creative work prerequisite, all belong to the scope of the present invention's protection.
The inventor is through a large amount of analysis of experimentss, and find that the nonstationary noise that produces the music noise possesses following characteristic: on frequency domain, the nonstationary noise that produces the music noise is distributed in low-frequency band (the upper limiting frequency scope of low-frequency band is at 800 hertz~1100 hertz); On time domain, nonstationary noise can produce the music noise because amplitude spectrum estimates inaccurate when unusual fluctuations occurring.The sub-band division of the voice signal of 8000 hertz of samplings is an example in the following table 1:
125-250 1250-1437.5
250-375 1437.5-1687.5
375-500 1687.5-1937.5
500-625 1937.5-2250
625-750 2250-2625
750-875 2625-3062.5
875-1062.5 3062.5-3500
1062.5-1250 3500-4000
Wherein, noisy speech signal is distributed in 0~4000 hertz, and by being divided into 16 sub-frequency bands shown in the table 1, then preceding 7 sub-frequency bands are low-frequency band (being that the low-frequency band scope is at 0~1062.5 hertz) with noisy speech signal.
The embodiment of the invention provides a kind of music noise cancellation method; Characteristic based on above-mentioned nonstationary noise; Before carrying out squelch; Noisy speech signal to input is carried out feature detection, judging whether the producing music noise, and the noisy speech signal that possibly produce the music noise is carried out making squelch after the correction that amplitude spectrum estimates handle.As shown in Figure 1, this method comprises:
S101, calculate the signal to noise ratio (S/N ratio) of noisy speech signal in low-frequency band;
If this signal to noise ratio (S/N ratio) of S102 satisfies setting threshold, think that then above-mentioned noisy speech signal will produce the music noise, the amplitude spectrum estimated parameter of this noisy speech signal is revised;
S103, utilize revised amplitude spectrum estimated parameter this noisy speech signal to be carried out the squelch of estimating based on amplitude spectrum.
In the above-mentioned processing procedure,, think that then above-mentioned noisy speech signal can not produce the music noise, can directly carry out squelch and handle if this signal to noise ratio (S/N ratio) does not satisfy setting threshold.Wherein, setting threshold comprises lower threshold R L(span is between 0.5~1.5) and upper limit threshold R H(span is between 3.5~4.5), signal to noise ratio (S/N ratio) are satisfied setting threshold and are meant that this signal to noise ratio (S/N ratio) is greater than lower threshold R LAnd less than upper limit threshold R H
The method that the invention described above embodiment provides; Because before carrying out squelch; Noisy speech signal is detected; Detect the noisy speech signal that possibly produce the music noise and it is carried out the correction of amplitude spectrum estimated parameter, thereby improved the precision of music walkaway, and then effectively eliminate the music noise.
Among the invention described above embodiment, the amplitude spectrum estimated parameter of noisy speech signal is revised.As for example and non-limiting, in the application of the noise reduction techniques of estimating based on amplitude spectrum, above-mentioned amplitude spectrum estimated parameter specifically can comprise: the noise energy of noisy speech signal and the gain coefficient of noisy speech signal.Accordingly, can adopt the method for smothing filtering that the amplitude spectrum estimated parameter of noisy speech signal is revised, specifically comprise: the noise energy to above-mentioned noisy speech signal is carried out The disposal of gentle filter; Gain coefficient to above-mentioned noisy speech signal carries out The disposal of gentle filter.To eliminate issuable music noise in the noise suppression process.Wherein, revising preceding amplitude spectrum estimated parameter specifically obtains from noise suppressor.
To carry out detailed explanation to the concrete implementation of the embodiment of the invention in actual application below.
The specific embodiment of the invention provides in a kind of squelch application based on the amplitude spectrum estimation, eliminates the method for music noise, and its concrete processing procedure is as shown in Figure 2, comprises following operation:
S201, to the input noisy speech signal calculate in the signal to noise ratio (S/N ratio) of low-frequency band;
Signal to noise ratio (S/N ratio) is meant the signal energy that comprises in the above-mentioned noisy speech signal and the ratio of noise energy; Wherein, Signal energy and noise energy are obtained from noise suppressor; Non-limiting as giving an example; In the application of the subband noise suppressor that comprises sub-band division module, subband signal energy estimation block, voice VAD judging module, noise energy estimation module, subband snr computation module, subband gain calculation module, signal energy is from the subband signal energy estimation block of subband noise suppressor, and noise energy obtains from the noise energy estimation module of subband noise suppressor.I frame noisy speech signal is following in the computing formula of the signal to noise ratio (S/N ratio) r of low-frequency band (i):
r ( i ) = Σ m N L E ch ( m , i ) Σ m N L E n ( m , i ) 0≤m<N L
Wherein, m is the sub-band serial number of i frame noisy speech signal, and in above-mentioned subband noise suppressor, noisy speech signal is divided into 16 subbands, and the sub-band division method is seen table 1; E Ch(m i) is signal energy in m subband of i frame noisy speech signal; E n(m i) is noise energy in m subband of i frame noisy speech signal; N LBe the higher limit of low frequency sub-band, the inventor is through analyzing the optimal N that obtains LValue is 7 (with table 1 are example, and then corresponding frequency is 1062.5 hertz), and the m span is more than or equal to 0 and less than N LInteger.
S202, judge whether the low-frequency band signal to noise ratio (S/N ratio) that calculates among the S201 satisfies setting threshold,, carry out S203,, then carry out S205 if do not satisfy if satisfy;
Setting threshold comprises lower threshold R LWith upper limit threshold R H, specifically can judge whether the low-frequency band signal to noise ratio (S/N ratio) of above-mentioned i frame noisy speech signal satisfies setting threshold through following discriminant:
Figure G2009102152120D00052
Wherein, M_flag (i) value is 1 o'clock, is illustrated in through after the noise reduction process of preceding 1~7 subband, and i frame noisy speech signal will produce the music noise; M_flag (i) value is 0 o'clock, representes that i frame noisy speech signal can not produce the music noise.Condition (r L<r (i)<r H) && (m<N L) (the low frequency sub-band higher limit is N to be illustrated in low-frequency band L) and r (i) greater than lower threshold R LAnd less than upper limit threshold R HThrough a large amount of preferred lower limit threshold value R that obtain that test of inventor LBe 1, preferred upper limit threshold R HBe 4.
S203, be that the amplitude spectrum estimated parameter of 1 i frame noisy speech signal is revised to M_flag (i), and carry out S204;
Wherein, in the above-mentioned application based on the subband noise suppressor, the amplitude spectrum estimated parameter specifically comprises: the subband noise energy of i frame noisy speech signal and the subband gain coefficient of i frame noisy speech signal.
The amplitude spectrum estimated parameter revised comprise: 1) the noise energy estimation module from the subband noise suppressor is obtained the subband noise energy, and it is carried out The disposal of gentle filter, and non-limiting, the computing formula of The disposal of gentle filter is following as for example:
E N(m, i)=(1-α 0) E n(m, i-1)+α 0E n(m, i) (and if only if M_flag (i)==1)
E N(m, i) expression is through the noise energy of m the subband of i frame noisy speech signal of correction; E n(m, i-1) noise energy of m the subband of i-1 frame noisy speech signal that provide of expression noise energy estimation module; E n(m, i) noise energy of m the subband of i frame noisy speech signal that provide of expression noise energy estimation module; α 0Be constant, the acquiescence value is 0.8; 0≤m<N L, N LValue is 15.
2) obtain the subband gain coefficient from the subband gain calculation module of subband noise suppressor, and it is carried out The disposal of gentle filter, and non-limiting, the computing formula of The disposal of gentle filter is following as for example:
g CH(m,i)=min{g ch(m,i-1),g ch(m,i)}
g CH(m, i) expression is through the gain coefficient of m the subband of i frame of correction; g Ch(m, i-1) gain coefficient of m the subband of i-1 frame that provide of expression subband gain calculation module; g Ch(m, i) gain coefficient of m the subband of i frame that provide of expression subband gain calculation module.
S204, the squelch of utilizing revised amplitude spectrum estimated parameter that i frame noisy speech signal is carried out estimating based on amplitude spectrum are handled, and this operation is carried out by above-mentioned subband noise suppressor;
S205, the squelch that the i frame noisy speech signal of M_flag (i)=0 is carried out estimating based on amplitude spectrum are handled, and this operation is carried out by above-mentioned subband noise suppressor.
In the above-mentioned processing procedure, can the music noise be set to noisy speech signal and produce sign, and this music noise generation sign is provided with based on the value of M_flag (i).Then in the process of carrying out S203~S204 and S205, can also produce the identical speech signal segments (forming) of sign to the music noise and carry out batch processing by some frame noisy speech signal.
The operation of above-mentioned S201~S203 can be accomplished by the music noise elimination apparatus of independent setting; Also can in the noise suppressor of estimating based on amplitude spectrum, set up the music noise cancellation module and accomplish, can also carry out software upgrading to the system of the existing noise suppressor of estimating based on amplitude spectrum and accomplish.Through the music noise cancellation method that the invention described above embodiment provides, effectively improved the accuracy of detection of music noise, effectively eliminate the music noise.Especially the implementation through noise suppressor is carried out software upgrading, implementation complexity is low, and need not to increase extra hardware.
The all or part of step of realization said method embodiment can be accomplished through the relevant hardware of programmed instruction, and aforesaid program can be stored in the computer read/write memory medium, and this program the step that comprises said method embodiment when carrying out; And aforesaid storage medium comprises: various media that can be program code stored such as ROM, RAM, magnetic disc or CD.
The embodiment of the invention also provides a kind of device of eliminating the music noise; Based on the characteristic of above-mentioned nonstationary noise, before carrying out squelch, the noisy speech signal of input is carried out feature detection; To judge whether the to produce music noise; And the noisy speech signal that possibly produce the music noise is carried out making squelch after the correction that amplitude spectrum estimates handle, this apparatus structure is as shown in Figure 3, and concrete implementation structure comprises:
Snr computation module 301; Be used to calculate the signal to noise ratio (S/N ratio) of noisy speech signal in low-frequency band; Wherein, Signal to noise ratio (S/N ratio) is meant the signal energy that comprises in the above-mentioned noisy speech signal and the ratio of noise energy, and signal energy and noise energy are obtained from noise suppressor, and noise energy obtains from the noise energy estimation module of subband noise suppressor; Said snr computation module 301 comprises the snr computation submodule, is used to adopt following formula to calculate i frame noisy speech signal at the signal to noise ratio (S/N ratio) r of low-frequency band (i):
r ( i ) = Σ m N L E Ch ( m , i ) Σ m N L E n ( m , i ) 0≤m<N LM is the sub-band serial number of i frame noisy speech signal, and in above-mentioned subband noise suppressor, noisy speech signal is divided into 16 subbands, and the sub-band division method is seen table 1; E Ch(m i) is signal energy in m subband of i frame noisy speech signal; E n(m i) is noise energy in m subband of i frame noisy speech signal; N LBe the higher limit of low frequency sub-band, the inventor is through analyzing the optimal N that obtains LValue is 7 (with table 1 are example, and then corresponding frequency is 1062.5 hertz), and the m span is more than or equal to 0 and less than N LInteger;
Parameter correcting module 302; When the signal to noise ratio (S/N ratio) that calculates when aforementioned calculation module 301 satisfies setting threshold; Think that then above-mentioned noisy speech signal will produce the music noise, parameter correcting module 302 is used for the amplitude spectrum estimated parameter of this noisy speech signal is revised;
Parameter sending module 303 is used for the amplitude spectrum estimated parameter after said parameter correcting module 302 processing is sent to the noise suppressor of estimating based on amplitude spectrum, so that carry out noise reduction process.
The device that the invention described above embodiment provides can also comprise judge module (figure does not show); Be used to judge whether the signal to noise ratio (S/N ratio) that snr computation module 301 calculates satisfies setting threshold; If this signal to noise ratio (S/N ratio) satisfies setting threshold; Then trigger parameter correcting module 302 work if this signal to noise ratio (S/N ratio) does not satisfy setting threshold, think that then above-mentioned noisy speech signal can not produce the music noise; 303 work of trigger parameter sending module send to noise suppressor by parameter sending module 303 with this noisy speech signal and directly carry out the squelch processing.Wherein, setting threshold comprises lower threshold R L(span is between 0.5~1.5) and upper limit threshold R H(span is between 3.5~4.5), signal to noise ratio (S/N ratio) are satisfied setting threshold and are meant that this signal to noise ratio (S/N ratio) is greater than lower threshold R LAnd less than upper limit threshold R H
The device that the invention described above embodiment provides; Because before carrying out squelch; Noisy speech signal is detected; Detect the noisy speech signal that possibly produce the music noise and it is carried out the correction of amplitude spectrum estimated parameter, thereby improved the precision of music walkaway, and then effectively eliminate the music noise.
Among the invention described above embodiment, as for example and non-limiting, in the application of the noise reduction techniques of estimating based on amplitude spectrum, above-mentioned amplitude spectrum estimated parameter specifically can comprise: the noise energy of noisy speech signal and the gain coefficient of noisy speech signal.Accordingly, parameter correcting module 302 can comprise:
The first parameter correcting module 3021; Be used for the noise energy of said noisy speech signal is revised; Specifically be used for the noise energy of said noisy speech signal is carried out The disposal of gentle filter, wherein, the noise energy estimation module of noise energy from the subband noise suppressor obtained; The said first parameter correcting module 3021 comprises the first parameter correction submodule, is used to utilize following formula that the subband noise energy of i frame noisy speech signal is carried out The disposal of gentle filter: E N(m, i)=(1-α 0) E n(m, i-1)+α 0E n(m, i) (and if only if M_flag (i)==1) E N(m, i) expression is through the noise energy of m the subband of i frame noisy speech signal of correction; E n(m, i-1) noise energy of m the subband of i-1 frame noisy speech signal that provide of expression noise energy estimation module; E n(m, i) noise energy of m the subband of i frame noisy speech signal that provide of expression noise energy estimation module; α 0Be constant, the acquiescence value is 0.8; 0≤m<N L, N LValue is 15;
The second parameter correcting module 3022; Be used for the gain coefficient of said noisy speech signal is revised; Specifically be used for the gain coefficient of said noisy speech signal is carried out The disposal of gentle filter, wherein, gain coefficient obtains from the subband gain calculation module of subband noise suppressor; The said second parameter correcting module 3022 comprises the second parameter correction submodule, is used to utilize following formula that the subband gain coefficient of i frame noisy speech signal is carried out The disposal of gentle filter: g CH(m, i)=min{g Ch(m, i-1), g Ch(m, i) } g CH(m, i) expression is through the gain coefficient of m the subband of i frame of correction; g Ch(m, i-1) gain coefficient of m the subband of i-1 frame that provide of expression subband gain calculation module; g Ch(m, i) gain coefficient of m the subband of i frame that provide of expression subband gain calculation module.
Eliminate issuable music noise in the noise suppression process through the mode of smothing filtering.Wherein, revising preceding amplitude spectrum estimated parameter specifically obtains from noise suppressor.
The embodiment of the invention also provides a kind of system of eliminating the music noise; Based on the characteristic of above-mentioned nonstationary noise, before carrying out squelch, the noisy speech signal of input is carried out feature detection; To judge whether the to produce music noise; And the noisy speech signal that possibly produce the music noise is carried out making squelch after the correction that amplitude spectrum estimates handle, the implementation structure of this system is as shown in Figure 4, and concrete implementation structure comprises music noise elimination apparatus 401 and noise suppressor 402:
Music noise elimination apparatus 401 be used to calculate noisy speech signal in the signal to noise ratio (S/N ratio) of low-frequency band when said signal to noise ratio (S/N ratio) satisfies setting threshold; Amplitude spectrum estimated parameter to said noisy speech signal is revised, and revised amplitude spectrum estimated parameter is sent to said noise suppressor;
The revised amplitude spectrum estimated parameter that noise suppressor 402 is used to utilize said noise suppressor to send carries out the squelch based on the amplitude spectrum estimation to said noisy speech signal.
Wherein, the device of the concrete implementation structure of the music noise elimination apparatus 401 elimination music noise that can provide with reference to the invention described above embodiment.Noise suppressor 402 can adopt the existing noise suppressor of estimating based on amplitude spectrum.
If above-mentioned signal to noise ratio (S/N ratio) does not satisfy setting threshold, think that then above-mentioned noisy speech signal can not produce the music noise, music noise elimination apparatus 401 sends to noise suppressor 402 with this noisy speech signal and directly carries out the squelch processing.Wherein, setting threshold comprises lower threshold R LWith upper limit threshold R H, signal to noise ratio (S/N ratio) satisfies setting threshold and is meant that this signal to noise ratio (S/N ratio) is greater than lower threshold R LAnd less than upper limit threshold R H
Among the invention described above embodiment; Said noise suppressor 402 also is used for sending the amplitude spectrum estimated parameter of unmodified to said music noise elimination apparatus 401; Said amplitude spectrum estimated parameter comprises: the noise energy of above-mentioned noisy speech signal and the gain coefficient of above-mentioned noisy speech signal.
The system that the invention described above embodiment provides; Because before carrying out squelch; Noisy speech signal is detected; Detect the noisy speech signal that possibly produce the music noise and it is carried out the correction of amplitude spectrum estimated parameter, thereby improved the precision of music walkaway, and then effectively eliminate the music noise.
The above; Be merely the preferable embodiment of the present invention, but protection scope of the present invention is not limited thereto, any technician who is familiar with the present technique field is in the technical scope that the present invention discloses; The variation that can expect easily or replacement all should be encompassed within protection scope of the present invention.Therefore, protection scope of the present invention should be as the criterion with the protection domain of claim.

Claims (10)

1. a method of eliminating the music noise is characterized in that, comprising:
Calculate the signal to noise ratio (S/N ratio) of noisy speech signal in low-frequency band;
Said signal to noise ratio (S/N ratio) is meant the signal energy that comprises in the said noisy speech signal and the ratio of noise energy, and when said noisy speech signal was unit with the frame, i frame noisy speech signal was following at the snr computation formula of low-frequency band:
0≤m<N L, m is the sub-band serial number of said i frame noisy speech signal; E Ch(m i) is signal energy in m subband of said i frame noisy speech signal; E n(m i) is noise energy in m subband of said i frame noisy speech signal; N LHigher limit for said low-frequency band;
When said signal to noise ratio (S/N ratio) satisfies setting threshold, the amplitude spectrum estimated parameter of said noisy speech signal is revised;
This method also comprises: whether the low-frequency band signal to noise ratio (S/N ratio) of judging said i frame noisy speech signal through following discriminant satisfies setting threshold:
Figure FDA00001756348300012
Wherein, M_flag (i) value is 1 o'clock, is illustrated in through after the noise reduction process of preceding 1~7 subband, and i frame noisy speech signal will produce the music noise; M_flag (i) value is 0 o'clock, representes that i frame noisy speech signal can not produce the music noise; Condition (r L<r (i)<r H) && (m<N L) be illustrated in low-frequency band and r (i) greater than lower threshold R LAnd less than upper limit threshold R H
Utilize revised amplitude spectrum estimated parameter said noisy speech signal to be carried out the squelch of estimating based on amplitude spectrum.
2. method according to claim 1 is characterized in that the nonstationary noise of said noisy speech signal is distributed in low-frequency band, and the upper limiting frequency scope of said low-frequency band is 800~1100 hertz.
3. method according to claim 1; It is characterized in that; Said signal to noise ratio (S/N ratio) satisfies setting threshold and is meant: said signal to noise ratio (S/N ratio) is greater than lower threshold and less than upper limit threshold, and the span of said lower threshold is for O.5~1.5, and the span of said upper limit threshold is 3.5~4.5.
4. according to any described method of claim 1~3, it is characterized in that said amplitude spectrum estimated parameter to said noisy speech signal is revised and comprised:
Noise energy to said noisy speech signal is carried out The disposal of gentle filter;
Gain coefficient to said noisy speech signal carries out The disposal of gentle filter.
5. method according to claim 4 is characterized in that, the noise energy of said noisy speech signal is carried out The disposal of gentle filter comprise: utilize following formula that the subband noise energy of i frame noisy speech signal is carried out The disposal of gentle filter:
E N(m, i)=(1-α 0) E n(m, i-1)+α 0E n(m, i) and if only if M_flag (i)==1,
E N(m, i) expression is through the noise energy of m the subband of i frame noisy speech signal of correction; E n(m, i-1) noise energy of m the subband of i-1 frame noisy speech signal that provide of expression noise energy estimation module; E n(m, i) noise energy of m the subband of i frame noisy speech signal that provide of expression noise energy estimation module; α 0Be constant; 0≤m<N L
The gain coefficient of said noisy speech signal is carried out The disposal of gentle filter to be comprised: utilize following formula that the subband gain coefficient of i frame noisy speech signal is carried out The disposal of gentle filter:
g CH(m,i)=min{g ch(m,i-1),g ch(m,i)}
g CH(m, i) expression is through the gain coefficient of m the subband of i frame of correction; g Ch(m, i-1) gain coefficient of m the subband of i-1 frame that provide of expression subband gain calculation module; g Ch(m, i) gain coefficient of m the subband of i frame that provide of expression subband gain calculation module.
6. a device of eliminating the music noise is characterized in that, comprising:
The snr computation module is used to calculate the signal to noise ratio (S/N ratio) of noisy speech signal in low-frequency band;
Said snr computation module comprises:
The snr computation submodule is used to adopt following formula to calculate i frame noisy speech signal at the signal to noise ratio (S/N ratio) r of low-frequency band (i):
Figure FDA00001756348300031
0≤m<N LM is the sub-band serial number of i frame noisy speech signal, E Ch(m i) is signal energy in m subband of i frame noisy speech signal; E n(m i) is noise energy in m subband of i frame noisy speech signal; N LHigher limit for said low-frequency band;
Judge module; Be used to judge whether the signal to noise ratio (S/N ratio) that snr computation module 301 calculates satisfies setting threshold, if this signal to noise ratio (S/N ratio) satisfies setting threshold, then trigger parameter correcting module work; If this signal to noise ratio (S/N ratio) does not satisfy setting threshold, then trigger parameter sending module work; Wherein, setting threshold comprises lower threshold R LWith upper limit threshold R H, judge through following discriminant whether the low-frequency band signal to noise ratio (S/N ratio) of above-mentioned i frame noisy speech signal satisfies setting threshold:
Wherein, M_flag (i) value is 1 o'clock, is illustrated in through after the noise reduction process of preceding 1~7 subband, and i frame noisy speech signal will produce the music noise; M_flag (i) value is 0 o'clock, representes that i frame noisy speech signal can not produce the music noise; Condition (r L<r (i)<r H) && (m<N L) be illustrated in low-frequency band and r (i) greater than lower threshold R LAnd less than upper limit threshold R H, said low-frequency band higher limit is N L
The parameter correcting module is used for when signal to noise ratio (S/N ratio) that said computing module calculates satisfies setting threshold, the amplitude spectrum estimated parameter of said noisy speech signal being revised;
The parameter sending module is used for the amplitude spectrum estimated parameter after the said parameter correcting module processing is sent to the noise suppressor of estimating based on amplitude spectrum.
7. device according to claim 6 is characterized in that, said parameter correcting module comprises:
The first parameter correcting module is used for the noise energy of said noisy speech signal is carried out The disposal of gentle filter;
The second parameter correcting module is used for the gain coefficient of said noisy speech signal is carried out The disposal of gentle filter.
8. device according to claim 7 is characterized in that, the said first parameter correcting module comprises: the first parameter correction submodule is used to utilize following formula that the subband noise energy of i frame noisy speech signal is carried out The disposal of gentle filter:
E N(m, i)=(1-α 0) E n(m, i-1)+α 0E n(m, i) and if only if M_flag (i)==1,
Wherein, E N(m, i) expression is through the noise energy of m the subband of i frame noisy speech signal of correction; E n(m, i-1) noise energy of m the subband of i-1 frame noisy speech signal that provide of expression noise energy estimation module; E n(m, i) noise energy of m the subband of i frame noisy speech signal that provide of expression noise energy estimation module; α 0Be constant; 0≤m<N L
The said second parameter correcting module comprises: the second parameter correction submodule is used to utilize following formula that the subband gain coefficient of i frame noisy speech signal is carried out The disposal of gentle filter:
g CH(m,i)=min{g ch(m,i-1),g ch(m,i)},
Wherein, g CH(m, i) expression is through the gain coefficient of m the subband of i frame of correction; g Ch(m, i-1) gain coefficient of m the subband of i-1 frame that provide of expression subband gain calculation module; g Ch(m, i) gain coefficient of m the subband of i frame that provide of expression subband gain calculation module.
9. a system of eliminating the music noise is characterized in that, comprises music noise elimination apparatus and noise suppressor:
Said music noise elimination apparatus adopts each described device of claim 6~8;
The revised amplitude spectrum estimated parameter that said noise suppressor is used to utilize said music noise elimination apparatus to send carries out the squelch based on the amplitude spectrum estimation to said noisy speech signal.
10. system according to claim 9; It is characterized in that; Said noise suppressor also is used for sending to said music noise elimination apparatus the amplitude spectrum estimated parameter of unmodified; Said amplitude spectrum estimated parameter comprises: the noise energy of said noisy speech signal and the gain coefficient of said noisy speech signal.
CN200910215212A 2009-12-30 2009-12-30 Method, device and system for eliminating music noise Active CN102117618B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN200910215212A CN102117618B (en) 2009-12-30 2009-12-30 Method, device and system for eliminating music noise

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN200910215212A CN102117618B (en) 2009-12-30 2009-12-30 Method, device and system for eliminating music noise

Publications (2)

Publication Number Publication Date
CN102117618A CN102117618A (en) 2011-07-06
CN102117618B true CN102117618B (en) 2012-09-05

Family

ID=44216349

Family Applications (1)

Application Number Title Priority Date Filing Date
CN200910215212A Active CN102117618B (en) 2009-12-30 2009-12-30 Method, device and system for eliminating music noise

Country Status (1)

Country Link
CN (1) CN102117618B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107123419A (en) * 2017-05-18 2017-09-01 北京大生在线科技有限公司 The optimization method of background noise reduction in the identification of Sphinx word speeds

Families Citing this family (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP6182895B2 (en) * 2012-05-01 2017-08-23 株式会社リコー Processing apparatus, processing method, program, and processing system
CN103581447B (en) * 2012-08-02 2016-09-07 中兴通讯股份有限公司 A kind of method of signal transacting, device and electronic equipment
CN103745729B (en) * 2013-12-16 2017-01-04 深圳百科信息技术有限公司 A kind of audio frequency denoising method and system
CN108269567B (en) * 2018-01-23 2021-02-05 北京百度网讯科技有限公司 Method, apparatus, computing device, and computer-readable storage medium for generating far-field speech data
CN111192573B (en) * 2018-10-29 2023-08-18 宁波方太厨具有限公司 Intelligent control method for equipment based on voice recognition
CN109671309A (en) * 2018-12-12 2019-04-23 广东小天才科技有限公司 A kind of mistake pronunciation recognition methods and electronic equipment
CN110431625B (en) * 2019-06-21 2023-06-23 深圳市汇顶科技股份有限公司 Voice detection method, voice detection device, voice processing chip and electronic equipment
CN113902634A (en) * 2019-08-15 2022-01-07 深圳市瑞立视多媒体科技有限公司 Three-dimensional track smoothing method, device, equipment and storage medium
CN111031186A (en) * 2019-12-03 2020-04-17 苏宁云计算有限公司 Noise processing method, server and client

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1312938A (en) * 1997-09-02 2001-09-12 夸尔柯姆股份有限公司 System and method for reducing noise
CN1430778A (en) * 2001-03-28 2003-07-16 三菱电机株式会社 Noise suppressor
CN1520069A (en) * 1998-06-22 2004-08-11 Dspc�������޹�˾ Noise suppressor
CN101142800A (en) * 2004-04-23 2008-03-12 声学技术公司 Noise suppression based on bark band weiner filtering and modified doblinger noise estimate
CN101379548A (en) * 2006-02-10 2009-03-04 艾利森电话股份有限公司 A voice detector and a method for suppressing sub-bands in a voice detector

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1312938A (en) * 1997-09-02 2001-09-12 夸尔柯姆股份有限公司 System and method for reducing noise
CN1520069A (en) * 1998-06-22 2004-08-11 Dspc�������޹�˾ Noise suppressor
CN1430778A (en) * 2001-03-28 2003-07-16 三菱电机株式会社 Noise suppressor
CN101142800A (en) * 2004-04-23 2008-03-12 声学技术公司 Noise suppression based on bark band weiner filtering and modified doblinger noise estimate
CN101379548A (en) * 2006-02-10 2009-03-04 艾利森电话股份有限公司 A voice detector and a method for suppressing sub-bands in a voice detector

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107123419A (en) * 2017-05-18 2017-09-01 北京大生在线科技有限公司 The optimization method of background noise reduction in the identification of Sphinx word speeds

Also Published As

Publication number Publication date
CN102117618A (en) 2011-07-06

Similar Documents

Publication Publication Date Title
CN102117618B (en) Method, device and system for eliminating music noise
CN101770775B (en) Signal processing method and device
CN102074245B (en) Dual-microphone-based speech enhancement device and speech enhancement method
CN101599269B (en) Phonetic end point detection method and device therefor
US9978398B2 (en) Voice activity detection method and device
CN101770779B (en) Noise spectrum tracking in noisy acoustical signals
US9570072B2 (en) System and method for noise reduction in processing speech signals by targeting speech and disregarding noise
WO2018213565A2 (en) Artificial intelligence-based text-to-speech system and method
Ganapathy et al. Robust feature extraction using modulation filtering of autoregressive models
CN102216984B (en) Noise gate, sound collection device, and noise removal method
JP4740609B2 (en) Voiced and unvoiced sound detection apparatus and method
CN103325380A (en) Gain post-processing for signal enhancement
CN102629472B (en) Noise rejection apparatus and noise rejection method
EP3349213B1 (en) System and method for noise estimation with music detection
CN103380628A (en) Audio processing device, audio processing method, and program
KR20200043548A (en) Method and device for processing audio signal
US20080189100A1 (en) Method and System for Improving Speech Quality
US9349383B2 (en) Audio bandwidth dependent noise suppression
Joy et al. Deep Scattering Power Spectrum Features for Robust Speech Recognition.
Sharma et al. Speech denoising using different types of filters
Mallidi et al. Robust speaker recognition using spectro-temporal autoregressive models.
Kim et al. End-to-end multi-task denoising for the joint optimization of perceptual speech metrics
Nouza et al. Adding controlled amount of noise to improve recognition of compressed and spectrally distorted speech
CN113205826B (en) LC3 audio noise elimination method, device and storage medium
KR102238429B1 (en) Sporadic noise detecting apparatus

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant