CN106816156A - A kind of enhanced method and device of audio quality - Google Patents

A kind of enhanced method and device of audio quality Download PDF

Info

Publication number
CN106816156A
CN106816156A CN201710064271.7A CN201710064271A CN106816156A CN 106816156 A CN106816156 A CN 106816156A CN 201710064271 A CN201710064271 A CN 201710064271A CN 106816156 A CN106816156 A CN 106816156A
Authority
CN
China
Prior art keywords
audio signal
signal
audio
treatment
carried out
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201710064271.7A
Other languages
Chinese (zh)
Other versions
CN106816156B (en
Inventor
张晨
张兴涛
孙学京
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Tuoling Inc
Original Assignee
Beijing Tuoling Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Tuoling Inc filed Critical Beijing Tuoling Inc
Priority to CN201710064271.7A priority Critical patent/CN106816156B/en
Publication of CN106816156A publication Critical patent/CN106816156A/en
Application granted granted Critical
Publication of CN106816156B publication Critical patent/CN106816156B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L21/0216Noise filtering characterised by the method used for estimating noise
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/21Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being power information

Landscapes

  • Engineering & Computer Science (AREA)
  • Human Computer Interaction (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Quality & Reliability (AREA)
  • Circuit For Audible Band Transducer (AREA)
  • Stereophonic System (AREA)
  • Soundproofing, Sound Blocking, And Sound Damping (AREA)

Abstract

The application is related to a kind of enhanced method and device of audio quality, wherein, methods described includes:Obtain the audio signal of preset format;Pre-processed for the audio signal, the pretreatment includes calculating the average signal of the audio signal Zhong Ge roads audio signal and/or carries out beam forming treatment to the audio signal;Based on the signal that pretreatment is obtained, noise suppressed treatment is carried out to the audio signal, obtained by the enhanced audio signal of tonequality.The enhanced method and device of audio quality that the application is provided, can effectively lift the audio quality of stereo microphone array.

Description

A kind of enhanced method and device of audio quality
Technical field
The application is related to audio signal processing technique field, the enhanced method and device of more particularly to a kind of audio quality.
Background technology
With the development of science and technology, every field for audio quality pursuit more and more higher, the object of audio research By initial single channel (mono), stereo (stereo), surround sound (surround) and 3D (3- are gradually transitions Dimensional) audio.Different from SCVF single channel voice frequency, MCVF multichannel voice frequency is obtained typically by microphone array.For 3D sounds Frequently, in order to pick up the audio of all directions, usually stereo microphone array, the array can obtain the level orientation of signal The three-dimensional information at angle, Vertical Square parallactic angle harmony source and microphone array reference point distance.
In the prior art, the audio enhancing technology of linear microphone array and plane microphone array can be had The effect of effect.But for stereo microphone array, prior art can't reach effective audio enhancing effect.
The content of the invention
The purpose of the application is to provide a kind of audio quality enhanced method and device, can effectively lift three-dimensional wheat The audio quality of gram wind array.
To achieve the above object, on the one hand the application provides a kind of enhanced method of audio quality, and methods described includes: Obtain the audio signal of preset format;Pre-processed for the audio signal, the pretreatment includes calculating the audio The average signal of signal Zhong Ge roads audio signal and/or beam forming treatment is carried out to the audio signal;Based on pre-processing The signal for arriving, noise suppressed treatment is carried out to the audio signal, is obtained by the enhanced audio signal of tonequality.
Further, when the pretreatment is the average signal of the calculating audio signal Zhong Ge roads audio signal, it is based on The signal that pretreatment is obtained, carries out the step of noise suppressed is processed and specifically includes to the audio signal:According to the average letter Number, determine the corresponding noise energy spectrum of the audio signal and signal energy spectrum;According to noise energy spectrum and signal energy Spectrum, noise suppressed treatment is carried out to the audio signal, is obtained by the enhanced audio signal of tonequality.
Further, when the pretreatment is when carrying out beam forming to the audio signal to process, based on pre-processing The signal for arriving, carries out the step of noise suppressed is processed and specifically includes to the audio signal:Be utilized respectively the first steering vector with And second steering vector in opposite direction with first steering vector carries out inner product treatment to the audio signal, obtains inner product First via audio signal and first via audio signal after treatment;Wherein, the sound can obtain according to first steering vector The audio signal of the pre-configured orientation in frequency signal;According to the first via audio signal after inner product treatment and the second tunnel audio letter Number, determine the corresponding noise energy spectrum of the audio signal and signal energy spectrum;According to noise energy spectrum and signal energy Spectrum, noise suppressed treatment is carried out to the first via audio signal after inner product treatment, obtains believing by the enhanced audio of tonequality Number.
Further, when the pretreatment is for the average signal of the calculating audio signal Zhong Ge roads audio signal and to institute When stating audio signal and carrying out beam forming and process, based on the signal that obtains of pretreatment, noise suppressed is carried out to the audio signal The step for the treatment of, specifically includes:Inner product treatment is carried out to the audio signal using the first steering vector, after obtaining inner product treatment Audio signal;Wherein, the audio signal of the pre-configured orientation in the audio signal is can obtain according to first steering vector; According to the average signal, the corresponding noise energy spectrum of the audio signal and signal energy spectrum are determined;According to the noise energy Amount spectrum and signal energy spectrum, noise suppressed treatment is carried out to the audio signal after inner product treatment, obtains strengthening by tonequality Audio signal.
Further, when the pretreatment is for the average signal of the calculating audio signal Zhong Ge roads audio signal and to institute When stating audio signal and carrying out beam forming and process, based on the signal that obtains of pretreatment, noise suppressed is carried out to the audio signal The step for the treatment of, specifically includes:Using the first steering vector and the second guiding arrow in opposite direction with first steering vector Amount carries out inner product treatment to the audio signal, obtains first via audio signal and the second tunnel audio signal after inner product treatment; Wherein, the audio signal of the pre-configured orientation in the audio signal is can obtain according to first steering vector;According to described flat First via audio signal and the second tunnel audio signal after equal signal and inner product treatment, determine the corresponding noise of the audio signal Inhibiting factor;According to the noise suppression factor, the first via audio signal after inner product treatment is carried out at noise suppressed Reason, obtains by the enhanced audio signal of tonequality.
Further, before being pre-processed for the audio signal, methods described also includes:Obtain the audio The sound field parameters of signal, the sound field parameters include at least one in sound bearing, sound source power and sound source divergence.
Further, estimate that the corresponding noise energy spectrum of the audio signal is specifically included:Judge Z in the sound field parameters Size between the sound source power and first threshold of signal, when the sound source power of Z signals in the sound field parameters is more than described the During one threshold value, estimate that the corresponding noise energy of the audio signal is composed less than the smoothing factor of Second Threshold using numerical value;Work as institute When the sound source power for stating Z signals in sound field parameters is less than or equal to the first threshold, institute is more than or equal to using numerical value The smoothing factor for stating Second Threshold estimates the corresponding noise energy spectrum of the audio signal.
Further, beam forming treatment is carried out to the audio signal to specifically include:According in the sound field parameters Sound bearing determines goal orientation vector;Inner product treatment is carried out using the goal orientation vector and the audio signal, with To the audio signal of beam forming.
Further, the signal for being obtained based on pretreatment, noise suppressed treatment is carried out to the audio signal and is specifically included: Sound source divergence in the sound field parameters, it is determined that the Dynamic gene for carrying out noise suppressed treatment;According to what is determined The Dynamic gene, noise suppressed treatment is carried out to the audio signal.
Further, the sound source divergence in the sound field parameters, it is determined that the tune for carrying out noise suppressed treatment Integral divisor is specifically included:The size between the sound source divergence in the sound field parameters and the 3rd threshold value is judged, when the sound source When divergence is more than three threshold value, Dynamic gene of the numerical value more than the 4th threshold value is determined;Sound in the sound field parameters Source divergence be less than or equal to three threshold value when, determine numerical value less than or equal to the 4th threshold value adjustment because Son.
To achieve the above object, on the other hand the application additionally provides a kind of enhanced method of audio quality, methods described Including:Obtain the audio signal of preset format;Beam forming treatment is carried out for the audio signal, obtains strengthening by tonequality Audio signal.
Further, the beam forming treatment is specifically included:Believe with the audio with reference to the steering vector of preset direction Number inner product treatment is carried out, obtain enhanced audio signal on the preset direction.
Further, before being pre-processed for the audio signal, methods described also includes:Obtain the audio The sound field parameters of signal, the sound field parameters include at least one in sound bearing, sound source power and sound source divergence.
Further, beam forming treatment is carried out to the audio signal to specifically include:According in the sound field parameters Sound bearing determines goal orientation vector;Inner product treatment is carried out using the goal orientation vector and the audio signal, is obtained Enhanced audio signal on target direction.
To achieve the above object, on the other hand the application also provides a kind of audio quality enhanced device, described device bag Include:Audio signal acquiring unit, the audio signal for obtaining preset format;Pretreatment unit, for for audio letter Number pre-processed, the pretreatment includes calculating the average signal of the audio signal Zhong Ge roads audio signal and/or to institute Stating audio signal carries out beam forming treatment;Noise suppressed processing unit, for the signal obtained based on pretreatment, to the sound Frequency signal carries out noise suppressed treatment, obtains by the enhanced audio signal of tonequality.
The enhanced method and device of a kind of audio quality that embodiment of the present invention is proposed, can be directed to the letter of preset format Number audio enhancing treatment is carried out, can further entered with reference to sound field parameters (sound bearing, sound source power and sound source divergence) The treatment of row noise suppressed and beam forming treatment, can effectively lift the quality of audio, reach Expected Results.
Brief description of the drawings
Fig. 1 is the enhanced method flow diagram of one implementation method sound intermediate frequency quality of the application;
Fig. 2 is the schematic diagram of four tunnel audio signals in one implementation method of the application;
Fig. 3 is another enhanced method flow diagram of implementation method sound intermediate frequency quality of the application;
Fig. 4 is another enhanced method flow diagram of implementation method sound intermediate frequency quality of the application;
Fig. 5 is another enhanced method flow diagram of implementation method sound intermediate frequency quality of the application;
Fig. 6 is another enhanced method flow diagram of implementation method sound intermediate frequency quality of the application;
Fig. 7 is another enhanced method flow diagram of implementation method sound intermediate frequency quality of the application;
Fig. 8 is the functional block diagram of the enhanced device of one implementation method sound intermediate frequency quality of the application.
Specific embodiment
In order that those skilled in the art more fully understand the technical scheme in the application, below in conjunction with the application reality The accompanying drawing in mode is applied, the technical scheme in the application implementation method is clearly and completely described, it is clear that described Implementation method is only a part of implementation method of the application, rather than whole implementation methods.Based on the embodiment party in the application Formula, all other implementation method that those of ordinary skill in the art are obtained under the premise of creative work is not made all should When the scope for belonging to the application protection.
Fig. 1 is referred to, the application implementation method provides a kind of audio quality enhanced method, and methods described includes following step Suddenly.
S1:Obtain the audio signal of preset format.
In the present embodiment, the audio signal of the preset format can be the audio signal of Ambisonic A forms. The audio signal of the Ambisonic A forms is four tunnel audio signals (LFU, RFD, LBD, RBU).Four tunnel audio signal Can be as shown in Figure 2.
S2:Pre-processed for the audio signal, the pretreatment includes calculating the audio signal Zhong Ge roads sound The average signal of frequency signal and/or beam forming treatment is carried out to the audio signal.
In the present embodiment, the audio signal of the Ambisonic A forms can be pre-processed, the pre- place The purpose of reason is to carry out enhancing treatment to the audio signal.Specifically, in the present embodiment, the mode of pretreatment can be wrapped Include the average signal that calculates the audio signal Zhong Ge roads audio signal and/or the audio signal is carried out at beam forming Reason.
Wherein, the average signal x of audio signal Zhong Ge roads audio signalave(n):
Wherein, n is the label of sampling point in audio time domain signal, and L is the frame length of Audio Signal Processing, xiN () is the i-th road sound The time-domain signal of frequency.
Beam forming processes xbf(n):
Wherein, θ is the azimuth in the range of [0,360], pi(θ) is the steering vector in θ directions.
The corresponding noise energy time spectrum of the audio signal is being estimated, the audio signal Zhong Ge roads audio letter can calculated Number average signal, then can be according to the average signal, it is determined that for the smoothing factor of estimated noise energy spectrum.It is described flat The sliding factor can for example be represented by following formula:
αs(λ, k)=αd+(1-αd)p(λ,k)
Wherein, λ represents the label of audio signal sound intermediate frequency frame, and k represents the label of audio signal intermediate-frequeney point, αs(λ, k) table Show corresponding smoothing factor, α at specific audio frequency frame and specified frequencydSmoothing factor is represented, value is that (λ k) is represented and referred to 0.85, p Determine corresponding average signal at audio frame and specified frequency.So, for different audio frames and frequency, different putting down can be corresponded to The sliding factor, the smoothing factor can be determined by average signal.
In the present embodiment, the corresponding noise energy spectrum of the audio signal can be estimated according to the smoothing factor. Specifically, the formula of estimated noise energy spectrum can be with as follows:
D (λ, k)=αs(λ,k)D(λ-1,k)+(1-αs(λ,k))|Y(λ,k)|2
Wherein, (λ, k) represents corresponding estimated noise energy spectrum at specific audio frequency frame and specified frequency to D, and (λ k) is represented Y Audio amplitude at specific audio frequency frame and specified frequency.
In the present embodiment, Fig. 3 is referred to, beam forming treatment can also be carried out to the audio signal.Specifically, The steering vector (steering vector) that preset direction can be combined carries out inner product treatment with the audio signal, so that can To strengthen the audio signal on the preset direction.So just can effectively strengthen the sound source of specific direction.
In one implementation method of the application, Fig. 4 is referred to, can be composed with reference to sound field parameters estimated noise energy.Specifically Ground, can obtain the sound field parameters of the audio signal, and the sound field parameters include sound bearing (sound location), sound At least one in source energy (sound power) and sound source divergence (sound diffusivity).The sound field parameters Can be obtained by direction of arrival (Direction of Arrival, DOA) method.
In the present embodiment, smoothing factor can possess different numerical value according to different audio frames and frequency, therefore can The smoothing factor of actual use is determined with the size between the sound source power and first threshold according to Z signals in sound field parameters. Specifically, when the sound source power of Z signals in the sound field parameters is more than the first threshold, Second Threshold is less than using numerical value Smoothing factor estimate the corresponding noise energy spectrum of the audio signal;When the sound source power of Z signals in the sound field parameters is small When the first threshold, the sound is estimated more than or equal to the smoothing factor of the Second Threshold using numerical value The corresponding noise energy spectrum of frequency signal.Specifically, the smoothing factor if less than Second Threshold has multiple, can use therein Any one smoothing factor is estimated.Likewise, having multiple if greater than or equal to the smoothing factor of Second Threshold, also may be used Estimated with using any one smoothing factor therein.Specifically, first threshold scope is [0.3,0.6], Second Threshold Scope is [0.05,0.4].
Wherein, Z signals are obtained according to transition matrix A:
Wherein, the transition matrix A=[a11 a12 a13 a14], the element a of the A11,a12,......,a14Value be Constant, is determined by different sound source scenes.
The energy of Z signals is
In the present embodiment, Fig. 5 is referred to, it is also possible to carry out beam forming treatment with reference to sound field parameters.Specifically, may be used Goal orientation vector is adaptively determined with the sound bearing in the sound field parameters, then can be led using the target Inner product treatment is carried out with the audio signal to vector, to obtain the audio signal of beam forming.
S3:Based on the signal that pretreatment is obtained, noise suppressed treatment is carried out to the audio signal, obtain increasing by tonequality Strong audio signal.
In the present embodiment, after being pre-processed to audio signal, the audio signal can be carried out at noise suppressed Reason, so as to obtain by the enhanced audio signal of tonequality.Specifically, noise suppressed can be carried out using spectrum-subtraction, it is also possible to adopt Noise suppressed is carried out with Wiener Filter Method.Wherein, spectrum-subtraction and Wiener Filter Method can be realized in a frequency domain.Noise suppressed Process can be carried out in whole frequency band, it is also possible to be carried out in a sub-band.
In present embodiment kind, Fig. 6 is referred to, after beam forming is carried out to audio signal, noise suppressed can be carried out Treatment.Specifically, the first steering vector and second guiding in opposite direction with first steering vector can be utilized respectively Vector carries out inner product treatment to the audio signal, respectively obtains first via audio signal and the second tunnel audio after inner product treatment Signal;Wherein, the audio signal of the pre-configured orientation in the audio signal is can obtain according to first steering vector;Then may be used Frequency-region signal is transformed to respectively with by the first via audio signal after inner product treatment and the second tunnel, and is made an uproar in a frequency domain Sound suppression is processed.
Specifically, beam forming is processed as:
Wherein, θ is the azimuth in the range of [0,360], pi(θ) is the steering vector in θ directions, xiN () is the i-th tunnel audio Time-domain signal.
Time-domain signal is transformed to frequency-region signal, discrete Fourier transform DFT, Fast Fourier Transform (FFT) FFT can be used Or Modified Discrete Cosine Transform MDCT is realized.
It should be noted that the application implementation method only can also carry out beam forming treatment to audio signal.Specifically, The application implementation method provides a kind of audio quality enhanced method, and methods described includes:
Obtain the audio signal of preset format;
Beam forming treatment is carried out for the audio signal, wherein, waveform shaping treatment is specifically included:
Inner product treatment is carried out with the audio signal with reference to the steering vector of preset direction, is increased with the preset direction The strong audio signal.
Fig. 7 is referred to, it is, of course, also possible to carry out noise suppressed treatment with reference to sound field parameters.Specifically, can be utilized respectively In first steering vector and the second steering vector in opposite direction with first steering vector are carried out to the audio signal Product treatment, respectively obtains the first via audio signal and the second tunnel audio signal after inner product treatment;Wherein, led according to described first The audio signal of the pre-configured orientation in the audio signal is can obtain to vector;Then can be by first after inner product treatment Road audio signal and the second tunnel audio signal are transformed to frequency-region signal, and the sound source diverging in the sound field parameters respectively Degree, it is determined that the Dynamic gene for carrying out noise suppressed treatment, finally then can be according to the Dynamic gene for determining, to described Audio signal carries out noise suppressed treatment.Specifically, in the sound source divergence in the sound field parameters, it is determined that for carrying out In the step of Dynamic gene of noise suppressed treatment, it can be determined that sound source divergence in the sound field parameters and the 3rd threshold value it Between size, when the sound source divergence be more than three threshold value when, determine numerical value more than the 4th threshold value Dynamic gene;When When sound source divergence in the sound field parameters is less than or equal to three threshold value, determine numerical value less than or equal to described The Dynamic gene of the 4th threshold value.Specifically, the 3rd threshold range is [0.3,0.5], the 4th threshold range is [0.05,0.5].
Fig. 8 is referred to, the application implementation method also provides a kind of audio quality enhanced device, and described device includes:
Audio signal acquiring unit 100, the audio signal for obtaining preset format;
Pretreatment unit 200, for being pre-processed for the audio signal, the pretreatment includes calculating the sound The average signal of frequency signal Zhong Ge roads audio signal and/or beam forming treatment is carried out to the audio signal;
Noise suppressed processing unit 300, for the signal obtained based on pretreatment, noise suppression is carried out to the audio signal System treatment, obtains by the enhanced audio signal of tonequality.
In one implementation method of the application, the pretreatment unit 200 is specifically included:
Average signal computing module, the average signal for calculating the audio signal Zhong Ge roads audio signal;
Smoothing factor determining module, for according to the average signal, it is determined that for estimated noise energy spectrum it is smooth because Son;
Estimation block, for estimating the corresponding noise energy spectrum of the audio signal according to the smoothing factor.
The enhanced method and device of a kind of audio quality that embodiment of the present invention is proposed, can be directed to the letter of preset format Number audio enhancing treatment is carried out, can further entered with reference to sound field parameters (sound bearing, sound source power and sound source divergence) The treatment of row noise suppressed and beam forming treatment, can effectively lift the quality of audio, reach Expected Results.
Description to the various implementation methods of the application above is supplied to those skilled in the art with the purpose for describing.It is not Be intended to exhaustion or be not intended to limit the invention to single disclosed embodiment.As described above, the application's is various Substitute and change will be apparent for above-mentioned technology one of ordinary skill in the art.Therefore, although specifically beg for The implementation method of some alternatives has been discussed, but other embodiment will be apparent, or those skilled in the art are relative Easily draw.The application is intended to be included in this of the invention all replacement for having discussed, modification and change, and falls Other embodiment in the spirit and scope of above-mentioned application.

Claims (10)

1. a kind of enhanced method of audio quality, it is characterised in that methods described includes:
Obtain the audio signal of preset format;
Pre-processed for the audio signal, the pretreatment includes calculating the audio signal Zhong Ge roads audio signal Average signal and/or beam forming treatment is carried out to the audio signal;
Based on the signal that pretreatment is obtained, noise suppressed treatment is carried out to the audio signal, obtained by the enhanced sound of tonequality Frequency signal.
2. the enhanced method of audio quality according to claim 1, it is characterised in that when the pretreatment is described to calculate During the average signal of audio signal Zhong Ge roads audio signal, based on the signal that pretreatment is obtained, the audio signal is made an uproar The step of sound suppresses treatment specifically includes:
According to the average signal, the corresponding noise energy spectrum of the audio signal and signal energy spectrum are determined;
According to noise energy spectrum and signal energy spectrum, noise suppressed treatment is carried out to the audio signal, obtained by sound The enhanced audio signal of matter;
When the pretreatment is when carrying out beam forming to the audio signal to process, based on the signal that pretreatment is obtained, to institute State audio signal and carry out the step of noise suppressed is processed and specifically include:
The first steering vector and second steering vector in opposite direction with first steering vector are utilized respectively to the sound Frequency signal carries out inner product treatment, respectively obtains first via audio signal and the second tunnel audio signal after inner product treatment;Wherein, root The audio signal of the pre-configured orientation in the audio signal is can obtain according to first steering vector;
According to first via audio signal and the second tunnel audio signal after inner product treatment, determine that the audio signal is corresponding Noise energy is composed and signal energy spectrum;
According to noise energy spectrum and signal energy spectrum, noise suppression is carried out to the first via audio signal after inner product treatment System treatment, obtains by the enhanced audio signal of tonequality.
3. the enhanced method of audio quality according to claim 1, it is characterised in that when the pretreatment is described to calculate The average signal of audio signal Zhong Ge roads audio signal and the audio signal is carried out beam forming process when, based on pretreatment The signal for obtaining, carries out the step of noise suppressed is processed and specifically includes to the audio signal:
Inner product treatment is carried out to the audio signal using the first steering vector, the audio signal after inner product treatment is obtained;Wherein, The audio signal of the pre-configured orientation in the audio signal is can obtain according to first steering vector;
According to the average signal, the corresponding noise energy spectrum of the audio signal and signal energy spectrum are determined;
According to noise energy spectrum and signal energy spectrum, the audio signal after inner product treatment is carried out at noise suppressed Reason, obtains by the enhanced audio signal of tonequality.
4. the enhanced method of audio quality according to claim 1, it is characterised in that based on the signal that pretreatment is obtained, The step of noise suppressed is processed is carried out to the audio signal to specifically include:
The audio is believed using the first steering vector and the second steering vector in opposite direction with first steering vector Number inner product treatment is carried out, obtain first via audio signal and the second tunnel audio signal after inner product treatment;Wherein, according to described One steering vector can obtain the audio signal of the pre-configured orientation in the audio signal;
According to first via audio signal and the second tunnel audio signal after the average signal and inner product treatment, the audio is determined The corresponding noise suppression factor of signal;
According to the noise suppression factor, noise suppressed treatment is carried out to the first via audio signal after inner product treatment, obtained To by the enhanced audio signal of tonequality.
5. the enhanced method of audio quality according to claim 1, it is characterised in that carried out for the audio signal Before pretreatment, methods described also includes:
The sound field parameters of the audio signal are obtained, the sound field parameters include the diverging of sound bearing, sound source power and sound source At least one in degree;
Correspondingly, estimate that the corresponding noise energy spectrum of the audio signal is specifically included:
The size between the sound source power of Z signals in the sound field parameters and first threshold is judged, when Z letters in the sound field parameters Number sound source power when being more than the first threshold, the audio signal is estimated less than the smoothing factor of Second Threshold using numerical value Corresponding noise energy spectrum;
When Z signals in the sound field parameters sound source power be less than or equal to the first threshold when, using numerical value be more than or The smoothing factor that person is equal to the Second Threshold estimates the corresponding noise energy spectrum of the audio signal;
Correspondingly, beam forming treatment is carried out to the audio signal to specifically include:
Sound bearing in the sound field parameters determines goal orientation vector;
Inner product treatment is carried out using the goal orientation vector and the audio signal, to obtain the audio signal of beam forming;
Correspondingly, the signal for being obtained based on pretreatment, noise suppressed treatment is carried out to the audio signal and is specifically included:
Sound source divergence in the sound field parameters, it is determined that the Dynamic gene for carrying out noise suppressed treatment;
According to the Dynamic gene for determining, noise suppressed treatment is carried out to the audio signal.
6. the enhanced method of audio quality according to claim 5, it is characterised in that according to the sound in the sound field parameters Source divergence, it is determined that being specifically included for carrying out the Dynamic gene of noise suppressed treatment:
The size between the sound source divergence in the sound field parameters and the 3rd threshold value is judged, when the sound source divergence is more than institute When stating three threshold values, Dynamic gene of the numerical value more than the 4th threshold value is determined;
When the sound source divergence in the sound field parameters is less than or equal to three threshold value, determine that numerical value is less than or waits In the Dynamic gene of the 4th threshold value.
7. a kind of enhanced method of audio quality, it is characterised in that methods described includes:
Obtain the audio signal of preset format;
Beam forming treatment is carried out for the audio signal, is obtained by the enhanced audio signal of tonequality.
8. the enhanced method of audio quality according to claim 7, it is characterised in that beam forming treatment is specifically included:
Inner product treatment is carried out with the audio signal with reference to the steering vector of preset direction, obtains enhanced on the preset direction Audio signal.
9. the enhanced method of audio quality according to claim 7, it is characterised in that carried out for the audio signal Before beam forming treatment, methods described also includes:
The sound field parameters of the audio signal are obtained, the sound field parameters include the diverging of sound bearing, sound source power and sound source At least one in degree;
Correspondingly, beam forming treatment is carried out to the audio signal to specifically include:
Sound bearing in the sound field parameters determines goal orientation vector;
Inner product treatment is carried out using the goal orientation vector and the audio signal, enhanced audio letter on target direction is obtained Number.
10. the enhanced device of a kind of audio quality, it is characterised in that described device includes:
Audio signal acquiring unit, the audio signal for obtaining preset format;
Pretreatment unit, for being pre-processed for the audio signal, the pretreatment includes calculating the audio signal The average signal of Zhong Ge roads audio signal and/or beam forming treatment is carried out to the audio signal;
Noise suppressed processing unit, for the signal obtained based on pretreatment, noise suppressed treatment is carried out to the audio signal, Obtain by the enhanced audio signal of tonequality.
CN201710064271.7A 2017-02-04 2017-02-04 Method and device for enhancing audio quality Active CN106816156B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710064271.7A CN106816156B (en) 2017-02-04 2017-02-04 Method and device for enhancing audio quality

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710064271.7A CN106816156B (en) 2017-02-04 2017-02-04 Method and device for enhancing audio quality

Publications (2)

Publication Number Publication Date
CN106816156A true CN106816156A (en) 2017-06-09
CN106816156B CN106816156B (en) 2020-06-30

Family

ID=59111991

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710064271.7A Active CN106816156B (en) 2017-02-04 2017-02-04 Method and device for enhancing audio quality

Country Status (1)

Country Link
CN (1) CN106816156B (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107920303A (en) * 2017-11-21 2018-04-17 北京时代拓灵科技有限公司 A kind of method and device of audio collection
CN108520756A (en) * 2018-03-20 2018-09-11 北京时代拓灵科技有限公司 A kind of method and device of speaker's speech Separation
CN113077787A (en) * 2020-12-22 2021-07-06 珠海市杰理科技股份有限公司 Voice data identification method, device, chip and readable storage medium
CN113170270A (en) * 2018-10-08 2021-07-23 诺基亚技术有限公司 Spatial audio enhancement and reproduction

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1809105A (en) * 2006-01-13 2006-07-26 北京中星微电子有限公司 Dual-microphone speech enhancement method and system applicable to mini-type mobile communication devices
CN1953059A (en) * 2006-11-24 2007-04-25 北京中星微电子有限公司 A method and device for noise elimination
CN102227768A (en) * 2009-01-06 2011-10-26 三菱电机株式会社 Noise cancellation device and noise cancellation program
CN102801861A (en) * 2012-08-07 2012-11-28 歌尔声学股份有限公司 Voice enhancing method and device applied to cell phone
CN104065798A (en) * 2013-03-21 2014-09-24 华为技术有限公司 Sound signal processing method and device
WO2016147020A1 (en) * 2015-03-19 2016-09-22 Intel Corporation Microphone array speech enhancement

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1809105A (en) * 2006-01-13 2006-07-26 北京中星微电子有限公司 Dual-microphone speech enhancement method and system applicable to mini-type mobile communication devices
CN1953059A (en) * 2006-11-24 2007-04-25 北京中星微电子有限公司 A method and device for noise elimination
CN102227768A (en) * 2009-01-06 2011-10-26 三菱电机株式会社 Noise cancellation device and noise cancellation program
CN102801861A (en) * 2012-08-07 2012-11-28 歌尔声学股份有限公司 Voice enhancing method and device applied to cell phone
CN104065798A (en) * 2013-03-21 2014-09-24 华为技术有限公司 Sound signal processing method and device
WO2016147020A1 (en) * 2015-03-19 2016-09-22 Intel Corporation Microphone array speech enhancement

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107920303A (en) * 2017-11-21 2018-04-17 北京时代拓灵科技有限公司 A kind of method and device of audio collection
CN107920303B (en) * 2017-11-21 2019-12-24 北京时代拓灵科技有限公司 Audio acquisition method and device
CN108520756A (en) * 2018-03-20 2018-09-11 北京时代拓灵科技有限公司 A kind of method and device of speaker's speech Separation
CN108520756B (en) * 2018-03-20 2020-09-01 北京时代拓灵科技有限公司 Method and device for separating speaker voice
CN113170270A (en) * 2018-10-08 2021-07-23 诺基亚技术有限公司 Spatial audio enhancement and reproduction
US11363403B2 (en) 2018-10-08 2022-06-14 Nokia Technologies Oy Spatial audio augmentation and reproduction
US11729574B2 (en) 2018-10-08 2023-08-15 Nokia Technologies Oy Spatial audio augmentation and reproduction
CN113077787A (en) * 2020-12-22 2021-07-06 珠海市杰理科技股份有限公司 Voice data identification method, device, chip and readable storage medium

Also Published As

Publication number Publication date
CN106816156B (en) 2020-06-30

Similar Documents

Publication Publication Date Title
JP7011075B2 (en) Target voice acquisition method and device based on microphone array
CN106816156A (en) A kind of enhanced method and device of audio quality
CN104103277B (en) A kind of single acoustics vector sensor target voice Enhancement Method based on time-frequency mask
US10856094B2 (en) Method and device for sound source localization
CN107221336A (en) It is a kind of to strengthen the devices and methods therefor of target voice
US8908883B2 (en) Microphone array structure able to reduce noise and improve speech quality and method thereof
CN100524465C (en) A method and device for noise elimination
US8947978B2 (en) System and method for estimating the direction of arrival of a sound
US9552828B2 (en) Audio signal processing device
CN102402987A (en) Noise suppression device, noise suppression method, and program
CN105467364A (en) Method and apparatus for localizing target sound source
CN112242148B (en) Headset-based wind noise suppression method and device
US20100111329A1 (en) Sound Processing Apparatus, Sound Processing Method and Program
CN111081267B (en) Multi-channel far-field speech enhancement method
CN107346664A (en) A kind of ears speech separating method based on critical band
CN103632677A (en) Method and device for processing voice signal with noise, and server
CN103680512B (en) The horizontal lifting system of speech recognition and its method of vehicle array microphone
CN107742521A (en) The coding method of multi-channel signal and encoder
CN105702262A (en) Headset double-microphone voice enhancement method
CN105590630A (en) Directional noise suppression method based on assigned bandwidth
CN105845150A (en) Voice enhancement method and system adopting cepstrum to correct
CN111951818B (en) Dual-microphone voice enhancement method based on improved power difference noise estimation algorithm
CN108520756A (en) A kind of method and device of speaker's speech Separation
CN103824563A (en) Hearing aid denoising device and method based on module multiplexing
CN114189781A (en) Noise reduction method and system for double-microphone neural network noise reduction earphone

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant