CN105845152A - Method for detecting audio signal echoes - Google Patents

Method for detecting audio signal echoes Download PDF

Info

Publication number
CN105845152A
CN105845152A CN201610169131.1A CN201610169131A CN105845152A CN 105845152 A CN105845152 A CN 105845152A CN 201610169131 A CN201610169131 A CN 201610169131A CN 105845152 A CN105845152 A CN 105845152A
Authority
CN
China
Prior art keywords
audio signal
frame
echo
power
value
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201610169131.1A
Other languages
Chinese (zh)
Inventor
杜鹢
杨俊�
郑泽俊
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
CHENGDU PTLC COMMUNICATIONS TECHNOLOGY Co Ltd
Original Assignee
CHENGDU PTLC COMMUNICATIONS TECHNOLOGY Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by CHENGDU PTLC COMMUNICATIONS TECHNOLOGY Co Ltd filed Critical CHENGDU PTLC COMMUNICATIONS TECHNOLOGY Co Ltd
Priority to CN201610169131.1A priority Critical patent/CN105845152A/en
Publication of CN105845152A publication Critical patent/CN105845152A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/24Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being the cepstrum
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Cable Transmission Systems, Equalization Of Radio And Reduction Of Echo (AREA)

Abstract

The invention discloses a method for detecting audio signal echoes. The method comprises following steps: sectional calculation is performed on power cepstrum of the audio signal to obtain kurtosis values and the kurtosis values are compared with a preset value to determine whether there are echoes in the audio signal according to the comparison results; thereby, whether there are echoes in the audio signal transmission process is detected in real time without reference signals and the detection accuracy is high; the problems that traditional detection methods are subject to reference resources and therefore the detection accuracy is difficult to guaranteed are solved.

Description

A kind of audio signal echo detection method
Technical field
The present invention relates to audio transmission system technical field, be specifically related to a kind of audio signal echo detection method.
Background technology
Along with the fast development of communication technology, people are more and more higher to the requirement of communication quality, and at audio frequency During transmission, the existence of echo is it is difficult to ensure that communication quality.In order to eliminate echo, and then improve communication Quality, lifting Consumer's Experience, detect whether to there is echo most important in sound transmission course in real time.Pass System echo checking system be use anechoic audio signal as reference signal, with the audio frequency of current transmission Signal compares, thus judges whether the audio signal of current transmission exists echo, but this dependence is with reference to letter Number detection method be limited by reference source, accuracy it is difficult to ensure that.
Summary of the invention
The technical problem to be solved is on the basis of without necessarily referring to signal, it is achieved believe at audio frequency Number transmitting procedure detects whether to there is echo in real time, it is therefore intended that provide a kind of audio signal detection of echoes side Method, solves current detection method and relies on reference source, there is detection inaccurate in the case of reference source instability Problem.
The present invention is achieved through the following technical solutions:
A kind of audio signal echo detection method, comprises the steps:
A, continuous print audio signal is carried out sub-frame processing;
B, calculate the power cepstrum of every frame audio signal;
C, to power cepstrum segmentation, calculate the kurtosis numerical value of every section;
D, comparison peak number of degrees value and setting threshold size, it is judged that whether audio signal exists echo.
Especially, described step B specifically includes:
B1, the audio signal that every frame is to be measured is carried out Fourier transformation, it is thus achieved that the power spectrum of every frame audio signal;
B2, power spectrum is carried out logarithm operation;
B3, the power spectrum through logarithm operation is carried out inverse Fourier transform, it is thus achieved that power cepstrum.
Especially, described step C specifically includes:
C1, power cepstrum to every frame audio signal carry out segmentation;
C2, calculate the kurtosis numerical value of every section of power cepstrum;
C3, obtain every frame audio signal maximum peak number of degrees value.
Especially, described step D specifically includes:
D1, comparison every frame audio signal maximum peak number of degrees value and setting threshold size;
D2, when the maximum peak number of degrees value of N continuous frame audio signal is all higher than setting threshold value, then judge this sound Frequently there is echo in signal, and N is positive integer.
The present invention compared with prior art, has such advantages as and beneficial effect:
A kind of audio signal echo detection method of the present invention, it is possible on the basis of without necessarily referring to signal, Realize detecting whether to exist echo during audio signal transmission in real time, detection accuracy is high, and customer service passes The detection method of system is limited by reference source, the accuracy of detection it is difficult to ensure that problem.
Accompanying drawing explanation
Accompanying drawing described herein is used for providing being further appreciated by the embodiment of the present invention, constitutes the application's A part, is not intended that the restriction to the embodiment of the present invention.In the accompanying drawings:
The audio signal echo detection method flow chart that Fig. 1 provides for the embodiment of the present invention 1.
Detailed description of the invention
For making the object, technical solutions and advantages of the present invention clearer, below in conjunction with embodiment and attached Figure, the present invention is described in further detail, and exemplary embodiment and the explanation thereof of the present invention are only used for Explain the present invention, not as a limitation of the invention.
Embodiment 1
As it is shown in figure 1, the audio signal echo detection method flow chart that Fig. 1 provides for the embodiment of the present invention 1.
In the present embodiment, audio signal echo detection method specifically includes:
S101, continuous print audio signal is carried out sub-frame processing.
Audio signal can not directly carry out signal processing as quasi-steady state signal, in the present embodiment, is carrying out letter Number process before to continuous print signal framing, every frame length about 20-30ms, the audio signal after framing is as stable state Signal is further carried out processing.
S102, the audio signal that every frame is to be measured is carried out Fourier transformation, it is thus achieved that the power of every frame audio signal Spectrum.
Power spectrum is the abbreviation of power spectral density function, and it is defined as the signal power in per unit band.Its table Show that signal power is along with the situation of change of frequency, i.e. signal power are in the distribution situation of frequency domain.The present embodiment In audio signal x (n) that certain frame is to be measured is carried out Fourier transformation, it is thus achieved that the power spectrum of audio signal is F (x (n)).
S103, power spectrum is carried out logarithm operation.
The power spectrum of audio signal x (n) is taken natural logrithm is ln (| F (x (n)) |2)。
S104, the power spectrum through logarithm operation is carried out inverse Fourier transform, it is thus achieved that power cepstrum.
Power cepstrum refers to the Fourier inversion that the Fourier transform spectrum of audio signal is carried out after logarithm operation again, When there is echo in audio signal, can form more sharp-pointed signal in power cepstrum, the present embodiment uses This characteristic is as the standard judging echo.For audio signal x (n), its power cepstrum PCx (n)= F-1 [ln (| F (x (n)) | 2)], wherein, F (.) represents Fourier transformation, and F-1 (.) represents Fourier inversion, ln (.) Represent and take natural logrithm.
S105, power cepstrum to every frame audio signal carry out segmentation.
Power cepstrum segmentation to every frame audio signal calculates kurtosis numerical value, if certain section exists echo, then should The kurtosis numerical value of echo section can be relatively big, in this, as the standard judging echo.
S106, calculate the kurtosis numerical value of every section of power cepstrum.
Kurtosis characterizes probability distribution density curve characteristic number of peak value height at meansigma methods.Coefficient of kurtosis is the biggest, The distribution of sample data just has more extremum, then its residual value must be more concentrated on around mode, Its distribution is inevitable the most precipitous.
The kurtosis computational methods of the stochastic audio signal are defined as: the fourth central square of stochastic variable and the ratio of variance Value.Wherein, central moment defines: for positive integer k, if E (X) exists, and E [| X-E (X) |k)] < ∞, Then claim E{ [X-E (X)]kIt it is the k rank central moment of stochastic variable X.If the variance of X is the second-order moment around mean of X, I.e. D (X)=E{ [X-E (X)]2};Variance defines: variance is to weigh stochastic variable at theory of probability and statistical variance Or the tolerance of dispersion degree during one group of data.In theory of probability, variance is used for measuring stochastic variable and its mathematic expectaion Departure degree between (i.e. average).Variance (sample variance) in statistics be each data respectively with its The difference of average square the average of sum.
In the present embodiment, the concrete formula that kurtosis calculates is:
K u r t o s i s = ( &Sigma; i = 1 N ( Y i - Y &OverBar; ) 4 ) / ( ( N - 1 ) S 4 )
Wherein, N is power spectrum cepstrum data number, and i is power spectrum data index, YiEach for power spectrum Numerical value,For the meansigma methods of power spectrum,S4It is four degree variance,
S107, obtain every frame audio signal maximum peak number of degrees value.
The kurtosis numerical value calculated for power cepstrum segmentation, it is thus achieved that every frame audio signal maximum peak number of degrees value, looks for Go out the signal segment that kurtosis value is maximum.
S108, comparison every frame audio signal maximum peak number of degrees value and setting threshold size.
The threshold value of default suitable size is as thresholding, and the peak every frame audio signal power cepstrum segmentation calculated Number of degrees value is compared with the threshold value of setting.
S109, when the maximum peak number of degrees value of N continuous frame audio signal is all higher than setting threshold value, then judging should There is echo in audio signal.
If N continuous frame audio signal maximum peak number of degrees value all occurs more than the threshold value set, then judge this audio frequency There is echo in signal, otherwise it is assumed that this audio signal does not has echo.
Technical scheme by the power cepstrum segmentation of audio signal being calculated kurtosis numerical value, and with in advance If threshold value compare, and then judge whether audio signal exists echo according to comparative result, thus not On the basis of needing to refer to signal, it is achieved detect whether in real time to there is echo during audio signal transmission, Detection accuracy is high, and the traditional detection method of customer service is limited by reference source, the accuracy of detection it is difficult to ensure that Problem.
Above-described detailed description of the invention, is carried out the purpose of the present invention, technical scheme and beneficial effect Further describe, be it should be understood that the detailed description of the invention that the foregoing is only the present invention, The protection domain being not intended to limit the present invention, all within the spirit and principles in the present invention, that is done is any Amendment, equivalent, improvement etc., should be included within the scope of the present invention.

Claims (4)

1. an audio signal echo detection method, it is characterised in that comprise the steps:
A, continuous print audio signal is carried out sub-frame processing;
B, calculate the power cepstrum of every frame audio signal;
C, to power cepstrum segmentation, calculate the kurtosis numerical value of every section;
D, comparison peak number of degrees value and setting threshold size, it is judged that whether audio signal exists echo.
2. audio signal echo detection method as claimed in claim 1, it is characterised in that described step B specifically includes:
B1, the audio signal that every frame is to be measured is carried out Fourier transformation, it is thus achieved that the power spectrum of every frame audio signal;
B2, power spectrum is carried out logarithm operation;
B3, the power spectrum through logarithm operation is carried out inverse Fourier transform, it is thus achieved that power cepstrum.
3. audio signal echo detection method as claimed in claim 1, it is characterised in that described step C specifically includes:
C1, power cepstrum to every frame audio signal carry out segmentation;
C2, calculate the kurtosis numerical value of every section of power cepstrum;
C3, obtain every frame audio signal maximum peak number of degrees value.
4. audio signal echo detection method as claimed in claim 1, it is characterised in that described step D specifically includes:
D1, comparison every frame audio signal maximum peak number of degrees value and setting threshold size;
D2, when the maximum peak number of degrees value of N continuous frame audio signal is all higher than setting threshold value, then judge that this audio signal exists echo, N is positive integer.
CN201610169131.1A 2016-03-23 2016-03-23 Method for detecting audio signal echoes Pending CN105845152A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201610169131.1A CN105845152A (en) 2016-03-23 2016-03-23 Method for detecting audio signal echoes

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610169131.1A CN105845152A (en) 2016-03-23 2016-03-23 Method for detecting audio signal echoes

Publications (1)

Publication Number Publication Date
CN105845152A true CN105845152A (en) 2016-08-10

Family

ID=56583106

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610169131.1A Pending CN105845152A (en) 2016-03-23 2016-03-23 Method for detecting audio signal echoes

Country Status (1)

Country Link
CN (1) CN105845152A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107360530A (en) * 2017-07-03 2017-11-17 苏州科达科技股份有限公司 The method of testing and device of a kind of echo cancellor

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2009029076A1 (en) * 2007-08-31 2009-03-05 Tellabs Operations, Inc. Controlling echo in the coded domain
CN104052526A (en) * 2013-03-17 2014-09-17 德克萨斯仪器股份有限公司 Clipping Based On Cepstral Distance For Acoustic Echo Canceller

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2009029076A1 (en) * 2007-08-31 2009-03-05 Tellabs Operations, Inc. Controlling echo in the coded domain
CN104052526A (en) * 2013-03-17 2014-09-17 德克萨斯仪器股份有限公司 Clipping Based On Cepstral Distance For Acoustic Echo Canceller

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
WEI ZENG等: "《An Algorithm of Echo Steganalysis Based on Power Cepstrum and Pattern Classification》", 《2008 INTERNATIONAL CONFERENCE ON AUDIO,LANGUAGE AND IMAGE PROCESSING》 *
周浩敏等: "《测试信号处理技术 第2版》", 31 May 2009, 北京航空航天大学出版社 *
汪云路等: "《基于统计特征的语音回声隐藏分析》", 《数据采集与处理》 *
管涛: "《数据分析基础及模型》", 31 July 2015, 合肥工业大学出版社 *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107360530A (en) * 2017-07-03 2017-11-17 苏州科达科技股份有限公司 The method of testing and device of a kind of echo cancellor
CN107360530B (en) * 2017-07-03 2020-12-01 苏州科达科技股份有限公司 Echo cancellation testing method and device

Similar Documents

Publication Publication Date Title
US20170133040A1 (en) Abnormal Frame Detection Method and Apparatus
CN104464722B (en) Voice activity detection method and apparatus based on time domain and frequency domain
US11056130B2 (en) Speech enhancement method and apparatus, device and storage medium
EP3091534B1 (en) Method and apparatus for processing speech signal according to frequency domain energy
CN102959625B (en) Method and apparatus for adaptively detecting voice activity in input audio signal
CN104681038B (en) Audio signal quality detection method and device
US20150030164A1 (en) Microphone calibration
CN102148033B (en) Method for testing intelligibility of speech transmission index
CN103886865A (en) Sound Processing Device, Sound Processing Method, And Program
CN103067322A (en) Method for evaluating voice quality of audio frame in single channel audio signal
CN104807540A (en) Noise inspection method and system
CN106663450A (en) Method of and apparatus for evaluating quality of a degraded speech signal
CN103578479A (en) Speech intelligibility measuring method based on auditory masking effect
CN107729845A (en) A kind of frequency respond noise-reduction method decomposed based on sub-space feature value
EP3252756A1 (en) Method and device for determining inter-channel time difference parameter
CN104021791B (en) Detecting method based on digital audio waveform sudden changes
CN105845152A (en) Method for detecting audio signal echoes
Cabrera et al. Increasing robustness in the calculation of the speech transmission index from impulse responses
Alammar et al. Evaluation of the impact of thresholding and frequency/time resolution on signal area estimation methods
CN110944089A (en) Double-talk detection method and electronic equipment
CN111081269B (en) Noise detection method and system in call process
CN112014811B (en) Fine estimation method for radar carrier frequency
CN109460614A (en) Signal time based on instant bandwidth-frequency decomposition method
CN105187143B (en) A kind of fast spectrum perception method and device based on bi-distribution
CN106228995A (en) A kind of audio signal interrupts detection method

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20160810

RJ01 Rejection of invention patent application after publication