CN102760441B - Background noise coding/decoding device and method as well as communication equipment - Google Patents

Background noise coding/decoding device and method as well as communication equipment Download PDF

Info

Publication number
CN102760441B
CN102760441B CN201210194567.8A CN201210194567A CN102760441B CN 102760441 B CN102760441 B CN 102760441B CN 201210194567 A CN201210194567 A CN 201210194567A CN 102760441 B CN102760441 B CN 102760441B
Authority
CN
China
Prior art keywords
unit
ground unrest
silence description
description frames
discontinuous transmission
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201210194567.8A
Other languages
Chinese (zh)
Other versions
CN102760441A (en
Inventor
王喆
张立斌
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Huawei Technologies Co Ltd
Original Assignee
Huawei Technologies Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Huawei Technologies Co Ltd filed Critical Huawei Technologies Co Ltd
Priority to CN201210194567.8A priority Critical patent/CN102760441B/en
Publication of CN102760441A publication Critical patent/CN102760441A/en
Application granted granted Critical
Publication of CN102760441B publication Critical patent/CN102760441B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Compression, Expansion, Code Conversion, And Decoders (AREA)

Abstract

The embodiment of the invention provides a background noise coding/decoding device and method, as well as communication equipment, and aims at realizing flexible regulation of an SID frame coding scheme and a non-continuous transmission mechanism according to the background noise of the communication environment. The technical scheme of the invention can flexibly adopt different SID frame coding schemes and non-continuous transmission mechanisms according to different types of background noises, and can self adapt to the coding accuracy and the temporal resolution required by different types of background noises under the condition of reasonably utilizing practical resources of a communication system, so as to achieve the purpose of providing various types of background noises at the decoding end with high quality on the premise of reasonably utilizing the communication channel resources.

Description

A kind of background noise encoding/decoding device, method and communication facilities
Technical field
The present invention relates to voice communication technology, particularly a kind of background noise encoding/decoding device, method and communication facilities.
Background technology
In voice communication course, the voice signal being transmitted can be divided into audible signal and without acoustical signal two classes, wherein: audible signal is the voice signal sending when efficient voice being detected, mainly comprises communication party's voice and the information such as ground unrest at that time; The ambient noise signal sending in the gap that efficient voice do not detected without acoustical signal, mainly comprises diversity of settings noise, such as white noise, background brouhaha and quiet etc.Concerning communicating pair, audible signal is useful signal, is the carrier of the main communication information, and is garbage signal without acoustical signal, does not comprise the main communication information.
In voice communication course, what mainly pay close attention to due to communicating pair is useful audible signal, therefore considers the pressure of bandwidth, does not wish to transmit useless without acoustical signal, to reduce transmission bandwidth, increases power system capacity.If but only pass audible signal, can cause the discontinuous of ground unrest, make a side who listens to feel very uncomfortable, especially more obvious in the situation that ground unrest is stronger, even can make voice indigestion, so can only reduce as far as possible transmit useless without acoustical signal, on the basis that guarantees user's susceptibility, reduce bandwidth as far as possible, CNG (Comfort Noise generation has arisen at the historic moment for this reason, comfort noise generates) technology, in CNG technology, transmitting terminal adopts silence compression coding techniques coding background noise frames in noiseless signal phase, and adopt DTX (Discontinuous Transmission, discontinuous transmission) mode is transmitted, receiving end adopts the mode of interpolation to produce continuous ambient noise signal, to increase the comfort of listener.
As shown in Figure 1, for adopting the communication system principle schematic diagram of silence compression coding techniques, wherein, transmitting terminal comprises:
VAD (Voice Activity Detector, speech detector), original signal is inputted VAD after certain pre-service, VAD is conventionally according to the eigenwert of each frame signal, signal to noise ratio (S/N ratio) for example, in detection signal, whether comprise efficient voice, and according to testing result, signal is sent into different scramblers and encode;
Speech coder, for the audible signal that comprises efficient voice being encoded to speech frame output, speech frame is transferred to receiving end by communication channel;
Non-voice scrambler, adopting silence compression technology is background noise frames by the noiseless Signal coding that does not comprise efficient voice;
Background noise frames transmitter, the DTX mode of adopt setting by background noise frames by traffic channel to receiving end, for example every several frames, send a background noise frames, or determine whether to send a background noise frames according to the characteristic parameter situation of change of ground unrest;
Accordingly, receiving end comprises:
Voice decoder, for the speech frame of decoding and receiving from communication channel, and exports after being reconstructed into audible signal;
Non-voice demoder, for the background noise frames of decoding and receiving from communication channel;
Reconstructor, for adopting the mode of interpolation that decoded data is reconstructed into continuous ambient noise signal, to increase the comfort of listener.
The non-voice scrambler of transmitting terminal and background noise frames transmitter form DTX unit, and the non-voice demoder of receiving end and reconstructor form CNG unit, and silence compression technology is by being positioned at VAD and the DTX of transmitting terminal, and the CNG realization that is positioned at receiving end.
With CELP (Code-Excited Linear Prediction, the Code Excited Linear Prediction) speech coding technology commonly used, be example, CELP is a kind of high efficiency speech coding technology.Owing to having adopted perceptual weighting, analyze the technology such as synthetic, vector quantization and rear filtering, CELP can provide high-quality synthetic speech in middle low rate.In CELP speech coding technology, at transmitting terminal, carry out speech signal analysis to obtain the characteristic parameter of voice, the characteristic parameter of voice is encoded in code stream and is sent; Receiving end carries out phonetic synthesis, reconstructed speech signal according to the characteristic parameter in code stream.Voice signal s (n) is divided into the short time period, is called frame, and the speech analysis of transmitting terminal and the phonetic synthesis of receiving end are all carried out frame by frame.
At transmitting terminal, original signal s (n) can regard the output that a pumping signal e (n) encourages a wave filter v (n) to produce as, be s (n)=e (n) * v (n), receiving end is used identical wave filter to synthesize voice, is called composite filter.Therefore the characteristic parameter transmitting in the encoding code stream of voice signal comprises composite filter parameter and excitation parameters, wherein: composite filter parameter is mainly line spectral frequencies LSF (Line Spectrum Frequency) quantization parameter, and pumping signal parameter comprises: adaptive codebook parameter (comprising pitch delay parameter and fundamental tone gain parameter) and fixed codebook parameters (position and symbol and the fixed codebook gain parameter that comprise non-zero pulses in fixed codebook).
Different from speech coding parameters, what ground unrest coding parameter was described is ground unrest characteristic, because the pumping signal of ground unrest can be thought simple noise random series, and these sequences all can simply produce with random noise generation device at encoding and decoding end, then with energy parameter, control the amplitude of these sequences, just can produce final pumping signal, therefore pumping signal characteristic parameter can simply represent with energy parameter, and do not need to further describe with other some characteristic parameters, so different from speech frame in ground unrest encoding code stream, be, its excitation parameters is the energy parameter of current background noise frame, and identical with speech frame in ground unrest encoding code stream, the composite filter parameter in encoding code stream is also line spectral frequencies LSF quantization parameter, is concrete quantization method difference to some extent.By above analysis, can think that to the coded system of ground unrest be exactly a kind of simple voice coding modes in essence.
Through DTX compressed encoding and by the background noise frames of discontinuous transmission, be commonly referred to as SID (Silence Descriptor, silence description frames) frame.In current various speech coding systems, each intrasystem ambient noise signal is all used SID frame format and discontinuous transmission mechanism.
Wherein, G.729B silence compression scheme is a kind of representative silence compression technology, G.729B based on CELP algorithm model, so its ground unrest parameter of transmitting also obtains based on CELP model, comprises for describing composite filter parameter and the excitation parameters of ground unrest.At coding side, for example the court verdict of VAD is 0 o'clock, represent that current signal is noise, through certain pretreated original signal, be admitted to ground unrest code device, at ground unrest code device, ground unrest parameter is extracted and quantized, form SID frame, then ground unrest code device determines whether sending a frame SID frame according to the parameter situation of change in SID frame, for example, when the filter parameter in SID frame and energy parameter change greatly, sends a frame SID frame.G.729B, in, the interval of two adjacent S ID frames is at least 2 frames.In decoding end, if enter the ground unrest decoding and reconstituting stage, in ground unrest decoding device, according to the SID frame receiving, synthesize the comfort noise of description encoding end ground unrest characteristic.G.729B the bit number that in SID frame is each parametric distribution is as shown in table 1 below:
Table 1.
Bit distributes (from a high position to low level) Parametric description
S1 Quantitative prediction device index
S2-S6 One-level LSF quantisation vector index
S7-S10 Secondary LSF quantisation vector index
S11-S15 Gain (energy)
AMR (Adaptive Multi-Rate speech codec, self-adaptation multi code Rate of Chinese character speech coder) and AMR-WB (Adaptive Multi-Rate Wide Band speech codec, wideband adaptive multi code Rate of Chinese character speech coder) DTX in silence compression scheme adopts fixing transmit mechanism, no matter how the characteristic of ground unrest changes, and every 8 frames send a SID frame.The bit number that in the SID frame of AMR is each parametric distribution is as shown in table 2 below, is that the bit number of each parametric distribution is as shown in table 3 below in the SID frame of AMR-WB.
Table 2.
Bit distributes (from a high position to low level) Parametric description
S1-S3 Quantitative prediction device index
S4-S11 One-level LSF quantisation vector index
S12-S20 Secondary LSF quantisation vector index
S21-S29 Three grades of LSF quantisation vector index
S30-S35 Logarithm frame energy index
Table 3.
Bit distributes (from a high position to low level) Parametric description
S1-S6 One-level LSF quantisation vector index
S7-S12 Secondary LSF quantisation vector index
S13-S18 Three grades of LSF quantisation vector index
S19-S23 Level Four LSF quantisation vector index
S24-S28 Pyatyi LSF quantisation vector index
S29-S34 Logarithm frame energy index
S35 Shake sign
Existing ground unrest coded system can be carried out simple code to ground unrest, and synthesizes simple comfortable ground unrest in decoding end, can better meet to a certain extent the requirement of answering quality.Yet along with the development of the communication technology, voice coding quality is more and more higher, corresponding, the coding quality of the ground unrest in sound stage is also more and more higher, in this case, the ground unrest when making voice communication is more continuous, also needs to improve the coding quality of silent period ground unrest.Existing comparatively single SID frame coding mode and discontinuous transmission mechanism obviously cannot meet the specific requirement of multiple communication environment.
Summary of the invention
The embodiment of the present invention provides a kind of background noise encoding/decoding device, method and communication facilities, to realize according to the ground unrest situation of communication environment, adjusts flexibly SID frame coding mode and discontinuous transmission mechanism.
A ground unrest code device, comprising:
Background noise analysis unit, at least one signal characteristic parameter value of background extraction noise signal;
Ground unrest taxon, for determining ground unrest type according to described at least one signal characteristic parameter value;
Coding switch unit and at least two discontinuous transmission unit, described coding switch unit is for inputting corresponding discontinuous transmission unit according to described ground unrest type by ambient noise signal, each discontinuous transmission unit is for being encoded to by ambient noise signal silence description frames the discontinuous transmission that comprises discontinuous transmission unit indication information, and wherein the silence description frames coded system of each discontinuous transmission unit and/or silence description frames discontinuous transmission mechanism are not identical.
A ground unrest decoding device, comprising:
The first recognition unit, for identifying the discontinuous transmission unit indication information of silence description frames;
Decoding switch unit and at least two comfort noise generation units, described decoding switch unit is used for according to the recognition result of described recognition unit, described silence description frames is inputted to corresponding comfort noise generation unit, each comfort noise generation unit is according to the corresponding decoding of the coded system of the silence description frames receiving silence description frames, and reconstruct ambient noise signal.
A communication facilities, comprising:
The first speech detector, for detection of whether comprising efficient voice classification output in the signal gathering;
Speech coder, for being speech frame continuous wave output by the Signal coding that comprises efficient voice;
Also comprise ground unrest code device, this ground unrest code device specifically comprises:
Background noise analysis unit, for obtaining at least one signal characteristic parameter value of the ambient noise signal that does not comprise efficient voice;
Ground unrest taxon, for determining ground unrest type according to described at least one signal characteristic parameter value;
Coding switch unit and at least two discontinuous transmission unit, described coding switch unit is for inputting corresponding discontinuous transmission unit according to described ground unrest type by ambient noise signal, each discontinuous transmission unit is for being encoded to by ambient noise signal silence description frames the discontinuous transmission that comprises discontinuous transmission unit indication information, and wherein the silence description frames coded system of each discontinuous transmission unit and/or silence description frames discontinuous transmission mechanism are not identical.
A communication facilities, comprises ground unrest decoding device, and described ground unrest decoding device comprises:
The first recognition unit, for identifying the discontinuous transmission unit indication information of silence description frames;
Decoding switch unit and at least two comfort noise generation units, described decoding switch unit is used for according to the recognition result of described recognition unit, described silence description frames is inputted to corresponding comfort noise generation unit, each comfort noise generation unit is according to the corresponding decoding of the coded system of the silence description frames receiving silence description frames, and reconstruct ambient noise signal.
A ground unrest coding method, comprising:
At least one signal characteristic parameter value of background extraction noise signal;
According to described at least one signal characteristic parameter value, determine ground unrest type;
According to a kind of discontinuous transmission mechanism of described ground unrest type selecting, ambient noise signal is encoded to silence description frames the discontinuous transmission that comprises discontinuous transmission mechanism indication information, wherein the silence description frames coded system in each discontinuous transmission mechanism and/or the discontinuous send mode of silence description frames are not identical.
A coding/decoding method, comprising:
Discontinuous transmission mechanism indication information in identification silence description frames, wherein the silence description frames coded system in each discontinuous transmission mechanism is not identical;
According to described discontinuous transmission mechanism indication information, adopt corresponding decoding process decoding silence description frames, and be reconstructed into ambient noise signal.
Application technical solution of the present invention, can be according to the different SID frame coding method of the dissimilar flexible employing of ground unrest and discontinuous transmission mechanism, in the situation that rationally utilizing communication system real resource, be adaptive to the desired encoding precision of dissimilar ground unrest and time sense, reach under the prerequisite of rationally utilizing channel resource in the high-quality object that all kinds ground unrest is provided of decoding end.
Accompanying drawing explanation
Fig. 1 is the communication system principle schematic diagram of existing employing silence compression coding techniques;
Fig. 2, Fig. 3, Fig. 5 and Fig. 6 are respectively a kind of ground unrest code device primary structure schematic diagram that the embodiment of the present invention provides;
The switch mode principle schematic that Fig. 4 provides for the embodiment of the present invention;
Fig. 7, Fig. 8 and Fig. 9 are respectively a kind of communication facilities primary structure schematic diagram that the embodiment of the present invention provides.
Embodiment
Further analyze existing ground unrest coding, in each coding/decoding system, all adopt single ground unrest frame coding mode, cannot describe targetedly the concrete ambient noise signal of the communication environment of various characteristics, thereby meet the multifarious demand of communication environment; Single discontinuous transmission mechanism also cannot be transmitted background noise frames targetedly to the background noise frames of different qualities, so that cannot produce targetedly good comfort noise in decoding end; And the unicity of discontinuous transmission mechanism also cannot be adaptive to the actual state of communication system, for example communication bandwidth, can not rationally be applied down to the resource of communication system.
Therefore, a kind of encoding and decoding technique scheme that the embodiment of the present invention provides, can according to ground unrest characteristic, carry out respective handling targetedly, at coding side, ground unrest is analyzed, according to the characteristic of ground unrest, adopt different DTX scheme codings to send SID frame, and adopt corresponding CNG scheme reconstructed background noise signal at receiving end.
As shown in Figure 2, be a kind of ground unrest code device 20 that the embodiment of the present invention provides, this ground unrest code device 20 is for silence compression navamander, and primary structure comprises:
At least two DTX unit 21, comprise 211,DTX unit, DTX unit 2 12......DTX unit 21n, each DTX unit 21 is not continuous wave output for ambient noise signal being encoded to SID frame according to different DTX mechanism, wherein, difference between different DTX mechanism comprises: SID frame coding mode is not identical, SID frame discontinuous transmission mechanism is not identical, or the two is not identical;
Background noise analysis unit 22, for calculating at least one signal characteristic parameter value according to ambient noise signal;
Ground unrest taxon 23, for determining ground unrest type according to described at least one signal characteristic parameter value;
Coding switch unit 24, for inputting corresponding DTX unit 21 according to described ground unrest type by the ambient noise signal of reception.
The function of background noise analysis unit 22 is to analyze the ambient noise signal of input to obtain the characteristic parameter of background signal, analytical approach is to calculate one or more signal characteristic parameter values that can characterize ambient noise signal, and these signal characteristic parameters can be one of following parameter or combination in any: when long, SNR (signal to Noise Ratio), ground unrest are estimated level, ground unrest energy hunting estimation, ground unrest spectrum fluctuation estimation, ground unrest vibration frequency etc.Other can characterization signal background the parameter of size, kind (stable state/unstable state, changes size, changes speed), SNR (when long, in short-term, peak value) also can be calculated for analyzing.Except ambient noise signal being inputted background noise analysis unit 22, VAD can also be adjudicated to current demand signal is the indicator signal input background noise analysis unit 22 of background signal, and background noise analysis unit 22 triggers and calculates signal characteristic parameter value according to ambient noise signal indication information.
The function of ground unrest taxon 23 is the signal characteristic parameter values that calculate according to background noise analysis unit 22, divides the type of current background noise signal.Ground unrest type at least defines two kinds, can define according to whether stablizing of ground unrest, for example: ground unrest is divided into stationary background noise signal and unstable state ambient noise signal two classes; Whether stablizing and the size definition of SNR when long according to ground unrest and for example: ground unrest is divided into the large SNR ambient noise signal of stable state, the little SNR ambient noise signal of stable state, the large SNR ambient noise signal of unstable state, little SNR ambient noise signal four classes of unstable state.Can also be according to the size definition ground unrest type of ground unrest institute inclusion information meaning, such as ambient noise signal can be divided into melodious ambient noise signal (as music, flowing water, bird's twitters etc.), environmental background noise signal is (as cocktail party, curbside, airport etc.) or other ambient noise signal three classes.In a word, the classified types of ground unrest can be set as required, does not limit the enforcement of technical solution of the present invention.
Coding is the pre-stored applicable DTX unit 21 of each ground unrest type in switch unit 24, the ground unrest type definite according to ground unrest taxon 23, between at least two DTX unit 21, switch, select the DTX unit 21 corresponding with current background noise type to process.
Generally each ground unrest type is distinguished corresponding a kind of different DTX scheme, but consider demand or the communication channel resources situation of change of concrete communication scenes, can also set different switch modes, the switching controls scheme that each switch mode is corresponding different, can the corresponding concrete switching controls scheme of each switch mode is pre-stored in coding switch unit 24, and by the required switch mode of switch mode steering order indication coding switch unit 24, switch mode steering order can be limited instruction set of the current channel resource situation of reflection, the corresponding a kind of switch mode of each specific switch mode steering order, it is the distribution method of a kind of DTX unit.Also can, when switch mode steering order each time, concrete switching controls scheme corresponding to required switch mode be sent to coding switch unit 24 simultaneously.
Coding switch unit 24 is determined corresponding switch mode according to the indication information of concrete reason.To those skilled in the art, concrete switch mode switching control method is a lot, will not enumerate here.
Therefore as shown in Figure 3, in the code device of ground unrest shown in Fig. 2 20, can also increase command reception unit 25 is set, receive switch mode steering order and be transmitted to coding switch unit 24, switch mode steering order can be operating personnel from outside input, can be also transmitting terminal according to the communication channel resource conditions generation detecting etc.
Coding switch unit 24 can be selected appropriate switch mode according to switch mode steering order, has at least DTX unit 21 corresponding to ground unrest type not identical between any two switch modes.The ground unrest code device 20 of take comprises that four DTX unit 21 are example, the setting example of switch mode as shown in Figure 4:
When command reception unit 25 is not set, coding switch unit 24 is only that ground unrest is selected appropriate DTX unit according to ground unrest type, and for example switch mode 1;
When being provided with command reception unit 25, coding switch unit 24 is determined the pattern of selecting DTX unit according to switch mode steering order.For example:
While receiving switch mode steering order 1, according to switch mode 1, switch;
While receiving switch mode steering order 2, according to switch mode 2, switch;
While receiving switch mode steering order 3, according to switch mode 3, switch.
Switch mode steering order can be sent according to current communication channel resource conditions, for example, when current channel resource is nervous, wherein the ground unrest of part ground unrest type may be used samely compared with the DTX unit of low rate, and even all ground unrest types are all used minimum speed limit DTX unit.Again for example, when there is no switch mode steering order, only have wherein part DTX unit to be used, but when switch mode steering order represents that current channel resource is abundant or nervous, another part DTX unit is used.
Example is as shown in Figure 4 visible:
Instruction 1 indication enters switch mode 1, each ground unrest type corresponding a kind of different DTX unit all now, and instruction 1 can mean that current channel resource is normal;
Instruction 2 indications enter switch mode 2, the DTX unit that now part ground unrest type can be corresponding identical, as ground unrest Class1 and the equal corresponding DTX of ground unrest type 2 unit 212, ground unrest type 3 and the equal corresponding DTX of ground unrest type 4 unit 214, instruction 2 can represent current channel resource, and abundant (212 compare DTX unit, DTX unit 211 code rates are high, take resource many; 214 compare DTX unit, DTX unit 213 code rates are high, take resource many), or can represent that more nervous (212 compare DTX unit, DTX unit 211 code rates low, takies resource few to current channel resource; 214 compare DTX unit, DTX unit 213 code rates are low, take resource few);
Instruction 3 indications enter switch mode 3, the equal corresponding DTX unit 214 only of all ground unrest types now, instruction 3 can represent current channel resource, and very abundant (DTX unit 214 code rates are the highest, take resource maximum), or can represent current channel resource very nervous (DTX unit 214 code rates are minimum, take resource minimum).
As shown in Figure 5, when very sufficient or ground unrest is melodious especially at current channel resource, can also speech coder be set in ground unrest code device 20 again, utilize speech coder to carry out full-rate codes to specific background noise.Satisfied the imposing a condition of signal characteristic parameter value of specific background noise, whether ground unrest taxon 23 meets and imposes a condition to judge specific background noise type according at least one signal characteristic parameter value, and indicate coding switch unit 24 when specific background noise, to be switched to speech coder, speech coder is encoded to speech frame continuous wave output by this specific background noise signal.Certainly, also can utilize the existing speech coder of transmitting terminal to bear this function.
Consult shown in Fig. 1, each DTX unit 21 comprises two parts, and one is non-voice scrambler, and for ambient noise signal being encoded to SID frame, it two is background noise frames transmitter, for sending background noise frames according to the discontinuous mode of setting.Difference between DTX unit can be the difference of SID frame coding mode, can be also the difference of discontinuous transmission mechanism, or the two is neither same.Wherein: the difference of SID frame coding mode can be embodied in the following aspects:
1, the difference of coding parameter
Different SID frames can adopt the parameter of different types of parameter and varying number;
2, the difference of identical parameters quantified precision
For the different SID frames of same parameters, can adopt different quantified precisions;
Therefore, the structure of the SID frame that different SID frame coding modes produce may be different with length, and for example: the coding parameter of stationary background noise signal noise can be relatively simple, and the quantified precision of parameter can be relatively low, the length of SID frame can be relatively short.Corresponding, the coding parameter of unstable state ambient noise signal can be relatively many, a kind of method is to increase the excitation parameters in speech coding parameters in existing SID frame, as adaptive codebook parameter (pitch delay and gain) and fixed codebook parameters (position of non-zero pulses and symbol and gain) etc., the quantified precision of parameter can be relatively high, as increasing quantified precision of existing SID frame median filter parameter and excitation parameters etc., the length of SID frame also can correspondingly increase.And for example, the SID frame of melodious ambient noise signal should be used the most complicated coded system (can be full-rate codes), the SID frame of environmental background noise signal is used time complicated coded system, and the SID frame of other ambient noise signal is used better simply coded system.
Different ground unrest types can also be used different discontinuous transmission mechanism, and the difference of the SID frame transmit mechanism of different background noise type is mainly reflected on the transmission intercal of adjacent two SID frames.For example, for stationary background noise signal, because the change of background is less, the Quality Down susceptibility after DTX/CNG is not high to such ground unrest for human auditory system, only needs with relatively long discontinuous transmission interval.For unstable state ambient noise signal, because background change is in time larger, in order to guarantee that such ground unrest can have a higher subjective quality after DTX/CNG, need to use relatively short discontinuous transmission interval or even continuous transmission.And for example, for melodious background, due to the non-Noise Characteristic of itself, the requirement of subjective quality is quite high, so DTX system just need to be carried out continuous transmission.For environmental background noise, due to itself, carried certain environmental information, in order to ensure not losing these information, DTX system also need to be upgraded SID frame with higher speed.For other ground unrest, due to true appearance the non-staple object of reduction background, so the renewal of carrying out SID frame that DTX system just can be relatively slow.
The transmission intercal of SID frame is divided into two kinds, constant transmissions interval and Adaptive Transmission interval.Under the mechanism at constant transmissions interval, the interval of adjacent two SID interframe is fixed, and the size at interval just depends on the type of ground unrest.Under the mechanism at Adaptive Transmission interval, although the transmission of SID frame is to be adaptive to the speed that noise changes, also there is the restriction of minimum time interval.In the embodiment of the present invention, according to different background noise type, can determine to adopt different minimum intervals.
As shown in Figure 6, the primary structure schematic diagram for the first communication facilities provided by the invention, comprising: VAD10, ground unrest code device 20, speech coder 30 and communication interface 40, wherein:
VAD10, for detection of efficient voice, sends audible signal into speech coder 30 and is encoded to speech frame, and communication interface sends speech frame by communication channel; Ambient noise signal is sent into ground unrest code device 20, ground unrest code device 20 output SID frames, communication interface sends SID frame by communication channel.
In ground unrest code device 20, coding switch unit 24 switches applicable DTX unit 21 according to the type of ambient noise signal.
As shown in Figure 7, the second communication facilities primary structure schematic diagram providing for the embodiment of the present invention, wherein: the coding switch unit 24 in ground unrest code device 20 is determined switch mode according to the instruction of command reception unit 25, and switch applicable DTX unit 21 according to the switch mode of the type of ambient noise signal and current use.
Shown in Figure 5, in the ground unrest code device 20 in the communication facilities shown in Fig. 6 and Fig. 7, speech coder can also be set, speech coder completes the full-rate codes of ground unrest.
As shown in Figure 8, if utilize existing speech coder 30 to complete the full-rate codes of ground unrest, the switch unit 24 of encoding is connected on speech coder 30, ambient noise signal is sent into speech frame the continuous wave output that is encoded to full rate in speech coder 30.
The technical scheme that adopts the embodiment of the present invention to provide, the coding send mode of corresponding each the SID frame of needs arranges corresponding DTX unit indication information in SID frame, and DTX unit indication information is determined corresponding decoding and reconstituting scheme for decoding end.For speech frame, by Voice decoder, decoded and continuous wave output.
Due to 22 pairs of background noise analysis unit, calculating the calculating of background characteristics parameter value calculates identical with the part of each DTX unit 21 when encoding SID frame, therefore background noise analysis unit 22 can also connect each DTX unit 21, analysis result is exported to each DTX unit 21, thereby part reduces the calculated amount of DTX unit 21.
As shown in Figure 9, a kind of communication apparatus construction schematic diagram providing for the embodiment of the present invention, the unit that completes receiving end function mainly comprises: ground unrest decoding device 50, recognition unit 60, decoding switch unit 70, Voice decoder 80 and communication interface, wherein:
This ground unrest decoding device 50 comprises at least two CNG unit (51,52...5n), and each CNG unit is according to a kind of decoding and reconstituting scheme decoding SID frame of setting and be reconstructed into ground unrest;
Recognition unit 60, for identifying speech frame and the SID frame receiving by communication interface, and further identifies the DTX unit indication information in SID frame;
Decoding switch unit 70, for switching according to the recognition result of recognition unit 60, sends speech frame into voice decipher 80, SID frame is sent into corresponding CNG unit and carry out decoding and reconstituting.
Wherein, consult shown in Fig. 1, each CNG unit comprises non-voice demoder, for the background noise frames of decoding and receiving from communication channel; Reconstructor, for adopting the mode of interpolation that decoded data is reconstructed into continuous ambient noise signal, to increase the comfort of listener.
If first recognition unit 60 identifies speech frame or SID frame and SID frame that the coded frame receiving is full-rate codes, further identify the DTX unit indication information in this SID frame., should have 1 or several for carrying the bit of DTX unit indication information in the code stream of SID frame here, bit bit position both can also can be at end in the beginning of SID frame, or for example, at other assigned address of SID frame, shown in following table 3 or table 4:
Table 3.
Bit distributes (from a high position to low level) Parametric description
S1 DTX unit indication information
S2 Quantitative prediction device index
S3-S7 One-level LSF quantisation vector index
S8-S11 Secondary LSF quantisation vector index
S12-S16 Gain (energy)
Table 4.
Bit distributes (from a high position to low level) Parametric description
S1 Quantitative prediction device index
S2-S6 One-level LSF quantisation vector index
S7-S10 Secondary LSF quantisation vector index
S11-S15 Gain (energy)
S16-S17 DTX unit indication information
In recognition unit 60, can comprise two recognin unit, a recognin unit is for identifying speech frame or the SID frame that the coded frame receiving is full-rate codes, if identify speech frame, notify decoding switch unit 70, decoding switch unit is sent speech frame into Voice decoder 80 decoding continuous wave outputs; If identifying is SID frame, SID frame is sent into another recognin unit and further identified the DTX unit indication information in SID frame.
Because different ground unrest types has different SID frame format or update mechanism, according to DTX unit indication information, select suitable DTX decoding scheme decoding SID frame, noise interpolating method in different decoding schemes is different, the comfort noise being finally restored out.
Communication facilities can possess transmitting terminal function, also concrete receiving end function, structure shown in Fig. 9 conventionally and the merging of structure shown in Fig. 7 or Fig. 8 be arranged in a communication facilities.
With specific embodiment, be elaborated again below.
Embodiment mono-
At coding side, ambient noise signal is inputted background noise analysis unit 22 frame by frame, background noise analysis unit 22 calculates the subband level level[k of each current demand signal frame, i], wherein k and i represent to be respectively the level of i frame k subband and to be kept in internal memory.The calculating of subband can be realized also and can be realized by the method for conversion by bank of filters.
When the enough somes of signal frame accumulation, background noise analysis unit 22 starts according to being kept at the subband level in internal memory, calculates ground unrest spectrum fluctuation parameters spec_var[i], and after upgrade result of calculation while receiving each frame background noise frames.
Ground unrest spectrum fluctuation parameters spec_var[i] computing method be:
spec _ var [ i ] = Σ m = i - L + 1 i ( Σ n = i - L + 1 , n ≠ m i ( Σ k = 1 N ( ( level [ k , m ] - level [ k , n ] ) / ( 1 L · N Σ m = i - L + 1 i Σ k = 1 N level [ k , i ] ) ) 2 ) )
The quantity that wherein N is subband, L is chosen as to do ambient noise signal number of frames average when long.Ground unrest spectrum fluctuation parameters spec_var[i] calculating can be also based on LSF coefficient, that is:
spec _ var [ i ] = Σ m = i - L + 1 i ( Σ n = i - L + 1 , n ≠ m i ( Σ k = 1 N ( ( lsf [ k , m ] - lsf [ k , n ] ) / ( 1 L · N Σ m = i - L + 1 i Σ k = 1 N lsf [ k , i ] ) ) 2 ) )
Lsf[k wherein, i] represent k lsf coefficient of i frame, N is lsf number of coefficients, L is chosen as to do ambient noise signal number of frames average when long.
Background noise analysis unit 22 exports the ground unrest spectrum fluctuation parameters calculating to ground unrest taxon 23.
Coding switch unit 24 is that current background noise frame is selected corresponding DTX unit according to the ground unrest type information of ground unrest taxon 23 outputs.
In the present embodiment, two kinds of background type have been defined: stationary background noise signal and unstable state ambient noise signal.Ground unrest taxon 23 compares and incorporates current background noise into in stable state or unstable state a kind of by ground unrest being composed to fluctuation parameters and thresholding spec_var_thr.If spec_var[i] < spec_var_thr, current background noise signal is classified as stationary background noise signal, otherwise is classified as unstable state ambient noise signal.
In the present embodiment, DTX system has two kinds of different programmes of work:
Scheme one adopts every 8 frames to send the discontinuous transmission mechanism of a SID frame, only comprises wave filter and energy parameter in SID frame;
Scheme two adopts every 3 frames to send the discontinuous transmission mechanism of a SID frame, and in SID frame, except comprising wave filter, energy parameter has also increased fixed codebook parameters (belonging to excitation parameters) outward.
When current background noise belongs to stable state class, coding switch unit 24 is selected programme of work one.When current background noise belongs to unstable state class, coding switch unit 24 is selected programme of work two.
During every kind of SID frame assembling, a DTX unit indication information is set in frame to state which kind of DTX unit this SID frame belongs to.In the present embodiment, DTX unit indication information can represent with 1 bit, and 0 and 1 represents respectively Yi HeDTX unit, DTX unit two.
In decoding end, first recognition unit 60 judges the DTX unit under this SID frame according to the DTX unit indication information in SID frame, and decoding switch unit 70 is inputted corresponding CNG unit 50 by SID frame and carried out decoding and reconstituting.Wherein:
If DTX unit indication information is 0, corresponding CNG unit obtains filter parameter and energy parameter and 8 ambient noise signal frames is thereafter carried out to interpolation according to the SID frame format decoding in above-mentioned DTX programme of work one;
If DTX unit indication information is 1, corresponding CNG unit obtains filter parameter, energy parameter and fixed codebook parameters and 3 ambient noise signal frames is thereafter carried out to interpolation according to the SID frame format decoding in above-mentioned DTX programme of work two.
Embodiment bis-
At coding side, ambient noise signal is inputed to background noise analysis unit 22 frame by frame, and background noise analysis unit 22 calculates the subband level level[k of current background noise signal frame, i], frame energy pow[i] and signal to noise ratio (S/N ratio) snr[i in short-term], and be kept in internal memory.Wherein:
Subband level level[k, i] calculating can realize also and can realize by the method for conversion by bank of filters;
The account form of frame energy is:
Figure BDA00001759760300171
wherein N represents total sub band number;
The account form of signal to noise ratio (S/N ratio) is in short-term:
Figure BDA00001759760300172
bckr_noise_pow[i wherein] ground unrest energy is estimated when long.
When the enough somes of ambient noise signal frame accumulation, background noise analysis unit 22 starts according to being kept at short-time characteristic parameter (the subband level in internal memory, frame energy, signal to noise ratio (S/N ratio) in short-term) calculate ground unrest energy hunting pow_var[i], signal to noise ratio (S/N ratio) snr_long[i when long], ground unrest energy is estimated bckr_noise_pow[i when long], and when receiving each ambient noise signal frame, upgrade result of calculation.
When long signal to noise ratio (S/N ratio), when the renewal of all the other calculation of parameter results occurs over just current demand signal frame and is ambient noise signal frame, when when long, the renewal of signal to noise ratio (S/N ratio) occurs over just current demand signal frame and is non-ambient noise signal frame.
Signal to noise ratio (S/N ratio) snr_long[i when long] computing method be:
Figure BDA00001759760300173
wherein L is chosen as to do non-ambient noise signal number of frames average when long.
Ground unrest energy hunting pow_var[i] computing method be:
pow _ var [ i ] = 1 L * &Sigma; m = i - L + 1 i ( pow [ m ] - 1 L * &Sigma; m = i - L + 1 i pow [ m ] ) 2 , Wherein L is chosen as to do ambient noise signal number of frames average when long.
When long, ground unrest energy is estimated bckr_noise_pow[i] computing method be:
Bckr_noise_pow[i]=(1-α) * bckr_noise_pow[i-1]+α * pow[i], wherein α is the scale factor of 0 to 1, in the present embodiment, value is 5%.
Background noise analysis unit 22 is by the ground unrest energy hunting pow_var[i calculating], signal to noise ratio (S/N ratio) snr_long[i when long] export ground unrest taxon 23 to.
In the present embodiment, four kinds of background type have been defined: the large SNR ambient noise signal of stable state, the little SNR ambient noise signal of stable state, the large SNR ambient noise signal of unstable state and the little SNR ambient noise signal of unstable state.
Ground unrest taxon 23 is by by pow_var[i] and snr_long[i] compare with an energy hunting thresholding pow_var_thr and a signal-noise ratio threshold snr_thr respectively, it is a class of 4 kinds of ground unrest type kinds that current background noise is incorporated into, wherein:
As pow_var[i] when > pow_var_thr and snr_long > snr_thr, current background is the large SNR ambient noise signal of unstable state;
As pow_var[i] when > pow_var_thr and snr_long < snr_thr, current background is the little SNR ambient noise signal of unstable state;
As pow_var[i] when < pow_var_thr and snr_long > snr_thr, current background is the large SNR ambient noise signal of stable state;
As pow_var[i] when < pow_var_thr and snr_long < snr_thr, current background is the little SNR ambient noise signal of stable state.
In the present embodiment, one has four kinds of different DTX unit, wherein:
DTX unit one adopts every 8 frames to send the discontinuous transmission mechanism of a SID frame, only comprises filter parameter and energy parameter in SID frame;
DTX unit two adopts every 8 frames to send the discontinuous transmission mechanism of a SID frame, also comprises fixed codebook parameters in SID frame except comprising filter parameter and energy parameter;
DTX unit three adopts every 3 frames to send the discontinuous transmission mechanism of a SID frame, also comprises fixed codebook parameters in SID frame except comprising filter parameter and energy parameter;
DTX unit four adopts the continuous transmission mechanism of full-rate codes.
Coding switch unit 24, except receiving the ground unrest type information of ground unrest taxon 23, is also connected with a switch mode steering order interface.Switch mode steering order has 5 kinds of indications, represent respectively that bandwidth is normal, bandwidth is more, bandwidth is less, bandwidth is a lot of and bandwidth seldom.
The switch mode relation of switch mode steering order and DTX system is as follows:
When switch mode steering order represents that bandwidth is normal, the switch mode of DTX system is: the large SNR class of stable state background is adopted to DTX unit one; The little SNR class of stable state background is adopted to DTX unit two; The large SNR class of unstable state background is adopted to DTX unit three; The little SNR class of unstable state background is adopted to DTX unit four.
When switch mode steering order represents that bandwidth is more, the switch mode of DTX system is: stable state large SNR class background and the little SNR class of stable state background are all adopted to DTX unit two; Unstable state large SNR class background and the little SNR class of unstable state background are all adopted to DTX unit four;
When switch mode steering order represents that bandwidth is less, the switch mode of DTX system is: stable state large SNR class background and the little SNR class of stable state background are all adopted to DTX unit one; Unstable state large SNR class background and the little SNR class of unstable state background are all adopted to DTX unit three;
When switch mode steering order represents that bandwidth is a lot, the switch mode of DTX system is: four kinds of ground unrest types are all adopted to DTX unit four;
When switch mode steering order represents that bandwidth seldom, the switch mode of DTX system is: four kinds of ground unrest types are all adopted to DTX unit one.
During every kind of SID frame assembling, a DTX unit indication information is set in frame to state which kind of DTX unit this SID frame belongs to.DTX unit indication information in the present embodiment can represent with 2 bits, wherein: 00,01 and 10 represent respectively Er HeDTX unit, Yi,DTX unit, DTX unit three, so scheme four due to be full-rate codes without sign out.
In decoding end, first recognition unit 60 judges the DTX unit under this SID frame according to the DTX unit indication information in SID frame, and indicates corresponding CNG unit decodes reconstruct, wherein:
If DTX unit indication information is 00, according to the SID frame format decoding in above-mentioned DTX programme of work one, obtains filter parameter and energy parameter and 8 ambient noise signal frames are thereafter carried out to interpolation;
If DTX unit indication information is 01, according to the SID frame format decoding in above-mentioned DTX programme of work two, obtain excitation parameters, filter parameter and energy parameter also carry out interpolation to 8 ambient noise signal frames thereafter.
If DTX unit indication information is 10, according to the SID frame format decoding in above-mentioned DTX programme of work three, obtains filter parameter, energy parameter and fixed codebook parameters and 3 ambient noise signal frames are thereafter carried out to interpolation.
Those skilled in the art can be flexibly according to signal characteristic parameter dividing background noise type, and sets switch mode, will not enumerate here.
In sum, the embodiment of the present invention provides a kind of coding method, comprising:
Obtain at least one signal characteristic parameter value of ambient noise signal to be sent;
According to described at least one signal characteristic parameter value, determine ground unrest type;
According to a kind of DTX mechanism of described ground unrest type selecting, ambient noise signal is encoded to SID frame the discontinuous transmission that comprises DTX mechanism indication information, wherein the SID frame coding mode in each DTX mechanism and/or the discontinuous send mode of SID frame are not identical.
Further, described coding method also comprises: set different DTX mechanism preference pattern, i.e. different switch mode, has at least DTX mechanism corresponding to ground unrest type not identical between any two DTX mechanism preference pattern; And according to selected DTX mechanism preference pattern, for each ground unrest type is determined corresponding DTX mechanism.
A nearlyer step, described method also comprises: when setting value that described at least one signal characteristic parameter value is, ambient noise signal is encoded to the speech frame of full rate and sends continuously.
Wherein, signal characteristic parameter value comprises one of following or combination in any: when long, SNR, ground unrest estimate that level, ground unrest energy hunting estimation, the fluctuation of ground unrest spectrum estimate and ground unrest vibration frequency.
The embodiment of the present invention also provides a kind of method that corresponding SID frame is decoded, and comprising:
DTX mechanism indication information in identification SID frame, wherein the SID frame coding mode in each DTX mechanism is not identical;
According to described DTX mechanism indication information, adopt corresponding decoding process decoding SID frame, and be reconstructed into ambient noise signal.
The technical scheme that the application embodiment of the present invention proposes, can be according to the actual state of the dissimilar and communication system of ground unrest, adopt different SID frame coding methods and discontinuous transmission mechanism, in the situation that rationally utilizing communication system real resource, be adaptive to the desired encoding precision of dissimilar ground unrest and time sense, reach under the prerequisite of rationally utilizing channel resource in the high-quality object that all kinds ground unrest is provided of decoding end.
Obviously, those skilled in the art can carry out various changes and modification and not depart from the spirit and scope of the present invention the present invention.Like this, if within of the present invention these are revised and modification belongs to the scope of the claims in the present invention and equivalent technologies thereof, the present invention is also intended to comprise these changes and modification interior.

Claims (31)

1. a ground unrest code device, is characterized in that, comprising:
Background noise analysis unit, at least one signal characteristic parameter value of background extraction noise signal; Described ambient noise signal comprises ground unrest, and described ground unrest comprises white noise, background brouhaha and quiet;
Ground unrest taxon, for determining ground unrest type according to described at least one signal characteristic parameter value;
Coding switch unit and at least two discontinuous transmission unit, described coding switch unit is for inputting corresponding discontinuous transmission unit according to described ground unrest type by ambient noise signal, each discontinuous transmission unit is for being encoded to by ambient noise signal silence description frames the discontinuous transmission that comprises discontinuous transmission unit indication information, wherein the silence description frames coded system of each discontinuous transmission unit and/or silence description frames discontinuous transmission mechanism are not identical, the silence description frames coded system of each discontinuous transmission unit is identical being specially not: the coding parameter of different silence description frames is not identical, or the quantified precision of the different silence description frames of same coding parameter is not identical, the silence description frames discontinuous transmission mechanism of each discontinuous transmission unit is identical being specially not: the transmission intercal of adjacent two silence description frames of transmission is not identical.
2. ground unrest code device as claimed in claim 1, it is characterized in that, also comprise: command reception unit, be used for receiving switch mode steering order and be transmitted to coding switch unit, described switch mode steering order is used to indicate coding switch unit and uses a kind of switch mode, has the discontinuous transmission unit that a ground unrest type is corresponding different between any two switch modes at least.
3. ground unrest code device as claimed in claim 1, it is characterized in that, also comprise: speech coder, wherein a kind of specific context noise type that described coding switch unit identifies according to ground unrest taxon, inputs speech coder by ambient noise signal and is encoded to speech frame continuous wave output.
4. the ground unrest code device as described in claim 1,2 or 3, is characterized in that, described discontinuous transmission unit specifically comprises:
Non-voice scrambler, for being encoded to silence description frames by ambient noise signal;
Silence description frames transmitter, for the described silence description frames of discontinuous transmission.
5. a ground unrest decoding device of the silence description frames of ground unrest code device output described in claim 1 being decoded, is characterized in that, comprising:
The first recognition unit, for identifying the discontinuous transmission unit indication information of silence description frames;
Decoding switch unit and at least two comfort noise generation units, described decoding switch unit is used for according to the recognition result of described recognition unit, described silence description frames is inputted to corresponding comfort noise generation unit, each comfort noise generation unit is according to the corresponding decoding of the coded system of the silence description frames receiving silence description frames, and reconstruct ambient noise signal.
6. ground unrest decoding device as claimed in claim 5, is characterized in that, also comprises:
The second recognition unit, described the second recognition unit is used for identifying speech frame and silence description frames, and described silence description frames is sent into the first recognition unit, or when identifying speech frame, notifies described decoding switch unit;
Voice decoder, the speech frame that described decoding switch unit identifies the second recognition unit is sent into Voice decoder decoding continuous wave output.
7. ground unrest decoding device as claimed in claim 6, is characterized in that, described the first recognition unit and the second recognition unit merge setting.
8. a communication facilities, comprising:
The first speech detector, for detection of whether comprising efficient voice classification output in the signal gathering;
The first speech coder, for being speech frame continuous wave output by the Signal coding that comprises efficient voice;
It is characterized in that, also comprise ground unrest code device, this ground unrest code device specifically comprises:
Background noise analysis unit, for obtaining at least one signal characteristic parameter value of the ambient noise signal that does not comprise efficient voice; Described ambient noise signal comprises ground unrest, and described ground unrest comprises white noise, background brouhaha and quiet;
Ground unrest taxon, for determining ground unrest type according to described at least one signal characteristic parameter value;
Coding switch unit and at least two discontinuous transmission unit, described coding switch unit is for inputting corresponding discontinuous transmission unit according to described ground unrest type by ambient noise signal, each discontinuous transmission unit is for being encoded to by ambient noise signal silence description frames the discontinuous transmission that comprises discontinuous transmission unit indication information, wherein the silence description frames coded system of each discontinuous transmission unit and/or silence description frames discontinuous transmission mechanism are not identical, the silence description frames coded system of each discontinuous transmission unit is identical being specially not: the coding parameter of different silence description frames is not identical, or the quantified precision of the different silence description frames of same coding parameter is not identical, the silence description frames discontinuous transmission mechanism of each discontinuous transmission unit is identical being specially not: the transmission intercal of adjacent two silence description frames of transmission is not identical.
9. communication facilities as claimed in claim 8, it is characterized in that, described ground unrest code device also comprises: command reception unit, be used for receiving switch mode steering order and be transmitted to coding switch unit, described switch mode steering order is used to indicate coding switch unit and uses a kind of switch mode, has the discontinuous transmission unit that a ground unrest type is corresponding different between any two switch modes at least.
10. communication facilities as claimed in claim 8, it is characterized in that, described coding switch unit also connects described the first speech coder, wherein a kind of specific context noise type that described coding switch unit identifies according to ground unrest taxon, inputs the first speech coder by ambient noise signal and is encoded to speech frame continuous wave output.
11. communication facilitiess as claimed in claim 8, it is characterized in that, described ground unrest code device also comprises: the second speech coder, wherein a kind of specific context noise type that described coding switch unit identifies according to ground unrest taxon, inputs the second speech coder by ambient noise signal and is encoded to speech frame continuous wave output.
12. communication facilitiess as described in claim 8,9,10 or 11, is characterized in that, described discontinuous transmission unit specifically comprises:
Non-voice scrambler, for being encoded to silence description frames according to the silence description frames coded system of setting by ambient noise signal;
Silence description frames transmitter, for sending described silence description frames according to the discontinuous transmission mechanism of setting.
13. communication facilitiess as claimed in claim 8, is characterized in that, also comprise:
The first recognition unit, for identifying the discontinuous transmission unit indication information of silence description frames;
Decoding switch unit and at least two comfort noise generation units, described decoding switch unit is used for according to the recognition result of described recognition unit, described silence description frames is inputted to corresponding comfort noise generation unit, each comfort noise generation unit is according to the corresponding decoding of the coded system of the silence description frames receiving silence description frames, and reconstruct ambient noise signal.
14. communication facilitiess as claimed in claim 13, is characterized in that, also comprise:
The second recognition unit, described the second recognition unit is used for identifying speech frame and silence description frames, and described silence description frames is sent into the first recognition unit, or when identifying speech frame, notifies described decoding switch unit;
Voice decoder, the speech frame that described decoding switch unit identifies the second recognition unit is sent into Voice decoder decoding continuous wave output.
15. communication facilitiess as claimed in claim 14, is characterized in that, described the first recognition unit and the second recognition unit merge setting.
16. 1 kinds of communication facilitiess, comprise ground unrest decoding device, it is characterized in that, described ground unrest decoding device comprises:
The first recognition unit, for identifying the discontinuous transmission unit indication information of silence description frames;
Decoding switch unit and at least two comfort noise generation units, described decoding switch unit is used for according to the recognition result of described recognition unit, described silence description frames is inputted to corresponding comfort noise generation unit, each comfort noise generation unit is according to the corresponding decoding of the coded system of the silence description frames receiving silence description frames, and reconstruct ambient noise signal; Described ambient noise signal comprises ground unrest, and described ground unrest comprises white noise, background brouhaha and quiet.
17. communication facilitiess as claimed in claim 16, is characterized in that, also comprise:
The second recognition unit, described the second recognition unit is used for identifying speech frame and silence description frames, and described silence description frames is sent into the first recognition unit, or when identifying silence description frames, notifies described decoding switch unit;
Voice decoder, the speech frame that described decoding switch unit identifies the second recognition unit is sent into Voice decoder and is decoded and export.
18. communication facilitiess as claimed in claim 17, is characterized in that, described the first recognition unit and the second recognition unit merge setting.
19. 1 kinds of ground unrest coding methods, is characterized in that, comprising:
At least one signal characteristic parameter value of background extraction noise signal; Described ambient noise signal comprises ground unrest, and described ground unrest comprises white noise, background brouhaha and quiet;
According to described at least one signal characteristic parameter value, determine ground unrest type;
According to a kind of discontinuous transmission mechanism of described ground unrest type selecting, ambient noise signal is encoded to silence description frames the discontinuous transmission that comprises discontinuous transmission mechanism indication information, wherein the silence description frames coded system in each discontinuous transmission mechanism and/or the discontinuous send mode of silence description frames are not identical, the silence description frames coded system of each discontinuous transmission unit is identical being specially not: the coding parameter of different silence description frames is not identical, or the quantified precision of the different silence description frames of same coding parameter is not identical, the silence description frames discontinuous transmission mechanism of each discontinuous transmission unit is identical being specially not: the transmission intercal of adjacent two silence description frames of transmission is not identical.
20. coding methods as claimed in claim 19, it is characterized in that, describedly according to a kind of discontinuous transmission mechanism of ground unrest type selecting, be: according to the corresponding relation between ground unrest type and discontinuous transmission mechanism in the discontinuous transmission mechanism preference pattern of setting, select, between any two switch modes, have at least discontinuous transmission mechanism corresponding to ground unrest type not identical.
21. coding methods as claimed in claim 19, is characterized in that, also comprise: when described at least one signal characteristic parameter value meets while imposing a condition, ambient noise signal is encoded to the speech frame of full rate and sends continuously.
22. coding methods as described in claim 19,20 or 21, it is characterized in that, described signal characteristic parameter comprises one of following or combination in any: when long, SNR, ground unrest estimate that level, ground unrest energy hunting estimation, the fluctuation of ground unrest spectrum estimate and ground unrest vibration frequency.
23. coding methods as claimed in claim 22, is characterized in that, according to the size of each signal characteristic parameter value, described ground unrest type comprises stationary background noise signal and unstable state ambient noise signal.
24. coding methods as claimed in claim 23, is characterized in that, when described signal characteristic parameter comprises when long SNR:
Described stationary background noise signal further comprises: the large SNR ambient noise signal of stable state and the little SNR ambient noise signal of stable state;
Described unstable state ambient noise signal further comprises: the large SNR ambient noise signal of unstable state and the little SNR ambient noise signal of unstable state.
25. coding methods as claimed in claim 19, is characterized in that, the difference between different silence description frames coded systems comprises: at least one silence description frames coding parameter is not identical, or the quantified precision of same silence description frames coding parameter is not identical.
26. coding methods as claimed in claim 25, is characterized in that, described silence description frames coding parameter comprises composite filter parameter and excitation parameters.
27. coding methods as claimed in claim 26, is characterized in that, described excitation parameters comprises adaptive codebook parameter and/or fixed codebook parameters.
28. coding methods as claimed in claim 19, is characterized in that, the difference between the discontinuous send mode of different silence description frames comprises: the transmission intercal between adjacent two silence description frames of transmission is not identical.
29. coding methods as claimed in claim 28, is characterized in that, described transmission intercal is determined according to the variation self-adaptation of background signal, and the minimum interval between the discontinuous send mode of different silence description frames is not identical.
30. coding methods as claimed in claim 28, is characterized in that, the transmission intercal of the discontinuous send mode of each silence description frames is not identical fixed value.
31. 1 kinds of methods that the silence description frames that method described in claim 19 is generated is decoded, is characterized in that, comprising:
Discontinuous transmission mechanism indication information in identification silence description frames, wherein the silence description frames coded system in each discontinuous transmission mechanism is not identical;
According to described discontinuous transmission mechanism indication information, adopt corresponding decoding process decoding silence description frames, and be reconstructed into ambient noise signal.
CN201210194567.8A 2007-06-05 2007-06-05 Background noise coding/decoding device and method as well as communication equipment Active CN102760441B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201210194567.8A CN102760441B (en) 2007-06-05 2007-06-05 Background noise coding/decoding device and method as well as communication equipment

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201210194567.8A CN102760441B (en) 2007-06-05 2007-06-05 Background noise coding/decoding device and method as well as communication equipment

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
CN 200710110573 Division CN101320563B (en) 2007-06-05 2007-06-05 Background noise encoding/decoding device, method and communication equipment

Publications (2)

Publication Number Publication Date
CN102760441A CN102760441A (en) 2012-10-31
CN102760441B true CN102760441B (en) 2014-03-12

Family

ID=47054882

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201210194567.8A Active CN102760441B (en) 2007-06-05 2007-06-05 Background noise coding/decoding device and method as well as communication equipment

Country Status (1)

Country Link
CN (1) CN102760441B (en)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110010141B (en) * 2013-02-22 2023-12-26 瑞典爱立信有限公司 Method and apparatus for DTX smearing in audio coding
CN104978970B (en) 2014-04-08 2019-02-12 华为技术有限公司 A kind of processing and generation method, codec and coding/decoding system of noise signal
CN113593602B (en) * 2021-07-19 2023-12-05 深圳市雷鸟网络传媒有限公司 Audio processing method and device, electronic equipment and storage medium

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1168751A (en) * 1994-12-05 1997-12-24 诺基亚电信公司 Method for substituting bar speech frames in digital communication system
CN1327574A (en) * 1998-12-07 2001-12-19 三菱电机株式会社 Sound decoding device and sound decoding method
JP2003501925A (en) * 1999-06-07 2003-01-14 エリクソン インコーポレイテッド Comfort noise generation method and apparatus using parametric noise model statistics
CN101320563B (en) * 2007-06-05 2012-06-27 华为技术有限公司 Background noise encoding/decoding device, method and communication equipment

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1168751A (en) * 1994-12-05 1997-12-24 诺基亚电信公司 Method for substituting bar speech frames in digital communication system
CN1327574A (en) * 1998-12-07 2001-12-19 三菱电机株式会社 Sound decoding device and sound decoding method
JP2003501925A (en) * 1999-06-07 2003-01-14 エリクソン インコーポレイテッド Comfort noise generation method and apparatus using parametric noise model statistics
CN101320563B (en) * 2007-06-05 2012-06-27 华为技术有限公司 Background noise encoding/decoding device, method and communication equipment

Also Published As

Publication number Publication date
CN102760441A (en) 2012-10-31

Similar Documents

Publication Publication Date Title
CN101320563B (en) Background noise encoding/decoding device, method and communication equipment
CN1703737B (en) Method for interoperation between adaptive multi-rate wideband (AMR-WB) and multi-mode variable bit-rate wideband (VMR-WB) codecs
CN100508028C (en) Method and device for adding release delay frame to multi-frame coded by voder
CN1144180C (en) Method and apparatus for preforming reducer rate variable rate vocoding
CN1828722B (en) Complex signal activated detection for improved speech/noise classification of an audio signal
CN103620675B (en) To equipment, acoustic coding equipment, equipment linear forecast coding coefficient being carried out to inverse quantization, voice codec equipment and electronic installation thereof that linear forecast coding coefficient quantizes
EP1747554B1 (en) Audio encoding with different coding frame lengths
EP1719119B1 (en) Classification of audio signals
CN103620676B (en) To method, sound encoding system, the method for linear forecast coding coefficient being carried out to inverse quantization, voice codec method and recording medium that linear forecast coding coefficient quantizes
EP1747442B1 (en) Selection of coding models for encoding an audio signal
CN1969319B (en) Signal encoding
CN101131817A (en) Method and apparatus for robust speech classification
KR20080055026A (en) Method and apparatus for determining encoding mode of audio signal, and method and appartus for encoding/decoding audio signal using it
JP2004512562A (en) High frequency enhanced hierarchical coding in wideband speech codec decoder
CN103384900A (en) Low-delay sound-encoding alternating between predictive encoding and transform encoding
CN107293311A (en) Very short pitch determination and coding
CN104517612A (en) Variable-bit-rate encoder, variable-bit-rate decoder, variable-bit-rate encoding method and variable-bit-rate decoding method based on AMR (adaptive multi-rate)-NB (narrow band) voice signals
EP1617416B1 (en) Method and apparatus for subsampling phase spectrum information
CN102760441B (en) Background noise coding/decoding device and method as well as communication equipment
CN102254562B (en) Method for coding variable speed audio frequency switching between adjacent high/low speed coding modes
EP1035538B1 (en) Multimode quantizing of the prediction residual in a speech coder
US7295974B1 (en) Encoding in speech compression
Gournay et al. A 1200 bits/s HSX speech coder for very-low-bit-rate communications
KR20070017379A (en) Selection of coding models for encoding an audio signal
KR20080091305A (en) Audio encoding with different coding models

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant