CN101800050A - Audio fine scalable coding method and system based on perception self-adaption bit allocation - Google Patents

Audio fine scalable coding method and system based on perception self-adaption bit allocation Download PDF

Info

Publication number
CN101800050A
CN101800050A CN201010107402A CN201010107402A CN101800050A CN 101800050 A CN101800050 A CN 101800050A CN 201010107402 A CN201010107402 A CN 201010107402A CN 201010107402 A CN201010107402 A CN 201010107402A CN 101800050 A CN101800050 A CN 101800050A
Authority
CN
China
Prior art keywords
subband
perceptual
important degree
module
self
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201010107402A
Other languages
Chinese (zh)
Other versions
CN101800050B (en
Inventor
胡瑞敏
杨玉红
刘元元
陈冰
高丽
项慨
周超群
杭波
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Wuhan University WHU
Original Assignee
Wuhan University WHU
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Wuhan University WHU filed Critical Wuhan University WHU
Priority to CN201010107402A priority Critical patent/CN101800050B/en
Publication of CN101800050A publication Critical patent/CN101800050A/en
Application granted granted Critical
Publication of CN101800050B publication Critical patent/CN101800050B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Compression, Expansion, Code Conversion, And Decoders (AREA)

Abstract

The invention relates to the technical field of audio coding, in particular to audio fine scalable coding method and system based on perception self-adaption bit allocation. The method comprises the following steps of: preprocessing input signals; carrying out subband division on frequency-domain signals; calculating the perception importance of each subband; uniformly sequencing the subband from small to large according to the perception importance; extracting the subband with the largest perception importance for scalable longitudinal vector quantization; and then carrying out self-adaption adjustment on the vector-quantized subband with the largest perception importance. The system comprises a preprocessing module, a subband division module, a perception importance calculating, sequencing and extracting module, a scalable quantizing and coding module, a self-adaption adjusting module and a scalable coding finish judging module. The invention realizes high-efficient fine scalable audio coding, preferably realizes the unification of quantization precision and quantization efficiency and also satisfies the requirement of high tone quality when improving coding efficiency.

Description

Audio fine scalable coding method and system based on the perception self-adaption bit distribution
Technical field
The present invention relates to technical field of audio, relate in particular to a kind of audio fine scalable coding method and system that distributes based on perception self-adaption bit.
Background technology
The scalable audio coding technology is divided into a core layer and a plurality of enhancement layer with code stream, wherein core layer guarantees the minimum reconstruction quality of signal, enhancement layer then improves reconstruction quality gradually by the mode that improves signal to noise ratio (S/N ratio) or extending bandwidth, and the enhancing number of plies that receives is many more, and decoding tonequality is high more.
Scalable encoding can be by directly abandoning enhancement layer bitstream to adapt to network bandwidth fluctuation, and the meticulous more network bandwidth that just can effectively adapt to more of partition size fluctuates; On the other hand, the objective criteria that scalable audio coding performance quality is estimated is the perception signal to noise ratio (S/N ratio) of each hierarchical layer, and the subjective assessment standard also is each hierarchical layer decoded signal perceived quality.Therefore determine the steady perception self-adaption bit allocative decision that promotes of perceived quality of each hierarchical layer that graduated encoding performance quality is played crucial effects.
The more representative method of existing fine and classified audio coding method is the optimal bit distribution method that Moving Picture Experts Group-1 in 1994 adopts, and the frequency domain subband gradable method of the encoding and decoding speech standard of new generation that proposes of ITU-T in 2006 in G.729EV.
The optimal bit distribution method evenly is divided into a plurality of subbands with frequency-region signal, sort according to subband perceptual important degree, adopt by the most important subband of 5 bit quantization method coding perceptibility, and carry out the adjustment of subband perceptual important degree, feedback is proceeded the ordering of subband perceptual important degree and is pursued bit quantization, finish up to Bit Allocation in Discrete end or whole sub-band coding, what wherein pursue the 5 bit quantization method employing is scalar quantization, and the subband most important information is carried out quantization encoding.The optimal bit distribution method has guaranteed the raising of coding quality, but because scalar quantization itself quantizes the compression defective, and this method has limited the raising of quantitative efficiency to a certain extent, low code check in can't being applicable to.
G.729EV the standard enhancement layer is 32 subbands with division of signal, adopt the criterion of estimating of perceptual importance, each subband is sorted by the perceptual important degree, result and distributable bit number according to ordering are determined the optimum bit allocative decision, each subband MDCT coefficient is divided the sphere vector quantization, G.729EV the Bit distribution method of standard enhancement layer coding employing is not optimum, the bit number of each subband of encoding is wasteful, under the few situation of bit number, this Bit distribution method can only instruct scrambler quantization encoding minority subband, and most of sub-band information will be lost fully, though this method has significant quantitative efficiency, but there is the phenomenon of Bit Allocation in Discrete inequality and waste bits, cause some subband bit serious waste, some subband bit famine, thus the raising of tonequality finally influenced.
From above technology, current fine and classified audio coding is in two extreme states, and quantitative efficiency and partition size can not have a Rational Unified Process preferably, and the method partition size that quantitative efficiency is high is just low, and the partition size that quantitative efficiency is low is just high.
Summary of the invention
The purpose of this invention is to provide a kind of audio fine scalable coding method and system that distributes based on perception self-adaption bit, with with perception self-adaption bit piece allocative decision and the combination of high-effective classifying vector quantization technology, realize fine and classified efficiently audio coding, realize the unification of quantified precision and quantitative efficiency preferably.
For achieving the above object, the present invention adopts following technical scheme:
A kind of audio fine scalable coding method that distributes based on perception self-adaption bit may further comprise the steps:
Step is 1.: input signal is carried out pre-service, and wherein pre-service comprises that input signal is carried out perceptual weighting to be handled and the time-frequency change process, obtains the signal frequency-domain representation after above-mentioned pre-service;
Step is 2.: carry out sub-band division to above-mentioned through the frequency-region signal that obtains after the pre-service, according to the method for even division whole frequency domain is divided into N subband, wherein N 〉=1;
Step is 3.: calculate the perceptual important degree of each subband, and unify the antithetical phrase tape sort according to the perceptual important degree according to order from big to small, extract the subband of perceptual important degree maximum;
Step is 4.: according to the subband of perceptual important degree maximum, carry out gradable vertical vector quantization;
Step is 5.: the maximum important perception importance degree subband behind the vector quantization is carried out the self-adaptation adjustment;
Step is 6.: judge whether gradable quantification number of times arrives maximum times in the whole quantizing process, if do not reach maximum times, then return step 3., if reach maximum times, then finish hierarchical coding.
Described step 3. in, if with the perceptual important degree criterion of sub belt energy as each subband, the spectrum energy that then calculates each subband and comprised; If with amplitude as perceptual important degree criterion, the spectrum amplitude that then calculates each subband and comprised.
Definition VQ_rank (k) is the quantification gradation of k subband, and to its initialization assignment is:
VQ_rank(0)=VQ_rank(1)...=VQ_rank(N-1)=0
K=0 wherein, 1 ... .N-1, the sub-band sum of N for dividing, N 〉=1;
The subband k of the perceptual important degree maximum that obtains is carried out the vector quantization of VQ_rank (k) level, give the frequency spectrum vector Y kDistribute the R bit, the vector after obtaining quantizing
Figure GSA00000029300100021
Wherein R value size is by the partition size S decision of scalable coder.
Definition QmaxBe maximum gradable number of times in the signal quantization process, its initialization Q=1 calculates
Figure GSA00000029300100022
The perceptual important degree And to Y k, VQ_rank (k) and ip (k) carry out following self-adaptation to be revised:
Y k = Y k - Y ^ k
VQ_rank(k)=VQ_rank(k)+1
ip ( k ) = ip ( k ) - ip ( k ) ^
Q=Q+1
Wherein, 0≤k≤N-1.
A kind of audio fine scalable coding system that distributes based on perception self-adaption bit comprises:
Pretreatment module is used for input signal is carried out pre-service, and wherein pre-service comprises that input signal is carried out perceptual weighting to be handled and the time-frequency change process, obtains the signal frequency-domain representation after above-mentioned pre-service;
The sub-band division module is used for the above-mentioned frequency-region signal that obtains after handling through pretreatment module is carried out sub-band division, according to the method for even division whole frequency domain is divided into N subband, wherein N 〉=1;
Subband perceptual important degree calculates ordering and extraction module, is used to calculate the perceptual important degree of each subband, and unifies the antithetical phrase tape sort according to the perceptual important degree according to order from big to small, extracts the subband of perceptual important degree maximum;
The scalar quantization coding module is used for the subband according to perceptual important degree maximum, carries out gradable vertical vector quantization;
The self-adaptation adjusting module is used for the maximum important perception importance degree subband behind the scalar quantization coding module vector quantization is carried out the self-adaptation adjustment;
Hierarchical coding finishes judge module, is used for judging whether the gradable quantification number of times of whole quantizing process arrives maximum times, and whether decision finishes hierarchical coding.
The perceptual weighting submodule is used for that input signal is carried out perceptual weighting and handles;
The time-frequency conversion submodule is used for that the signal after the perceptual weighting processing is carried out time-frequency conversion and handles.
Subband perceptual important degree calculates the ordering submodule, is used to calculate the perceptual important degree of each subband, and unifies the antithetical phrase tape sort according to the perceptual important degree according to order from big to small;
Perceptual important degree extraction module is used for the subband to the subband extraction perceptual important degree maximum after the ordering of subband perceptual important degree calculating ordering submodule.
The present invention has the following advantages and good effect:
1) with perception self-adaption bit piece allocative decision and the combination of high-effective classifying vector quantization technology, realizes fine and classified efficiently audio coding, realized the unification of quantified precision and quantitative efficiency preferably;
2) the present invention is that criterion antithetical phrase band carries out gradable vector quantification from people's ear apperceive characteristic with the perceptual important degree, has improved effectiveness of classification, has also satisfied the demand of high tone quality when improving code efficiency.
Description of drawings
Fig. 1 is the process flow diagram of the audio fine scalable coding method that distributes based on perception self-adaption bit provided by the invention.
Fig. 2 is sub-band division first synoptic diagram of the audio fine scalable coding method that distributes based on perception self-adaption bit provided by the invention.
Fig. 3 is sub-band division second synoptic diagram of the audio fine scalable coding method that distributes based on perception self-adaption bit provided by the invention.
Fig. 4 is the application synoptic diagram of the audio fine scalable coding system that distributes based on perception self-adaption bit provided by the invention.
Embodiment
The present invention mainly is that the perceptual important degree with subband is a criterion, the audio fine scalable coding method and the system that distribute based on perception self-adaption bit of proposition.
The present invention is relatively with the disposable the highest subband of perceptual important degree of distributing to of bit, increased effectiveness of classification, and relatively by the method for Bit Allocation in Discrete, improved code efficiency, from people's ear apperceive characteristic, be criterion with the perceptual important degree, the antithetical phrase band carries out gradable vector quantification, improve effectiveness of classification, described the present invention below respectively in conjunction with the accompanying drawings in detail.
The audio fine scalable coding method that distributes based on perception self-adaption bit provided by the invention specifically may further comprise the steps, and as shown in Figure 1, comprising:
Step 1: input signal is carried out pre-service, and wherein pre-service comprises that input signal is carried out perceptual weighting to be handled and the time-frequency change process, obtains the signal frequency-domain representation after above-mentioned pre-service;
Step 2: carry out sub-band division to above-mentioned through the frequency-region signal that obtains after the pre-service, whole frequency domain is divided into N subband, wherein N 〉=1 according to the method for even division;
Step 3: calculate the perceptual important degree of each subband, and unify the antithetical phrase tape sort according to order from big to small, extract the subband of perceptual important degree maximum according to the perceptual important degree;
The perceptual important degree criterion difference of concrete signal, if with the perceptual important degree criterion of sub belt energy as each subband, the spectrum energy that then calculates each subband and comprised; If with amplitude as perceptual important degree criterion, the spectrum amplitude that then calculates each subband and comprised;
The perceptual important degree that defines each subband is ip (k), k=0,1...N-1; According to the perceptual important degree size of calculating gained, the ordering of perceptual important degree is carried out in each subband unification, extract subband ip (k)=E (the k)=Max (ip (j)) of perceptual important degree maximum, wherein k=0,1 ... .N-1, j=0,1,2 ... N-1, the sub-band sum of N for dividing;
Step 4:, carry out gradable vertical vector quantization according to the subband of perceptual important degree maximum; This step further can comprise following substep:
1. defining VQ_rank (k) is the quantification gradation of k subband, and to its initialization assignment is:
VQ_rank(0)=VQ_rank(1)...=VQ_rank(N-1)=0
K=0 wherein, 1 ... .N-1, the sub-band sum of N for dividing, N 〉=1;
2. the subband k of the perceptual important degree maximum that obtains is carried out the vector quantization of VQ_rank (k) level, give the frequency spectrum vector Y kDistribute the R bit, the vector after obtaining quantizing
Figure GSA00000029300100051
Wherein R value size is by the partition size S decision of scalable coder;
Step 5: the maximum important perception importance degree subband behind the vector quantization is carried out the self-adaptation adjustment; These step concrete operations are as follows:
Definition Q MaxBe maximum gradable number of times in the signal quantization process, its initialization Q=1 calculates
Figure GSA00000029300100052
The perceptual important degree
Figure GSA00000029300100053
And to Y k, VQ_rank (k) and ip (k) carry out following self-adaptation to be revised:
Y k = Y k - Y ^ k
VQ_rank(k)=VQ_rank(k)+1
ip ( k ) = ip ( k ) - ip ( k ) ^
Q=Q+1
Wherein, 0≤k≤N-1;
Step 6: judge whether gradable quantification number of times arrives maximum times in the whole quantizing process,, then return step 3,, then finish hierarchical coding if reach maximum times if do not reach maximum times.
The audio fine scalable coding system that distributes based on perception self-adaption bit provided by the invention comprises with lower module:
1. pretreatment module is used for input signal is carried out pre-service, and wherein pre-service comprises that input signal is carried out perceptual weighting to be handled and the time-frequency change process, obtains the signal frequency-domain representation after above-mentioned pre-service;
Pretreatment module further comprises perceptual weighting submodule, time-frequency conversion submodule,
The perceptual weighting submodule is used for that input signal is carried out perceptual weighting and handles;
The time-frequency conversion submodule is used for that the signal after the perceptual weighting processing is carried out time-frequency conversion and handles;
2. sub-band division module is used for the above-mentioned frequency-region signal that obtains after handling through pretreatment module is carried out sub-band division, according to the method for even division whole frequency domain is divided into N subband, wherein N 〉=1;
3. subband perceptual important degree calculates ordering and extraction module, is used to calculate the perceptual important degree of each subband, and unifies the antithetical phrase tape sort according to the perceptual important degree according to order from big to small, extracts the subband of perceptual important degree maximum;
This module comprises that further subband perceptual important degree calculates ordering submodule, perceptual important degree extraction module:
Subband perceptual important degree calculates the ordering submodule, is used to calculate the perceptual important degree of each subband, and unifies the antithetical phrase tape sort according to the perceptual important degree according to order from big to small;
Perceptual important degree extraction module is used for the subband to the subband extraction perceptual important degree maximum after the ordering of subband perceptual important degree calculating ordering submodule;
4. the scalar quantization coding module is used for the subband according to perceptual important degree maximum, carries out gradable vertical vector quantization;
5. the self-adaptation adjusting module is used for the maximum important perception importance degree subband behind the scalar quantization coding module vector quantization is carried out the self-adaptation adjustment;
6. hierarchical coding finishes judge module, is used for judging whether the gradable quantification number of times of whole quantizing process arrives maximum times, and whether decision finishes hierarchical coding.
Further the invention will be further described in conjunction with the accompanying drawings with specific embodiment below:
Step 1: input signal is carried out pre-service, and pre-service specifically comprises perceptual weighting and two processes of time-frequency conversion;
1. input signal is sent into perceptual weighting filter M LB(z), while γ 1', γ 2' and γ 3' (0<γ 1', γ 2, ' γ 3'<1) three also corresponding adjustment of coefficient are to relax quantization noise spectrum:
W LB ( z ) = A ^ ( Z / γ 1 ′ ) A ^ ( z / γ 2 ′ ) ( 1 + Σ i = 1 2 a i γ 3 ′ i z - i )
γ wherein 1', γ 2', γ 3' for adjusting parameter, a iBe the linear prediction analysis coefficient, i is the exponent number of linear prediction, A ^ ( z ) = a ^ 0 + a ^ 1 z - 1 + · · · + a ^ 10 z - 10 .
2. time-frequency conversion is that time-domain signal is transformed into frequency domain, obtains the spectrum expression of sound signal, and present embodiment adopts the MDCT conversion.
Step 2: the frequency-region signal behind the time-frequency conversion is carried out spectral sub-bands divide, suppose entire spectrum evenly is divided into 64 subbands herein;
Fig. 2 is for evenly being divided into the synoptic diagram of 8 subbands, and transverse axis is represented subband frequency domain division scope, and the longitudinal axis is represented frequency domain energy amplitude, and its medium and low frequency core layer coding is basis of the present invention, not in limit of consideration of the present invention; The subband that comes out according to residual computations is used numeral " 1 " to arrive " 8 " in the drawings and is indicated respectively, and wherein subband 1, subband 2, subband 3 and subband 4 are low frequency audio sub-bands; Subband 5, subband 6, subband 7 and subband 8 are high-frequency audio subbands; The division of 64 subbands and 8 sub-band division are in like manner;
Step 3: suppose the measurement standard of the energy of each subband herein as subband perceptual important degree, calculate the energy that each subband comprised of 64 subbands, and sort from big to small according to the energy size, extract the subband of perceptual important degree maximum, embodiment is:
1. defining ip (k) is the perceptual important degree of k subband, and E (k) is k the spectrum energy that subband comprised, and calculates the energy of each subband with following formula:
ip ( k ) = E ( k ) = Y k 2
K=0 wherein, 1...63, Y kIt is the MDCT spectral coefficient that k subband comprises;
2. the energy size of calculating each subband of gained according to following formula is the measurement standard of perceptual important degree, and the ordering of perceptual important degree is carried out in each subband unification, and the subband of extraction perceptual important degree maximum is sent into step 4 and carried out vector quantization, specifically is expressed as:
ip(k)=Max(ip(j))
Wherein, 0≤k≤63, j=0 ..., 63;
Step 4: the subband of the perceptual important degree maximum that obtains according to step 3, according to this subband is carried out vertical vector quantization, suppose that here k subband is the subband of perceptual important degree maximum, concrete embodiment is:
1. defining VQ_rank (k) is the quantification gradation of k subband, and to its initialization assignment is:
VQ_rank(0)=VQ_rank(1)...=VQ_rank(63)=0
K=0 wherein, 1 ... .63, the sub-band sum of N for dividing;
2. the subband k of the perceptual important degree maximum that obtains is carried out VQ_rank (k)=0 grade vector quantization, give the quantization vector Y of this subband kDistribute R bit, wherein R partition size is as required adjusted, and weighs between quantitative efficiency and partition size, and to handle frame length 20ms, partition size 1kbps is an example, and then R is 20 bits, the vector after obtaining quantizing
Figure GSA00000029300100071
Step 5: the subband k behind step 4 vector quantization is carried out the self-adaptation finishing, specifically implement as follows:
Suppose Q Max=10 are maximum gradable number of times in the signal quantization process, its initialization Q=1;
Calculate
Figure GSA00000029300100072
The perceptual important degree And to Y k, VQ_rank (k) and ip (k) carry out following self-adaptation to be revised, that is:
Y k = Y k - Y ^ k
VQ_rank(k)=VQ_rank(k)+1
ip ( k ) = ip ( k ) - ip ( k ) ^
Q=Q+1
0≤k≤63 wherein;
Step 6: whether the gradable quantification number of times Q after judgement carry out step 5 is greater than Q MaxIf, greater than then finish hierarchical coding, if not greater than Q MaxThen proceed step 3.
Fig. 3 is 8 subband bit allocation amounts synoptic diagram, and transverse axis is represented subband frequency domain division scope, and the longitudinal axis is represented frequency domain energy amplitude, and its medium and low frequency core layer coding is basis of the present invention, not in limit of consideration of the present invention; Enhancement layer evenly is divided into 8 subbands, according to each sub belt energy amplitude relatively, finds the 6th sub belt energy maximum, this subband of encoding vector block 1, adjust the 6th sub belt energy; Rearrangement sub belt energy amplitude is found the 1st sub belt energy maximum, this subband of encoding vector block 2; By that analogy, the 1st to 18 vector block of encoding respectively.
The binaural signal imported among Fig. 4 through mix down, resume module such as pre-service, low pass and high-pass filtering obtain low strap residual signals and high band signal.Low strap residual signals and high band signal obtain output code flow output as the input of graduated encoding module through method scalar quantization provided by the invention.
Fig. 4 is the application of content of the present invention in whole audio coding framework, wherein graduated encoding vector quantization 30 is realized the position of fine granulation hierarchical coding for the present invention, with content application of the present invention in the gradable vector quantification of coding framework, instruct audio coding, improve quantitative efficiency and quantified precision.

Claims (7)

1. an audio fine scalable coding method that distributes based on perception self-adaption bit is characterized in that, may further comprise the steps:
Step is 1.: input signal is carried out pre-service, and wherein pre-service comprises that input signal is carried out perceptual weighting to be handled and the time-frequency change process, obtains the signal frequency-domain representation after above-mentioned pre-service;
Step: carry out sub-band division to above-mentioned through the frequency-region signal that obtains after the pre-service, whole frequency domain is divided into N subband, wherein N 〉=1 according to the method for even division;
Step is 3.: calculate the perceptual important degree of each subband, and unify the antithetical phrase tape sort according to the perceptual important degree according to order from big to small, extract the subband of perceptual important degree maximum;
Step is 4.: according to the subband of perceptual important degree maximum, carry out gradable vertical vector quantization;
Step is 5.: the maximum important perception importance degree subband behind the vector quantization is carried out the self-adaptation adjustment;
Step is 6.: judge whether gradable quantification number of times arrives maximum times in the whole quantizing process, if do not reach maximum times, then return step 3., if reach maximum times, then finish hierarchical coding.
2. the audio fine scalable coding method that distributes based on perception self-adaption bit according to claim 1 is characterized in that:
Described step 3. in, if with the perceptual important degree criterion of sub belt energy as each subband, the spectrum energy that then calculates each subband and comprised; If with amplitude as perceptual important degree criterion, the spectrum amplitude that then calculates each subband and comprised.
3. the audio fine scalable coding method that distributes based on perception self-adaption bit according to claim 1 is characterized in that,
4. described step further comprises following substep:
Definition VQ_rank (k) is the quantification gradation of k subband, and to its initialization assignment is:
VQ_rank(0)=VQ_rank(1)...=VQ_rank(N-1)=0
K=0 wherein, 1 ... .N-1, the sub-band sum of N for dividing, N 〉=1;
The subband k of the perceptual important degree maximum that obtains is carried out the vector quantization of VQ_rank (k) level, give the frequency spectrum vector Y kDistribute the R bit, the vector after obtaining quantizing
Figure FSA00000029290000011
Wherein R value size is by the partition size S decision of scalable coder.
4. the audio fine scalable coding method that distributes based on perception self-adaption bit according to claim 3 is characterized in that 5. described step further comprises following substep:
Definition Q MaxBe maximum gradable number of times in the signal quantization process, its initialization Q=1 calculates
Figure FSA00000029290000012
The perceptual important degree And to Y k, VQ_rank (k) and ip (k) carry out following self-adaptation to be revised:
Y k = Y k - Y k ^
VQ_rank(k)=VQ_rank(k)+1
ip ( k ) = ip ( k ) - ip ( k ) ^
Q=Q+1
Wherein, 0≤k≤N-1.
5. an audio fine scalable coding system that distributes based on perception self-adaption bit is characterized in that, comprising:
Pretreatment module is used for input signal is carried out pre-service, and wherein pre-service comprises that input signal is carried out perceptual weighting to be handled and the time-frequency change process, obtains the signal frequency-domain representation after above-mentioned pre-service;
The sub-band division module is used for the above-mentioned frequency-region signal that obtains after handling through pretreatment module is carried out sub-band division, according to the method for even division whole frequency domain is divided into N subband, wherein N 〉=1;
Subband perceptual important degree calculates ordering and extraction module, is used to calculate the perceptual important degree of each subband, and unifies the antithetical phrase tape sort according to the perceptual important degree according to order from big to small, extracts the subband of perceptual important degree maximum;
The scalar quantization coding module is used for the subband according to perceptual important degree maximum, carries out gradable vertical vector quantization;
The self-adaptation adjusting module is used for the maximum important perception importance degree subband behind the scalar quantization coding module vector quantization is carried out the self-adaptation adjustment;
Hierarchical coding finishes judge module, is used for judging whether the gradable quantification number of times of whole quantizing process arrives maximum times, and whether decision finishes hierarchical coding.
6. the audio fine scalable coding system that distributes based on perception self-adaption bit according to claim 5 is characterized in that,
Described pretreatment module further comprises:
The perceptual weighting submodule is used for that input signal is carried out perceptual weighting and handles;
The time-frequency conversion submodule is used for that the signal after the perceptual weighting processing is carried out time-frequency conversion and handles.
7. according to claim 5 or the 6 described audio fine scalable coding systems that distribute based on perception self-adaption bit, it is characterized in that described subband perceptual important degree calculates ordering and extraction module further comprises:
Subband perceptual important degree calculates the ordering submodule, is used to calculate the perceptual important degree of each subband, and unifies the antithetical phrase tape sort according to the perceptual important degree according to order from big to small;
Perceptual important degree extraction module is used for the subband to the subband extraction perceptual important degree maximum after the ordering of subband perceptual important degree calculating ordering submodule.
CN201010107402A 2010-02-03 2010-02-03 Audio fine scalable coding method and system based on perception self-adaption bit allocation Expired - Fee Related CN101800050B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201010107402A CN101800050B (en) 2010-02-03 2010-02-03 Audio fine scalable coding method and system based on perception self-adaption bit allocation

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201010107402A CN101800050B (en) 2010-02-03 2010-02-03 Audio fine scalable coding method and system based on perception self-adaption bit allocation

Publications (2)

Publication Number Publication Date
CN101800050A true CN101800050A (en) 2010-08-11
CN101800050B CN101800050B (en) 2012-10-10

Family

ID=42595697

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201010107402A Expired - Fee Related CN101800050B (en) 2010-02-03 2010-02-03 Audio fine scalable coding method and system based on perception self-adaption bit allocation

Country Status (1)

Country Link
CN (1) CN101800050B (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101950562A (en) * 2010-11-03 2011-01-19 武汉大学 Hierarchical coding method and system based on audio attention
CN106796800A (en) * 2014-07-28 2017-05-31 弗劳恩霍夫应用研究促进协会 The audio coder and decoder of the cross processing device using frequency domain processor, Time Domain Processing device and for continuous initialization
CN107424622A (en) * 2014-06-24 2017-12-01 华为技术有限公司 Audio coding method and device
CN112037802A (en) * 2020-05-08 2020-12-04 珠海市杰理科技股份有限公司 Audio coding method and device based on voice endpoint detection, equipment and medium
CN112037803A (en) * 2020-05-08 2020-12-04 珠海市杰理科技股份有限公司 Audio encoding method and device, electronic equipment and storage medium

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR100261254B1 (en) * 1997-04-02 2000-07-01 윤종용 Scalable audio data encoding/decoding method and apparatus
ES2305852T3 (en) * 2003-10-10 2008-11-01 Agency For Science, Technology And Research PROCEDURE FOR CODING A DIGITAL SIGNAL IN A SCALABLE BINARY FLOW, PROCEDURE FOR DECODING A SCALABLE BINARY FLOW.
CN101609674B (en) * 2008-06-20 2011-12-28 华为技术有限公司 Method, device and system for coding and decoding
CN101499279B (en) * 2009-03-06 2011-11-02 武汉大学 Bit distribution method and apparatus with progressively fine spacing parameter

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101950562A (en) * 2010-11-03 2011-01-19 武汉大学 Hierarchical coding method and system based on audio attention
CN107424622A (en) * 2014-06-24 2017-12-01 华为技术有限公司 Audio coding method and device
CN107424622B (en) * 2014-06-24 2020-12-25 华为技术有限公司 Audio encoding method and apparatus
US11074922B2 (en) 2014-06-24 2021-07-27 Huawei Technologies Co., Ltd. Hybrid encoding method and apparatus for encoding speech or non-speech frames using different coding algorithms
CN106796800A (en) * 2014-07-28 2017-05-31 弗劳恩霍夫应用研究促进协会 The audio coder and decoder of the cross processing device using frequency domain processor, Time Domain Processing device and for continuous initialization
CN112037802A (en) * 2020-05-08 2020-12-04 珠海市杰理科技股份有限公司 Audio coding method and device based on voice endpoint detection, equipment and medium
CN112037803A (en) * 2020-05-08 2020-12-04 珠海市杰理科技股份有限公司 Audio encoding method and device, electronic equipment and storage medium
CN112037802B (en) * 2020-05-08 2022-04-01 珠海市杰理科技股份有限公司 Audio coding method and device based on voice endpoint detection, equipment and medium
CN112037803B (en) * 2020-05-08 2023-09-29 珠海市杰理科技股份有限公司 Audio encoding method and device, electronic equipment and storage medium

Also Published As

Publication number Publication date
CN101800050B (en) 2012-10-10

Similar Documents

Publication Publication Date Title
US7613603B2 (en) Audio coding device with fast algorithm for determining quantization step sizes based on psycho-acoustic model
EP2346030B1 (en) Audio encoder, method for encoding an audio signal and computer program
KR100949232B1 (en) Encoding device, decoding device and methods thereof
CN102081927B (en) Layering audio coding and decoding method and system
CN102341852B (en) Filtering speech
CN101283407B (en) Transform coder and transform coding method
CN101199005B (en) Post filter, decoder, and post filtering method
US9009036B2 (en) Methods and systems for bit allocation and partitioning in gain-shape vector quantization for audio coding
CN102511062B (en) Allocation of bits in an enhancement coding/decoding for improving a hierarchical coding/decoding of digital audio signals
US8099275B2 (en) Sound encoder and sound encoding method for generating a second layer decoded signal based on a degree of variation in a first layer decoded signal
JP6980871B2 (en) Signal coding method and its device, and signal decoding method and its device
JP7280306B2 (en) Apparatus and method for MDCT M/S stereo with comprehensive ILD with improved mid/side determination
CN101800050B (en) Audio fine scalable coding method and system based on perception self-adaption bit allocation
CN103928030A (en) Gradable audio coding system and method based on sub-band space attention measure
CN104025189A (en) Method for encoding voice signal, method for decoding voice signal, and apparatus using same
KR102386738B1 (en) Signal encoding method and apparatus, and signal decoding method and apparatus
CN101281748B (en) Method for filling opening son (sub) tape using encoding index as well as method for generating encoding index
CN101192410B (en) Method and device for regulating quantization quality in decoding and encoding
CN101950562A (en) Hierarchical coding method and system based on audio attention
CN102074243B (en) Bit plane based perceptual audio hierarchical coding system and method
CN101562015A (en) Audio-frequency processing method and device
CN105957533B (en) Voice compression method, voice decompression method, audio encoder and audio decoder
KR20150032220A (en) Signal encoding method and apparatus and signal decoding method and apparatus
JP2003233397A (en) Device, program, and data transmission device for audio encoding
CN103503065A (en) Method and a decoder for attenuation of signal regions reconstructed with low accuracy

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20121010

Termination date: 20150203

EXPY Termination of patent right or utility model