CN102420987A - Self-adaption bit distribution method based on code rate control of hierarchical B frame structure - Google Patents

Self-adaption bit distribution method based on code rate control of hierarchical B frame structure Download PDF

Info

Publication number
CN102420987A
CN102420987A CN2011103908715A CN201110390871A CN102420987A CN 102420987 A CN102420987 A CN 102420987A CN 2011103908715 A CN2011103908715 A CN 2011103908715A CN 201110390871 A CN201110390871 A CN 201110390871A CN 102420987 A CN102420987 A CN 102420987A
Authority
CN
China
Prior art keywords
picture frame
time horizon
gop
frame
bit
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN2011103908715A
Other languages
Chinese (zh)
Inventor
沈礼权
姜益中
王向前
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
University of Shanghai for Science and Technology
Original Assignee
University of Shanghai for Science and Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by University of Shanghai for Science and Technology filed Critical University of Shanghai for Science and Technology
Priority to CN2011103908715A priority Critical patent/CN102420987A/en
Publication of CN102420987A publication Critical patent/CN102420987A/en
Pending legal-status Critical Current

Links

Images

Abstract

The invention discloses a self-adaption bit distribution method based on code rate control of a hierarchical B frame structure. The self-adaption bit distribution method comprises the steps of: 1, figuring out a target bit number distributed to each GOP (Ground Observation Post) on the basis of the length requirement of a video sequence and the GOP in a configuration file attached by the video sequence and the speed requirement of the target bit; 2, figuring a target bit number distributed to each time layer in the hierarchical B frame structure on the basis of the target bit number of each GOP, obtained in the step 1; 3, figuring out a target bit number distributed to each image frame in the same time layer on the basis of the target bit number of each time layer, obtained in the step 2; and 4, updating a weight value of each time layer through iteration calculation according to coding information of a coding image. On the premise of no increase of the bit numbers consumed by the video coding actually, the coding complexity of each image frame in the video sequence is accurately predicted, each time layer bit distribution weight value in the hierarchical B frame structure is updated through iteration calculation, and the target bit numbers are distributed for each frame of image reasonably, thus coding efficiency of the video sequence and decoding quality of each frame of image are increased.

Description

Self-adaptive bit allocation method based on the Rate Control of hierarchical B-frame structure
Technical field
The present invention relates to the Rate Control technical field in the video coding, particularly a kind of self-adaptive bit allocation method of the Rate Control based on the hierarchical B-frame structure.
Background technology
In recent years along with Development of Multimedia Technology; People require increasingly high to the definition of Internet video, fluency etc.; Yet the size of the network bandwidth has determined the quantity of transmitted bit in the unit interval, and the various network terminal is not quite similar to the demand of video quality height, network bandwidth size, just because of this demand; (SVC is the abbreviation of English Scalable Video Coding to SVC; Its Chinese translation is a scalable video) technology becomes and solves one of adaptive key technology of video, and H.264 SVC is keeping having adopted the forecast coding structure based on hierarchical B-frame that is proposed by FraunhoferHHI (Heinrich-Hertz-Institute) on the basis of high efficient coding; Obtain higher code efficiency, also increased the computation complexity of prediction simultaneously.
The Rate Control technology is mainly adjusted through the output bit to video encoder in real time; To obtain the best equilibrium between video quality and the bandwidth usage, the Rate Control technology comprises that (CBR is the abbreviation of English Constant Bit Rate for VBR (VBR is the abbreviation of English Variable Bit Rate, and its Chinese translation is variable bit control) and CBR; Its Chinese translation is fixed bit control) two kinds of patterns; Wherein, the CBR pattern is a kind of closed-loop process pattern, and it is input as video sequence and target bit rate; According to the estimation of the complexity of video sequence, the situation that takies and the channel width of decoder buffer; QP (QP is the abbreviation of English quantization parameter, and its Chinese translation is a quantization step) is dynamically adjusted, obtain satisfactory code check.The self-adaptive bit allocation method of Rate Control of the present invention is to the CBR pattern in the Rate Control technology.
The hierarchical B-frame structure is divided into a plurality of time horizons with video sequence according to time orientation encodes, and has comprised a basic layer and a plurality of enhancement layer.The image of basic layer need not rely on other image layers and carry out absolute coding when coding, but the image of enhancement layer must depend on the image of other image layers.Document 1 has proposed a kind of current comparatively popular Bit distribution method based on the hierarchical B-frame structure; This method is according to dependence between basic layer and each enhancement layer in the hierarchical B-frame structure, for each time horizon has distributed a fixing weighted value that carries out Bit Allocation in Discrete.Weighted value size according to each time horizon in the process of carrying out Rate Control is distributed target bit for it.Though this fixed weight value Bit distribution method has been considered the dependence in the time prediction between basic layer and each enhancement layer; But two big-differences in the video coding process have but been ignored: the video sequence interframe motion smoothing that 1, has; The video sequence interframe motion that has is fierce, and different video sequence coding complexities have nothing in common with each other; 2, in same video sequence, motion smoothing between the partial frame that has, the part interframe movement that has is fierce, even encoder complexity is also different between each picture frame on the layer at the same time.Adopt fixed weight value Bit distribution method; Each time horizon of different video sequences is adopted the weighted value size of fixing Bit Allocation in Discrete; Each picture frame in the same time horizon is adopted the weighted value size of identical Bit Allocation in Discrete; Do not consider this two big-difference in the video coding fully, this will reduce the code efficiency of whole video sequence, thereby has reduced final decoded image quality.Document 1:Y. Liu, Z. Li, Y. C. Soh; " Rate Control of is Scalable Extension H.264/AVC ", IEEE Trans. Circuits Syst. Video Technol., Vol.18; No.1, pp. 116-121, Jan. 2008.
Summary of the invention
The object of the present invention is to provide a kind of self-adaptive bit allocation method of the Rate Control based on the hierarchical B-frame structure; Under the condition of the bit number that does not improve the video coding actual consumption; Encoder complexity through each picture frame in the accurate prediction video sequence; Calculate the weighted value size of the Bit Allocation in Discrete of each time horizon in the hierarchical B-frame structure, improve the code efficiency of video sequence and the quality of each picture frame image of decoding back.
For achieving the above object; Design of the present invention is as shown in Figure 1; At first; By in the incidental configuration file of video sequence to video sequence and the length requirement of GOP (GOP is the abbreviation of English group of pictures, and its Chinese translation is an image sets) and the rate requirement of target bits, calculate the target bit of distributing to each GOP; Then, by the target bit that above-mentioned each GOP that obtains distributes, go out to distribute to the target bit of each time horizon according to the hierarchical B-frame Structure Calculation; Then, by the target bit of above-mentioned each time horizon that obtains, go out to distribute to the target bit of each picture frame according to the various computing of encoder complexity in the identical time horizon; At last, according to distributing each picture frame target bit that obtains that each picture frame is encoded in the last step, pass through the weighted value that iterative computation is upgraded this time horizon by the coded message that each picture frame coding in the identical time horizon obtains.
The present invention can be under the condition of the bit rate that does not improve the video coding actual consumption; Encoder complexity through accurate prediction video sequence and each picture frame thereof; Reasonably calculate the target bit of distributing to each picture frame in the video sequence, improve the quality of code efficiency and each picture frame effectively.
In common video sequence, movement degree has large-scale fluctuation between different GOP, and the different images interframe in same GOP; Motion also has sizable fluctuation, regardless of movement degree, because in the hierarchical B-frame structure; Basic tomographic image adopts absolute coding; So all need distribute more bit number to improve the coding quality of image, but in the violent GOP of motion, because the image correlation of the image of basic layer and enhancement layer is little; Inter prediction is inaccurate, so the image of enhancement layer also need distribute more target bit; On the other hand; In the GOP of motion smoothing, the image of basic layer and the image correlation of enhancement layer are big, and inter prediction is comparatively accurate; Therefore the image of enhancement layer need not to distribute more target bit, only needs most of target bit is distributed to basic tomographic image to improve the quality of reference frame.For this reason, the basic layer of hierarchical B-frame need be regulated according to the encoder complexity self adaptation of each picture frame among the GOP with the weighted value size of the target bit of each enhancement layer.The hierarchical B-frame structure decides the time horizon at each picture frame place by the length of the playing sequence of each picture frame and GOP, and therefore also there is bigger difference in the encoder complexity of each picture frame on identical time horizon.For this reason, each picture frame in the identical time horizon can not also should be regulated according to its encoder complexity simply according to the weighted value size of place time horizon when distributing target bit.The present invention is through actual consumption bit number that extracts each picture frame corresponding reference frame in the current time layer and the encoder complexity that coding quality PSNR coded messages such as (PSNR are the abbreviation of English Peak Signal to Noise Ratio, and its Chinese translation is a Y-PSNR) is analyzed each picture frame in the current time layer.
Given this; The present invention proposes a kind of coded message based on encoded image respectively self adaptation regulate the big or small method of each picture frame predictive coding complexity in time horizon weighted value size and the identical time horizon, this method has been utilized the correlation of each picture frame on time orientation in the video sequence.
Referring to Fig. 2, Fig. 3, the characteristics of hierarchical B-frame structure are whenever to insert a width of cloth key frame at a distance from one section particular time interval, and the picture frame between two key frames is called non-key frame; A GOP is made up of a key frame and one group of non-key frame; Wherein key frame adopts intraframe predictive coding or unidirectional inter prediction encoding, constitute the basic layer in the time horizon, but not key frame adopts the bidirectional interframe predictive coding; Constitute the enhancement layer in the time horizon; Different with video coding in the past is, is not the DISPLAY ORDER coding according to each picture frame in the video sequence based on the video coding of hierarchical B-frame structure, but according to time horizon from bottom to top coding.Therefore the length of GOP is different, and the time horizon at picture frame place will change, thereby causes its reference frame and coded sequence also inequality; The direction of arrow among the figure is to point to coded frame by reference frame, such as the 8th frame in the DISPLAY ORDER, in Fig. 2; It is a key frame, adopts unidirectional inter prediction encoding, is positioned at the 0th layer of time horizon; Promptly basic layer has only reference frame of the 0th frame; And in Fig. 3, it is non-key frame, adopts the bidirectional interframe predictive coding, is positioned at the 1st layer of time horizon, and promptly enhancement layer has comprised the 0th frame and two reference frames of the 16th frame.
The weighted value size of supposing each time horizon in the hierarchical B-frame structure of current GOP is respectively
Figure 2011103908715100002DEST_PATH_IMAGE001
; Wherein
Figure 2011103908715100002DEST_PATH_IMAGE002
expression is basic layer;
Figure 2011103908715100002DEST_PATH_IMAGE003
expression top in the enhancement layer; Except basic layer; The picture frame sum of each time horizon is
Figure 2011103908715100002DEST_PATH_IMAGE004
; Weighted value size according to each time horizon; Calculate the target bit of distributing to each time horizon, its calculation expression is:
Figure 2011103908715100002DEST_PATH_IMAGE005
Wherein
Figure 2011103908715100002DEST_PATH_IMAGE006
expression distribute to
Figure 2011103908715100002DEST_PATH_IMAGE007
layer time horizon target bit; Target bit among the current GOP of
Figure 2011103908715100002DEST_PATH_IMAGE008
expression; The sum of picture frame in
Figure 2011103908715100002DEST_PATH_IMAGE009
and layer of representing
Figure 162862DEST_PATH_IMAGE007
respectively and
Figure 2011103908715100002DEST_PATH_IMAGE011
layer time horizon; The weighted value size of
Figure 2011103908715100002DEST_PATH_IMAGE012
and the layer layer time horizon with
Figure 250084DEST_PATH_IMAGE011
of representing
Figure 392986DEST_PATH_IMAGE007
respectively comprises basic layer and the total time number of plies of enhancement layer in
Figure 2011103908715100002DEST_PATH_IMAGE014
expression hierarchical B-frame structure.
According to the hierarchical B-frame structure; Each picture frame in the enhancement layer finds two its corresponding reference frames; One preceding; One after; The reference prediction encoder complexity that it provides is expressed as ,
Figure 2011103908715100002DEST_PATH_IMAGE016
respectively, and two reference prediction encoder complexity calculation expressions are respectively:
Figure 2011103908715100002DEST_PATH_IMAGE017
Figure 2011103908715100002DEST_PATH_IMAGE018
In the formula;
Figure 2011103908715100002DEST_PATH_IMAGE019
and
Figure 2011103908715100002DEST_PATH_IMAGE020
representes forward direction and the bit number of back to two reference frame actual consumptions respectively, and
Figure 2011103908715100002DEST_PATH_IMAGE021
and
Figure 2011103908715100002DEST_PATH_IMAGE022
representes forward direction and the coding quality PSNR of back to two reference frames respectively.
With the predictive coding complexity of picture frame be defined as
Figure 2011103908715100002DEST_PATH_IMAGE023
, its expression formula is:
Figure 2011103908715100002DEST_PATH_IMAGE024
Give the target bit of each picture frame according to the predictive coding complexity dispensed of each picture frame in the identical time horizon, its allocation algorithm is shown below:
Figure 2011103908715100002DEST_PATH_IMAGE025
The target bit of
Figure 2011103908715100002DEST_PATH_IMAGE027
two field picture in
Figure 2011103908715100002DEST_PATH_IMAGE026
expression layer time horizon of distributing to
Figure 193638DEST_PATH_IMAGE007
in the formula; The target bit
Figure 828199DEST_PATH_IMAGE006
of
Figure 845199DEST_PATH_IMAGE007
distributed in
Figure 2011103908715100002DEST_PATH_IMAGE028
expression layer time horizon deducts remaining bit number behind the used up bit number of encoding; The predictive coding complexity of
Figure 709884DEST_PATH_IMAGE027
two field picture and
Figure 797926DEST_PATH_IMAGE011
two field picture in
Figure 2011103908715100002DEST_PATH_IMAGE029
and
Figure 2011103908715100002DEST_PATH_IMAGE030
the layer time horizon of representing
Figure 223408DEST_PATH_IMAGE007
respectively, the sum of picture frame in
Figure 64959DEST_PATH_IMAGE009
expression the Ceng time horizon.
So, each picture frame in the enhancement layer among the GOP all will obtain a target bit, obtain QP according to this target bit according to the binomial rate-distortion model then, encode according to QP at last.
After each picture frame is all encoded and is accomplished among the GOP; Bit number according to actual consumption passes through the weighted value that iterative computation is upgraded each time horizon, and its calculating formula is:
Figure 2011103908715100002DEST_PATH_IMAGE031
The bit number of two field picture coding actual consumption of
Figure 2011103908715100002DEST_PATH_IMAGE032
expression layer in the time horizon wherein, the sum of picture frame in
Figure 791344DEST_PATH_IMAGE009
expression
Figure 314730DEST_PATH_IMAGE007
the layer time horizon.
If first GOP that the current GOP that will encode is a video sequence, then according to the introduction in the document one, it is following that the weighted value size of each time horizon is got empirical value:
Figure 2011103908715100002DEST_PATH_IMAGE033
According to the foregoing invention design, the present invention adopts following technical proposals:
A kind of self-adaptive bit allocation method of the Rate Control based on the hierarchical B-frame structure; It is characterized in that; Adaptive bit based on the picture frame kinetic characteristic distributes; Through the temporal correlation between adjacent GOP and the neighborhood graph picture frame, coded messages such as the actual consumption bit number of extraction neighborhood graph picture frame and coding quality PSNR, the encoder complexity of prognostic chart picture frame; The predictive coding complexity that obtains by aforementioned calculation; Regulate the weighted value size of each time horizon among the GOP adaptively and be each picture frame distribution target bit in the identical time horizon; Can under the situation that does not improve coding actual consumption bit rate, obviously improve code efficiency and picture quality like this; Its concrete steps are:
(1), the Rate Control of GOP: by in the incidental configuration file of video sequence to the length requirement of video sequence and GOP and the rate requirement of target bits, calculate the target bit of distributing to each GOP;
(2), the Rate Control of time horizon:, go out to distribute to the target bit of each time horizon according to the hierarchical B-frame Structure Calculation by the target bit that above-mentioned each GOP that obtains distributes;
(3), the Rate Control of picture frame:, go out to distribute to the target bit of each picture frame according to the various computing of encoder complexity in the identical time horizon by the target bit of above-mentioned each time horizon that obtains;
(4), weighted value upgrades: according to distributing each picture frame target bit that obtains that each picture frame is encoded in the last step, pass through the weighted value that iterative computation is upgraded this time horizon by the coded message that each picture frame coding in the identical time horizon obtains.
The Rate Control of above-mentioned steps (1) GOP, by in the incidental configuration file of video sequence to the length requirement of video sequence and GOP and the rate requirement of target bits, calculate the target bit of distributing to each GOP, its step is following:
(1-1), according to the length of GOP, confirm the hierarchical B-frame structure, the length difference of GOP will cause the difference of hierarchical B-frame structure;
(1-2), based on the length of GOP in the step (1-1) and the length of video sequence, confirm the number of GOP in the video sequence;
Figure 2011103908715100002DEST_PATH_IMAGE034
The Rate Control of above-mentioned steps (2) time horizon is to confirm the residing time horizon of each picture frame among the GOP according to the hierarchical B-frame structure, according to the weighted value size of time horizon, calculates the target bit of distributing to each time horizon then, the steps include:
(2-1), obtain the weighted value size
Figure 416678DEST_PATH_IMAGE001
of each time horizon among the current GOP according to the coded message of each picture frame among the last GOP; Wherein
Figure 595986DEST_PATH_IMAGE002
expression is basic layer; expression top in the enhancement layer; As if first GOP that current GOP is a video sequence, the weighted value size of each time horizon is set according to formula (7) then;
Figure 2011103908715100002DEST_PATH_IMAGE035
The Rate Control of above-mentioned steps (3) picture frame; At first obtain the predictive coding complexity of each picture frame in the identical time horizon through the coded message of extracting reference frame image; Then again according to the target bit of distributing to each time horizon in the step (2); Calculate the target bit of distributing to each picture frame in the identical time horizon, the steps include:
(3-1), find pairing two reference frames of each picture frame by the hierarchical B-frame structure; And coded messages such as the actual consumption bit number of extraction reference frame and coding quality PSNR, calculate two reference prediction encoder complexities that reference frame provided respectively according to following formula:
Figure 2011103908715100002DEST_PATH_IMAGE036
and
and
Figure 551545DEST_PATH_IMAGE020
representes forward direction and the bit number of back to two reference frame actual consumptions respectively, and
Figure 14887DEST_PATH_IMAGE021
and
Figure 359281DEST_PATH_IMAGE022
representes forward direction and the coding quality PSNR of back to two reference frames respectively;
(3-2), according to the predictive coding complexity of computes picture frame:
The predictive coding complexity of each picture frame in the target bit of distributing to time horizon that (3-3), calculates by step (2) and the identical time horizon that step (3-2) obtains; Dispensed is given the target bit
Figure 794941DEST_PATH_IMAGE026
of each picture frame, and calculation expression is:
Figure 2011103908715100002DEST_PATH_IMAGE039
Wherein the target bit of expression layer time horizon of distributing to
Figure 454910DEST_PATH_IMAGE007
deducts remaining bit number behind the used up bit number of encoding; The predictive coding complexity of
Figure 428682DEST_PATH_IMAGE027
two field picture and
Figure 635673DEST_PATH_IMAGE011
two field picture in
Figure 689899DEST_PATH_IMAGE029
and
Figure 304551DEST_PATH_IMAGE030
the layer time horizon of representing respectively, the sum of picture frame in
Figure 185340DEST_PATH_IMAGE009
expression the Ceng time horizon.
Above-mentioned steps (4) weighted value upgrades: at first according to the target bit of each picture frame that obtains in the step (3), each picture frame is encoded, follow the weighted value according to coded message layer computing time of each picture frame, its step is following:
(4-1), by the target bit of distributing to each picture frame in the step (3); Calculate the QP value that each picture frame is encoded according to binomial rate-distortion model
Figure 2011103908715100002DEST_PATH_IMAGE040
, wherein
Figure 2011103908715100002DEST_PATH_IMAGE041
(abbreviation of Mean Absolute Difference) is mean absolute error;
Figure 2011103908715100002DEST_PATH_IMAGE042
and
Figure 2011103908715100002DEST_PATH_IMAGE043
is two recursive models parameters,
Figure 2011103908715100002DEST_PATH_IMAGE044
the header file information bit of presentation code packing;
(4-2), picture frame is encoded, and obtain the bit number of each picture frame actual consumption according to the QP value that calculates in the step (4-1);
(4-3), by the bit number of each picture frame actual consumption in the step (4-2); Difference according to the residing time horizon of each picture frame; Calculate the weighted value
Figure 2011103908715100002DEST_PATH_IMAGE045
of each time horizon, its calculating formula is:
Figure 2011103908715100002DEST_PATH_IMAGE046
Where
Figure 210245DEST_PATH_IMAGE032
denotes
Figure 537321DEST_PATH_IMAGE007
Time layer of the first layer
Figure 821672DEST_PATH_IMAGE027
The actual consumption of the frame number of bits;
Figure 906303DEST_PATH_IMAGE009
denotes layer time the total number of layers in the image frame.
The self-adaptive bit allocation method of the Rate Control based on the hierarchical B-frame structure of the present invention compared with prior art; Have following conspicuous outstanding substantive distinguishing features and remarkable advantage: this method is distributed based on the adaptive bit number of picture frame kinetic characteristic; Through the temporal correlation between adjacent GOP and the neighborhood graph picture frame; Coded messages such as the actual consumption bit number of extraction neighborhood graph picture frame and coding quality PSNR, the encoder complexity of prognostic chart picture frame; This method is carried out adaptive target bit and is distributed by above-mentioned definite predictive coding complexity, can under the situation that does not improve coding actual consumption bit rate, obviously improve code efficiency and picture quality.
Description of drawings
Fig. 1 is the block diagram of self-adaptive bit allocation method that the present invention is based on the Rate Control of hierarchical B-frame structure;
Fig. 2 is that the GOP length used in the embodiment of the invention is 8 hierarchical B-frame structural framing figure;
Fig. 3 is that GOP length is 16 hierarchical B-frame structural framing figure;
Fig. 4 is the experimental result comparison diagram of " Akiyo " sequence between the Bit distribution method of the present invention and set time layer weighted value;
Fig. 5 is the experimental result comparison diagram of " Foreman " sequence between the Bit distribution method of the present invention and set time layer weighted value;
Fig. 6 is the experimental result comparison diagram of " News " sequence between the Bit distribution method of the present invention and set time layer weighted value;
Fig. 7 is the experimental result comparison diagram of " Mobile " sequence between the Bit distribution method of the present invention and set time layer weighted value;
Among above-mentioned Fig. 4 to Fig. 7, transverse axis is represented the speed (bit rate) of the target bits in the video sequence configuration file, and unit is kilobits per second (kbit/s), and the longitudinal axis is represented decoded image quality (PSNR); Band line of dots among the figure is represented experimental result of the present invention, and band square frame dotted line is represented the experimental result of method in the document one.
Fig. 8 is the experimental result comparison sheet of the Bit distribution method of the present invention and set time layer weighted value.
Embodiment
Below in conjunction with Figure of description embodiments of the invention are done further explain.
One embodiment of the present of invention are described below: referring to Fig. 1; Self-adaptive bit allocation method based on the Rate Control of hierarchical B-frame structure is the coded message according to encoded image; It is big or small that self adaptation is regulated in time horizon weighted value size and the identical time horizon each picture frame predictive coding complexity; Be each picture frame reasonable distribution target bit in the video sequence, improve code efficiency and decoded image quality, its concrete steps are following:
(1) Rate Control of GOP: based in the incidental configuration file of video sequence to the length requirement of video sequence and GOP and the rate requirement of target bits, calculate the target bit of distributing to each GOP;
(2) Rate Control of time horizon: the target bit of distributing for each GOP based on obtaining in the step (1), calculate the target bit of distributing to each time horizon in the hierarchical B-frame structure;
(3) Rate Control of picture frame: the target bit of distributing for each time horizon based on obtaining in the step (2) calculates the target bit of distributing to each picture frame in the same time horizon;
(4) weighted value upgrades: the target bit of distributing for each picture frame according to obtaining in the step (3) is encoded, and again according to the coded message of coded image, upgrades the weighted value of each time horizon through iterative computation.
The Rate Control of the described GOP of above-mentioned steps (1), its step is following:
(1-1), according to the length of GOP, confirm the hierarchical B-frame structure;
(1-2), based on the length of GOP in the step (1-1) and the length of video sequence, confirm the number of GOP in the video sequence;
(1-3), based on the number of GOP in the step (1-2), confirm to distribute to the target bit of each GOP.
The Rate Control of the described time horizon of above-mentioned steps (2), its step is following:
(2-1), obtain the weighted value size
Figure 115884DEST_PATH_IMAGE001
of each time horizon among the current GOP according to the coded message of each picture frame among the last GOP; As if first GOP that current GOP is a video sequence, the weighted value size of each time horizon is set according to formula (7) then;
(2-2), by the weighted value size
Figure 571136DEST_PATH_IMAGE012
of each time horizon in the target bit of distributing to each GOP of step (1) gained and the step (2-1), calculate the target bit of distributing to each time horizon among the GOP according to formula (1).
The Rate Control of the described picture frame of above-mentioned steps (3), its step is following:
(3-1), find pairing two reference frames of each picture frame by the hierarchical B-frame structure; And extract the coded messages such as actual consumption bit number and coding quality PSNR of reference frame, calculate the reference prediction encoder complexity of two reference frames respectively according to formula (2) and formula (3);
(3-2), according to the predictive coding complexity of formula (4) calculating chart picture frame;
(3-3), the predictive coding complexity of each picture frame in the target bit of the time horizon that calculates by step (2) and the identical time horizon that step (3-2) obtains, give the target bit of each picture frame according to formula (5) dispensed.
The described weighted value of above-mentioned steps (4) upgrades, and its step is following:
(4-1), by the target bit that is distributed in the step (3), calculate the QP value of each picture frame coding according to binomial rate-distortion model
Figure 907178DEST_PATH_IMAGE040
;
(4-2), picture frame is encoded, and obtain the bit number of each picture frame actual consumption according to the QP value that calculates in the step (4-1);
(4-3), by the bit number of each picture frame actual consumption in the step (4-2), calculate and upgrade the weighted value of each time horizon according to formula (6).
Below the fixed weight value Bit distribution method based on the Rate Control of hierarchical B-frame structure technology in the present invention and the document one is compared, the test video sequence that relatively provides has 176 * 144 QCIF and two kinds of sizes of CIF of 352 * 288.In order to embody the robustness that the present invention has, when choosing list entries, both adopted " Akiyo " and " News " this type games to relax level and smooth video sequence and also adopted the violent relatively video sequence of " Foreman " and " Mobile " this type games.In addition; In order to embody the accuracy of Rate Control; Each video sequence all is provided with three to five target bit value in the experiment; Under different target bit rates required, the present invention had the experimental result comparison sheet that good broad applicability sees the Bit distribution method of the present invention shown in Figure 8 and set time layer weighted value for details with proof.
Experimental result comparison sheet 1 has at length compared the method in the present invention and the document one, the bit number of
Figure 2011103908715100002DEST_PATH_IMAGE047
expression actual consumption in the table and the percentage error between the target bit.From the data of Fig. 8, can find out the tight surrounding target bit number of actual bit number that Bit distribution method of the present invention consumes; Compare the accuracy rate basically identical of control with the method for document one; Yet tangible increase is arranged but on PSNR; The average 0.1-0.4dB that promotes; Fig. 4, Fig. 5, Fig. 6, Fig. 7 have listed the comparison of these four video sequence PSNR under different target bit rates respectively, can see under identical target bit, and the decoded image quality curve of this method remains on the decoded image quality curve of the method in the document one always.This shows that this method has improved the code efficiency and the picture quality of cycle tests effectively.

Claims (5)

1. the self-adaptive bit allocation method based on the Rate Control of hierarchical B-frame structure is characterized in that, these method concrete steps are:
(1), the Rate Control of GOP: by in the incidental configuration file of video sequence to the length requirement of video sequence and GOP and the rate requirement of target bits, calculate the target bit of distributing to each GOP;
(2), the Rate Control of time horizon:, go out to distribute to the target bit of each time horizon according to the hierarchical B-frame Structure Calculation by the target bit that above-mentioned each GOP that obtains distributes;
(3), the Rate Control of picture frame:, go out to distribute to the target bit of each picture frame according to the various computing of encoder complexity in the identical time horizon by the target bit of above-mentioned each time horizon that obtains;
(4), weighted value upgrades: according to distributing each picture frame target bit that obtains that each picture frame is encoded in the last step, pass through the weighted value that iterative computation is upgraded this time horizon by the coded message that each picture frame coding in the identical time horizon obtains.
2. according to the self-adaptive bit allocation method of the said Rate Control based on the hierarchical B-frame structure of claim 1; It is characterized in that; The Rate Control of the described GOP of above-mentioned steps (1); By in the incidental configuration file of video sequence to the length requirement of video sequence and GOP and the rate requirement of target bits, calculate the target bit of distributing to each GOP, its step is following:
(1-1), according to the length of GOP, confirm the hierarchical B-frame structure, the length difference of GOP will cause the difference of hierarchical B-frame structure;
(1-2), based on the length of GOP in the step (1-1) and the length of video sequence, confirm the number of GOP in the video sequence;
Figure 230160DEST_PATH_IMAGE001
3. according to the self-adaptive bit allocation method of the said Rate Control based on the hierarchical B-frame structure of claim 2; It is characterized in that; The Rate Control of the described time horizon of above-mentioned steps (2) is to confirm the residing time horizon of each picture frame among the GOP according to the hierarchical B-frame structure, and is big or small according to the weighted value of time horizon then; Calculate the target bit of distributing to each time horizon, the steps include:
(2-1), obtain the weighted value size
Figure 521202DEST_PATH_IMAGE002
of each time horizon among the current GOP according to the coded message of each picture frame among the last GOP; Wherein
Figure 300939DEST_PATH_IMAGE003
expression is basic layer; expression top in the enhancement layer; As if first GOP that current GOP is a video sequence, the weighted value size of each time horizon is set according to formula (7) then;
4. according to the self-adaptive bit allocation method of the said Rate Control based on the hierarchical B-frame structure of claim 3; It is characterized in that; The Rate Control of the described picture frame of above-mentioned steps (3) at first obtains the predictive coding complexity of each picture frame in the identical time horizon, then again according to the target bit of distributing to each time horizon in the step (2) through the coded message of extracting reference frame image; Calculate the target bit of distributing to each picture frame in the identical time horizon, the steps include:
(3-1), find pairing two reference frames of each picture frame by the hierarchical B-frame structure; And coded messages such as the actual consumption bit number of extraction reference frame and coding quality PSNR, calculate two reference prediction encoder complexities that reference frame provided respectively according to following formula:
Figure 270666DEST_PATH_IMAGE006
and
Figure 537699DEST_PATH_IMAGE007
Figure 674283DEST_PATH_IMAGE008
and
Figure 343161DEST_PATH_IMAGE009
representes forward direction and the bit number of back to two reference frame actual consumptions respectively, and and
Figure 90854DEST_PATH_IMAGE011
representes forward direction and the coding quality PSNR of back to two reference frames respectively;
(3-2), according to the predictive coding complexity of computes picture frame:
The predictive coding complexity of each picture frame in the target bit of distributing to time horizon that (3-3), calculates by step (2) and the identical time horizon that step (3-2) obtains; Dispensed is given the target bit
Figure 787470DEST_PATH_IMAGE013
of each picture frame, and calculation expression is:
Figure 217314DEST_PATH_IMAGE014
Wherein the target bit
Figure 315217DEST_PATH_IMAGE017
of expression layer time horizon of distributing to deducts remaining bit number behind the used up bit number of encoding; The predictive coding complexity of
Figure 893780DEST_PATH_IMAGE020
two field picture and two field picture in
Figure 650384DEST_PATH_IMAGE018
and the layer time horizon of representing respectively, the sum of picture frame in
Figure 553749DEST_PATH_IMAGE022
Biao Shi the Ceng time horizon.
5. according to the self-adaptive bit allocation method of the said Rate Control based on the hierarchical B-frame structure of claim 4; It is characterized in that; The described weighted value of above-mentioned steps (4) upgrades: at first according to the target bit of each picture frame that obtains in the step (3); Each picture frame is encoded, follow the weighted value according to coded message layer computing time of each picture frame, its step is following:
(4-1), by the target bit of distributing to each picture frame in the step (3); Calculate the QP value that each picture frame is encoded according to binomial rate-distortion model
Figure 788738DEST_PATH_IMAGE023
, wherein
Figure 731286DEST_PATH_IMAGE024
(abbreviation of Mean Absolute Difference) is mean absolute error;
Figure 605439DEST_PATH_IMAGE025
and
Figure 26056DEST_PATH_IMAGE026
is two recursive models parameters, the header file information bit of presentation code packing;
(4-2), picture frame is encoded, and obtain the bit number of each picture frame actual consumption according to the QP value that calculates in the step (4-1);
(4-3), by the bit number of each picture frame actual consumption in the step (4-2); Difference according to the residing time horizon of each picture frame; Calculate the weighted value
Figure 612075DEST_PATH_IMAGE028
of each time horizon, its calculating formula is:
Where denotes Time layer of the first layer
Figure 45145DEST_PATH_IMAGE020
The actual consumption of the frame number of bits;
Figure 395355DEST_PATH_IMAGE022
denotes layer time the total number of layers in the image frame .
CN2011103908715A 2011-12-01 2011-12-01 Self-adaption bit distribution method based on code rate control of hierarchical B frame structure Pending CN102420987A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN2011103908715A CN102420987A (en) 2011-12-01 2011-12-01 Self-adaption bit distribution method based on code rate control of hierarchical B frame structure

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN2011103908715A CN102420987A (en) 2011-12-01 2011-12-01 Self-adaption bit distribution method based on code rate control of hierarchical B frame structure

Publications (1)

Publication Number Publication Date
CN102420987A true CN102420987A (en) 2012-04-18

Family

ID=45945196

Family Applications (1)

Application Number Title Priority Date Filing Date
CN2011103908715A Pending CN102420987A (en) 2011-12-01 2011-12-01 Self-adaption bit distribution method based on code rate control of hierarchical B frame structure

Country Status (1)

Country Link
CN (1) CN102420987A (en)

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104067338A (en) * 2011-10-19 2014-09-24 奥兰吉公司 Improved hierarchical coding
CN106937115A (en) * 2017-03-02 2017-07-07 杭州当虹科技有限公司 The method for carrying out bit number training to bit number fallout predictor in Video coding
CN109819253A (en) * 2017-11-21 2019-05-28 腾讯科技(深圳)有限公司 Method for video coding, device, computer equipment and storage medium
CN109862359A (en) * 2018-12-29 2019-06-07 北京数码视讯软件技术发展有限公司 Bit rate control method, device and electronic equipment based on hierarchical B-frame
CN110139101A (en) * 2019-06-20 2019-08-16 电子科技大学 A kind of frame level bit distribution method based on the control of the domain λ code rate
CN112752103A (en) * 2019-10-31 2021-05-04 Oppo广东移动通信有限公司 Method and device for controlling code rate
CN112954348A (en) * 2021-02-20 2021-06-11 浙江商汤科技开发有限公司 Video encoding method and apparatus, electronic device, and storage medium

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040146108A1 (en) * 2003-01-23 2004-07-29 Shih-Chang Hsia MPEG-II video encoder chip design
CN101917614A (en) * 2010-06-03 2010-12-15 北京邮电大学 Bit rate control method based on H.264 hierarchical B-frame coding structure
CN102186084A (en) * 2011-06-07 2011-09-14 东莞电子科技大学电子信息工程研究院 Spatial enhancement layer code rate control realization method for scalable video coding

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040146108A1 (en) * 2003-01-23 2004-07-29 Shih-Chang Hsia MPEG-II video encoder chip design
CN101917614A (en) * 2010-06-03 2010-12-15 北京邮电大学 Bit rate control method based on H.264 hierarchical B-frame coding structure
CN102186084A (en) * 2011-06-07 2011-09-14 东莞电子科技大学电子信息工程研究院 Spatial enhancement layer code rate control realization method for scalable video coding

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
严涛等: "《基于二项式R-D模型的多视点视频码率控制算法》", 《高技术通讯》, vol. 20, no. 5, 31 May 2010 (2010-05-31), pages 481 - 486 *

Cited By (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104067338A (en) * 2011-10-19 2014-09-24 奥兰吉公司 Improved hierarchical coding
CN104067338B (en) * 2011-10-19 2016-10-05 奥兰吉公司 Improved hierarchical coding
CN106937115A (en) * 2017-03-02 2017-07-07 杭州当虹科技有限公司 The method for carrying out bit number training to bit number fallout predictor in Video coding
CN109819253A (en) * 2017-11-21 2019-05-28 腾讯科技(深圳)有限公司 Method for video coding, device, computer equipment and storage medium
CN109862359A (en) * 2018-12-29 2019-06-07 北京数码视讯软件技术发展有限公司 Bit rate control method, device and electronic equipment based on hierarchical B-frame
CN110139101A (en) * 2019-06-20 2019-08-16 电子科技大学 A kind of frame level bit distribution method based on the control of the domain λ code rate
CN110139101B (en) * 2019-06-20 2022-03-25 电子科技大学 Frame-level bit distribution method based on lambda domain code rate control
CN112752103A (en) * 2019-10-31 2021-05-04 Oppo广东移动通信有限公司 Method and device for controlling code rate
WO2021083286A1 (en) * 2019-10-31 2021-05-06 Oppo广东移动通信有限公司 Bit rate control method and device, and computer storage medium
CN112752103B (en) * 2019-10-31 2023-01-31 Oppo广东移动通信有限公司 Method and device for controlling code rate
CN112954348A (en) * 2021-02-20 2021-06-11 浙江商汤科技开发有限公司 Video encoding method and apparatus, electronic device, and storage medium
CN112954348B (en) * 2021-02-20 2024-04-02 浙江商汤科技开发有限公司 Video encoding method and device, electronic equipment and storage medium

Similar Documents

Publication Publication Date Title
CN102420987A (en) Self-adaption bit distribution method based on code rate control of hierarchical B frame structure
CN101010964B (en) Method and apparatus for using frame rate up conversion techniques in scalable video coding
CN101171843B (en) Content classification for multimedia processing
CN102067610B (en) Rate control model adaptation based on slice dependencies for video coding
CN100558170C (en) Video encoding/decoding method with active buffer management and complexity control function
CN102630013B (en) Bit rate control video compression method and device on basis of scene switching
CN101795415B (en) Method and device for controlling code rate in video coding
CN105049850A (en) HEVC (High Efficiency Video Coding) code rate control method based on region-of-interest
CN100464585C (en) Video-frequency compression method
CN102065302B (en) H.264 based flexible video coding method
CN102036062B (en) Video coding method and device and electronic equipment
CN106416251A (en) Scalable video encoding rate adaptation based on perceived quality
CN107071422B (en) Low complex degree HEVC rate adaption transformation coding method based on image correlation model
CN103179405A (en) Multi-stage area-of-interest-based multi-view video coding method
CN103250411B (en) The method controlled for the intrasystem bit rate of scalable video and system thereof
CN103533359A (en) H.264 code rate control method
CN108200431B (en) Bit allocation method for video coding code rate control frame layer
CN101835042A (en) Wyner-Ziv video coding system controlled on the basis of non feedback speed rate and method
CN102300094B (en) Video coding method
CN107277537B (en) A kind of distributed video compressed sensing method of sampling based on temporal correlation
CN104159095B (en) A kind of multiple views texture video and the bit rate control method of depth map encoding
CN103002283A (en) Multi-view distributed video compression side information generation method
CN102572428B (en) Side information estimating method oriented to distributed coding and decoding of multimedia sensor network
CN102970510A (en) Method for transmitting human face video
CN102740081B (en) Method for controlling transmission errors of multiview video based on distributed coding technology

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C02 Deemed withdrawal of patent application after publication (patent law 2001)
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20120418