CN101873484A - Method and device for selecting coding mode in layered video coding - Google Patents

Method and device for selecting coding mode in layered video coding Download PDF

Info

Publication number
CN101873484A
CN101873484A CN 200910165229 CN200910165229A CN101873484A CN 101873484 A CN101873484 A CN 101873484A CN 200910165229 CN200910165229 CN 200910165229 CN 200910165229 A CN200910165229 A CN 200910165229A CN 101873484 A CN101873484 A CN 101873484A
Authority
CN
China
Prior art keywords
layer
movable information
coding
mode
motion vector
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN 200910165229
Other languages
Chinese (zh)
Other versions
CN101873484B (en
Inventor
武晓阳
俞海
贾永华
胡扬忠
邬伟琪
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Hangzhou Hikvision Digital Technology Co Ltd
Original Assignee
Hangzhou Hikvision Software Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Hangzhou Hikvision Software Co Ltd filed Critical Hangzhou Hikvision Software Co Ltd
Priority to CN 200910165229 priority Critical patent/CN101873484B/en
Publication of CN101873484A publication Critical patent/CN101873484A/en
Application granted granted Critical
Publication of CN101873484B publication Critical patent/CN101873484B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Compression Or Coding Systems Of Tv Signals (AREA)

Abstract

The invention relates to method and device for selecting a coding mode in layered video coding. The method comprises the following steps of: acquiring in-layer prediction mode motion information selected by an enhancement layer coding block; acquiring interlayer prediction mode motion information selected by the enhancement layer coding block on the basis of the in-layer prediction mode motion information; selecting a coding mode by judging whether the motion information of an in-layer mode is consistent with the prediction motion information of an interlayer mode or not; and coding by adopting the interlayer motion information prediction mode if the motion information of the in-layer mode is consistent with the prediction motion information of the interlayer mode or else coding by adopting the in-layer mode. The invention can be utilized to reduce the amount of calculation.

Description

Coding mode selection method and device in the layered video coding
Technical field
The present invention relates to technical field of image processing, coding mode selection method and device in particularly a kind of layered video coding.
Background technology
In the application of video technique, the implementation efficiency of encoder is the emphasis of paying close attention to always, uses for the monitoring of real-time coding especially, and encoder is the target of pursuing always efficiently.Present development trend, video image resolution is increasing, and the video way is more and more, requires encoder not only will handle SD, also will handle high definition; Not only to realize one road coding, also need to realize multiplex coding.Like this, will when the realization of encoder, add serial fast processing algorithm.
The scalable coding technology, actual is a kind of multi-layer coding technology, also is a kind of multiplex coding technology, need carry out the image encoding of different resolution in same encoder.In the image encoding process of upper strata, can adopt with the layer coded message as a reference, be called the layer in the prediction; Also can adopt the reference layer image information as a reference, become inter-layer prediction.
In the end of the year 2007, the joint video expert group of MPEG and VECG has formulated the scalable video compression point coding standard on the basis of standard H.264/AVC, be called H.264/SVC (hereinafter to be referred as SVC), and this standard is adopted by ISO, becomes international standard.SVC is a multi-layer video compressed encoding standard, and each floor correspondence the input of one road video sequence.The build-in attribute of video sequence comprises resolution sizes (as CIF, QVGA, 720P etc.), frame per second (30 frame/second).In order to improve compression performance, SVC adopts the inter-layer prediction technology, promptly when anterior layer when encoding, can from reference layer, obtain prediction data as the reference data.In standard, the present encoding layer is called enhancement layer, and reference layer is called basic layer.Different according to enhancement layer and basic layer input video sequence, comprise three kinds of scalable scheme among the SVC: quality scalable, flexible time domain and spatial domain are scalable.Under the quality scalable situation, enhancement layer is all identical with resolution, the frame per second of basic layer input video sequence, but basic relatively layer, the enhancement layer behind the coding has higher fidelity, promptly higher picture quality; Under the flexible time domain situation, enhancement layer is identical with the resolution of basic layer input video sequence, the frame per second difference, and basic relatively layer, the frame per second of enhancement layer input video sequence is higher; Under the scalable situation in spatial domain, enhancement layer is different with the resolution of basic layer input video sequence, and frame per second is identical, basic relatively layer, the resolution of enhancement layer input video sequence bigger (also can be identical).At three kinds of scalable scenes, SVC adopts quality scalable technology, flexible time domain technology, three kinds of corresponding compression coding technologies of the scalable technology in spatial domain respectively, reduces the redundant information of enhancement layer and basic interlayer, to improve encoding compression efficient.The code stream that the SVC coding produces can efficiently be suitable for heterogeneous environment easily, such as the totally different heterogeneous network of bandwidth, the display device that resolution sizes is different, the consumption terminal that the disposal ability power differs.
The scalable technology in spatial domain is at when the different and formulation of the resolution sizes of anterior layer and reference layer input video sequence.The scalable technology in spatial domain is in when coding, every frame video image is divided into more the lower Item unit encodes, and this more lower Item unit is called encoding block.In a certain encoding block of encoding enhancement layer, can obtain the needed reference data of coding by the data of enhancement layer itself, be called prediction in the layer; Also can obtain the required reference data of coding, be called inter-layer prediction by the data of basic layer.According to the mode difference of obtaining reference data, prediction also comprises two big classifications in the layer: infra-frame prediction and inter prediction.Infra-frame prediction is meant the prediction mode of reference data from same picture frame; Inter prediction is meant the prediction mode of reference data from other picture frame (frame or two frames).In SVC, infra-frame prediction has the prediction of 4x4 block size, comprises pattern in 9, such as lateral prediction, vertical prediction etc., also has the prediction of 8x8,16x16 block size in addition, all comprises various modes; The pattern of inter prediction is more, and the P frame prediction of forward direction reference is arranged, and the B frame prediction of two-way reference etc. is also arranged, and every kind can also be divided into 7 kinds of different patterns such as 16x16,16x8 according to block size.In the inter-layer prediction situation, SVC limits, and dual mode is arranged, interlayer infra-frame prediction, interlayer inter prediction.When the corresponding encoding block of basic layer adopted intraframe coding method, the encoding block of enhancement layer can adopt the interlayer infra-frame prediction; When the corresponding encoding block of basic layer adopted inter-frame encoding, the encoding block of enhancement layer can adopt the interlayer inter prediction.According to the information of forecasting difference, the interlayer inter prediction comprises interlayer movable information (motion vector, reference key etc.) prediction and inter-layer residue information prediction.As seen, in the SVC standard, the coding mode of encoding block is of a great variety.But after the coding, the only corresponding a kind of coding mode of each encoding block, encoder task greatly are exactly to select the optimal coding mode of current macro from miscellaneous coding mode.
In the prior art, it is very important part during encoder is realized that coding mode selection method is selected.Superior model selection scheme can effectively be selected suitable coding mode fast from miscellaneous coding mode, so just largely reduced the complexity of coding, improves coding rate, can also guarantee the encoding compression performance simultaneously.
Nearly 2 years of SVC standard is just formulated, and the realization of SVC still is in the stage of just having launched at present, and present model selection scheme is the direct performance of prediction and inter-layer prediction in the layer relatively, and selection excellent performance person is as last coding mode.
The scalable technology for encoding mode selecting method in a kind of spatial domain can be as shown in Figure 1 in the prior art.Concrete, be the performance specification parameter P_EL of computation layer inner estimation mode, and calculate the performance specification parameter P_BL of inter-layer prediction mode, by comparing P_EL and P_BL, select the superior last coding mode of conduct of performance characterising parameter.If the performance specification parameter P_BL of inter-layer prediction mode greater than the performance specification parameter P_EL of layer inner estimation mode, then selects layer internal schema; Otherwise, if the performance specification parameter P_BL of inter-layer prediction mode then selects the interlayer pattern less than the performance specification parameter P_EL of layer inner estimation mode.Certainly, the performance specification parameter has many kinds, such as the rate distortion value (RD) of encoding block, and the absolute difference of encoding block and (SAD) etc.Especially, relatively the time, P_EL and P_BL are same types, or are SAD entirely, or are RD entirely.These parameters are the measurements to code efficiency, and its size just can be reacted the quality of coding efficiency.Therefore, prior art is determined final coding mode by these two performance characterising parameters that compare layer inner estimation mode and inter-layer prediction mode.
In research and practice process to prior art, the inventor finds to exist in the prior art following problem:
The coding mode selection method of art methods inevitably all needs to calculate the performance specification parameter of layer inner estimation mode and inter-layer prediction mode, compare again, like this, this computational process can be introduced extra computational complexity, thereby reduces the speed of encoder.
Summary of the invention
The purpose of the embodiment of the invention provides coding mode selection method and device in a kind of layered video coding, to reduce amount of calculation, reduces computation complexity.
For solving the problems of the technologies described above, the embodiment of the invention provides that coding mode selection method and device are achieved in that in a kind of layered video coding
Coding mode selection method in a kind of layered video coding comprises:
Obtain the enhancement layer coding piece and select the movable information of layer inner estimation mode;
On the basis of layer inner estimation mode movable information, obtain the enhancement layer coding piece and select the movable information of inter-layer prediction mode;
Whether the movable information by judging layer internal schema and the predicted motion information of interlayer pattern the consistent coding mode of selecting, if the movable information of layer internal schema is consistent with the predicted motion information of interlayer pattern, then adopt interlayer movable information predictive mode to encode, otherwise adopt layer internal schema to encode.
Coding mode choice device in a kind of layered video coding comprises:
First acquiring unit is used to obtain the movable information that the enhancement layer coding piece is selected layer inner estimation mode;
Second acquisition unit is used for obtaining the movable information that the enhancement layer coding piece is selected inter-layer prediction mode on the basis of layer inner estimation mode movable information;
Selected cell, whether the movable information by judging layer internal schema and the predicted motion information of interlayer pattern the consistent coding mode of selecting, if the movable information of layer internal schema is consistent with the predicted motion information of interlayer pattern, then adopt interlayer movable information predictive mode to encode, otherwise adopt layer internal schema to encode.
The technical scheme that provides by the above embodiment of the invention as seen, in the SVC coding, the information of presentation code piece has movable information and residual information.Movable information is used to refer to the prediction data of encoding block correspondence, and residual error then is encoding block and the difference of surveying data.The prediction of interlayer movable information is a kind of inter-layer prediction mode among the SVC, utilizes this pattern can effectively reduce the bit number that movable information is represented, but does not obviously promote for the expression of residual error.The application avoids introducing the technology of performance parameter in the time of model selection, only relatively whether the interlayer pattern is consistent with the movable information of layer internal schema, selects coding mode, reduces amount of calculation, also guaranteed the image encoding quality.
Description of drawings
Fig. 1 is the scalable technology for encoding mode selecting method in a kind of spatial domain in the prior art;
Fig. 2 is the schematic diagram of the application spatial domain scalability;
Fig. 3 is the flow chart of coding mode selection method embodiment in the application's layered video coding;
Fig. 4 is the application's motion vector schematic diagram;
Fig. 5 is the application's inter-frame forecast mode schematic diagram;
Fig. 6 is in the telescopic two-layer coding in the application spatial domain, and enhancement layer and basic layer resolution concern schematic diagram;
Fig. 7 is the flow chart of the object lesson of the application Fig. 3 embodiment;
Fig. 8 is the block diagram of coding mode choice device embodiment in a kind of layered video coding of the application;
Fig. 9 is another block diagram of coding mode choice device embodiment in a kind of layered video coding of the application.
Embodiment
The embodiment of the invention provides coding mode selection method and device in a kind of layered video coding.
In order to make those skilled in the art person understand the present invention program better, the embodiment of the invention is described in further detail below in conjunction with drawings and embodiments.
The application is at the model selection among the SVC, propose in a kind of layer or the selection scheme of interlayer movable information predictive mode, behind coding mode in the selected layer of the encoding block of anterior layer, fast and effeciently judge whether to select interlayer movable information predictive mode, thereby reduce the complexity that the interlayer coding mode is selected.
SVC is the scalable video coding standard, supports three kinds of scalable scheme: flexible time domain, and spatial domain scalability, and quality scalability, this patent is fit to the spatial domain scalability.Fig. 2 is the schematic diagram of spatial domain scalability, and enhancement layer image passes through cutting, dwindles, and obtains basic tomographic image, and the enhancement layer dashed region is exactly the zone of basic tomographic image correspondence in enhancement layer.Require W among the SVC c>=W b, H c>=H bW b, H bBe the width and the height of basic tomographic image; W e, H eBe the width and the height of enhancement layer image; W c, H cBe width and the height of basic layer, promptly only be in W at the enhancement layer counterpart c, H cIn encoding block just have inter-layer prediction mode; (x0 is the upper left corner of basic layer correspondence image in enhancement layer y0), is used for determining the correspondence image position.
In order to improve the encoding compression performance, a lot of coding modes are provided in the standard, can delamination inner estimation mode and inter-layer prediction mode two big classifications.Layer inner estimation mode is consistent with the coding mode H.264/AVC, and inter-layer prediction mode is to utilize the data of reference layer to obtain the mode of prediction data, is distinctive among the SVC.The application is on the basis of determining layer inner estimation mode, further judges whether to take interlayer movable information predictive mode.
The flow process of coding mode selection method embodiment can comprise as shown in Figure 3 in the application's layered video coding:
S310: obtain the movable information that the enhancement layer coding piece is selected layer inner estimation mode.
The movable information of securing layer internal schema is meant from numerous layer inner estimation modes and selects to determine suitable layer internal schema, and adopts this pattern-coding, the movable information of acquisition.
Movable information comprises motion vector and reference key in video coding.
As shown in Figure 4, when the current block of present frame mapped on the reference frame, the current block of mapping not necessarily overlapped with match block position on the reference frame, has and departs from, and this just departs from and represents with motion vector.The content of adjacent image frame is very close in the video.In order to improve code efficiency, can adopt predictive coding, match block in encoding block and the reference frame is carried out difference, obtain difference numerical, promptly residual error like this, only need be handled residual error in next code.But two two field pictures are taken after all certain intervals is arranged, motion has taken place in object in the image in this spacer segment, and the object on the present frame directly corresponds to above the reference frame, and object can not overlap, but certain deviation is arranged, and is promptly represented by motion vector.Arrow among Fig. 4 is represented motion vector, and it is a two-dimensional array, for example is expressed as that (x, y), x, y represent horizontal and vertical motion respectively.
When obtaining motion vector, in the hunting zone of reference frame, select best matching blocks, make the performance parameter minimum.The reference key scope is the coding Control Parameter, and encoder can be set in advance.Here performance parameter is to weigh the parameter of matching degree, can be SAD, also can be SSD (Sum of squared difference, variance and) etc.After the setting search scope, further how determining to search for a little is the specific coding strategy of encoder, and such is tactful of a great variety, the most intelligible a kind of be in the hunting zone, to search for, for each search point, obtain performance parameter, select the performance parameter minimum again.
Reference key is used for indicating match block which reference frame in front.Determine and the motion vector of reference key determine that mode is similar, can be by in different reference frames, searching for, thereby choose its performance good obtain reference key.
S320: on the basis of layer inner estimation mode movable information, obtain the movable information that the enhancement layer coding piece is selected inter-layer prediction mode.
Obtaining the enhancement layer coding piece and select the movable information of interlayer pattern, can be the inter-layer prediction movable information from the corresponding encoding block extracting data enhancement layer coding of basic layer.
Macro block has different predictive modes when doing inter prediction.Shown in Figure 5 is inter-frame forecast mode, also is interframe layer inner estimation mode, has 7 kinds, varies in size according to piecemeal, can not be divided into 16x16,16x8,8x16,8x8,8x4,4x8 and 4x4.Under every kind of pattern, macroblock partitions becomes different piecemeals, and is corresponding with movable information, promptly corresponding to one group of motion vector and reference key.Such as the 16x8 pattern, have 2 piecemeals, each piecemeal has its corresponding motion vector and reference key.
Under the situation of the selected layer of SVC encoder inner estimation mode, a kind of among Fig. 5 just will further obtain the inter-layer motion prediction information of each encoding block, can carry out according to following steps:
A1: search the corresponding blocks of definite enhancement layer coding piece in basic layer.Can obtain the corresponding blocks in the basic layer by pixel coordinate (1, the 1) position calculation of each enhancement layer coding piece.Encoding block is a rectangle, and each rectangular block comprises some pixels, these pixel coordinate representations, and the point in the upper left corner is commonly defined as (0,0) position.Computational process is according to formula (1), (2).Comprise (B in the basic layer x, B y) the 4x4 piece of the some corresponding blocks that is exactly the enhancement layer coding piece in basic layer.
B x = round ( E x · D x + 2 S - 1 2 S ) , B y = round ( E y · D y + 2 S - 1 2 S ) - - - ( 1 )
D x = round ( 2 S * Wb Wc ) , D y = round ( 2 S * Hb Hc ) - - - ( 2 )
Ex, Ey are the enhancement layer pixels positions, and Bx, By are that (S is a computational accuracy to the enhancement layer pixels point, generally gets 16 for Ex, the Ey) correspondence position in basic layer, and round () rounds calculating, for example round.
A2: obtain the movable information of corresponding blocks in the basic layer, this movable information comprises motion vector and reference index information.
Reference index information in the movable information of aforementioned definite basic layer corresponding blocks is selected the reference index information of inter-layer prediction mode as the enhancement layer coding piece.
A3: the motion vector of layer corresponding blocks is selected the motion vector of inter-layer prediction mode through behind the convergent-divergent as the enhancement layer coding piece substantially.
Can be with the motion vector of basic layer corresponding blocks through behind the convergent-divergent, select the motion vector of inter-layer prediction mode as the enhancement layer coding piece, promptly select the motion vector of inter-layer prediction mode as the enhancement layer coding piece.The convergent-divergent formula is as shown in Equation 3:
Mv _ EL x = round ( Mv _ BL x * Wc Wb ) , Mv _ EL y = round ( Mv _ BL y * Hc Hb ) - - - ( 3 )
Wherein Mv_EL is the motion vector behind the convergent-divergent, and Mv_BL is a motion vector before the convergent-divergent.Mv_BLx, Mv_BLy are basic layer corresponding blocks motion vectors, and x represents horizontal component, and y represents vertical component.Mv_ELx, Mv_ELy are motion vectors behind the convergent-divergent, are exactly the motion vector in the inter-layer prediction mode movable information.
S330: whether the movable information of selecting by the enhancement layer coding piece of judging layer internal schema and the predicted motion information of interlayer pattern the consistent coding mode of selecting.
On the basis of two steps, whether consistent with the predicted motion information of interlayer pattern in front by the movable information of inspection layer internal schema, select final coding mode.
Concrete, if both unanimities then adopt interlayer movable information predictive mode to encode, otherwise adopt layer internal schema to encode.
In the SVC coding, the information of presentation code piece has movable information and residual information.Movable information is used to refer to the prediction data of encoding block correspondence, and residual error then is the difference of encoding block and prediction data.The prediction of interlayer movable information is a kind of inter-layer prediction mode among the SVC, utilizes this pattern can effectively reduce the bit number that movable information is represented, but does not obviously promote for the expression of residual error.The application avoids introducing the technology of performance parameter in the time of model selection, only relatively whether the interlayer pattern is consistent with the movable information of layer internal schema, select coding mode, reduce amount of calculation, and, select in interlayer pattern generation inter-layer prediction mode and the layer and pattern movable information when consistent, also guaranteed the image encoding quality.
Below give an example said method embodiment is illustrated.
As shown in Figure 6, in the telescopic two-layer coding in spatial domain, the ratio of enhancement layer and basic layer resolution is 2.This object lesson is realized according to step as shown in Figure 7:
S710: obtain the movable information of A1 layer inner estimation mode, motion vector is Mv_A1, and reference key is RIdx_A1.
As previously mentioned, this also when obtaining motion vector, in the hunting zone of reference frame, selects best matching blocks, so that the performance parameter minimum.The reference key scope is the coding Control Parameter, and encoder can be set in advance.Here performance parameter is to weigh the parameter of matching degree, can be SAD or SSD etc.
A1 is the encoding block of enhancement layer, and its layer internal schema selects to be defined as the 16x16 pattern, and for example the motion vector that obtains in the manner described above is Mv_A1, and reference key is RIdx_A1.
S720: obtain the movable information of inter-layer prediction mode on the basis of A1 layer inner estimation mode movable information, motion vector is MvPred_A1=2*Mv_A0, and reference key is RIdxPred_A1=RIdx_A0.
Here need to prove that basic layer and enhancement layer are relative concepts, such as totally three layers situation, is a layer A0 from the bottom up, layer A1, layer A2.The reference layer of layer A2 is layer A1, and the reference layer of layer A1 is layer A0, and layer A1 opposite layer A0 is an enhancement layer so, and layer A1 opposite layer A2 is basic layer.Though A0 is in lower floor, also be an encoding block, also need coding, movable information is also arranged.
The reference block that can calculate the pairing basic layer of A1 according to aforementioned (1), (2) formula is A0, and A0 is that size is the piece of 8x8.After A0 encoded, its motion vector was Mv_A0, and reference key is RIdx_A0.
In the prediction of interlayer movable information, behind the motion vector process convergent-divergent with basic layer corresponding blocks, select the motion vector of inter-layer prediction mode as the enhancement layer coding piece, layer encoding block that be enhanced selects the motion vector of inter-layer prediction mode and prediction index to be respectively: MvPred_A1=2*Mv_A0, RIdxPred_A1=RIdx_A0, wherein 2 is the zoom factor of the basic layer of enhancement layer.
S730: whether consistent by motion vector, the reference key of judging layer internal schema with motion vector, the reference key of interlayer pattern, if consistently select interlayer movable information predictive mode, if inconsistent then select a layer interior coding mode for use.
At this moment, if MvPred_A1=Mv_A1, and RIdxPred_A1=RIdx_A1, then judge this encoding block selection interlayer movable information predictive mode; =RIdx_A1 then selects a layer interior coding mode for use.
The coding mode selection method of prior art, do not consider inter-layer prediction mode with the layer inner estimation mode data relationship, both performance parameters of double counting compare again, this has produced waste in computing.And the foregoing description avoids introducing the technology of performance parameter in the time of model selection, and only relatively whether the interlayer pattern is consistent with the movable information of layer internal schema, selects coding mode, reduces amount of calculation, also guaranteed the image encoding quality.
Below introduce coding mode choice device embodiment in a kind of layered video coding of the application, this device embodiment can comprise as shown in Figure 8:
First acquiring unit 81 is used to obtain the movable information that the enhancement layer coding piece is selected layer inner estimation mode;
Second acquisition unit 82 is used for obtaining the movable information that the enhancement layer coding piece is selected inter-layer prediction mode on the basis of layer inner estimation mode movable information;
Selected cell 83, whether the movable information by judging layer internal schema and the predicted motion information of interlayer pattern the consistent coding mode of selecting, if the movable information of layer internal schema is consistent with the predicted motion information of interlayer pattern, then adopt interlayer movable information predictive mode to encode, otherwise adopt layer internal schema to encode.
Preferably, described device embodiment can be as shown in Figure 9, and wherein said second acquisition unit 82 comprises:
Search unit 821, search the corresponding blocks of layer inner estimation mode in basic layer that described enhancement layer coding piece is determined;
The 3rd acquiring unit 822, the movable information of corresponding blocks in the basic layer of acquisition, this movable information comprises motion vector and reference index information;
Unit for scaling 823 is used for motion vector with basic layer corresponding blocks through behind the convergent-divergent, selects the motion vector of inter-layer prediction mode as the enhancement layer coding piece.
Preferably, described movable information comprises motion vector and reference key, and described enhancement layer coding block size is 16x16,16x8,8x16,8x8,8x4,4x8, a kind of among the 4x4.
Preferably, the corresponding blocks of definite enhancement layer coding piece in basic layer searched in the described unit 821 of searching, and specifically can comprise:
By (1,1) position of each enhancement layer coding piece, calculate acquisition (B by following formula (1), (2) x, B y):
B x = round ( E x · D x + 2 S - 1 2 S ) , B y = round ( E y · D y + 2 S - 1 2 S ) - - - ( 1 )
D x = round ( 2 S * Wb Wc ) , D y = round ( 2 S * Hb Hc ) - - - ( 2 )
To comprise (B in the basic layer x, B y) the 4x4 piece of point is defined as the corresponding blocks of enhancement layer coding piece in basic layer.
Preferably, the 3rd acquiring unit 822 obtains the movable information of corresponding blocks in the basic layer, specifically can comprise:
Reference index information in the movable information of aforementioned definite basic layer corresponding blocks is selected the reference index information of inter-layer prediction mode as the enhancement layer coding piece.
After the motion vector of basic layer corresponding blocks can being passed through following formula (3) convergent-divergent, select the motion vector of inter-layer prediction mode as the enhancement layer coding piece:
Mv _ EL x = round ( Mv _ BL x * Wc Wb ) , Mv _ EL y = round ( Mv _ BL y * Hc Hb ) - - - ( 3 )
Wherein Mv_EL is the motion vector behind the convergent-divergent, and Mv_BL is a motion vector before the convergent-divergent.Mv_BLx, Mv_BLy are basic layer corresponding blocks motion vectors, and x is a horizontal component, and y is a vertical component.Mv_ELx, Mv_ELy are motion vectors behind the convergent-divergent, are exactly the motion vector in the inter-layer prediction mode movable information.
Preferably, when first acquiring unit 81 obtains the enhancement layer coding piece and selects the movable information of layer inner estimation mode, can in the hunting zone of reference frame, select best matching blocks, make the performance parameter minimum.
So described the embodiment of the invention by embodiment, those of ordinary skills know, the present invention has many distortion and variation and do not break away from spirit of the present invention, wish that appended claim comprises these distortion and variation and do not break away from spirit of the present invention.

Claims (10)

1. coding mode selection method in the layered video coding is characterized in that, comprising:
Obtain the enhancement layer coding piece and select the movable information of layer inner estimation mode;
On the basis of layer inner estimation mode movable information, obtain the enhancement layer coding piece and select the movable information of inter-layer prediction mode;
Whether the movable information by judging layer internal schema and the predicted motion information of interlayer pattern the consistent coding mode of selecting, if the movable information of layer internal schema is consistent with the predicted motion information of interlayer pattern, then adopt interlayer movable information predictive mode to encode, otherwise adopt layer internal schema to encode.
2. the method for claim 1 is characterized in that, the described movable information that obtains enhancement layer coding piece selection inter-layer prediction mode on the basis of layer inner estimation mode movable information comprises:
Search the corresponding blocks of described enhancement layer coding piece in basic layer;
Obtain the movable information of corresponding blocks in the basic layer, this movable information comprises motion vector and reference index information;
Behind the motion vector process convergent-divergent with basic layer corresponding blocks, select the motion vector of inter-layer prediction mode as the enhancement layer coding piece.
3. method as claimed in claim 2 is characterized in that, the described corresponding blocks of definite enhancement layer coding piece in basic layer of searching comprises:
By (1,1) position of each enhancement layer coding piece, calculate acquisition (B by following formula (1), (2) x, B y):
B x = round ( E x · D x + 2 S - 1 2 S ) , B y = round ( E y · D y + 2 S - 1 2 S ) - - - ( 1 )
D x = round ( 2 S * Wb Wc ) , D y = round ( 2 S * Hb Hc ) - - - ( 2 )
To comprise (B in the basic layer x, B y) the 4x4 piece of point is defined as the corresponding blocks of enhancement layer coding piece in basic layer.
4. method as claimed in claim 2 is characterized in that, the index information of the movable information of corresponding blocks in the basic layer of described acquisition comprises:
Reference index information in the movable information of aforementioned definite basic layer corresponding blocks is selected the reference index information of inter-layer prediction mode as the enhancement layer coding piece.
5. method as claimed in claim 2 is characterized in that, the motion vector of the described corresponding blocks of layer substantially is through behind the convergent-divergent, and the motion vector as enhancement layer coding piece selection inter-layer prediction mode comprises:
Behind motion vector process following formula (3) convergent-divergent with basic layer corresponding blocks, select the motion vector of inter-layer prediction mode as the enhancement layer coding piece:
Mv _ EL x = round ( Mv _ BL x * Wc Wb ) , Mv _ EL y = round ( Mv _ BL y * Hc Hb ) - - - ( 3 )
Wherein Mv_EL is the motion vector behind the convergent-divergent, and Mv_BL is a motion vector before the convergent-divergent, and Mv_BLx, Mv_BLy are basic layer corresponding blocks motion vectors, and x represents horizontal component, and y represents vertical component, and Mv_ELx, Mv_ELy are motion vectors behind the convergent-divergent.
6. the method for claim 1 is characterized in that, when the enhancement layer coding piece is selected the movable information of securing layer inner estimation mode, in the hunting zone of reference frame, selects to make the best matching blocks of performance parameter minimum.
7. as each described method among the claim 1-6, it is characterized in that, described enhancement layer coding piece, its size is 16x16,16x8,8x16,8x8,8x4,4x8, a kind of among the 4x4.
8. coding mode choice device in the layered video coding is characterized in that, comprising:
First acquiring unit is used to obtain the movable information that the enhancement layer coding piece is selected layer inner estimation mode;
Second acquisition unit is used for obtaining the movable information that the enhancement layer coding piece is selected inter-layer prediction mode on the basis of layer inner estimation mode movable information;
Selected cell, whether the movable information by judging layer internal schema and the predicted motion information of interlayer pattern the consistent coding mode of selecting, if the movable information of layer internal schema is consistent with the predicted motion information of interlayer pattern, then adopt interlayer movable information predictive mode to encode, otherwise adopt layer internal schema to encode.
9. device as claimed in claim 8 is characterized in that, described second acquisition unit comprises:
Search the unit, search the corresponding blocks of described enhancement layer coding piece in basic layer;
The 3rd acquiring unit, the movable information of corresponding blocks in the basic layer of acquisition, this movable information comprises motion vector and reference index information;
Unit for scaling is used for motion vector with basic layer corresponding blocks through behind the convergent-divergent, selects the motion vector of inter-layer prediction mode as the enhancement layer coding piece.
10. device as claimed in claim 8 is characterized in that described movable information comprises motion vector and reference key, and described enhancement layer coding block size is 16x16,16x8,8x16,8x8,8x4,4x8, a kind of among the 4x4.
CN 200910165229 2009-08-13 2009-08-13 Method and device for selecting coding mode in layered video coding Active CN101873484B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN 200910165229 CN101873484B (en) 2009-08-13 2009-08-13 Method and device for selecting coding mode in layered video coding

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN 200910165229 CN101873484B (en) 2009-08-13 2009-08-13 Method and device for selecting coding mode in layered video coding

Publications (2)

Publication Number Publication Date
CN101873484A true CN101873484A (en) 2010-10-27
CN101873484B CN101873484B (en) 2012-05-30

Family

ID=42998110

Family Applications (1)

Application Number Title Priority Date Filing Date
CN 200910165229 Active CN101873484B (en) 2009-08-13 2009-08-13 Method and device for selecting coding mode in layered video coding

Country Status (1)

Country Link
CN (1) CN101873484B (en)

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102843554A (en) * 2011-06-21 2012-12-26 乐金电子(中国)研究开发中心有限公司 Interframe image prediction encoding and decoding methods and video encoding and decoding device
CN103237213A (en) * 2013-04-08 2013-08-07 华为技术有限公司 Method for coding videos, method for decoding videos and related devices
WO2014048188A1 (en) * 2012-09-29 2014-04-03 华为技术有限公司 Scalable coding method and device
WO2014139431A1 (en) * 2013-03-12 2014-09-18 Mediatek Inc. Inter-layer motion vector scaling for scalable video coding
CN104539970A (en) * 2014-12-21 2015-04-22 北京工业大学 3D-HEVC interframe coding merge mode fast decision making method
CN105075260A (en) * 2013-02-25 2015-11-18 Lg电子株式会社 Method for encoding video of multi-layer structure supporting scalability and method for decoding same and apparatus therefor
CN108111846A (en) * 2012-11-15 2018-06-01 联发科技股份有限公司 For the interframe layer prediction method and device of scalable video
CN108337522A (en) * 2011-06-15 2018-07-27 韩国电子通信研究院 Scalable coding/decoding method/equipment, flexible coding method/equipment and medium

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101023672A (en) * 2004-07-12 2007-08-22 诺基亚公司 System and method for motion prediction in scalable video coding
EP1694074A1 (en) * 2005-02-18 2006-08-23 Thomson Licensing Process for scalable coding of images
KR100896279B1 (en) * 2005-04-15 2009-05-07 엘지전자 주식회사 Method for scalably encoding and decoding video signal
CN101198064A (en) * 2007-12-10 2008-06-11 武汉大学 Movement vector prediction method in resolution demixing technology

Cited By (19)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11838524B2 (en) 2011-06-15 2023-12-05 Electronics And Telecommunications Research Institute Method for coding and decoding scalable video and apparatus using same
US11412240B2 (en) 2011-06-15 2022-08-09 Electronics And Telecommunications Research Institute Method for coding and decoding scalable video and apparatus using same
CN108337522B (en) * 2011-06-15 2022-04-19 韩国电子通信研究院 Scalable decoding method/apparatus, scalable encoding method/apparatus, and medium
CN108391137A (en) * 2011-06-15 2018-08-10 韩国电子通信研究院 Scalable coding/decoding method/equipment, flexible coding method/equipment and medium
CN108391136A (en) * 2011-06-15 2018-08-10 韩国电子通信研究院 Scalable coding/decoding method/equipment, flexible coding method/equipment and medium
CN108391135A (en) * 2011-06-15 2018-08-10 韩国电子通信研究院 Scalable coding/decoding method/equipment, flexible coding method/equipment and medium
CN108337522A (en) * 2011-06-15 2018-07-27 韩国电子通信研究院 Scalable coding/decoding method/equipment, flexible coding method/equipment and medium
CN102843554A (en) * 2011-06-21 2012-12-26 乐金电子(中国)研究开发中心有限公司 Interframe image prediction encoding and decoding methods and video encoding and decoding device
CN103716627B (en) * 2012-09-29 2017-02-22 华为技术有限公司 layered coding and decoding method and device
WO2014048188A1 (en) * 2012-09-29 2014-04-03 华为技术有限公司 Scalable coding method and device
CN108111846A (en) * 2012-11-15 2018-06-01 联发科技股份有限公司 For the interframe layer prediction method and device of scalable video
CN105075260A (en) * 2013-02-25 2015-11-18 Lg电子株式会社 Method for encoding video of multi-layer structure supporting scalability and method for decoding same and apparatus therefor
US9756350B2 (en) 2013-03-12 2017-09-05 Hfi Innovation Inc. Inter-layer motion vector scaling for scalable video coding
CN105052144A (en) * 2013-03-12 2015-11-11 联发科技股份有限公司 Inter-layer motion vector scaling for scalable video coding
CN105052144B (en) * 2013-03-12 2019-01-01 寰发股份有限公司 Inter-layer prediction method for scalable video
WO2014139431A1 (en) * 2013-03-12 2014-09-18 Mediatek Inc. Inter-layer motion vector scaling for scalable video coding
CN103237213B (en) * 2013-04-08 2016-03-30 华为技术有限公司 Method for video coding and video encoding/decoding method and relevant apparatus
CN103237213A (en) * 2013-04-08 2013-08-07 华为技术有限公司 Method for coding videos, method for decoding videos and related devices
CN104539970A (en) * 2014-12-21 2015-04-22 北京工业大学 3D-HEVC interframe coding merge mode fast decision making method

Also Published As

Publication number Publication date
CN101873484B (en) 2012-05-30

Similar Documents

Publication Publication Date Title
CN101873484B (en) Method and device for selecting coding mode in layered video coding
CN104539962B (en) It is a kind of merge visually-perceptible feature can scalable video coding method
TWI344792B (en) A method and apparatus for processing a signal
JP5052134B2 (en) Image encoding method, image decoding method, image encoding device, image decoding device, image encoding program, and image decoding program
CN101860754B (en) Method and device for coding and decoding motion vector
CN104412597B (en) The method and device that unified difference vector for 3D Video codings is derived
CN101159875B (en) Double forecast video coding/decoding method and apparatus
CN107409225A (en) Movable information derivation pattern (DM) determines in video coding
CN101472181B (en) Method, device and system for configurable performance motion estimation for video encoding
CN102025995B (en) Spatial enhancement layer rapid mode selection method of scalable video coding
CN103248895B (en) A kind of quick mode method of estimation for HEVC intraframe coding
CN101022555B (en) Interframe predictive coding mode quick selecting method
CN103891290A (en) Motion vector processing
CN107027025B (en) A kind of light field image compression method based on macro block of pixels adaptive prediction
CN102204254A (en) Methods and apparatus for prediction refinement using implicit motion prediction
CN102752588B (en) Video encoding and decoding method using space zoom prediction
JP2010504010A (en) Data compression method in video sequence
CN101198061A (en) Solid video stream encoding method based on sight point image mapping
CN101389028A (en) Video intra-frame encoding method based on space domain decomposition
CN101883275B (en) Video coding method
CN101959067B (en) Decision method and system in rapid coding mode based on epipolar constraint
CN102088605A (en) Rapid interlayer prediction selection method for scalable video coding
CN102196269B (en) Grayscale image sequence coding method for traffic access monitoring system
CN106791876A (en) A kind of depth map fast intra-frame predicting method based on 3D HEVC
CN101527854A (en) Interframe mode selection method

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
ASS Succession or assignment of patent right

Owner name: HANGZHOU HIKVISION DIGITAL TECHNOLOGY CO., LTD.

Free format text: FORMER OWNER: HANGZHOU HAIKANG WEISHI SOFTWARE CO., LTD.

Effective date: 20121101

C41 Transfer of patent application or patent right or utility model
COR Change of bibliographic data

Free format text: CORRECT: ADDRESS; FROM: HANGZHOU, ZHEJIANG PROVINCE TO: 310051 HANGZHOU, ZHEJIANG PROVINCE

TR01 Transfer of patent right

Effective date of registration: 20121101

Address after: Hangzhou City, Zhejiang province 310051 Binjiang District East Road Haikang Science Park No. 700, No. 1

Patentee after: Hangzhou Hikvision Digital Technology Co., Ltd.

Address before: Ma Cheng Road in Hangzhou city of Zhejiang Province, No. 36

Patentee before: Hangzhou Haikang Weishi Software Co., Ltd.