WO2016050219A1 - Method of adaptive motion vetor resolution for video coding - Google Patents

Method of adaptive motion vetor resolution for video coding Download PDF

Info

Publication number
WO2016050219A1
WO2016050219A1 PCT/CN2015/091275 CN2015091275W WO2016050219A1 WO 2016050219 A1 WO2016050219 A1 WO 2016050219A1 CN 2015091275 W CN2015091275 W CN 2015091275W WO 2016050219 A1 WO2016050219 A1 WO 2016050219A1
Authority
WO
WIPO (PCT)
Prior art keywords
current
resolution
color
shifted
block
Prior art date
Application number
PCT/CN2015/091275
Other languages
French (fr)
Inventor
Xiaozhong Xu
Kai Zhang
Shan Liu
Jicheng An
Xianguo Zhang
Original Assignee
Mediatek Inc.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from PCT/CN2014/088017 external-priority patent/WO2016049894A1/en
Priority claimed from PCT/CN2015/071553 external-priority patent/WO2016119104A1/en
Priority claimed from PCT/CN2015/072175 external-priority patent/WO2016123749A1/en
Priority to CN202111509061.7A priority Critical patent/CN114554199B/en
Priority to CN202010541786.3A priority patent/CN111818334B/en
Priority to CA2961681A priority patent/CA2961681C/en
Application filed by Mediatek Inc. filed Critical Mediatek Inc.
Priority to CN201580052679.1A priority patent/CN107079164B/en
Priority to KR1020207001441A priority patent/KR102115715B1/en
Priority to US15/514,129 priority patent/US10455231B2/en
Priority to KR1020177010070A priority patent/KR102068828B1/en
Priority to EP15847504.6A priority patent/EP3189660B1/en
Publication of WO2016050219A1 publication Critical patent/WO2016050219A1/en
Priority to US16/564,042 priority patent/US10880547B2/en

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/117Filters, e.g. for pre-processing or post-processing
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/124Quantisation
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/134Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
    • H04N19/136Incoming video signal characteristics or properties
    • H04N19/137Motion inside a coding unit, e.g. average field, frame or block difference
    • H04N19/139Analysis of motion vectors, e.g. their magnitude, direction, variance or reliability
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/17Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
    • H04N19/176Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/50Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
    • H04N19/503Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
    • H04N19/51Motion estimation or motion compensation
    • H04N19/513Processing of motion vectors
    • H04N19/517Processing of motion vectors by encoding
    • H04N19/52Processing of motion vectors by encoding by predictive encoding
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/50Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
    • H04N19/503Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
    • H04N19/51Motion estimation or motion compensation
    • H04N19/523Motion estimation or motion compensation with sub-pixel accuracy
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/85Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using pre-processing or post-processing specially adapted for video compression
    • H04N19/86Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using pre-processing or post-processing specially adapted for video compression involving reduction of coding artifacts, e.g. of blockiness

Definitions

  • the present invention relates to adaptive motion vector resolution in video coding.
  • the present invention relates to applying motion vector prediction depending on the current motion vector resolution, the reference motion vector resolution or both.
  • High Efficiency Video Coding is a new coding standard that has been developed in recent years.
  • the fixed-size macroblock of H. 264/AVC is replaced by a flexible block, named coding unit (CU) .
  • Pixels in the CU share the same coding parameters to improve coding efficiency.
  • a CU may begin with a largest CU (LCU) , which is also referred as coded tree unit (CTU) in HEVC.
  • CTU coded tree unit
  • prediction unit PU
  • JCTVC-S0085 Joint Collaborative Team on Video Coding (JCT-VC) of ITU-T SG16 WP3 and ISO/IEC JTC1/SC29/WG11 19th Meeting: France, FR, 17–24 Oct. 2014, Document: JCTVC-S0085
  • an adaptive MB resolution enable flag i.e., adaptive_mv_resolution_enabled_flag
  • SPS sequence parameter set
  • an integer MV flag i.e., use_integer_mv_flag
  • use_integer_mv_flag 1 or quarter pixel resolution
  • MV is parsed and decoded in the same way regardless whether use_integer_mv_flag is 0 or 1 in the current slice.
  • MV is scaled based on use_integer_mv_flag as shown below according to JCTVC-S1005 (Joshi, et al., “High Efficiency Video Coding (HEVC) Screen Content Coding: Draft 2” , Joint Collaborative Team on Video Coding (JCT-VC) of ITU-T SG16 WP3 and ISO/IEC JTC1/SC29/WG11 19th Meeting: France, FR, 17–24 Oct. 2014, Document: JCTVC-S1005) .
  • X is equal to 0 or 1
  • mvLX represents the motion vector of the luma component associated with list LX (i.e, list L0 or L1)
  • mvcLX represents the motion vector of the chroma component associated with list LX (i.e, list L0 or L1)
  • the operation “” mvLX ⁇ 2” means that mvLX is left-shifted by 2 and the result replaces the original mvLX.
  • the operation “” mvcLX ⁇ 2” means that mvLX is left-shifted by 2 and the result replaces the original mvcLX.
  • TMVP temporal motion vector prediction
  • MVP motion vector predictor
  • Fig. 1 shows an example.
  • use_integer_mv_flag in the current picture (110) is 0
  • use_integer_mv_flag in the collocated picture (120) is 1
  • the collocated MV (124) is equal to (4, 4) for the reference block (122) .
  • the collocated MV (4, 4) will be treated as a MVP for the current MV (114) of the current block (112) of the current picture (110) directly if TMVP is used.
  • the collocated MV (4, 4) in the collocated picture represents a motion vector value of (16, 16) when MV is expressed in the quarter-pixel resolution.
  • use_integer_mv_flag in the current picture is 1, use_integer_mv_flag in the collocated picture is 0, and the collocated MV is still equal to (4, 4) .
  • the collocated MV (4, 4) will be treated as a MVP for the current picture directly if TMVP is used.
  • the collocated MV (4, 4) in the collocated picture represents a motion vector value of (1, 1) when the MV is expressed in the integer pixel resolution.
  • the resolution of the MVP does not match with the resolution of the current MV to be predicted. This will deteriorate the efficiency of TMVP.
  • the adaptive MV resolution may also cause an issue in the deblocking process when MVs in the current slice have integer pixel resolution.
  • boundary strength is determined to select corresponding deblocking filter for a block boundary.
  • the boundary strength is determined according to various factors. Among the different factors, one factor is related to the motion vectors on both sides of a block boundary.
  • is compared to a threshold value of 4 to determine the filtering boundary strength, where Q_MV and P_MV represents motion vectors in two adjacent blocks, Q_MVx and Q_MVy represent the x and y components of Q_MV, and P_MVx and P_MVy represent the x and y components of P_MV.
  • the threshold value of 4 is designed with the assumption that MVs always use quarter-pixel resolution. However, when MVs in the current slice uses other MV resolution such as integer pixel resolution, the threshold value of 4 may not be appropriate.
  • IntraBC Intra picture block copy
  • HEVC-SCC HEVC screen content coding
  • the prediction block is obtained from the reconstructed region of current frame.
  • the block vectors (BVs) and residual are coded.
  • the IntraBC is unified with Inter coding mode. That is, the current picture is treated as a reference picture and inserted into one or both reference picture lists.
  • Block vector prediction and coding are the same as inter motion vector prediction and coding. This unification simplifies the codec design.
  • the block vectors are always using integer resolution while motion vectors can be both integer resolution and quarter-pel resolution, switched at slice level. This may lead to a resolution mismatch during the deblocking stage and MV prediction stage.
  • HEVC screen content coding (HEVC-SCC) extension another coding tool, named adaptive color-space transform or in-loop color-space transform has been adopted.
  • An example of decoding flow for the in-loop color-space transform is shown in Fig. 2.
  • An additional module, i.e., inverse color-space transform (230) is included.
  • Various modules are shown in Fig.
  • the decoder also includes a first switch (S1) select inverse color-space transform (in the lower position) or bypass the inverse color-space transform (in the upper position) .
  • the decoder also includes a second switch (S2) select Inter prediction (in the upper position) or Intra prediction (in the lower position) .
  • inverse color-space transform (230)
  • all other modules are standard decoder modules used in conventional HEVC.
  • the inverse color-space transform is invoked to convert the residual domain back to the original domain for the output from the conventional inverse DCT/DST transform and CCP.
  • a flag is signaled to indicate the usage of color-space transform in a CU.
  • IntraBC Intra picture block copy
  • Inter modes the flag is signaled only when there is at least one non-zero coefficient in the current CU.
  • Intra modes the flag is signaled only when the chroma mode of the first PU (i.e., top-left PU within the CU) is coded with DM mode.
  • DM mode corresponds to direct mode where Intra mode for the chroma component is the same as the Intra mode used for the luma component.
  • the forward and the inverse color-space transforms for lossy coding use the YCoCg transform matrices, which are defined as follows:
  • the original color space (C 0 , C 1 , C 2 ) may correspond to (R, G, B) , (Y, Cb, Cr) or (Y, U, V) .
  • the forward color-space transform in lossy coding as shown in equation (2) is not normalized, which results in reduced signal when the transform is applied.
  • the norm of the forward transform is roughly equal to for C 0 and C 2
  • delta QPs quantization parameters
  • the quantization parameter is set to (QP-5, QP-3, QP-5) for the three components, respectively, where QP is the ′normal′ QP value for the CU without the color-space transform.
  • the QP adjustment to accommodate the signal range reduction color-space transform is performed for the quantization/de-quantization process.
  • the deblocking process also utilizes the QP values, only the normal QP values are used for the deblocking process.
  • the quantization parameter qP is derived as follows:
  • cu_residual_act_flag [xTbY] [yTbY] is 1 if adaptive color-space transform is applied in the block with left-top position (xTbY, yTbY) . Otherwise, cu_residual_act_flag [xTbY] [yTbY] is 0.
  • Qp′Y, Qp′Cb and Qp′Cr correspond to original quantization parameters for color component Y, Cb and Cr respectively.
  • levelScale [k] ⁇ 40, 45, 51, 57, 64, 72 ⁇ .
  • the qP adjustment in equations (4) , (5) and (6) may cause a qP to be less than 0 when the adaptive color-space transform is applied.
  • qP ⁇ 0, “qP%6” will result in a negative argument for the the list levelScale [] and lead to an undefined levelScale [] value. Therefore, d [x] [y] is undefined in equation (7) . Therefore, it is desirable to overcome this issue.
  • a method of MVP (motion vector prediction) for video coding with adaptive motion vector resolution is disclosed.
  • the MVP coding is applied to the current MV or the current MV is stored depending on the current MV resolution, the reference MV resolution, or both the current MV resolution and the reference MV resolution.
  • the reference MV associated with the reference block in the reference picture corresponds to a temporal MV associated with a temporal reference block in the reference picture.
  • TMVP coding is applied to the current MV using a modified temporal MV as a motion vector predictor for the current MV, where the modified temporal MV is generated by right-shifting the temporal MV.
  • An offset may be added to the temporal MV before the temporal MV is right-shifted to generate the modified temporal MV.
  • the current MV resolution corresponds to integer pixel resolution
  • the current MV is left-shifted before it is stored in a memory.
  • the shifted current MV can be used for the TMVP coding of a subsequent MV.
  • the shifted current MV is shifted back before it is used as a motion vector predictor for another block in a current picture containing the current slice.
  • a syntax element related to the current MV resolution can be stored in a memory for the TMVP coding of a subsequent MV.
  • the TMVP coding can be applied to the current MV using a modified temporal MV as a motion vector predictor for the current MV, where the modified temporal MV is generated by right-shifting the temporal MV.
  • the TMVP coding can be applied to the current MV using a modified temporal MV as a motion vector predictor for the current MV, where the modified temporal MV is generated by left-shifting the temporal MV.
  • the TMVP coding may also be disabled for the current MV or the encoder may disregard the reference picture for the TMVP coding of the current block.
  • the current MV shifted or the temporal MV shifted is clipped to a valid range.
  • the current MV may have different ranges for different current MV resolutions or the temporal MV may have different ranges for different reference MV resolutions.
  • a first syntax element indicating the current MV resolution and a second syntax element indicating the reference MV resolution can be determined by an encoder to cause the current MV resolution has a same value as the reference MV resolution.
  • the current MV resolution may also indicated by a MV resolution flag in a slice header and all blocks within a corresponding slice share the MV resolution flag, and the MV resolution flag has a same value for all slices in a sequence.
  • the current MV resolution can be indicated by a MV resolution flag in a sequence level and all blocks within a corresponding sequence share the MV resolution flag. Therefore, MV resolution for a current MV and a temporal reference MV will always be the same.
  • the block boundary is deblocked depending on the MV resolution.
  • the MV resolution corresponds to integer resolution
  • the current MV and the neighboring MV are left-shifted by 2 to become a shifted current MV and a shifted neighboring MV
  • the shifted current MV and the shifted neighboring MV are included in determination of boundary strength used for said deblocking.
  • the MV resolution corresponds to half-pixel resolution
  • the current MV and the neighboring MV are left-shifted by 1.
  • the MV resolution corresponds to one-eighth-pixel resolution
  • the current MV and the neighboring MV are right-shifted by one.
  • a first absolution difference in a vertical component between the between the current MV and the neighboring MV and a second absolution difference in a horizontal component between the between the current MV are compared to a threshold value of 1 instead of 4 to determine boundary strength used for said deblocking.
  • the first absolution difference in a vertical component and the second absolution difference in a horizontal component are compared to a threshold value of 2 instead of 4.
  • the first absolution difference in a vertical component and the second absolution difference in a horizontal component are compared to a threshold value of 8 instead of 4.
  • a valid adjusted qPs is generated from the qPs by modifying the qPs to adjusted qPs to account for the color-space transform and setting the adjusted qPs to equal to or greater than zero if the adjusted qPs are smaller than zero.
  • the multiple video components correspond to YCrCb color components
  • Variable nX is 5 for Y component and Cb component and nX is 3 for Cr component and Max () is a maximum function.
  • a valid adjusted qPs can also be generated using a clipping function.
  • (qPX-n X ) is clipped to a range from MinQPX to MaxQPX, where MinQPX and MaxQPX correspond to a valid minimum quantization parameter and to a valid maximum quantization parameter for one color component respectively.
  • MinQPX can be zero and MaxQPX can be fivety-one for the Y component, the Cb component and Cr component.
  • qPX’ is generated from qPX according to a function of qPX if the color-space transform is applied to the current coding block, where the function of qPX is always greater than or equal to zero.
  • Fig. 1 illustrates an example of TMVP (temporal motion vector prediction) for a current MV (motion vector) under the condition of different MV resolution between the current MV and the TMVP.
  • TMVP temporary motion vector prediction
  • Fig. 2 illustrates a block diagram of a decoding system incorporating in-loop color-space transform.
  • Fig. 3 illustrates an example of scaling the TMVP when use_integer_mv_flag of the current slice is equal to 1.
  • Fig. 4 illustrates an example of scaling MV before storing the MV when use_integer_mv_flag of the current slice is equal to 1.
  • Fig. 5 illustrates an example that the MV uses the integer pixel resolution and the MV is left shifted by 2 before the MV is used for MV comparison in the deblocking process.
  • Fig. 6 illustrates an exemplary flowchart of MVP (motion vector prediction) for video data incorporating an embodiment of the present invention.
  • TMVP temporary motion vector predictor
  • TMVP temporary motion vector predictor
  • use_integer_mv_flag 1 .
  • TMVP temporary motion vector predictor
  • the operations (x+offx) >>2 and (y+offy) >>2 is performed before (x, y) is used as MVP when TMVP is applied.
  • the “n>>2” operation corresponds to right-shifting n by “2” , which is equivalent to “divide by 4” .
  • the “shift by 2” operation can be implemented more efficient than the “divide by 4” operation.
  • Fig. 3 illustrates an example of scaling the TMVP when use_integer_mv_flag of the current slice is equal to 1.
  • use_integer_mv_flag in the current picture (310) is 1
  • use_integer_mv_flag in the collocated picture (320) is 0, and the collocated MV (324) for the reference block (322) is right-shifted by 2 and used as the TMVP for the current MV (314) of the current block (312) .
  • Decoded MVs of the current slice are stored for use as TMVP by subsequent pictures. For example, if a MV in block B of the current slice is decoded as (x, y) , then (x ⁇ 2, y ⁇ 2) will be stored in the MV buffer for block B. And (x ⁇ 2, y ⁇ 2) will be treated as TMVP for a following picture if B is the collocated block.
  • the operation “n ⁇ 2” corresponds to left-shifting n by 2.
  • the scaled MV stored in the memory can be used for subsequent pictures as TMVP.
  • the scaled MV stored in the memory is right-shifted before it is used as a motion vector predictor (i.e., the spatial motion vector predictor) for another block in the current picture containing the current slice.
  • the scaled MV stored in the memory can be used for the determination of boundary strength which is required for the deblocking process.
  • both block vector and motion vector are decoded at integer resolution.
  • both block vector and motion vector are stored at fractional-pel resolution (e.g., quarter-pel resolution) .
  • the decoded motion vector or block vector will be left shifted by 2 to be at quarter-pel resolution, go through clipping process (so the resulted value is within certain value range, such as between-2 15 and 2 15 –1) and to be stored in quarter-pel resolution.
  • the block vector predictor and motion vector predictor used are at integer resolution. This is done by right shifting the vector predictor by N (a positive integer number such as 2) .
  • N a positive integer number such as 2 .
  • both block vector and motion vector are decoded at fractional-pel resolution.
  • both block vector and motion vector are stored at fractional-pel resolution (e.g., quarter-pel resolution) without any shifting operation.
  • the block vector predictor and motion vector predictor used are at fractional-pel resolution (e.g., quarter-pel resolution) .
  • the decoded motion vector or block vector will go through clipping process (so the resulted value is within certain value range, such as between-2 15 and 2 15 –1) and to be stored in quarter-pel resolution. Because the clipping is done in the derivation of MV (or BV) , there is no need to do the extra clipping operation prior to motion compensation (interpolation) . Further, when a stored vector is used to predict an integer MV, the predictor should right shifted (e.g., by 2) to be in integer-pel resolution, before the predcition happens. In one embodiment, when predFlagLX is equal to 1 and the picture with index refIdx from reference picture list LX of the slice is not the current picture, the luma motion vector mvLX is derived as follows:
  • uLX [0] ( ( ( (mvpLX [0] >> (2*use_integer_mv_flag) ) + mvdLX [0] ) ⁇
  • uLX [1] ( ( ( (mvpLX [1] >> (2*use_integer_mv_flag) ) + mvdLX [1] ) ⁇
  • mvpLX [0] and mvpLX [1] correspond to the motion vector components associated with the motion vector predictor
  • mvdLX [0] and mvdLX [1] correspond to the motion vector differences between the current motion vector and the motion vector predictor in list LX, where X is equal to 0 or 1.
  • the stored block vector or motion vector is used as a vector predictor in the block vector or motion vector prediction. If the slice that contains the to-be-predicted vector uses integer motion vector (the slice level flag “use_integer_mv_flag” is true or equal to 1) , the vector predictor is right shifted by N (N is an integer number, such as 2) to make it at integer resolution, before prediction occurs. If the slice that contains the to-be-predicted vector uses fractional-pel motion vector (the slice level flag “use_integer_mv_flag” is false) , the vector predictor is used directly without any shifting operation.
  • integer motion vector the slice level flag “use_integer_mv_flag” is true or equal to 1
  • N is an integer number, such as 2
  • the stored block vector and motion vectors are used as inputs to interpolation filter and deblocking filter.
  • use_integer_mv_flag is stored so that it can be read by following pictures and used to determine whether to scale the decoded MV.
  • TMVP is conducted differently depending on use_integer_mv_flag of the current slice and use_integer_mv_flag of the collocated picture.
  • TMVP can be right shifted before it is used as MVP for the current block. If TMVP corresponds to (x, y) , then ( (x+offx) >>2, (y+offy) >>2) is used as MVP when TMVP is applied.
  • offx and offy are offsets in shift and they can be any value such as -3, -2, -1, 0, 1, 2, 3, etc.
  • TMVP can be left shifted before it is used as MVP for the current block, when MVs of the current slice have the quarter pixel resolution and MVs of the collocated picture have the integer pixel resolution. If TMVP corresponds to (x, y) , then (x ⁇ 2, y ⁇ 2) should be used as MVP when TMVP is applied.
  • MVs of the current slice and MVs of the collocated picture are forced to have the same resolution.
  • use_integer_mv_flag of the current slice and use_integer_mv_flag of the collocated must be the same.
  • TMVP is disabled if MVs of the current slice and MVs of the collocated picture have different resolutions. In other words, if use_integer_mv_flag of the current slice and use_integer_mv_flag of the collocated picture are different, TMVP is disabled for the current MV.
  • a reference picture will not be used as the collocated picture for the current slice if use_integer_mv_flag of the current slice and use_integer_mv_flag of the reference picture are different.
  • all slices in a sequence are forced to have the same MV pixel resolution.
  • use_integer_mv_flag of all the slices in a sequence are the same.
  • use_integer_mv_flag is transmitted at a sequence level (e.g. in SPS) instead of the slice level. If use_integer_mv_flag in the sequence level has a value of 1, MV resulotions of all slices in the sequence are integer pixel. Otherwise, MV resulotions of all slices in the sequence are quarter pixel.
  • MV is clipped after scaling based on MV pixel resolution.
  • motion vector values, MVL0 and MVL1 for list L0 and L1 respectively should be clipped to [-2 15 , 2 15 –1] after being shifted left by 2.
  • MVL0 Max (-2 15 , Min (MVL0 ⁇ 2, 2 15 –1) )
  • MVL1 Max (-2 15 , Min (MVL1 ⁇ 2, 2 15 –1) ) .
  • this clip should be done for the MV scaling according to JCTVC-S1005 as shown in equation (1) .
  • the decoded MV can be constrained in different ranges depending on whether use_integer_mv_flag of the current slice is 0 or 1.
  • the range can be specified as:
  • the resolution of block vector and motion vector is unified when the current picture is used as one of the reference pictures. That is, the resolution of block vectors in a slice is the same as the resolution of regular motion vectors in the same slice.
  • use_integer_mv_flag 1
  • MVs use the half pixel resolution
  • the absolute difference between the horizontal or vertical component of the motion vectors is compared with a threshold value of 2 instead of 4.
  • MVs use the eighth pixel resolution
  • the absolute difference between the horizontal or vertical component of the motion vectors is compared with a threshold value of 8 instead of 4.
  • the temporal motion vector prediction has been used as an example for applying MVP coding to the current MV or storing the current MV depending on the current MV resolution, the reference MV resolution, or both the current MV resolution and the reference MV resolution.
  • the present invention may also be applied to other types of MVP such as spatial MVP or inter-view MVP.
  • the color-space transform may also cause an issue for a decoder system when the color-space transform is applied.
  • the received QP quantization parameter
  • the adjusted QP may become negative.
  • QP after adjustment is always no smaller than 0 according one of the following operations:
  • Max () corresonds to a maximum function in the above equations In another embodiment, adjusted QP is clipped to a valid range after QP is adjusted to accommodate signal range reduction due to color-space transform.
  • the above equations can be rewritten as follows:
  • clip (a, b, x) is a clipping function that clips the variable x to a range from a to b.
  • MinQPY, MinQPCb and MinQPCr correspond to the minimum clipping values for Y, Cr and Cr respectively.
  • MaxQPY, MaxQPCb and MaxQPCr correspond to the maximum clipping values for Y, Cr and Cr respectively.
  • MinQPY can be set to be 0 and MaxQPY can be set to be 51
  • MinQPCb can be set to be 0 and MaxQPCb can be set to be 51
  • MinQPCr can be set to be 0 and MaxQPCr can be set to be 51.
  • MinQPCb and/or MaxQPCb can be set according to MinQPY and/or MaxQPY respectively.
  • MinQPCr and/or MaxQPCr can be set according to MinQPY and/or MaxQPY respectively.
  • QP is calculated within a valid range according to a function if adaptive color-space transform is used.
  • equations (4) , (5) and (6) can be rewritten as:
  • fY (Qp′Y) may correspond to (Qp′Y-5 + OffsetY1+OffsetY2) %OffsetY1.
  • fCb (Qp′Cb) may correspond to (Qp′Cb-5 + OffsetCb1+ OffsetCb2) %OffsetCb1.
  • function fCr (Qp′Cr ) may correspond to (Qp′Cr-3 + OffsetCr1+OffsetCr2) %OffsetCr1.
  • OffsetY1, OffsetCb1 and OffsetCr1 can be all set to 51.
  • a constraint can be applied in the encoder side so that the adjusted QP will not be lower than 0. If equations (1) , (2) and (3) incur one or more qP ⁇ 0, the bit-stream is considered as illegal. In another embodiment, the adaptive color-space transform should be turned off if equations (1) , (2) and (3) incur one or more qP ⁇ 0.
  • Fig. 6 illustrates an exemplary flowchart of MVP (motion vector prediction) for video data incorporating an embodiment of the present invention.
  • the system receives input data associated with a current MV (motion vector) for a current block in a current slice in step 610.
  • the input data may be retrieved from storage such as a computer memory of buffer (RAM or DRAM) .
  • the input data may also be received from a processor such as a processing unit or a digital signal.
  • the input data may correspond to the current MV to be predicted.
  • the input data may correspond to coded MV data to be decoded.
  • the current MV resolution for the current MV, reference MV resolution for a reference MV associated with a reference block in a reference picture, or both the current MV resolution and the reference MV resolution are determined in step 620.
  • MVP coding is then applied to the current MV or the current MV is stored depending on the current MV resolution, the reference MV resolution, or both the current MV resolution and the reference MV resolution in step 630.
  • Embodiment of the present invention as described above may be implemented in various hardware, software codes, or a combination of both.
  • an embodiment of the present invention can be a circuit integrated into a video compression chip or program code integrated into video compression software to perform the processing described herein.
  • An embodiment of the present invention may also be program code to be executed on a Digital Signal Processor (DSP) to perform the processing described herein.
  • DSP Digital Signal Processor
  • the invention may also involve a number of functions to be performed by a computer processor, a digital signal processor, a microprocessor, or field programmable gate array (FPGA) .
  • These processors can be configured to perform particular tasks according to the invention, by executing machine-readable software code or firmware code that defines the particular methods embodied by the invention.
  • the software code or firmware code may be developed in different programming languages and different formats or styles.
  • the software code may also be compiled for different target platforms.
  • different code formats, styles and languages of software codes and other means of configuring code to perform the tasks in accordance with the invention will not depart from the spirit and scope of the invention.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)

Abstract

A method of MVP (motion vector prediction) for video coding with adaptive motion vector resolution is disclosed. According to the method, the MVP coding is applied to the current MV or the current MV is stored depending on the current MV resolution, the reference MV resolution, or both the current MV resolution and the reference MV resolution. In one embodiment, when the current MV resolution corresponds to integer pixel resolution, MVP coding is then applied to the current MV using a modified temporal MV as a motion vector predictor for the current MV, where the modified temporal MV is generated by right-shifting the temporal MV. In another embodiment, when the current MV resolution corresponds to integer pixel resolution, the current MV is left-shifted before it is stored in a memory.

Description

METHOD OF ADAPTIVE MOTION VETOR RESOLUTION FOR VIDEO CODING
CROSS REFERENCE TO RELATED APPLICATIONS
The present invention claims priority to PCT Patent Application, Serial No. PCT/CN2014/088017, filed on September 30, 2014, PCT Patent Application, Serial No. PCT/CN2015/071553, filed on January 26, 2015, PCT Patent Application, Serial No. PCT/CN2015/072175, filed on February 3, 2015, U.S. Provisional Patent Application, Serial No. 62/154,373, filed on April 29, 2015, and U.S. Provisional Patent Application, Serial No. 62/182,685, filed on June 22, 2015. The PCT Patent Applications and the U.S. Provisional Patent Applications are hereby incorporated by reference in their entireties.
TECHNICAL FIELD
The present invention relates to adaptive motion vector resolution in video coding. In particular, the present invention relates to applying motion vector prediction depending on the current motion vector resolution, the reference motion vector resolution or both.
BACKGROUND
High Efficiency Video Coding (HEVC) is a new coding standard that has been developed in recent years. In the High Efficiency Video Coding (HEVC) system, the fixed-size macroblock of H. 264/AVC is replaced by a flexible block, named coding unit (CU) . Pixels in the CU share the same coding parameters to improve coding efficiency. A CU may begin with a largest CU (LCU) , which is also referred as coded tree unit (CTU) in HEVC. In addition to the concept of coding unit, the concept of prediction unit (PU) is also introduced in HEVC. Once the splitting of CU hierarchical tree is done, each leaf CU is further split into one or more prediction units (PUs) according to prediction type and PU partition. Several coding tools for screen content coding have been developed.
Unlike regular live video contents, video materials corresponding to screen contents often have integer displacement values. Therefore, conventional encoders normally interpret motion vectors in bitstreams as fractional pixel offsets, such as 1/4 may unnecessarily increase the bitrate. On the other hand, fractional motion vector values are still very useful for contents corresponding to natural scenes such as camera-captured contents. Accordingly, adaptive motion vector resolution  targeted to address the issue of different video contents has been described in JCTVC-S0085 (Li, et al., “Adaptive motion vector resolution for screen content” , Joint Collaborative Team on Video Coding (JCT-VC) of ITU-T SG16 WP3 and ISO/IEC JTC1/SC29/WG11 19th Meeting: Strasbourg, FR, 17–24 Oct. 2014, Document: JCTVC-S0085) .
According to JCTVC-S0085, an adaptive MB resolution enable flag (i.e., adaptive_mv_resolution_enabled_flag) is signaled in SPS (sequence parameter set) to indicate whether adaptive motion vector resolution is applied as shown in Table 1. In the slice header, an integer MV flag (i.e., use_integer_mv_flag) is signaled (see note (1-2) in Table 1) to indicate whether a motion vector (MV) in the current slice uses integer pixel resolution (i.e., use_integer_mv_flag = 1) or quarter pixel resolution (i.e., use_integer_mv_flag = 0) . As shown in Table 1, syntax element, use_integer_mv_flag is incorporated in the bitstream only when adaptive MV resolution is enabled as indicated by the-SPS level syntax element, adaptive_mv_resolution_enabled_flag (see note (1-1) in Table 1) .
Table 1.
Figure PCTCN2015091275-appb-000001
At the decoder side, MV is parsed and decoded in the same way regardless whether use_integer_mv_flag is 0 or 1 in the current slice. Before the interpolating process, MV is scaled based on use_integer_mv_flag as shown below according to JCTVC-S1005 (Joshi, et al., “High Efficiency Video Coding (HEVC) Screen Content Coding: Draft 2” , Joint Collaborative Team on Video Coding (JCT-VC) of ITU-T SG16 WP3 and ISO/IEC JTC1/SC29/WG11 19th Meeting: Strasbourg, FR, 17–24 Oct. 2014, Document: JCTVC-S1005) .
if use_integer_mv_flag == 1, then mvLX <<= 2, mvCLX <<= 2.   (1)
In equation (1) , X is equal to 0 or 1, mvLX represents the motion vector of the luma component associated with list LX (i.e, list L0 or L1) , and mvcLX represents the motion vector of the chroma component associated with list LX (i.e, list L0 or L1) . The operation “” mvLX << 2” means that mvLX is left-shifted by 2 and the result replaces the original mvLX. Similarly, the operation  “” mvcLX << 2” means that mvLX is left-shifted by 2 and the result replaces the original mvcLX.
There are several problems in the current slice level MV adaptive resolution approach. First, when temporal motion vector prediction (TMVP) is applied and the use_integer_mv_flag in the collocated picture and in the current picture are different, the MV resolution of the motion vector predictor (MVP) and the MV in the current picture being predicted will be mismatched. In this disclosure, abbreviation MVP may also correspond to either motion vector prediction or motion vector predictor depending on the usage context.
Fig. 1 shows an example. In this scenario, use_integer_mv_flag in the current picture (110) is 0, use_integer_mv_flag in the collocated picture (120) is 1, and the collocated MV (124) is equal to (4, 4) for the reference block (122) . According to the existing practice, the collocated MV (4, 4) will be treated as a MVP for the current MV (114) of the current block (112) of the current picture (110) directly if TMVP is used. However, the collocated MV (4, 4) in the collocated picture represents a motion vector value of (16, 16) when MV is expressed in the quarter-pixel resolution.
In another scenario, use_integer_mv_flag in the current picture is 1, use_integer_mv_flag in the collocated picture is 0, and the collocated MV is still equal to (4, 4) . According to the existing practice, the collocated MV (4, 4) will be treated as a MVP for the current picture directly if TMVP is used. However, the collocated MV (4, 4) in the collocated picture represents a motion vector value of (1, 1) when the MV is expressed in the integer pixel resolution.
In both examples shown above, the resolution of the MVP does not match with the resolution of the current MV to be predicted. This will deteriorate the efficiency of TMVP.
Another problem arises due to the bit width limitation. According to JCTVC-S1005, the resulting values of mvL0 and mvL1 as specified above will always be in the range of-215 to 215-1, inclusive. With this limitation, it is guaranteed that mvL0 and mvL1 can be expressed in two bytes. However, under the condition that use_integer_mv_flag in the current picture is 1, use_integer_mv_flag in the collocated picture is 0 and the collocated MV is equal to (215–1, 215–1) , mvLX will exceed the two byte limitation after the mvLX <<= 2 and mvcLX <<= 2 operation if TMVP merging candidate is selected.
The adaptive MV resolution may also cause an issue in the deblocking process when MVs in the current slice have integer pixel resolution. In the deblocking process, boundary strength is determined to select corresponding deblocking filter for a block boundary. The boundary strength is determined according to various factors. Among the different factors, one factor is related to the  motion vectors on both sides of a block boundary. In particular, |Q_MVx–P_MVx| and |Q_MVy–P_MVy| is compared to a threshold value of 4 to determine the filtering boundary strength, where Q_MV and P_MV represents motion vectors in two adjacent blocks, Q_MVx and Q_MVy represent the x and y components of Q_MV, and P_MVx and P_MVy represent the x and y components of P_MV.
The threshold value of 4 is designed with the assumption that MVs always use quarter-pixel resolution. However, when MVs in the current slice uses other MV resolution such as integer pixel resolution, the threshold value of 4 may not be appropriate.
IntraBC (Intra picture block copy) is another coding tool for HEVC screen content coding (HEVC-SCC) extension. For the coding units (CUs) using IntraBC mode, the prediction block is obtained from the reconstructed region of current frame. Then, the block vectors (BVs) and residual are coded. In the 20th JCT-VC meeting in Geneva, February 2015, it was agreed that the IntraBC is unified with Inter coding mode. That is, the current picture is treated as a reference picture and inserted into one or both reference picture lists. Block vector prediction and coding are the same as inter motion vector prediction and coding. This unification simplifies the codec design. However there are some remaining issues. For example, in current SCM the block vectors are always using integer resolution while motion vectors can be both integer resolution and quarter-pel resolution, switched at slice level. This may lead to a resolution mismatch during the deblocking stage and MV prediction stage.
In HEVC screen content coding (HEVC-SCC) extension, another coding tool, named adaptive color-space transform or in-loop color-space transform has been adopted. An example of decoding flow for the in-loop color-space transform is shown in Fig. 2. An additional module, i.e., inverse color-space transform (230) is included. Various modules are shown in Fig. 2 including entropy decoder (210) , de-quantization (215) , inverse transform (220) , cross-component prediction (CCP, 225) , motion compensation (235) , Intra prediction (240) , adder (245) , deblocking filter (250) , SAO (sample adaptive offset) filter (255) and DPB (decoded picture buffer, 260) . The decoder also includes a first switch (S1) select inverse color-space transform (in the lower position) or bypass the inverse color-space transform (in the upper position) . The decoder also includes a second switch (S2) select Inter prediction (in the upper position) or Intra prediction (in the lower position) . Other than the inverse color-space transform (230) , all other modules are standard decoder modules used in conventional HEVC. When a block is coded with the color-space transform enabled, the inverse color-space transform is invoked to convert the residual domain back to the original domain for the  output from the conventional inverse DCT/DST transform and CCP. A flag is signaled to indicate the usage of color-space transform in a CU. For IntraBC (Intra picture block copy) and Inter modes, the flag is signaled only when there is at least one non-zero coefficient in the current CU. For Intra modes, the flag is signaled only when the chroma mode of the first PU (i.e., top-left PU within the CU) is coded with DM mode. DM mode corresponds to direct mode where Intra mode for the chroma component is the same as the Intra mode used for the luma component.
Two different color-space transforms are applied depending on whether the CU is coded in a lossless or lossy manner. The forward and the inverse color-space transforms for lossy coding use the YCoCg transform matrices, which are defined as follows:
Forward:
Figure PCTCN2015091275-appb-000002
and     (2)
Inverse:
Figure PCTCN2015091275-appb-000003
     (3)
wherein the original color space (C0, C1, C2) may correspond to (R, G, B) , (Y, Cb, Cr) or (Y, U, V) .
The forward color-space transform in lossy coding as shown in equation (2) is not normalized, which results in reduced signal when the transform is applied. Considering that the norm of the forward transform is roughly equal to
Figure PCTCN2015091275-appb-000004
for C0 and C2, and
Figure PCTCN2015091275-appb-000005
for C1, delta QPs (quantization parameters) of (-5, -3, -5) are used to compensate the reduced signal range for the three color components, respectively. In other words, when the color-space transform is applied, the quantization parameter is set to (QP-5, QP-3, QP-5) for the three components, respectively, where QP is the ′normal′ QP value for the CU without the color-space transform. The QP adjustment to accommodate the signal range reduction color-space transform is performed for the quantization/de-quantization process. On the other hand, while the deblocking process also utilizes the QP values, only the normal QP values are used for the deblocking process.
In the specification of HEVC-SCC, the QP is adjusted as described in sub-clause 8.6.2. The quantization parameter qP is derived as follows:
–If cIdx is equal to 0,
qP = Qp′Y + (cu_residual_act_flag [xTbY] [yTbY] ? -5 : 0)     (4)
–Otherwise, if cIdx is equal to 1,
qP = Qp′Cb + (cu_residual_act_flag [xTbY] [yTbY] ? -5: 0)     (5)
–Otherwise (cIdx is equal to 2) ,
qP = Qp′Cr + (cu_residual_act_flag [xTbY] [yTbY] ? -3: 0)     (6)
where cu_residual_act_flag [xTbY] [yTbY] is 1 if adaptive color-space transform is applied in the block with left-top position (xTbY, yTbY) . Otherwise, cu_residual_act_flag [xTbY] [yTbY] is 0. Qp′Y, Qp′Cb and Qp′Cr correspond to original quantization parameters for color component Y, Cb and Cr respectively. The expression (cu_residual_act_flag [xTbY] [yTbY] ? -5: 0) will return a value (-5) if cu_residual_act_flag [xTbY] [yTbY] is true or equal to 1, and will return 0 if cu_residual_act_flag [xTbY] [yTbY] is false or equal to 0. The variable cIdx mentioned above corresponds to color index.
However, the scaling process controlled by QP is defined in sub-clause 8.6.3 of the HEVC standard according to:
d[x] [y] = Clip3 (coeffMin, coeffMax,
( (TransCoeffLevel [xTbY] [yTbY] [cIdx] [x] [y] *m [x] [y] *
levelScale [qP%6] << (qP/6) ) + (1 << (bdShift-1) ) )
>> bdShift) .              (7)
For k = 0.5, the list levelScale [] is specified as levelScale [k] = {40, 45, 51, 57, 64, 72 } .
The qP adjustment in equations (4) , (5) and (6) may cause a qP to be less than 0 when the adaptive color-space transform is applied. When qP < 0, “qP%6” will result in a negative argument for the the list levelScale [] and lead to an undefined levelScale [] value. Therefore, d [x] [y] is undefined in equation (7) . Therefore, it is desirable to overcome this issue.
SUMMARY
A method of MVP (motion vector prediction) for video coding with adaptive motion vector resolution is disclosed. According to the present invention, the MVP coding is applied to the current MV or the current MV is stored depending on the current MV resolution, the reference MV resolution, or both the current MV resolution and the reference MV resolution. In one embodiment, the reference MV associated with the reference block in the reference picture corresponds to a temporal MV associated with a temporal reference block in the reference picture. In another embodiment, when the current MV resolution corresponds to integer pixel resolution, TMVP coding is applied to the current MV using a modified temporal MV as a motion vector predictor for the  current MV, where the modified temporal MV is generated by right-shifting the temporal MV. An offset may be added to the temporal MV before the temporal MV is right-shifted to generate the modified temporal MV. In another embodiment, when the current MV resolution corresponds to integer pixel resolution, the current MV is left-shifted before it is stored in a memory. The shifted current MV can be used for the TMVP coding of a subsequent MV. The shifted current MV is shifted back before it is used as a motion vector predictor for another block in a current picture containing the current slice. Alternatively, a syntax element related to the current MV resolution can be stored in a memory for the TMVP coding of a subsequent MV.
When the current MV resolution corresponds to integer pixel resolution and the reference MV resolution corresponds to non-integer pixel resolution, the TMVP coding can be applied to the current MV using a modified temporal MV as a motion vector predictor for the current MV, where the modified temporal MV is generated by right-shifting the temporal MV. In another case, when the current MV resolution corresponds to non-integer pixel resolution and the reference MV resolution corresponds to integer pixel resolution, the TMVP coding can be applied to the current MV using a modified temporal MV as a motion vector predictor for the current MV, where the modified temporal MV is generated by left-shifting the temporal MV. When the current MV resolution is different from the reference MV resolution, the TMVP coding may also be disabled for the current MV or the encoder may disregard the reference picture for the TMVP coding of the current block.
In another embodiment, when a shift operation is applied to the current MV or the temporal MV due to the current MV resolution or the reference MV resolution respectively, the current MV shifted or the temporal MV shifted is clipped to a valid range. Alternatively, the current MV may have different ranges for different current MV resolutions or the temporal MV may have different ranges for different reference MV resolutions.
A first syntax element indicating the current MV resolution and a second syntax element indicating the reference MV resolution can be determined by an encoder to cause the current MV resolution has a same value as the reference MV resolution. The current MV resolution may also indicated by a MV resolution flag in a slice header and all blocks within a corresponding slice share the MV resolution flag, and the MV resolution flag has a same value for all slices in a sequence. Alternatively, the current MV resolution can be indicated by a MV resolution flag in a sequence level and all blocks within a corresponding sequence share the MV resolution flag. Therefore, MV resolution for a current MV and a temporal reference MV will always be the same.
Another aspect of the present invention addresses the issue associated with boundary  strength derivation when adaptive MV resolution is used. In one embodiment, the block boundary is deblocked depending on the MV resolution. For example, when the MV resolution corresponds to integer resolution, the current MV and the neighboring MV are left-shifted by 2 to become a shifted current MV and a shifted neighboring MV, and the shifted current MV and the shifted neighboring MV are included in determination of boundary strength used for said deblocking. In another example, when the MV resolution corresponds to half-pixel resolution, the current MV and the neighboring MV are left-shifted by 1. In a further example, when the MV resolution corresponds to one-eighth-pixel resolution, the current MV and the neighboring MV are right-shifted by one.
In another embodiment, when the MV resolution corresponds to integer resolution, a first absolution difference in a vertical component between the between the current MV and the neighboring MV and a second absolution difference in a horizontal component between the between the current MV are compared to a threshold value of 1 instead of 4 to determine boundary strength used for said deblocking. In another example, when the MV resolution corresponds to half-pixel resolution, the first absolution difference in a vertical component and the second absolution difference in a horizontal component are compared to a threshold value of 2 instead of 4. In yet another example, when the MV resolution corresponds to one-eighth-pixel resolution, the first absolution difference in a vertical component and the second absolution difference in a horizontal component are compared to a threshold value of 8 instead of 4.
Another aspect of the present invention addresses the issue associated with quantization parameter adjustment to accommodate signal range reduction when color-space transform is applied. In one embodiment, a valid adjusted qPs (quantization parameters) is generated from the qPs by modifying the qPs to adjusted qPs to account for the color-space transform and setting the adjusted qPs to equal to or greater than zero if the adjusted qPs are smaller than zero. For example, the multiple video components correspond to YCrCb color components, the valid adjusted quantization parameter qPX’ for one color component according to qPX’ = Max (zero, qPX-nX) if the color-space-transform-flag indicates that the color-space transform is applied to the current coding block and qPX’ = Max (zero, qPX) if the color-space-transform-flag indicates that the color-space transform is not applied to the current coding block. Variable nX is 5 for Y component and Cb component and nX is 3 for Cr component and Max () is a maximum function. A valid adjusted qPs can also be generated using a clipping function. For example, when the color-space transform is applied, (qPX-nX) is clipped to a range from MinQPX to MaxQPX, where MinQPX and MaxQPX correspond to a valid minimum quantization parameter and to a valid maximum quantization parameter for one color component respectively. MinQPX can be zero and MaxQPX can be fivety-one for the Y  component, the Cb component and Cr component. In another embodiment, qPX’ is generated from qPX according to a function of qPX if the color-space transform is applied to the current coding block, where the function of qPX is always greater than or equal to zero.
BRIEF DESCRIPTION OF THE DRAWINGS
Fig. 1 illustrates an example of TMVP (temporal motion vector prediction) for a current MV (motion vector) under the condition of different MV resolution between the current MV and the TMVP.
Fig. 2 illustrates a block diagram of a decoding system incorporating in-loop color-space transform.
Fig. 3 illustrates an example of scaling the TMVP when use_integer_mv_flag of the current slice is equal to 1.
Fig. 4 illustrates an example of scaling MV before storing the MV when use_integer_mv_flag of the current slice is equal to 1.
Fig. 5 illustrates an example that the MV uses the integer pixel resolution and the MV is left shifted by 2 before the MV is used for MV comparison in the deblocking process.
Fig. 6 illustrates an exemplary flowchart of MVP (motion vector prediction) for video data incorporating an embodiment of the present invention.
DETAILED DESCRIPTION
The following description is of the best-contemplated mode of carrying out the invention. This description is made for the purpose of illustrating the general principles of the invention and should not be taken in a limiting sense. The scope of the invention is best determined by reference to the appended claims.
As mentioned previously, when adaptive MV (motion vector) resolution is used and if the current MV and an corresponding TMVP (temporal motion vector prediction) has different MV resolutions, it will cause a mismatch beteen the two MVs. Accordingly, the TMVP operation may not performly correctly. In order to regularize MVs when adaptive MV resolution is applied, several methods are proposed.
In one embodiment, TMVP (temporal motion vector predictor) is right shifted before it is used as MVP for the current block, when MVs of the current slice have the integer pixel resolution (i.e., use_integer_mv_flag = 1) . For example, if TMVP corresponds to (x, y) , the operations (x+offx) >>2 and (y+offy) >>2 is performed before (x, y) is used as MVP when TMVP is applied. The “n>>2” operation corresponds to right-shifting n by “2” , which is equivalent to “divide by 4” . As is well-known in the field, the “shift by 2” operation can be implemented more efficient than the “divide by 4” operation. The parameters, offx and offy correspond to offsets in shift and they can be any integer, such as -3, -2, -1, 0, 1, 2, 3, etc. Fig. 3 illustrates an example of scaling the TMVP when use_integer_mv_flag of the current slice is equal to 1. In this scenario, use_integer_mv_flag in the current picture (310) is 1, use_integer_mv_flag in the collocated picture (320) is 0, and the collocated MV (324) for the reference block (322) is right-shifted by 2 and used as the TMVP for the current MV (314) of the current block (312) .
In another embodiment, a decoded MV of the current slice is shifted left before it is stored when MVs of the current slice have the integer pixel resolution (i.e., use_integer_mv_flag = 1) . Decoded MVs of the current slice are stored for use as TMVP by subsequent pictures. For example, if a MV in block B of the current slice is decoded as (x, y) , then (x<<2, y<<2) will be stored in the MV buffer for block B. And (x<<2, y<<2) will be treated as TMVP for a following picture if B is the collocated block. The operation “n <<2” corresponds to left-shifting n by 2. Fig. 4 illustrates an example of scaling MV before storing the MV when use_integer_mv_flag of the current slice is equal to 1. In this scenario, use_integer_mv_flag in the current picture (410) is 1, the current MV (414) of the current block (412) is left-shifted by 2 to generate a scaled MV (424) and the scaled MV (424) is stored in a memory . In one embodiment, the scaled MV stored in the memory can be used for subsequent pictures as TMVP. In another embodiment, the scaled MV stored in the memory is right-shifted before it is used as a motion vector predictor (i.e., the spatial motion vector predictor) for another block in the current picture containing the current slice. In yet another embodiment, the scaled MV stored in the memory can be used for the determination of boundary strength which is required for the deblocking process.
In the present application, when the integer motion vector resolution is enabled (i.e., use_integer_mv_flag is equal to 1) , both block vector and motion vector are decoded at integer resolution. And both block vector and motion vector are stored at fractional-pel resolution (e.g., quarter-pel resolution) . This is done by left shifting the decoded block vector or motion vector by N, in which N is a positive integer number (e.g., N=2) . For example, the decoded motion vector or block vector will be left shifted by 2 to be at quarter-pel resolution, go through clipping process (so the  resulted value is within certain value range, such as between-215 and 215–1) and to be stored in quarter-pel resolution. However, the block vector predictor and motion vector predictor used are at integer resolution. This is done by right shifting the vector predictor by N (a positive integer number such as 2) . On the other hand, when the integer motion vector resolution is disabled (i.e., use_integer_mv_flag is equal to 0) , both block vector and motion vector are decoded at fractional-pel resolution. And both block vector and motion vector are stored at fractional-pel resolution (e.g., quarter-pel resolution) without any shifting operation. And the block vector predictor and motion vector predictor used are at fractional-pel resolution (e.g., quarter-pel resolution) . For example, the decoded motion vector or block vector will go through clipping process (so the resulted value is within certain value range, such as between-215 and 215–1) and to be stored in quarter-pel resolution. Because the clipping is done in the derivation of MV (or BV) , there is no need to do the extra clipping operation prior to motion compensation (interpolation) . Further, when a stored vector is used to predict an integer MV, the predictor should right shifted (e.g., by 2) to be in integer-pel resolution, before the predcition happens. In one embodiment, when predFlagLX is equal to 1 and the picture with index refIdx from reference picture list LX of the slice is not the current picture, the luma motion vector mvLX is derived as follows:
uLX [0] = ( ( ( (mvpLX [0] >> (2*use_integer_mv_flag) ) + mvdLX [0] ) <<
        (2*use_integer_mv_flag ) ) + 216) %216
mvLX [0] = (uLX [0] >= 215) ? (uLX [0] -216) : uLX [0]
uLX [1] = ( ( ( (mvpLX [1] >> (2*use_integer_mv_flag) ) + mvdLX [1] ) <<
        (2*use_integer_mv_flag) ) + 216) %216
mvLX [1] = (uLX [1] >= 215) ? (uLX [1] -216) : uLX [1] .
In the above equations, mvpLX [0] and mvpLX [1] correspond to the motion vector components associated with the motion vector predictor, and mvdLX [0] and mvdLX [1] correspond to the motion vector differences between the current motion vector and the motion vector predictor in list LX, where X is equal to 0 or 1.
When the stored block vector or motion vector is used as a vector predictor in the block vector or motion vector prediction. If the slice that contains the to-be-predicted vector uses integer motion vector (the slice level flag “use_integer_mv_flag” is true or equal to 1) , the vector predictor is right shifted by N (N is an integer number, such as 2) to make it at integer resolution, before prediction occurs. If the slice that contains the to-be-predicted vector uses fractional-pel motion  vector (the slice level flag “use_integer_mv_flag” is false) , the vector predictor is used directly without any shifting operation.
The stored block vector and motion vectors are used as inputs to interpolation filter and deblocking filter.
In an alternative solution, use_integer_mv_flag is stored so that it can be read by following pictures and used to determine whether to scale the decoded MV.
In yet another embodiment, TMVP is conducted differently depending on use_integer_mv_flag of the current slice and use_integer_mv_flag of the collocated picture.
For example, when MVs of the current slice have the integer pixel resolution and MVs of the collocated picture have the quarter pixel resolution, TMVP can be right shifted before it is used as MVP for the current block. If TMVP corresponds to (x, y) , then ( (x+offx) >>2, (y+offy) >>2) is used as MVP when TMVP is applied. Again, offx and offy are offsets in shift and they can be any value such as -3, -2, -1, 0, 1, 2, 3, etc.
In another example, TMVP can be left shifted before it is used as MVP for the current block, when MVs of the current slice have the quarter pixel resolution and MVs of the collocated picture have the integer pixel resolution. If TMVP corresponds to (x, y) , then (x<<2, y<<2) should be used as MVP when TMVP is applied.
In one embodiment, MVs of the current slice and MVs of the collocated picture are forced to have the same resolution. In other words, use_integer_mv_flag of the current slice and use_integer_mv_flag of the collocated must be the same.
In another embodiment, TMVP is disabled if MVs of the current slice and MVs of the collocated picture have different resolutions. In other words, if use_integer_mv_flag of the current slice and use_integer_mv_flag of the collocated picture are different, TMVP is disabled for the current MV.
In still another embodiment, a reference picture will not be used as the collocated picture for the current slice if use_integer_mv_flag of the current slice and use_integer_mv_flag of the reference picture are different.
In still another embodiment, all slices in a sequence are forced to have the same MV pixel resolution. In other words, use_integer_mv_flag of all the slices in a sequence are the same.
In still another embodiment, use_integer_mv_flag is transmitted at a sequence level (e.g. in SPS) instead of the slice level. If use_integer_mv_flag in the sequence level has a value of 1, MV resulotions of all slices in the sequence are integer pixel. Otherwise, MV resulotions of all slices in the sequence are quarter pixel.
In one embodiment, MV is clipped after scaling based on MV pixel resolution. For example, motion vector values, MVL0 and MVL1 for list L0 and L1 respectively should be clipped to [-215, 215–1] after being shifted left by 2. In other words, MVL0 = Max (-215, Min (MVL0 << 2, 215–1) ) and MVL1 = Max (-215, Min (MVL1 << 2, 215–1) ) . Especially, this clip should be done for the MV scaling according to JCTVC-S1005 as shown in equation (1) .
In another embodiment, the decoded MV can be constrained in different ranges depending on whether use_integer_mv_flag of the current slice is 0 or 1. The range should be tigher for use_integer_mv_flag = 1 than the range for use_integer_mv_flag = 0. For example, the range can be specified as:
if use_integer_mv_flag is == 0, -215 <= mvL0, mvL1 <= 215–1, and
if use_integer_mv_flag is == 1, -213 <=mvL0, mvL1 <= 213–1.
In the foregoing embodiments, the resolution of block vector and motion vector is unified when the current picture is used as one of the reference pictures. That is, the resolution of block vectors in a slice is the same as the resolution of regular motion vectors in the same slice.
As mentioned before, there is also an issue related to deblocking with related to MV resolution. In order to harmonize adaptive MV resolution and deblocking, several methods are disclosed as follows.
In one embodiment, deblocking are performed differently when different MV resolutions are used. For example, deblocking can be performed differently depending on use_integer_mv_flag. If the MV uses the integer pixel resolution (i.e., use_integer_mv_flag = 1) , the MV is left shifted by 2 (i.e., MVx=MVx<<2 and MVy=MVy<<2) before the MV is used for MV comparison in the deblocking process as shown in Fig. 5. In Fig. 5, the value of use_integer_mv_flag is checked in step 510 to determine whether interger MV resolution is used. If “use_integer_mv_flag = 1” is true (i.e., the “yes” path from step 510) , the MV is scalled in step 520 before the MV is used for MV comparison in the deblocking process in step 530. If “use_integer_mv_flag = 1” is false (i.e., the “no”  path from step 510) , the MV is directly used for MV comparison in the deblocking process in step 530 without scaling.
In another embodiment, the absolute difference between the horizontal or vertical component of the motion vectors is compared with a threshold value of 1 instead of 4 when MVs use the integer pixel resolution (i.e., use_integer_mv_flag = 1) .
If the MV uses the half pixel resolution, the MV is left shifted by 1 (i.e., MVx=MVx<<1 and MVy=MVy<<1) before it is used for MV comparison in the deblocking process according to one embodiment.
If the MV uses the eighth pixel resolution, the MV is right shifted by 1 (i.e., MVx=MVx>>1 and MVy=MVy>>1, or MVx= (MVx+1) >>1 and MVy= (MVy+1) >>1) before it is used for MV comparison in the deblocking process.
If MVs use the half pixel resolution, the absolute difference between the horizontal or vertical component of the motion vectors is compared with a threshold value of 2 instead of 4.
If MVs use the eighth pixel resolution, the absolute difference between the horizontal or vertical component of the motion vectors is compared with a threshold value of 8 instead of 4.
In still another embodiment, deblocking is not conducted when integer pixel resolution is used (i.e., use_integer_mv_flag = 1) .
In the above embodiments, the temporal motion vector prediction has been used as an example for applying MVP coding to the current MV or storing the current MV depending on the current MV resolution, the reference MV resolution, or both the current MV resolution and the reference MV resolution. The present invention may also be applied to other types of MVP such as spatial MVP or inter-view MVP.
As mentioned before, the color-space transform may also cause an issue for a decoder system when the color-space transform is applied. According to a conventional decoder implementation, the received QP (quantization parameter) is adjusted by substracting 5 or 3 from the original QP. Accordingly, the adjusted QP may become negative. In order to guarantee that the QP is always in a valid range when adaptive color-space transform is applied, several methods are disclosed for QP adjustment.
In one embodiment, QP after adjustment is always no smaller than 0 according one of the  following operations:
qP = Max (0, Qp′Y + (cu_residual_act_flag [xTbY] [yTbY] ? -5: 0) )              (8)
qP = Max (0, Qp′Cb + (cu_residual_act_flag [xTbY] [yTbY] ? -5: 0) )             (9)
qP = Max (0, Qp′Cr + (cu_residual_act_flag [xTbY] [yTbY] ? -3: 0) ) .           (10)
Max () corresonds to a maximum function in the above equations. In another embodiment, adjusted QP is clipped to a valid range after QP is adjusted to accommodate signal range reduction due to color-space transform. For example, the above equations can be rewritten as follows:
qP = clip3 (MinQPY, MaxQPY,
        Qp′Y + (cu_residual_act_flag [xTbY] [yTbY] ? -5: 0) )             (11)
qP = clip3 (MinQPCb , MaxQPCb,
        Qp′Cb + (cu_residual_act_flag [xTbY] [yTbY] ? -5 : 0) )           (12)
qP = clip3 (MinQPCr , MaxQPCr,
        Qp′Cr + (cu_residual_act_flag [xTbY] [yTbY] ? -3: 0) ) .          (13)
In the above equations, clip (a, b, x) is a clipping function that clips the variable x to a range from a to b. MinQPY, MinQPCb and MinQPCr correspond to the minimum clipping values for Y, Cr and Cr respectively. MaxQPY, MaxQPCb and MaxQPCr correspond to the maximum clipping values for Y, Cr and Cr respectively. For example, MinQPY can be set to be 0 and MaxQPY can be set to be 51; MinQPCb can be set to be 0 and MaxQPCb can be set to be 51; and MinQPCr can be set to be 0 and MaxQPCr can be set to be 51.
In another example, MinQPCb and/or MaxQPCb can be set according to MinQPY and/or MaxQPY respectively. Similarly, MinQPCr and/or MaxQPCr can be set according to MinQPY and/or MaxQPY respectively.
In still another embodiment, QP is calculated within a valid range according to a function if adaptive color-space transform is used. For example, equations (4) , (5) and (6) can be rewritten as:
qP = (cu_residual_act_flag [xTbY] [yTbY] ? fY (Qp′Y) : Qp′Y) ,              (14)
qP = (cu_residual_act_flag [xTbY] [yTbY] ? fCb (Qp′Cb) : Qp′Cb) ,           (15)
qP = (cu_residual_act_flag [xTbY] [yTbY] ? fCr (Qp′Cr) : Qp′Cr) .           (16)
The return value of fY () , fCb () and fCr () is larger than or equal to 0. For example, function  fY (Qp′Y) may correspond to (Qp′Y-5 + OffsetY1+OffsetY2) %OffsetY1. Function, fCb (Qp′Cb) may correspond to (Qp′Cb-5 + OffsetCb1+ OffsetCb2) %OffsetCb1. Similarly, function fCr (Qp′Cr ) may correspond to (Qp′Cr-3 + OffsetCr1+OffsetCr2) %OffsetCr1. For example, OffsetY1, OffsetCb1 and OffsetCr1 can be all set to 51.
In still another embodiment, a constraint can be applied in the encoder side so that the adjusted QP will not be lower than 0. If equations (1) , (2) and (3) incur one or more qP< 0, the bit-stream is considered as illegal. In another embodiment, the adaptive color-space transform should be turned off if equations (1) , (2) and (3) incur one or more qP< 0.
Fig. 6 illustrates an exemplary flowchart of MVP (motion vector prediction) for video data incorporating an embodiment of the present invention. The system receives input data associated with a current MV (motion vector) for a current block in a current slice in step 610. The input data may be retrieved from storage such as a computer memory of buffer (RAM or DRAM) . The input data may also be received from a processor such as a processing unit or a digital signal. At the encoder side, the input data may correspond to the current MV to be predicted. At the decoder side, the input data may correspond to coded MV data to be decoded. The current MV resolution for the current MV, reference MV resolution for a reference MV associated with a reference block in a reference picture, or both the current MV resolution and the reference MV resolution are determined in step 620. MVP coding is then applied to the current MV or the current MV is stored depending on the current MV resolution, the reference MV resolution, or both the current MV resolution and the reference MV resolution in step 630.
The above description is presented to enable a person of ordinary skill in the art to practice the present invention as provided in the context of a particular application and its requirement. Various modifications to the described embodiments will be apparent to those with skill in the art, and the general principles defined herein may be applied to other embodiments. Therefore, the present invention is not intended to be limited to the particular embodiments shown and described, but is to be accorded the widest scope consistent with the principles and novel features herein disclosed. In the above detailed description, various specific details are illustrated in order to provide a thorough understanding of the present invention. Nevertheless, it will be understood by those skilled in the art that the present invention may be practiced.
Embodiment of the present invention as described above may be implemented in various hardware, software codes, or a combination of both. For example, an embodiment of the present invention can be a circuit integrated into a video compression chip or program code integrated into  video compression software to perform the processing described herein. An embodiment of the present invention may also be program code to be executed on a Digital Signal Processor (DSP) to perform the processing described herein. The invention may also involve a number of functions to be performed by a computer processor, a digital signal processor, a microprocessor, or field programmable gate array (FPGA) . These processors can be configured to perform particular tasks according to the invention, by executing machine-readable software code or firmware code that defines the particular methods embodied by the invention. The software code or firmware code may be developed in different programming languages and different formats or styles. The software code may also be compiled for different target platforms. However, different code formats, styles and languages of software codes and other means of configuring code to perform the tasks in accordance with the invention will not depart from the spirit and scope of the invention.
The invention may be embodied in other specific forms without departing from its spirit or essential characteristics. The described examples are to be considered in all respects only as illustrative and not restrictive. The scope of the invention is therefore, indicated by the appended claims rather than by the foregoing description. All changes which come within the meaning and range of equivalency of the claims are to be embraced within their scope.

Claims (30)

  1. A method of MVP (motion vector prediction) for video data, the method comprising:
    receiving input data associated with a current MV (motion vector) for a current block in a current slice;
    determining current MV resolution for the current MV, reference MV resolution for a reference MV associated with a reference block in a reference picture, or both the current MV resolution and the reference MV resolution; and
    applying MVP coding to the current MV or storing the current MV depending on the current MV resolution, the reference MV resolution, or both the current MV resolution and the reference MV resolution.
  2. The method of Claim 1, wherein the reference MV associated with the reference block in the reference picture corresponds to a temporal MV associated with a temporal reference block in the reference picture.
  3. The method of Claim 2, wherein when the current MV resolution corresponds to integer pixel resolution, said applying MVP coding to the current MV uses a modified temporal MV as a motion vector predictor for the current MV, wherein the modified temporal MV is generated by right-shifting the temporal MV.
  4. The method of Claim 3, wherein an offset is added to the temporal MV before the temporal MV is right-shifted to generate the modified temporal MV.
  5. The method of Claim 2, wherein when the current MV resolution corresponds to integer pixel resolution, the current MV is left-shifted before it is stored in a memory.
  6. The method of Claim 5, wherein the current MV stored in the memory is used for a subsequent MV.
  7. The method of Claim 5, wherein the current MV stored in the memory is right-shifted before it is used as a motion vector predictor for another block in a current picture containing the current slice.
  8. The method of Claim 2, wherein when the current MV resolution corresponds to integer pixel resolution, the current MV is left-shifted before it is used in determination of boundary strength used  for a deblocking process.
  9. The method of Claim 2, wherein a syntax element related to the current MV resolution is stored in a memory for the MVP coding of a subsequent MV.
  10. The method of Claim 2, wherein when the current MV resolution corresponds to non-integer pixel resolution and the reference MV resolution corresponds to integer pixel resolution, said applying MVP coding to the current MV uses a modified temporal MV as a motion vector predictor for the current MV, wherein the modified temporal MV is generated by left-shifting the temporal MV.
  11. The method of Claim 2, wherein when the current MV resolution is different from the reference MV resolution, said applying MVP coding to the current MV disables the MVP coding for the current block.
  12. The method of Claim 2, wherein when the current MV resolution is different from the reference MV resolution, said applying MVP coding to the current MV disregards the reference picture for the MVP coding of the current block.
  13. The method of Claim 2, wherein when a shift operation is applied to the current MV or the temporal MV due to the current MV resolution or the reference MV resolution respectively, the current MV shifted or the temporal MV shifted is clipped to a valid range.
  14. The method of Claim 2, wherein the current MV has different ranges for different current MV resolutions or the temporal MV has different ranges for different reference MV resolutions.
  15. The method of Claim 2, wherein a first syntax element indicating the current MV resolution and a second syntax element indicating the reference MV resolution are determined by an encoder to cause the current MV resolution has a same value as the reference MV resolution.
  16. The method of Claim 2, wherein the current MV resolution is indicated by a MV resolution flag in a slice header and all blocks within a corresponding slice share the MV resolution flag.
  17. The method of Claim 2, wherein the current MV resolution is indicated by a MV resolution flag in a sequence level and all blocks within a corresponding sequence share the MV resolution flag.
  18. A method of deblocking for reconstructed video data, the method comprising:
    receiving input data associated with a current reconstructed block in a current slice;
    determining MV (motion vector) resolution associated with the current slice;
    determining a current MV associated with the current reconstructed block;
    determining a neighboring MV associated with a neighboring reconstructed block in the current slice and adjacent to a block boundary of the current reconstructed block; and
    deblocking the block boundary depending on the MV resolution.
  19. The method of Claim 18, wherein when the MV resolution corresponds to integer resolution, the current MV and the neighboring MV are left-shifted by 2 to become a shifted current MV and a shifted neighboring MV, and the shifted current MV and the shifted neighboring MV are included in determination of boundary strength used for said deblocking.
  20. The method of Claim 18, wherein when the MV resolution corresponds to half-pixel resolution, the current MV and the neighboring MV are left-shifted by one to become a shifted current MV and a shifted neighboring MV, and the shifted current MV and the shifted neighboring MV are included in determination of boundary strength used for said deblocking.
  21. The method of Claim 18, wherein when the MV resolution corresponds to one-eighth-pixel resolution, the current MV and the neighboring MV are right-shifted by one to become a shifted current MV and a shifted neighboring MV, and the shifted current MV and the shifted neighboring MV are included in determination of boundary strength used for said deblocking.
  22. The method of Claim 18, wherein when the MV resolution corresponds to integer resolution, a first absolution difference in a vertical component between the between the current MV and the neighboring MV and a second absolution difference in a horizontal component between the between the current MV are compared to a threshold value of one instead of four to determine boundary strength used for said deblocking.
  23. The method of Claim 18, wherein when the MV resolution corresponds to half-pixel resolution, a first absolution difference in a vertical component between the between the current MV and the neighboring MV and a second absolution difference in a horizontal component between the between the current MV are compared to a threshold value of two instead of four to determine boundary strength used for said deblocking.
  24. The method of Claim 18, wherein when the MV resolution corresponds to one-eighth-pixel resolution, a first absolution difference in a vertical component between the between the current MV  and the neighboring MV and a second absolution difference in a horizontal component between the between the current MV are compared to a threshold value of eight instead of four to determine boundary strength used for said deblocking.
  25. A method of video decoding for color video data, wherein the color video data includes multiple video components and encoding process includes color-space transform, the method comprising:
    receiving coded data associated with a current coding block;
    determining qPs (quantization parameters) for color components and a color-space-transform-flag from the coded data for the current coding block;
    if the color-space-transform-flag indicates that the color-space transform is applied to the current coding block:
    generating valid adjusted qPs from the qPs, wherein said generating the valid adjusted qPs comprises modifying the qPs to adjusted qPs to account for the color-space transform and setting the adjusted qPs to equal to or greater than zero if the adjusted qPs are smaller than zero;
    de-quantizing quantized transform coefficients associated with the current coding block using the valid adjusted qPs to generate decoded transform coefficients;
    applying inverse transform to the decoded transform coefficients to generate a first intermediate reconstructed coding block;
    applying inverse color-space transform to the first intermediate coding block or processed first intermediate coding block to generate a second intermediate reconstructed coding block; and
    further processing the second intermediate reconstructed coding block to generate a final reconstructed coding block.
  26. The method of Claim 25, wherein the multiple video components correspond to YCrCb color components, and said generating the valid adjusted qPs from the qPs corresponds to generating a valid adjusted quantization parameter qPX’ for one color component according to qPX’ = Max (0, qPX-nX) if the color-space-transform-flag indicates that the color-space transform is applied to the current coding block and qPX’ = Max (0, qPX) if the color-space-transform-flag indicates that the color-space transform is not applied to the current coding block, wherein qPX correspond to  quantization parameter for one color component, nX is 5 for Y component and Cb component and nX is 3 for Cr component, and Max () is a maximum function.
  27. The method of Claim 25, wherein said setting the adjusted qPs to equal to or greater than zero corresponds to clipping the adjusted qPs to a range from minimum qPs to maximum qPs.
  28. The method of Claim 25, wherein the multiple video components correspond to YCrCb color components, and said generating the valid adjusted qPs from the qPs corresponds to generating a valid adjusted quantization parameter qPX’ from quantization parameter qPX for one color component by clipping (qPX-nX) to a range from MinQPX to MaxQPX if the color-space-transform-flag indicates that the color-space transform is applied to the current coding block, wherein MinQPX and MaxQPX correspond to a valid minimum quantization parameter and a valid maximum quantization parameter for one color component respectively, nX is 5 for Y component and Cb component and nX is 3 for Cr component.
  29. The method of Claim 28, wherein MinQPX is 0 and MaxQPX is 51 for the Y component, the Cb component and Cr component.
  30. The method of Claim 25, wherein said generating the valid adjusted qPs from the qPs corresponds to generating a valid adjusted quantization parameter qPX’ from quantization parameter qPX for one color component according to a function of qPX if the color-space transform is applied to the current coding block, wherein the function of qPX is always greater than or equal to zero.
PCT/CN2015/091275 2014-09-30 2015-09-30 Method of adaptive motion vetor resolution for video coding WO2016050219A1 (en)

Priority Applications (9)

Application Number Priority Date Filing Date Title
EP15847504.6A EP3189660B1 (en) 2014-09-30 2015-09-30 Method of adaptive motion vector resolution for video coding
KR1020177010070A KR102068828B1 (en) 2014-09-30 2015-09-30 Method of adaptive motion vector resolution for video coding
US15/514,129 US10455231B2 (en) 2014-09-30 2015-09-30 Method of adaptive motion vector resolution for video coding
CN202010541786.3A CN111818334B (en) 2014-09-30 2015-09-30 Method for adaptive motion vector resolution for video coding
CA2961681A CA2961681C (en) 2014-09-30 2015-09-30 Method of adaptive motion vetor resolution for video coding
CN202111509061.7A CN114554199B (en) 2014-09-30 2015-09-30 Method for adaptive motion vector resolution for video coding
CN201580052679.1A CN107079164B (en) 2014-09-30 2015-09-30 Method for adaptive motion vector resolution for video coding
KR1020207001441A KR102115715B1 (en) 2014-09-30 2015-09-30 Method of adaptive motion vector resolution for video coding
US16/564,042 US10880547B2 (en) 2014-09-30 2019-09-09 Method of adaptive motion vector resolution for video coding

Applications Claiming Priority (10)

Application Number Priority Date Filing Date Title
PCT/CN2014/088017 WO2016049894A1 (en) 2014-09-30 2014-09-30 Scaling in color transform
CNPCT/CN2014/088017 2014-09-30
CNPCT/CN2015/071553 2015-01-26
PCT/CN2015/071553 WO2016119104A1 (en) 2015-01-26 2015-01-26 Motion vector regularization
CNPCT/CN2015/072175 2015-02-03
PCT/CN2015/072175 WO2016123749A1 (en) 2015-02-03 2015-02-03 Deblocking filtering with adaptive motion vector resolution
US201562154373P 2015-04-29 2015-04-29
US62/154,373 2015-04-29
US201562182685P 2015-06-22 2015-06-22
US62/182,685 2015-06-22

Related Child Applications (2)

Application Number Title Priority Date Filing Date
US15/514,129 A-371-Of-International US10455231B2 (en) 2014-09-30 2015-09-30 Method of adaptive motion vector resolution for video coding
US16/564,042 Continuation US10880547B2 (en) 2014-09-30 2019-09-09 Method of adaptive motion vector resolution for video coding

Publications (1)

Publication Number Publication Date
WO2016050219A1 true WO2016050219A1 (en) 2016-04-07

Family

ID=55629454

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2015/091275 WO2016050219A1 (en) 2014-09-30 2015-09-30 Method of adaptive motion vetor resolution for video coding

Country Status (6)

Country Link
US (2) US10455231B2 (en)
EP (1) EP3189660B1 (en)
KR (2) KR102115715B1 (en)
CN (3) CN114554199B (en)
CA (1) CA2961681C (en)
WO (1) WO2016050219A1 (en)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2018110203A1 (en) * 2016-12-16 2018-06-21 シャープ株式会社 Moving image decoding apparatus and moving image encoding apparatus
EP3763122A4 (en) * 2018-03-07 2021-01-13 Tencent America LLC Method and apparatus for video coding
US10986366B2 (en) 2016-06-30 2021-04-20 Interdigital Vc Holdings, Inc. Video coding with adaptive motion information refinement
US11431964B2 (en) 2018-11-22 2022-08-30 Beijing Bytedance Network Technology Co., Ltd. Coordination method for sub-block based inter prediction
US11695946B2 (en) 2019-09-22 2023-07-04 Beijing Bytedance Network Technology Co., Ltd Reference picture resampling in video processing
US11871025B2 (en) * 2019-08-13 2024-01-09 Beijing Bytedance Network Technology Co., Ltd Motion precision in sub-block based inter prediction

Families Citing this family (55)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP3117612A1 (en) * 2014-03-14 2017-01-18 VID SCALE, Inc. Systems and methods for rgb video coding enhancement
CN114554199B (en) * 2014-09-30 2023-11-10 寰发股份有限公司 Method for adaptive motion vector resolution for video coding
US20160337662A1 (en) * 2015-05-11 2016-11-17 Qualcomm Incorporated Storage and signaling resolutions of motion vectors
US10200713B2 (en) 2015-05-11 2019-02-05 Qualcomm Incorporated Search region determination for inter coding within a particular picture of video data
US10812822B2 (en) * 2015-10-02 2020-10-20 Qualcomm Incorporated Intra block copy merge mode and padding of unavailable IBC reference region
KR20180043151A (en) * 2016-10-19 2018-04-27 에스케이텔레콤 주식회사 Apparatus and Method for Video Encoding or Decoding
CN109963155B (en) 2017-12-23 2023-06-06 华为技术有限公司 Prediction method and device for motion information of image block and coder-decoder
US10523948B2 (en) 2018-02-05 2019-12-31 Tencent America LLC Method and apparatus for video coding
US10687071B2 (en) 2018-02-05 2020-06-16 Tencent America LLC Method and apparatus for video coding
US11202079B2 (en) 2018-02-05 2021-12-14 Tencent America LLC Method and apparatus for video decoding of an affine model in an intra block copy mode
KR20220029762A (en) 2018-02-28 2022-03-08 삼성전자주식회사 A method and an apparatus for video decoding, a method and an apparatus for video encoding
US10462483B1 (en) 2018-04-26 2019-10-29 Tencent America LLC Method and apparatus for video coding
US10448025B1 (en) 2018-05-11 2019-10-15 Tencent America LLC Method and apparatus for video coding
US11109025B2 (en) 2018-06-04 2021-08-31 Tencent America LLC Method and apparatus for sub-block based temporal motion vector prediction
GB2589221B (en) 2018-06-19 2023-03-22 Beijing Bytedance Network Tech Co Ltd Mode dependent MVD precision set
US10448026B1 (en) * 2018-07-09 2019-10-15 Tencent America LLC Method and apparatus for block vector signaling and derivation in intra picture block compensation
US10904559B2 (en) 2018-07-13 2021-01-26 Tencent America LLC Block vector prediction in intra block copy mode
US11019331B2 (en) 2018-07-16 2021-05-25 Tencent America LLC Method and apparatus for video coding with prediction information
US10798376B2 (en) 2018-07-17 2020-10-06 Tencent America LLC Method and apparatus for video coding
CA3107531A1 (en) * 2018-07-31 2020-02-06 Mediatek, Inc. Method and apparatus of merge with motion vector difference for video coding
US11057617B2 (en) * 2018-08-03 2021-07-06 Tencent America LLC Method and apparatus for video coding
US10958932B2 (en) * 2018-09-12 2021-03-23 Qualcomm Incorporated Inter-prediction coding of video data using generated motion vector predictor list including non-adjacent blocks
WO2020058886A1 (en) 2018-09-19 2020-03-26 Beijing Bytedance Network Technology Co., Ltd. Fast algorithms for adaptive motion vector resolution in affine mode
WO2020060351A1 (en) * 2018-09-21 2020-03-26 엘지전자 주식회사 Method and apparatus for deriving motion vector
US11706442B2 (en) * 2018-09-21 2023-07-18 Lg Electronics Inc. Process and apparatus for controlling compressed motion vectors
US10848782B2 (en) 2018-09-21 2020-11-24 Tencent America LLC Method and apparatus for video coding
WO2020071672A1 (en) * 2018-10-02 2020-04-09 엘지전자 주식회사 Method for compressing motion vector and apparatus therefor
US11317099B2 (en) 2018-10-05 2022-04-26 Tencent America LLC Method and apparatus for signaling an offset in video coding for intra block copy and/or inter prediction
US10764601B2 (en) 2018-10-06 2020-09-01 Tencent America LLC Method and apparatus for video coding
US11284066B2 (en) 2018-10-10 2022-03-22 Tencent America LLC Method and apparatus for intra block copy in intra-inter blending mode and triangle prediction unit mode
US11140404B2 (en) 2018-10-11 2021-10-05 Tencent America LLC Method and apparatus for video coding
US11509919B2 (en) 2018-10-17 2022-11-22 Tencent America Reference sample memory size restrictions for intra block copy
WO2020084470A1 (en) * 2018-10-22 2020-04-30 Beijing Bytedance Network Technology Co., Ltd. Storage of motion parameters with clipping for affine mode
WO2020086317A1 (en) 2018-10-23 2020-04-30 Tencent America Llc. Method and apparatus for video coding
WO2020084552A1 (en) 2018-10-24 2020-04-30 Beijing Bytedance Network Technology Co., Ltd. Motion candidate derivation based on spatial neighboring block in sub-block motion vector prediction
WO2020143832A1 (en) * 2019-01-12 2020-07-16 Beijing Bytedance Network Technology Co., Ltd. Bi-prediction constraints
CN113412623A (en) 2019-01-31 2021-09-17 北京字节跳动网络技术有限公司 Recording context of affine mode adaptive motion vector resolution
WO2020192726A1 (en) 2019-03-27 2020-10-01 Beijing Bytedance Network Technology Co., Ltd. History-based motion vector prediction
US11394990B2 (en) 2019-05-09 2022-07-19 Tencent America LLC Method and apparatus for signaling predictor candidate list size
WO2020228691A1 (en) * 2019-05-12 2020-11-19 Beijing Bytedance Network Technology Co., Ltd. Signaling for reference picture resampling
EP3954119A4 (en) 2019-05-21 2022-06-22 Beijing Bytedance Network Technology Co., Ltd. Syntax signaling in sub-block merge mode
US11212545B2 (en) 2019-06-07 2021-12-28 Tencent America LLC Method and apparatus for improved implicit transform selection
WO2020252745A1 (en) * 2019-06-20 2020-12-24 Alibaba Group Holding Limited Loop filter design for adaptive resolution video coding
KR20210107858A (en) 2019-07-11 2021-09-01 텐센트 아메리카 엘엘씨 Method and apparatus for video coding
US11616962B2 (en) 2019-07-15 2023-03-28 Tencent America LLC Method and apparatus for video coding
US11375243B2 (en) 2019-07-17 2022-06-28 Tencent America LLC Method and apparatus for video coding
WO2021027774A1 (en) 2019-08-10 2021-02-18 Beijing Bytedance Network Technology Co., Ltd. Subpicture dependent signaling in video bitstreams
WO2021027862A1 (en) 2019-08-13 2021-02-18 Beijing Bytedance Network Technology Co., Ltd. Motion precision in sub-block based inter prediction
CN110572673B (en) * 2019-09-27 2024-04-09 腾讯科技(深圳)有限公司 Video encoding and decoding method and device, storage medium and electronic device
US11310511B2 (en) 2019-10-09 2022-04-19 Tencent America LLC Method and apparatus for video coding
CN114631317B (en) 2019-10-18 2024-03-15 北京字节跳动网络技术有限公司 Syntax constraints in parameter set signaling of sub-pictures
CN114902666A (en) * 2019-10-28 2022-08-12 Lg电子株式会社 Image encoding/decoding method and apparatus using adaptive color transform and method of transmitting bitstream
WO2021107641A1 (en) * 2019-11-26 2021-06-03 주식회사 윌러스표준기술연구소 Method and device for processing video signal by using adaptive color space transform
US11516514B2 (en) * 2020-03-27 2022-11-29 Tencent America LLC High level control for deblocking operations
WO2021202464A1 (en) 2020-03-30 2021-10-07 Bytedance Inc. Constraints on collocated pictures in video coding

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20050259730A1 (en) * 2004-05-18 2005-11-24 Sharp Laboratories Of America, Inc. Video coding with residual color conversion using reversible YCoCg
US20110274161A1 (en) * 2010-05-06 2011-11-10 Samsung Electronics Co., Ltd. Image processing method and apparatus
CN102783149A (en) * 2010-02-19 2012-11-14 高通股份有限公司 Adaptive motion resolution for video coding
WO2013155267A2 (en) 2012-04-11 2013-10-17 Qualcomm Incorporated Motion vector rounding
WO2016029144A1 (en) 2014-08-22 2016-02-25 Qualcomm Incorporated Unify intra block copy and inter prediction

Family Cites Families (29)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20020196853A1 (en) * 1997-06-04 2002-12-26 Jie Liang Reduced resolution video decompression
US7295609B2 (en) * 2001-11-30 2007-11-13 Sony Corporation Method and apparatus for coding image information, method and apparatus for decoding image information, method and apparatus for coding and decoding image information, and system of coding and transmitting image information
US8374238B2 (en) * 2004-07-13 2013-02-12 Microsoft Corporation Spatial scalability in 3D sub-band decoding of SDMCTF-encoded video
KR100763178B1 (en) * 2005-03-04 2007-10-04 삼성전자주식회사 Method for color space scalable video coding and decoding, and apparatus for the same
JP4470898B2 (en) 2006-03-16 2010-06-02 ソニー株式会社 Image processing apparatus and method, and program
EP3484154A1 (en) * 2006-10-25 2019-05-15 GE Video Compression, LLC Quality scalable coding
KR101369746B1 (en) * 2007-01-22 2014-03-07 삼성전자주식회사 Method and apparatus for Video encoding and decoding using adaptive interpolation filter
US8599921B2 (en) * 2009-03-27 2013-12-03 Vixs Systems, Inc Adaptive partition subset selection module and method for use therewith
CN101820547A (en) * 2009-02-27 2010-09-01 源见科技(苏州)有限公司 Inter-frame mode selecting method
JP5184447B2 (en) * 2009-06-22 2013-04-17 株式会社Kddi研究所 Video encoding apparatus and decoding apparatus
US10327008B2 (en) * 2010-10-13 2019-06-18 Qualcomm Incorporated Adaptive motion vector resolution signaling for video coding
KR20120088488A (en) * 2011-01-31 2012-08-08 한국전자통신연구원 method for storing temporal motion vector and apparatus using the same
US9247249B2 (en) * 2011-04-20 2016-01-26 Qualcomm Incorporated Motion vector prediction in video coding
US9167269B2 (en) 2011-10-25 2015-10-20 Qualcomm Incorporated Determining boundary strength values for deblocking filtering for video coding
US9503702B2 (en) * 2012-04-13 2016-11-22 Qualcomm Incorporated View synthesis mode for three-dimensional video coding
CN102647594B (en) 2012-04-18 2014-08-20 北京大学 Integer pixel precision motion estimation method and system for same
US9414054B2 (en) * 2012-07-02 2016-08-09 Microsoft Technology Licensing, Llc Control and use of chroma quantization parameter values
TWI610574B (en) * 2012-09-29 2018-01-01 微軟技術授權有限責任公司 Use of chroma quantization parameter offsets in deblocking
CN103188496B (en) * 2013-03-26 2016-03-09 北京工业大学 Based on the method for coding quick movement estimation video of motion vector distribution prediction
CN105379277B (en) * 2013-07-15 2019-12-17 株式会社Kt Method and apparatus for encoding/decoding scalable video signal
US9762927B2 (en) * 2013-09-26 2017-09-12 Qualcomm Incorporated Sub-prediction unit (PU) based temporal motion vector prediction in HEVC and sub-PU design in 3D-HEVC
US9667996B2 (en) * 2013-09-26 2017-05-30 Qualcomm Incorporated Sub-prediction unit (PU) based temporal motion vector prediction in HEVC and sub-PU design in 3D-HEVC
US9774881B2 (en) * 2014-01-08 2017-09-26 Microsoft Technology Licensing, Llc Representing motion vectors in an encoded bitstream
US10531116B2 (en) * 2014-01-09 2020-01-07 Qualcomm Incorporated Adaptive motion vector resolution signaling for video coding
US10432928B2 (en) * 2014-03-21 2019-10-01 Qualcomm Incorporated Using a current picture as a reference for video coding
WO2016048834A1 (en) * 2014-09-26 2016-03-31 Vid Scale, Inc. Intra block copy coding with temporal block vector prediction
CN114554199B (en) * 2014-09-30 2023-11-10 寰发股份有限公司 Method for adaptive motion vector resolution for video coding
KR20170084251A (en) * 2014-11-20 2017-07-19 에이치에프아이 이노베이션 인크. Method of motion vector and block vector resolution control
US20160337662A1 (en) * 2015-05-11 2016-11-17 Qualcomm Incorporated Storage and signaling resolutions of motion vectors

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20050259730A1 (en) * 2004-05-18 2005-11-24 Sharp Laboratories Of America, Inc. Video coding with residual color conversion using reversible YCoCg
CN102783149A (en) * 2010-02-19 2012-11-14 高通股份有限公司 Adaptive motion resolution for video coding
US20110274161A1 (en) * 2010-05-06 2011-11-10 Samsung Electronics Co., Ltd. Image processing method and apparatus
WO2013155267A2 (en) 2012-04-11 2013-10-17 Qualcomm Incorporated Motion vector rounding
WO2016029144A1 (en) 2014-08-22 2016-02-25 Qualcomm Incorporated Unify intra block copy and inter prediction

Cited By (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10986366B2 (en) 2016-06-30 2021-04-20 Interdigital Vc Holdings, Inc. Video coding with adaptive motion information refinement
WO2018110203A1 (en) * 2016-12-16 2018-06-21 シャープ株式会社 Moving image decoding apparatus and moving image encoding apparatus
EP3763122A4 (en) * 2018-03-07 2021-01-13 Tencent America LLC Method and apparatus for video coding
US11128870B2 (en) 2018-03-07 2021-09-21 Tencent America LLC Method and apparatus for video coding
US11627324B2 (en) 2018-03-07 2023-04-11 Tencent America LLC Adaptive block vector resolution in video coding
EP4300972A3 (en) * 2018-03-07 2024-04-10 Tencent America LLC Method and apparatus for video coding
US11431964B2 (en) 2018-11-22 2022-08-30 Beijing Bytedance Network Technology Co., Ltd. Coordination method for sub-block based inter prediction
US11632541B2 (en) 2018-11-22 2023-04-18 Beijing Bytedance Network Technology Co., Ltd. Using collocated blocks in sub-block temporal motion vector prediction mode
US11671587B2 (en) 2018-11-22 2023-06-06 Beijing Bytedance Network Technology Co., Ltd Coordination method for sub-block based inter prediction
US11871025B2 (en) * 2019-08-13 2024-01-09 Beijing Bytedance Network Technology Co., Ltd Motion precision in sub-block based inter prediction
US11695946B2 (en) 2019-09-22 2023-07-04 Beijing Bytedance Network Technology Co., Ltd Reference picture resampling in video processing

Also Published As

Publication number Publication date
US10455231B2 (en) 2019-10-22
KR20200008063A (en) 2020-01-22
KR20170065542A (en) 2017-06-13
EP3189660A4 (en) 2018-03-21
US10880547B2 (en) 2020-12-29
KR102115715B1 (en) 2020-05-27
US20170295370A1 (en) 2017-10-12
EP3189660A1 (en) 2017-07-12
CA2961681C (en) 2022-08-09
CN107079164B (en) 2020-07-10
CN107079164A (en) 2017-08-18
CN114554199A (en) 2022-05-27
US20200007863A1 (en) 2020-01-02
CN111818334B (en) 2022-04-01
CA2961681A1 (en) 2016-04-07
CN114554199B (en) 2023-11-10
CN111818334A (en) 2020-10-23
KR102068828B1 (en) 2020-01-22
EP3189660B1 (en) 2023-07-12

Similar Documents

Publication Publication Date Title
US10880547B2 (en) Method of adaptive motion vector resolution for video coding
CA2964324C (en) Method of guided cross-component prediction for video coding
CA2986950C (en) Method and apparatus of error handling for video coding using intra block copy mode
US11284084B1 (en) Constraints on model-based reshaping in video processing
CN115244924A (en) Signaling across component adaptive loop filters
US20140078394A1 (en) Selective use of chroma interpolation filters in luma interpolation process
JP7322290B2 (en) Syntax for Subpicture Signaling in Video Bitstreams
CN117319645A (en) Method, apparatus and computer readable storage medium for processing video data
CN114641992B (en) Signaling of reference picture resampling
CN114223206A (en) Color palette mode using different division structures
US11595658B2 (en) Derivation of collocated motion vectors
CN115004707A (en) Interaction between adaptive color transform and quantization parameters
JP7401689B2 (en) Interaction between in-loop filtering and video tiles
WO2021219143A1 (en) Entropy coding for motion precision syntax
WO2021222871A1 (en) Methods and devices for prediction dependent residual scaling for video coding

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 15847504

Country of ref document: EP

Kind code of ref document: A1

ENP Entry into the national phase

Ref document number: 2961681

Country of ref document: CA

WWE Wipo information: entry into national phase

Ref document number: 15514129

Country of ref document: US

NENP Non-entry into the national phase

Ref country code: DE

REEP Request for entry into the european phase

Ref document number: 2015847504

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: 2015847504

Country of ref document: EP

ENP Entry into the national phase

Ref document number: 20177010070

Country of ref document: KR

Kind code of ref document: A