US20160182884A1 - Method of Deriving Default Disparity Vector in 3D and Multiview Video Coding - Google Patents

Method of Deriving Default Disparity Vector in 3D and Multiview Video Coding Download PDF

Info

Publication number
US20160182884A1
US20160182884A1 US14/908,273 US201414908273A US2016182884A1 US 20160182884 A1 US20160182884 A1 US 20160182884A1 US 201414908273 A US201414908273 A US 201414908273A US 2016182884 A1 US2016182884 A1 US 2016182884A1
Authority
US
United States
Prior art keywords
view
default
inter
current block
index
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
US14/908,273
Other versions
US10230937B2 (en
Inventor
Jian-Liang Lin
Na Zhang
Yi-Wen Chen
Jicheng An
Yu-Lin Chang
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
HFI Innovation Inc
Original Assignee
MediaTek Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by MediaTek Inc filed Critical MediaTek Inc
Priority to US14/908,273 priority Critical patent/US10230937B2/en
Assigned to MEDIATEK INC. reassignment MEDIATEK INC. ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: AN, JICHENG, CHANG, YU-LIN, CHEN, YI-WEN, LIN, JIAN-LIANG, ZHANG, NA
Publication of US20160182884A1 publication Critical patent/US20160182884A1/en
Assigned to HFI INNOVATION INC. reassignment HFI INNOVATION INC. ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: MEDIATEK INC.
Application granted granted Critical
Publication of US10230937B2 publication Critical patent/US10230937B2/en
Active legal-status Critical Current
Adjusted expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N13/00Stereoscopic video systems; Multi-view video systems; Details thereof
    • H04N13/10Processing, recording or transmission of stereoscopic or multi-view image signals
    • H04N13/106Processing image signals
    • H04N13/161Encoding, multiplexing or demultiplexing different image signal components
    • H04N13/0048
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/124Quantisation
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/134Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
    • H04N19/136Incoming video signal characteristics or properties
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/17Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
    • H04N19/176Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/44Decoders specially adapted therefor, e.g. video decoders which are asymmetric with respect to the encoder
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/50Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
    • H04N19/503Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
    • H04N19/51Motion estimation or motion compensation
    • H04N19/513Processing of motion vectors
    • H04N19/517Processing of motion vectors by encoding
    • H04N19/52Processing of motion vectors by encoding by predictive encoding
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/50Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
    • H04N19/593Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving spatial prediction techniques
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/50Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
    • H04N19/597Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding specially adapted for multi-view video sequence encoding
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N13/00Stereoscopic video systems; Multi-view video systems; Details thereof
    • H04N2013/0074Stereoscopic image analysis
    • H04N2013/0081Depth or disparity estimation from stereoscopic image signals

Definitions

  • the present invention relates to three-dimensional video coding.
  • the present invention relates to disparity vector derivation in 3 D video coding.
  • Three-dimensional (3D) television has been a technology trend in recent years that intends to bring viewers sensational viewing experience.
  • Various technologies have been developed to enable 3D viewing.
  • the multi-view video is a key technology for 3D TV application among others.
  • the traditional video is a two-dimensional (2D) medium that only provides viewers a single view of a scene from the perspective of the camera.
  • the multi-view video is capable of offering arbitrary viewpoints of dynamic scenes and provides viewers the sensation of realism.
  • the multi-view video is typically created by capturing a scene using multiple cameras simultaneously, where the multiple cameras are properly located so that each camera captures the scene from one viewpoint. Accordingly, the multiple cameras will capture multiple video sequences corresponding to multiple views. In order to provide more views, more cameras have been used to generate multi-view video with a large number of video sequences associated with the views. Accordingly, the multi-view video will require a large storage space to store and/or a high bandwidth to transmit. Therefore, multi-view video coding techniques have been developed in the field to reduce the required storage space or the transmission bandwidth.
  • a straightforward approach may be to simply apply conventional video coding techniques to each single-view video sequence independently and disregard any correlation among different views. Such coding system would be very inefficient.
  • multi-view video coding exploits inter-view redundancy.
  • Various 3D coding tools have been developed or being developed by extending existing video coding standard. For example, there are standard development activities to extend H.264/AVC (advanced video coding) and HEVC (high efficiency video coding) to multi-view video coding (MVC) and 3D coding. The corresponding new standards being developed are referred as 3D-HEVC (High Efficiency Video Coding) or 3D-AVC (Advanced Video Coding) coding respectively.
  • 3D HEVC High Efficiency Video Coding
  • 3D-AVC Advanced Video Coding
  • DCP Disparity-Compensated Prediction
  • 3D-HTM test Model for three-dimensional video coding based on HEVC (High Efficiency Video Coding)
  • MCP motion-compensated prediction
  • FIG. 1 illustrates an example of 3D video coding system incorporating MCP and DCP.
  • the vector ( 110 ) used for DCP is termed as disparity vector (DV), which is analog to the motion vector (MV) used in MCP.
  • DV disparity vector
  • the DV of a DCP block can also be predicted by the disparity vector predictor (DVP) candidate derived from neighboring blocks or the temporal collocated blocks that also use inter-view reference pictures.
  • DVP disparity vector predictor
  • 3D-HTM when deriving an inter-view Merge candidate for Merge/Skip modes, if the motion information of corresponding block is not available or not valid, the inter-view Merge candidate is replaced by a DV.
  • Inter-view motion prediction is used to share the previously encoded motion information of reference views.
  • a DV for current block is derived first, and then the prediction block in the already coded picture in the reference view is located by adding the DV to the location of current block. If the prediction block is coded using MCP, the associated motion parameters can be used as candidate motion parameters for the current block in the current view.
  • the derived DV can also be directly used as a candidate DV for DCP.
  • Inter-view residual prediction is another coding tool used in 3D-HTM.
  • the residual signal of the current prediction block i.e., PU
  • the corresponding blocks can be located by respective DVs.
  • the video pictures and depth maps corresponding to a particular camera position are indicated by a view identifier (i.e., V 0 , V 1 and V 2 ). All video pictures and depth maps that belong to the same camera position are associated with the same viewId (i.e., view identifier).
  • the view identifiers are used for specifying the coding order within the access units and detecting missing views in error-prone environments.
  • An access unit includes all video pictures and depth maps corresponding to the same time instant. Inside an access unit, the video picture and, when present, the associated depth map having viewId equal to 0 are coded first, followed by the video picture and depth map having viewId equal to 1, etc.
  • the view with viewId equal to 0 i.e., V 0
  • the base view video pictures can be coded using a conventional HEVC video coder without dependence on other views.
  • motion vector predictor MVP
  • disparity vector predictor DVP
  • inter-view blocks in inter-view picture may be abbreviated as inter-view blocks.
  • the derived candidate is termed as inter-view candidates, which can be inter-view MVPs or DVPs.
  • the coding tools that codes the motion information of a current block e.g., a current prediction unit, PU
  • inter-view motion parameter prediction e.g., a current prediction unit, PU
  • a corresponding block in a neighboring view is termed as an inter-view block and the inter-view block is located using the disparity vector derived from the depth information of current block in current picture.
  • VSP View Synthesis Prediction
  • NBDV Neighboring Block Disparity Vector
  • the derived disparity vector is then used to fetch a depth block in the depth image of the reference view.
  • the procedure to derive the virtual depth can be applied for VSP to locate the corresponding depth block in a coded view.
  • the fetched depth block may have the same size of the current prediction unit (PU), and it will then be used to do backward warping for the current PU.
  • the warping operation may be performed at a sub-PU level precision, such as 2 ⁇ 2 or 4 ⁇ 4 blocks.
  • VSP is only applied for texture component coding.
  • the VSP prediction is added as a new merging candidate to signal the use of VSP prediction.
  • a VSP block may be a skipped block without any residual, or a Merge block with residual information coded.
  • the VSP-based merging candidate may also be referred as VSP merging candidate for convenience in this disclosure.
  • VSP predicted When a picture is coded as B picture and the current block is signaled as VSP predicted, the following steps are applied to determine the prediction direction of VSP:
  • VSP When a picture is coded as a P picture and the current prediction block is using VSP, uni-direction VSP is applied.
  • the VSP flag is always set as true no matter if there is an inter-view reference picture with the view index equal to the view index of the inter-view reference picture pointed by the derived DV.
  • the DV is critical in 3D video coding for inter-view motion prediction, inter-view residual prediction, disparity-compensated prediction (DCP), view synthesis prediction (VSP) or any other tools which need to indicate the correspondence between inter-view pictures.
  • DCP disparity-compensated prediction
  • VSP view synthesis prediction
  • the DVs used for the other coding tools are derived using either the scheme of neighboring block disparity vector (NBDV) or the scheme of depth oriented neighboring block disparity vector (DoNBDV) as described below.
  • NBDV neighboring block disparity vector
  • DoNBDV depth oriented neighboring block disparity vector
  • Neighboring block disparity vector In the current 3D-HEVC, a disparity vector can be used as a DVP candidate for Inter mode or as a Merge candidate for Merge/Skip mode. A derived disparity vector can also be used as an offset vector for inter-view motion prediction and inter-view residual prediction.
  • the DV When used as an offset vector, the DV is derived from spatial and temporal neighboring blocks as shown in FIGS. 2A-2B . Multiple spatial and temporal neighboring blocks are determined and DV availability of the spatial and temporal neighboring blocks is checked according to a pre-determined order. This coding tool for DV derivation based on neighboring (spatial and temporal) blocks is termed as Neighboring Block DV (NBDV).
  • the temporal neighboring block set is searched first.
  • the temporal merging candidate set includes the location at the center of the current block (i.e., BCTR) and the location diagonally across from the lower-right corner of the current block (i.e., RB) in a temporal reference picture.
  • the temporal search order starts from RB to BCTR. Once a block is identified as having a DV, the checking process will be terminated.
  • the spatial neighboring block set includes the location diagonally across from the lower-left corner of the current block (i.e., A 0 ), the location next to the left-bottom side of the current block (i.e., A 1 ), the location diagonally across from the upper-left corner of the current block (i.e., B 2 ), the location diagonally across from the upper-right corner of the current block (i.e., B 0 ), and the location next to the top-right side of the current block (i.e., B 1 ) as shown in FIG. 2B .
  • the search order for the spatial neighboring blocks is (A 1 , B 1 , B 0 , A 0 , B 2 ).
  • the disparity information can be obtained from another coding tool, named DV-MCP.
  • DV-MCP another coding tool
  • the disparity vector used for the inter-view motion prediction represents a motion correspondence between the current and the inter-view reference picture. This type of motion vector is referred to as inter-view predicted motion vector and the blocks are referred to as DV-MCP blocks.
  • FIG. 3 illustrates an example of a DV-MCP block, where the motion information of the DV-MCP block ( 310 ) is predicted from a corresponding block ( 320 ) in the inter-view reference picture.
  • the location of the corresponding block ( 320 ) is specified by a disparity vector ( 330 ).
  • the disparity vector used in the DV-MCP block represents a motion correspondence between the current and inter-view reference picture.
  • the motion information ( 322 ) of the corresponding block ( 320 ) is used to predict motion information ( 312 ) of the current block ( 310 ) in the current view.
  • the dvMcpDisparity is set to indicate that the disparity vector is used for the inter-view motion parameter prediction.
  • the dvMcpFlag of the candidate is set to 1 if the candidate is generated by inter-view motion parameter prediction and is set to 0 otherwise. If neither DCP coded blocks nor DV-MCP coded blocks are found in the above mentioned spatial and temporal neighboring blocks, then a zero vector can be used as a default disparity vector.
  • DoNBDV Depth Oriented Neighboring Block Disparity Vector
  • HEVC High Efficiency Video Coding
  • AMVP adaptive motion vector prediction
  • Merge mode the number of motion hypotheses, the reference indices, the motion vector differences, and indications specifying the used motion vector predictors are coded in the bitstream.
  • Merge mode the second mode is referred to as Merge mode. For this mode, only an indication is coded, which signals the set of motion parameters that are used for the block.
  • the motion information of spatial neighbor is directly used as the motion hypothesis of the current PU.
  • the inter-view reference picture pointed by the derived DV may not be included in the reference picture lists of the current PU. Therefore, while the VSP mode may still be selected (i.e., VSP flag could be set as true), however, the VSP process cannot be performed if the inter-view reference picture pointed by the derived DV may not be included in the reference picture lists of the current PU. In this case, the VSP mode does not have any effective motion information if the VSP mode does get selected. As a result, a mismatch between encoder and decoder will occur.
  • the Neighboring Block Disparity Vector (NBDV) derivation process checks the availability of disparity vector (DV) associated with spatial and temporal neighboring blocks. If no DV can be derived from the neighboring blocks, a default DV with a zero-valued vector pointing to the base view (with a view index equal to 0) is used.
  • the DV derived by NBDV can be further used by the process of depth-oriented NBDV (DoNBDV) to derive a refined DV.
  • DoNBDV depth-oriented NBDV
  • An example of disparity vector derivation process of NBDV (steps 1-2) and DoNBDV (step 3) according to HTM-8.0 is illustrated as follows.
  • NBDV is used to derive a DV from the spatial or temporal neighboring blocks based on a predefined order.
  • a default DV with a zero vector pointing to the base view (with a view index equal to 0) is used.
  • the base view reference picture is not included in the reference picture list of a current image unit (e.g., a slice or a largest coding unit). Under this condition, the default DV may point to a non-existing reference picture and this may cause mismatch between an encoder and decoder due to this invalid view index.
  • a method and apparatus for a three-dimensional or multi-view video encoding or decoding system utilizing unified disparity vector (DV) derivation is disclosed.
  • a three-dimensional coding tool using a derived disparity vector is selected, embodiments according to the present invention will first obtain the derived DV from one or more neighboring blocks of the current block. If the derived DV is available, the selected three-dimensional coding tool is applied to the current block using the derived DV. If the derived DV is not available, the selected three-dimensional coding tool is applied to the current block using a default DV, where the default DV is set to point to the inter-view reference picture in a reference picture list of the current block.
  • the default DV can be set to a default view index corresponding to ⁇ 1, which means the selected three-dimensional coding tool should be disabled.
  • the default DV can be determined at each slice level or each picture level.
  • the derived DV is available if a first inter-view reference picture in a first view associated with the derived DV is in one reference picture list of the current block.
  • the derived DV for the current block can be derived from one or more spatial neighboring block of the current block, one or more temporal neighboring block of the current block, or one or more spatial and one or more temporal neighboring blocks of the current block.
  • the view index of the default DV may be set to the view index of the inter-view reference picture of the current slice or picture with a minimum view index.
  • the view index of the default DV may also be set to the view index of any inter-view reference picture of the current slice or picture.
  • the view index of the default DV is set to the view index of the inter-view reference picture of the current slice or picture having a nearest view index, where the nearest view index is measured based on view distance or view index difference with the current slice or picture.
  • the view index of the default DV can be set to the view index of the inter-view reference picture having smallest quantization parameters.
  • the view index of the default DV is set to the view index of an inter-view reference picture that is found firstly among a search set according to a search order.
  • the search set includes all inter-view reference pictures in one or two reference lists of the current block and the search order starts from a zero picture index to a maximum reference picture index.
  • the current block corresponds to a prediction unit (PU) in a B slice
  • the inter-view reference pictures in the reference list- 0 can be searched before or after the inter-view reference pictures in the reference list- 1 , or the inter-view reference pictures can be searched in an interleaved order between the reference list- 0 and the reference list- 1 .
  • the vector value of the default DV can be set to a zero vector or a default vector.
  • the default vector can be set to a converted disparity that is derived from a default depth value.
  • the default depth value can be explicitly signaled or implicitly determined for both an encoder and a decoder.
  • the default depth value can be determined based on a middle value, a mean value, or a medium value of valid depth values, or based on a dominant depth value.
  • the dominant depth value can be determined based on statistic of previously reconstructed depth values.
  • the default vector can also be set to a selected disparity from default disparity values.
  • the default disparity values can be explicitly signaled or implicitly determined for both an encoder and a decoder.
  • the selected disparity can determined based on a middle value, a mean value, or a medium value of the set of the default disparity values, or based on a dominant disparity value.
  • the dominant disparity value is determined based on statistic of previously reconstructed disparity vectors.
  • FIG. 1 illustrates an example of three-dimensional video coding incorporating disparity-compensated prediction (DCP) as an alternative to motion-compensated prediction (MCP).
  • DCP disparity-compensated prediction
  • MCP motion-compensated prediction
  • FIGS. 2A-2B illustrate an example of spatial neighboring blocks of the current block belonging to a set for derivation of the VSP merging candidate according to HEVC-based 3D test Model version 7.0 (HTM-7.0).
  • FIG. 3 illustrates an example of a disparity derivation from motion-compensated prediction (DV-MCP) block, where the location of the corresponding blocks is specified by a disparity vector.
  • DV-MCP motion-compensated prediction
  • FIGS. 4A-4B illustrate an example of temporal and spatial neighboring blocks of the current block belonging to a set for derivation of the VSP merging candidate according to HEVC-based 3D test Model version 8.0 (HTM-8.0).
  • FIG. 5 illustrates an exemplary flowchart of three-dimensional or multi-view video encoding or decoding that uses unified disparity vector derivation according to an embodiment of the present invention.
  • disparity vector is widely used in various coding tools for three-dimensional video coding system.
  • the inter-view reference picture pointed by the derived DV may not be included in the reference picture lists of current block (e.g. a prediction unit (PU)). If the VSP mode is selected in this case, the VSP process cannot be correctly performed.
  • the default DV corresponding to a zero-valued vector pointing to a base view may cause issue if the base view reference picture is not included in the reference picture list of the current image unit (e.g., a slice or a largest coding unit).
  • embodiments of the present invention disclose a disparity vector derivation process that derives a disparity vector or a default disparity vector free from the issues occurring in the conventional approach.
  • the VSP flag is set to “false”. In other words, the VSP mode is disabled for the current block.
  • the default view index is set to zero when no DV can be found according to a DV derivation process (e.g. NBDV).
  • NBDV DV derivation process
  • the inter-view reference picture with zero view index may not be in the reference picture lists. Therefore, the default DV with a zero view index becomes invalid in this case.
  • another set of embodiments of the present invention modify the reference view index used by the conventional approach to avoid the issue.
  • the vector of the default DV is set to a default value (e.g., a zero vector) and the reference view index is set to the minimum view index of the inter-view reference pictures in the reference picture lists of the current slice or picture.
  • the vector of the default DV is set to a default value (e.g., a zero vector) and the reference view index is set to the view index of the inter-view reference picture which is the nearest one in terms of the view distance. If more than one has the same nearest view distance, the one with smaller view index is selected.
  • the vector of the default DV is set to a default value (e.g., a zero vector) and the reference view index is set to the view index of any inter-view reference picture in the reference picture lists of the current PU. Accordingly, the inter-view reference picture pointed by the DV derived from NBDV always corresponds to an inter-view reference picture in the reference picture lists of the current picture.
  • the default view index is set to ⁇ 1 to indicate the invalidity of the derived DV.
  • the view index associated with the derived DV is equal to ⁇ 1
  • the VSP mode is not allowed for the current PU.
  • the reference view index of the default DV is set to the minimum view index of the inter-view reference pictures in the reference picture lists of the current slice/picture.
  • the vector of the default DV is set to the DV converted by a default depth value (e.g. the middle or mean value of the valid depth values or the dominant depth value).
  • the DV is converted by calculating the projection vector between the current view and the corresponding view which is identified by the view index assigned to the default DV with the help of the camera parameters.
  • the vector of the default DV is set to a default value (e.g., a zero vector) and the reference view index of the default DV is set to the nearest one in terms of the view distance or difference of the view index in the reference picture lists of the current slice/picture. If more than one has the same nearest view distance, the one with smaller view index is selected.
  • the vector of the default DV is set to the DV converted by a default depth value (e.g. middle or mean value of the valid depth values or dominant depth value). Specifically, the DV is converted by calculating the projection vector between the current view and the corresponding view which is identified by the view index assigned to the default DV with the help of the camera parameters.
  • the view index of the default DV is set to the view index of the first inter-view reference picture by searching the reference picture with the reference index from zero to the maximum reference index in the reference picture list 0 and list 1 .
  • the searching order for list 0 and list 1 may correspond to searching all reference pictures in list 0 first and then searching all reference pictures in list 1 .
  • the searching order for list 0 and list 1 may also be interleaved searching, i.e. alternatingly searching part of list and list 1 .
  • the vector of the default DV is set as a default vector (e.g. a zero vector).
  • the view index of the default DV is set to the first inter-view reference picture by searching the reference picture with the reference index from zero to the maximum reference index in the reference picture list 0 and list 1 .
  • the searching order for list 0 and list 1 may correspond to searching all reference pictures in list 0 first and then searching all reference pictures in list 1 .
  • the vector of the default DV is set to the DV converted by a default depth value (e.g. middle or mean value of the valid depth values or dominant depth value). Specifically, the DV is converted by calculating the projection vector between the current view and the corresponding view which is identified by the view index assigned to the default DV with the help of the camera parameters.
  • the coding tools that utilize the derived DV e.g. view synthesis prediction, inter-view residual prediction and advanced residual prediction (ARP)
  • ARP advanced residual prediction
  • the view index associated with the derived DV is set to ⁇ 1.
  • the view index of the default DV can be set to the view index of the inter-view reference picture having smaller QP parameters. If more than one inter-view reference picture having the same smaller QP parameters, the one with smaller view index is selected.
  • Another aspect of the present invention addresses syntax design to support the needed modification to overcome the issues in the conventional system. Accordingly, in the ninth embodiment, the exemplary syntax design to support the needed modification is illustrated as follows.
  • the reference view index i.e., refViewldx
  • the disparity vector i.e., mvDisp
  • a default value i.e., (0, 0)
  • the variable of the refined disparity DV i.e., mvRefinedDisp is set equal to mvDisp.
  • foundFlag is a flag indicating whether an inter-view reference picture has been found in the reference picture lists.
  • Loop (b) of the syntax is associated with the reference list. For B pictures, two reference lists are used and otherwise, one reference list is used. Also, loop (b) is terminated whenever an inter-view reference picture is found in the reference picture lists (as indicated by “!foundFlag”).
  • procedure (d) it checks whether the picture order count of an underlying reference picture in the reference list (i.e., RefPicListX[i]) is equal to the picture order count of the current picture. If so, the reference view index of the default DV (i.e., refViewldx) is set to the view index of the inter-view reference picture (i.e., Vid) and foundFlag is set to 1 to terminate the process.
  • the reference view index i.e., refViewIdx
  • the disparity vector i.e., mvDisp
  • a default value i.e., (0, 0)
  • the variable of the refined disparity DV i.e., mvRefinedDisp is set equal to mvDisp.
  • loop (a) through loop (c) are also used as in the previous case.
  • procedure (e) it checks whether the view index of an underlying reference picture in the reference list (i.e., RefPicListX[i]) is equal to the underlying view index. If so, the reference view index of the default DV (i.e., refViewIdx) is set to the underlying view index (i.e., Vid) and foundFlag is set to 1 to terminate the process.
  • Variable Vid starts from 0 and increments for each iteration.
  • the view index assigned to the default according to the above syntax corresponds to a smallest view index.
  • FIG. 5 illustrates an exemplary flowchart of three-dimensional or multi-view video encoding or decoding that uses unified disparity vector derivation according to an embodiment of the present invention.
  • the system receives input data associated with a current block in a dependent view as shown in step 510 .
  • the input data may correspond to un-coded or coded texture data.
  • the input data may be retrieved from storage such as a computer memory, buffer (RAM or DRAM) or other media.
  • the video bitstream may also be received from a processor such as a controller, a central processing unit, a digital signal processor or electronic circuits that produce the input data.
  • a selected three-dimensional coding tool is selected in step 520 , where the three-dimensional coding tool utilizes a derived DV (disparity vector).
  • the derived DV for the current block is derived in step 530 .
  • the availability of the derived DV is checked in step 540 . If the derived DV is available (i.e.,“Yes” path), the selected three-dimensional coding tool is applied to the current block using the derived DV in step 550 . If the derived DV is not available (i.e., “No” path), the selected three-dimensional coding tool is applied to the current block using a default DV in step 560 , where the default DV is set to point to a second inter-view reference picture in one reference picture list of the current block.
  • input data associated with a current block in a dependent view is received.
  • the input data may correspond to un-coded or coded texture data.
  • the input data may be retrieved from storage such as a computer memory, buffer (RAM or DRAM) or other media.
  • a video bitstream may also be received from a processor such as a controller, a central processing unit, a digital signal processor or electronic circuits that produce the input data.
  • a selected three-dimensional coding tool using a derived DV is determined, where the derived DV for a current block is obtained from one or more spatial neighboring block and one or more temporal neighboring block of the current block. The selected three-dimensional coding tool is applied to the current block using the derived DV if the derived DV is available.
  • the selected three-dimensional coding tool is applied to the current block using a default DV, wherein the default DV is set to point to an inter-view reference picture in one reference picture list of the current block.
  • the availability of the derived DV may be determined by checking if a first inter-view reference picture in a first view associated with the derived DV is in one reference picture list of the current block.
  • Embodiment of the present invention as described above may be implemented in various hardware, software codes, or a combination of both.
  • an embodiment of the present invention can be a circuit integrated into a video compression chip or program code integrated into video compression software to perform the processing described herein.
  • An embodiment of the present invention may also be program code to be executed on a Digital Signal Processor (DSP) to perform the processing described herein.
  • DSP Digital Signal Processor
  • the invention may also involve a number of functions to be performed by a computer processor, a digital signal processor, a microprocessor, or field programmable gate array (FPGA). These processors can be configured to perform particular tasks according to the invention, by executing machine-readable software code or firmware code that defines the particular methods embodied by the invention.
  • the software code or firmware code may be developed in different programming languages and different formats or styles.
  • the software code may also be compiled for different target platforms.
  • different code formats, styles and languages of software codes and other means of configuring code to perform the tasks in accordance with the invention will not depart from the spirit and scope of the invention.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)

Abstract

A method and apparatus for a three-dimensional or multi-view video encoding or decoding system utilizing unified disparity vector derivation is disclosed. When a three-dimensional coding tool using a derived disparity vector (DV) is selected, embodiments according to the present invention will first obtain the derived DV from one or more neighboring blocks. If the derived DV is available, the selected three-dimensional coding tool is applied to the current block using the derived DV. If the derived DV is not available, the selected three-dimensional coding tool is applied to the current block using a default DV, where the default DV is set to point to an inter-view reference picture in a reference picture list of the current block.

Description

    CROSS REFERENCE TO RELATED APPLICATIONS
  • The present invention claims priority to U.S. Provisional Patent Application, Ser. No. 61/865,346, filed on Aug. 13, 2013, entitled “Inter-view Reference Picture Selection and Its Exception Handler in 3D Video Coding” and U.S. Provisional Patent Application, Ser. No. 61/895,468, filed on Oct. 25, 2013, entitled “Methods of deriving the default disparity vector in multiview and 3D video coding”. The U.S. Provisional Patent Applications are hereby incorporated by reference in their entireties.
  • FIELD OF INVENTION
  • The present invention relates to three-dimensional video coding. In particular, the present invention relates to disparity vector derivation in 3D video coding.
  • BACKGROUND OF THE INVENTION
  • Three-dimensional (3D) television has been a technology trend in recent years that intends to bring viewers sensational viewing experience. Various technologies have been developed to enable 3D viewing. Among them, the multi-view video is a key technology for 3D TV application among others. The traditional video is a two-dimensional (2D) medium that only provides viewers a single view of a scene from the perspective of the camera. However, the multi-view video is capable of offering arbitrary viewpoints of dynamic scenes and provides viewers the sensation of realism.
  • The multi-view video is typically created by capturing a scene using multiple cameras simultaneously, where the multiple cameras are properly located so that each camera captures the scene from one viewpoint. Accordingly, the multiple cameras will capture multiple video sequences corresponding to multiple views. In order to provide more views, more cameras have been used to generate multi-view video with a large number of video sequences associated with the views. Accordingly, the multi-view video will require a large storage space to store and/or a high bandwidth to transmit. Therefore, multi-view video coding techniques have been developed in the field to reduce the required storage space or the transmission bandwidth.
  • A straightforward approach may be to simply apply conventional video coding techniques to each single-view video sequence independently and disregard any correlation among different views. Such coding system would be very inefficient. In order to improve efficiency of multi-view video coding, multi-view video coding exploits inter-view redundancy. Various 3D coding tools have been developed or being developed by extending existing video coding standard. For example, there are standard development activities to extend H.264/AVC (advanced video coding) and HEVC (high efficiency video coding) to multi-view video coding (MVC) and 3D coding. The corresponding new standards being developed are referred as 3D-HEVC (High Efficiency Video Coding) or 3D-AVC (Advanced Video Coding) coding respectively. Various 3D coding tools developed or being developed for 3D-HEVC and 3D-AVC are reviewed as follows.
  • To share the previously coded texture information of adjacent views, a technique known as Disparity-Compensated Prediction (DCP) has been included in 3D-HTM (test Model for three-dimensional video coding based on HEVC (High Efficiency Video Coding)) as an alternative coding tool to motion-compensated prediction (MCP). MCP refers to an inter-picture prediction that uses previously coded pictures of the same view, while DCP refers to an inter-picture prediction that uses previously coded pictures of other views in the same access unit. FIG. 1 illustrates an example of 3D video coding system incorporating MCP and DCP. The vector (110) used for DCP is termed as disparity vector (DV), which is analog to the motion vector (MV) used in MCP. FIG. 1 illustrates three MVs (120, 130 and 140) associated with MCP. Moreover, the DV of a DCP block can also be predicted by the disparity vector predictor (DVP) candidate derived from neighboring blocks or the temporal collocated blocks that also use inter-view reference pictures. In current 3D-HTM, when deriving an inter-view Merge candidate for Merge/Skip modes, if the motion information of corresponding block is not available or not valid, the inter-view Merge candidate is replaced by a DV.
  • Inter-view motion prediction is used to share the previously encoded motion information of reference views. For deriving candidate motion parameters for a current block in a dependent view, a DV for current block is derived first, and then the prediction block in the already coded picture in the reference view is located by adding the DV to the location of current block. If the prediction block is coded using MCP, the associated motion parameters can be used as candidate motion parameters for the current block in the current view. The derived DV can also be directly used as a candidate DV for DCP.
  • Inter-view residual prediction is another coding tool used in 3D-HTM. To share the previously coded residual information of adjacent views, the residual signal of the current prediction block (i.e., PU) can be predicted by the residual signals of the corresponding blocks in the inter-view pictures. The corresponding blocks can be located by respective DVs. The video pictures and depth maps corresponding to a particular camera position are indicated by a view identifier (i.e., V0, V1 and V2). All video pictures and depth maps that belong to the same camera position are associated with the same viewId (i.e., view identifier). The view identifiers are used for specifying the coding order within the access units and detecting missing views in error-prone environments. An access unit includes all video pictures and depth maps corresponding to the same time instant. Inside an access unit, the video picture and, when present, the associated depth map having viewId equal to 0 are coded first, followed by the video picture and depth map having viewId equal to 1, etc. The view with viewId equal to 0 (i.e., V0) is also referred to as the base view or the independent view. The base view video pictures can be coded using a conventional HEVC video coder without dependence on other views.
  • For the current block, motion vector predictor (MVP)/ disparity vector predictor (DVP) can be derived from the inter-view blocks in the inter-view pictures. In the following, inter-view blocks in inter-view picture may be abbreviated as inter-view blocks. The derived candidate is termed as inter-view candidates, which can be inter-view MVPs or DVPs. The coding tools that codes the motion information of a current block (e.g., a current prediction unit, PU) based on previously coded motion information in other views is termed as inter-view motion parameter prediction. Furthermore, a corresponding block in a neighboring view is termed as an inter-view block and the inter-view block is located using the disparity vector derived from the depth information of current block in current picture.
  • View Synthesis Prediction (VSP) is a technique to remove inter-view redundancies among video signal from different viewpoints, in which synthetic signal is used as references to predict a current picture. In 3D-HEVC test model, HTM-7.0, there exists a process to derive a disparity vector predictor, known as NBDV (Neighboring Block Disparity Vector). The derived disparity vector is then used to fetch a depth block in the depth image of the reference view. The procedure to derive the virtual depth can be applied for VSP to locate the corresponding depth block in a coded view. The fetched depth block may have the same size of the current prediction unit (PU), and it will then be used to do backward warping for the current PU. In addition, the warping operation may be performed at a sub-PU level precision, such as 2×2 or 4×4 blocks.
  • In current implementation, VSP is only applied for texture component coding. Also the VSP prediction is added as a new merging candidate to signal the use of VSP prediction. In such a way, a VSP block may be a skipped block without any residual, or a Merge block with residual information coded. The VSP-based merging candidate may also be referred as VSP merging candidate for convenience in this disclosure.
  • When a picture is coded as B picture and the current block is signaled as VSP predicted, the following steps are applied to determine the prediction direction of VSP:
      • Obtain the view index refViewIdxNBDV of the derived disparity vector from NBDV;
      • Obtain the reference picture list RefPicListNBDV (either RefPicList0 or RefPicList1) that is associated with the reference picture with view index refViewIdxNBDV;
      • Check the availability of an interview reference picture with view index refViewldx that is not equal to refViewIdxNBDV in the reference picture list other than RefPicListNBDV;
        • If such a different interview reference picture is found, bi-direction VSP is applied. The depth block from view index refViewIdxNBDV is used as the current block's depth information (in case of texture-first coding order), and the two different interview reference pictures (each from one reference picture list) are accessed via backward warping process and further weighted to achieve the final backward VSP predictor;
        • Otherwise, uni-direction VSP is applied with RefPicListNBDV as the reference picture list for prediction.
  • When a picture is coded as a P picture and the current prediction block is using VSP, uni-direction VSP is applied.
  • It is noted that, when adding the VSP Merge candidate, the VSP flag is always set as true no matter if there is an inter-view reference picture with the view index equal to the view index of the inter-view reference picture pointed by the derived DV.
  • The DV is critical in 3D video coding for inter-view motion prediction, inter-view residual prediction, disparity-compensated prediction (DCP), view synthesis prediction (VSP) or any other tools which need to indicate the correspondence between inter-view pictures. The DV derivation utilized in current test model of 3D-HEVC is described as follow.
  • DV Derivation in 3D-HEVC. Currently, except for the DV for DCP, the DVs used for the other coding tools are derived using either the scheme of neighboring block disparity vector (NBDV) or the scheme of depth oriented neighboring block disparity vector (DoNBDV) as described below.
  • Neighboring block disparity vector (NBDV). In the current 3D-HEVC, a disparity vector can be used as a DVP candidate for Inter mode or as a Merge candidate for Merge/Skip mode. A derived disparity vector can also be used as an offset vector for inter-view motion prediction and inter-view residual prediction. When used as an offset vector, the DV is derived from spatial and temporal neighboring blocks as shown in FIGS. 2A-2B. Multiple spatial and temporal neighboring blocks are determined and DV availability of the spatial and temporal neighboring blocks is checked according to a pre-determined order. This coding tool for DV derivation based on neighboring (spatial and temporal) blocks is termed as Neighboring Block DV (NBDV). The temporal neighboring block set, as shown in FIG. 2A, is searched first. The temporal merging candidate set includes the location at the center of the current block (i.e., BCTR) and the location diagonally across from the lower-right corner of the current block (i.e., RB) in a temporal reference picture. The temporal search order starts from RB to BCTR. Once a block is identified as having a DV, the checking process will be terminated. The spatial neighboring block set includes the location diagonally across from the lower-left corner of the current block (i.e., A0), the location next to the left-bottom side of the current block (i.e., A1), the location diagonally across from the upper-left corner of the current block (i.e., B2), the location diagonally across from the upper-right corner of the current block (i.e., B0), and the location next to the top-right side of the current block (i.e., B1) as shown in FIG. 2B. The search order for the spatial neighboring blocks is (A1, B1, B0, A0, B2).
  • If a DCP coded block is not found in the neighboring block set (i.e., spatial and temporal neighboring blocks as shown in FIGS. 2A and 2B), the disparity information can be obtained from another coding tool, named DV-MCP. In this case, when a spatial neighboring block is MCP coded block and its motion is predicted by the inter-view motion prediction, as shown in FIG. 3, the disparity vector used for the inter-view motion prediction represents a motion correspondence between the current and the inter-view reference picture. This type of motion vector is referred to as inter-view predicted motion vector and the blocks are referred to as DV-MCP blocks. FIG. 3 illustrates an example of a DV-MCP block, where the motion information of the DV-MCP block (310) is predicted from a corresponding block (320) in the inter-view reference picture. The location of the corresponding block (320) is specified by a disparity vector (330). The disparity vector used in the DV-MCP block represents a motion correspondence between the current and inter-view reference picture. The motion information (322) of the corresponding block (320) is used to predict motion information (312) of the current block (310) in the current view.
  • To indicate whether a MCP block is DV-MCP coded and to store the disparity vector for the inter-view motion parameters prediction, two variables are used to represent the motion vector information for each block:
      • dvMcpFlag, and
      • dvMcpDisparity.
  • When dvMcpFlag is equal to 1, the dvMcpDisparity is set to indicate that the disparity vector is used for the inter-view motion parameter prediction. In the construction process for the AMVP mode and Merge candidate list, the dvMcpFlag of the candidate is set to 1 if the candidate is generated by inter-view motion parameter prediction and is set to 0 otherwise. If neither DCP coded blocks nor DV-MCP coded blocks are found in the above mentioned spatial and temporal neighboring blocks, then a zero vector can be used as a default disparity vector.
  • Depth Oriented Neighboring Block Disparity Vector (DoNBDV). A method to enhance the NBDV by extracting a more accurate disparity vector from the depth map is utilized in current 3D-HEVC. A depth block from coded depth map in the same access unit is first retrieved and used as a virtual depth of the current block. To be specific, the refined DV is converted from the maximum disparity of the pixel subset in the virtual depth block which is located by the DV derived using NBDV. This coding tool for DV derivation is termed as Depth-oriented NBDV (DoNBDV).
  • In HEVC, two different modes for signaling the motion parameters for a block are specified. In the first mode, which is referred to as adaptive motion vector prediction (AMVP) mode, the number of motion hypotheses, the reference indices, the motion vector differences, and indications specifying the used motion vector predictors are coded in the bitstream. The second mode is referred to as Merge mode. For this mode, only an indication is coded, which signals the set of motion parameters that are used for the block. In the current 3D-HEVC, during the process of collecting motion hypotheses for AMVP, if the reference picture type of spatial neighbor is the same as the reference picture type of current PU (inter-view or temporal) and the picture order count (POC) of the reference picture of spatial neighbor is equal to the POC of the reference picture of the current PU, the motion information of spatial neighbor is directly used as the motion hypothesis of the current PU.
  • In the conventional scheme, the inter-view reference picture pointed by the derived DV may not be included in the reference picture lists of the current PU. Therefore, while the VSP mode may still be selected (i.e., VSP flag could be set as true), however, the VSP process cannot be performed if the inter-view reference picture pointed by the derived DV may not be included in the reference picture lists of the current PU. In this case, the VSP mode does not have any effective motion information if the VSP mode does get selected. As a result, a mismatch between encoder and decoder will occur.
  • Furthermore, in the conventional 3D-HEVC, the Neighboring Block Disparity Vector (NBDV) derivation process checks the availability of disparity vector (DV) associated with spatial and temporal neighboring blocks. If no DV can be derived from the neighboring blocks, a default DV with a zero-valued vector pointing to the base view (with a view index equal to 0) is used. The DV derived by NBDV can be further used by the process of depth-oriented NBDV (DoNBDV) to derive a refined DV. An example of disparity vector derivation process of NBDV (steps 1-2) and DoNBDV (step 3) according to HTM-8.0 is illustrated as follows.
      • 1. The disparity vector (DV) is set to (0, 0) initially.
      • 2. The NBDV derivation is performed as follows.
        • a) Search the temporal neighboring blocks to determine if the disparity vector can be found in these temporal neighbouring blocks. Once a DV is found, the DV found is used as the output of the NBDV process and the process is terminated. In HTM-8.0, two temporal neighboring blocks are used, including a co-located block in co-located picture and a co-located block in RAP (Random Access Point) picture, where the two co-located blocks correspond to a central block in the co-located picture and the RAP picture respectively as shown in FIG. 4A.
        • b) Search the spatial neighbouring blocks (i.e., blocks A1 and B1 as shown in FIG. 4B) to determine if a disparity vector can be found in these spatial neighbouring blocks. Once a DV is found, the DV found is used as the output of the NBDV process and the process is terminated.
        • c) Search the spatial neighbouring blocks (i.e., blocks A1 and B1 as shown in FIG. 4B) to determine if an intrinsic disparity vector can be found in these spatial neighbouring blocks. The intrinsic disparity vector is the disparity information obtained from spatial neighboring DV-MCP blocks whose motion is predicted from a corresponding block in the inter-view reference picture where the location of the corresponding blocks is specified by a disparity vector as shown in FIG. 3. The disparity vector used in the DV-MCP block represents a motion correspondence between the current and inter-view reference pictures. Once an intrinsic DV is found, the found DV is used as the output of the NBDV process and the process is terminated.
        • d) If there is still no DV found, a zero vector with a zero view index is used as a default output for the NBDV process.
      • 3. If a flag (i.e., depth_refinement_flag) indicating whether NBDV is further refined from the depth map, is equal to 1, then a refined NBDV, DVref is derived as follows.
        • a) Find the corresponding depth block of the reference view by using NBDV,
        • b) Select the representative depth value in the corresponding depth block, and
        • c) Convert the representative depth value to the disparity vector.
  • In current 3D-HEVC, NBDV is used to derive a DV from the spatial or temporal neighboring blocks based on a predefined order. When no DV can be derived from the neighboring blocks, a default DV with a zero vector pointing to the base view (with a view index equal to 0) is used. However, there may be cases that the base view reference picture is not included in the reference picture list of a current image unit (e.g., a slice or a largest coding unit). Under this condition, the default DV may point to a non-existing reference picture and this may cause mismatch between an encoder and decoder due to this invalid view index.
  • SUMMARY OF THE INVENTION
  • A method and apparatus for a three-dimensional or multi-view video encoding or decoding system utilizing unified disparity vector (DV) derivation is disclosed. When a three-dimensional coding tool using a derived disparity vector is selected, embodiments according to the present invention will first obtain the derived DV from one or more neighboring blocks of the current block. If the derived DV is available, the selected three-dimensional coding tool is applied to the current block using the derived DV. If the derived DV is not available, the selected three-dimensional coding tool is applied to the current block using a default DV, where the default DV is set to point to the inter-view reference picture in a reference picture list of the current block. If the derived DV is not available and no inter-view reference picture can be found in any reference picture list of the current block, the default DV can be set to a default view index corresponding to −1, which means the selected three-dimensional coding tool should be disabled. The default DV can be determined at each slice level or each picture level. In some embodiments, the derived DV is available if a first inter-view reference picture in a first view associated with the derived DV is in one reference picture list of the current block. The derived DV for the current block can be derived from one or more spatial neighboring block of the current block, one or more temporal neighboring block of the current block, or one or more spatial and one or more temporal neighboring blocks of the current block.
  • One aspect of the present invention addresses the view index selection for the default DV. The view index of the default DV may be set to the view index of the inter-view reference picture of the current slice or picture with a minimum view index. The view index of the default DV may also be set to the view index of any inter-view reference picture of the current slice or picture. In yet another example, the view index of the default DV is set to the view index of the inter-view reference picture of the current slice or picture having a nearest view index, where the nearest view index is measured based on view distance or view index difference with the current slice or picture. Furthermore, the view index of the default DV can be set to the view index of the inter-view reference picture having smallest quantization parameters.
  • In another embodiment, the view index of the default DV is set to the view index of an inter-view reference picture that is found firstly among a search set according to a search order. The search set includes all inter-view reference pictures in one or two reference lists of the current block and the search order starts from a zero picture index to a maximum reference picture index. When the current block corresponds to a prediction unit (PU) in a B slice, the inter-view reference pictures in the reference list-0 can be searched before or after the inter-view reference pictures in the reference list-1, or the inter-view reference pictures can be searched in an interleaved order between the reference list-0 and the reference list-1.
  • Another aspect of the present invention addresses the vector value for the default DV. The vector value of the default DV can be set to a zero vector or a default vector. The default vector can be set to a converted disparity that is derived from a default depth value. The default depth value can be explicitly signaled or implicitly determined for both an encoder and a decoder. The default depth value can be determined based on a middle value, a mean value, or a medium value of valid depth values, or based on a dominant depth value. The dominant depth value can be determined based on statistic of previously reconstructed depth values.
  • The default vector can also be set to a selected disparity from default disparity values. The default disparity values can be explicitly signaled or implicitly determined for both an encoder and a decoder.
  • The selected disparity can determined based on a middle value, a mean value, or a medium value of the set of the default disparity values, or based on a dominant disparity value. The dominant disparity value is determined based on statistic of previously reconstructed disparity vectors.
  • BRIEF DESCRIPTION OF THE DRAWINGS
  • FIG. 1 illustrates an example of three-dimensional video coding incorporating disparity-compensated prediction (DCP) as an alternative to motion-compensated prediction (MCP).
  • FIGS. 2A-2B illustrate an example of spatial neighboring blocks of the current block belonging to a set for derivation of the VSP merging candidate according to HEVC-based 3D test Model version 7.0 (HTM-7.0).
  • FIG. 3 illustrates an example of a disparity derivation from motion-compensated prediction (DV-MCP) block, where the location of the corresponding blocks is specified by a disparity vector.
  • FIGS. 4A-4B illustrate an example of temporal and spatial neighboring blocks of the current block belonging to a set for derivation of the VSP merging candidate according to HEVC-based 3D test Model version 8.0 (HTM-8.0).
  • FIG. 5 illustrates an exemplary flowchart of three-dimensional or multi-view video encoding or decoding that uses unified disparity vector derivation according to an embodiment of the present invention.
  • DETAILED DESCRIPTION
  • As described above, disparity vector (DV) is widely used in various coding tools for three-dimensional video coding system. However, the inter-view reference picture pointed by the derived DV may not be included in the reference picture lists of current block (e.g. a prediction unit (PU)). If the VSP mode is selected in this case, the VSP process cannot be correctly performed. Furthermore, the default DV corresponding to a zero-valued vector pointing to a base view may cause issue if the base view reference picture is not included in the reference picture list of the current image unit (e.g., a slice or a largest coding unit).
  • Accordingly, embodiments of the present invention disclose a disparity vector derivation process that derives a disparity vector or a default disparity vector free from the issues occurring in the conventional approach. In the first embodiment, if there is no available DV or no inter-view reference picture with the view index equal to the view index of the derived DV in the reference picture lists of current (e.g., a prediction unit (PU)), the VSP flag is set to “false”. In other words, the VSP mode is disabled for the current block.
  • In current 3D-HEVC, the default view index is set to zero when no DV can be found according to a DV derivation process (e.g. NBDV). However, the inter-view reference picture with zero view index may not be in the reference picture lists. Therefore, the default DV with a zero view index becomes invalid in this case. Accordingly, another set of embodiments of the present invention modify the reference view index used by the conventional approach to avoid the issue. In the second embodiment of the present invention, the vector of the default DV is set to a default value (e.g., a zero vector) and the reference view index is set to the minimum view index of the inter-view reference pictures in the reference picture lists of the current slice or picture. In the third embodiment, the vector of the default DV is set to a default value (e.g., a zero vector) and the reference view index is set to the view index of the inter-view reference picture which is the nearest one in terms of the view distance. If more than one has the same nearest view distance, the one with smaller view index is selected. In the fourth embodiment, the vector of the default DV is set to a default value (e.g., a zero vector) and the reference view index is set to the view index of any inter-view reference picture in the reference picture lists of the current PU. Accordingly, the inter-view reference picture pointed by the DV derived from NBDV always corresponds to an inter-view reference picture in the reference picture lists of the current picture. When no inter-view reference pictures exist in the reference picture lists of the current PU, the default view index is set to −1 to indicate the invalidity of the derived DV. In this case when the view index associated with the derived DV is equal to −1, the VSP mode is not allowed for the current PU.
  • In the fifth embodiment, the reference view index of the default DV is set to the minimum view index of the inter-view reference pictures in the reference picture lists of the current slice/picture. The vector of the default DV is set to the DV converted by a default depth value (e.g. the middle or mean value of the valid depth values or the dominant depth value). Specifically, the DV is converted by calculating the projection vector between the current view and the corresponding view which is identified by the view index assigned to the default DV with the help of the camera parameters.
  • In the sixth embodiment, the vector of the default DV is set to a default value (e.g., a zero vector) and the reference view index of the default DV is set to the nearest one in terms of the view distance or difference of the view index in the reference picture lists of the current slice/picture. If more than one has the same nearest view distance, the one with smaller view index is selected. The vector of the default DV is set to the DV converted by a default depth value (e.g. middle or mean value of the valid depth values or dominant depth value). Specifically, the DV is converted by calculating the projection vector between the current view and the corresponding view which is identified by the view index assigned to the default DV with the help of the camera parameters.
  • In the seventh embodiment, the view index of the default DV is set to the view index of the first inter-view reference picture by searching the reference picture with the reference index from zero to the maximum reference index in the reference picture list 0 and list 1. The searching order for list 0 and list 1 may correspond to searching all reference pictures in list 0 first and then searching all reference pictures in list 1. The searching order for list 0 and list 1 may also be interleaved searching, i.e. alternatingly searching part of list and list 1. The vector of the default DV is set as a default vector (e.g. a zero vector).
  • In the eighth embodiment, the view index of the default DV is set to the first inter-view reference picture by searching the reference picture with the reference index from zero to the maximum reference index in the reference picture list 0 and list 1. The searching order for list 0 and list 1 may correspond to searching all reference pictures in list 0 first and then searching all reference pictures in list 1. The vector of the default DV is set to the DV converted by a default depth value (e.g. middle or mean value of the valid depth values or dominant depth value). Specifically, the DV is converted by calculating the projection vector between the current view and the corresponding view which is identified by the view index assigned to the default DV with the help of the camera parameters. When no inter-view reference pictures is included in the reference picture lists of the current slice/picture, the coding tools that utilize the derived DV (e.g. view synthesis prediction, inter-view residual prediction and advanced residual prediction (ARP)) will not be allowed. In this case, the view index associated with the derived DV is set to −1.
  • Multiple examples of selecting a valid view index for the default DV have been illustrated above. However, these examples are not meant for providing an exhaustive list of valid view index selection. A skilled person may select other valid view index for the default DV. For example, the view index of the default DV can be set to the view index of the inter-view reference picture having smaller QP parameters. If more than one inter-view reference picture having the same smaller QP parameters, the one with smaller view index is selected.
  • Another aspect of the present invention addresses syntax design to support the needed modification to overcome the issues in the conventional system. Accordingly, in the ninth embodiment, the exemplary syntax design to support the needed modification is illustrated as follows.
  • When no DV is available, the reference view index (i.e., refViewldx) is set equal to 0, and the disparity vector is (i.e., mvDisp) is set equal to a default value, (i.e., (0, 0)). The variable of the refined disparity DV (i.e., mvRefinedDisp) is set equal to mvDisp.

  • for (Vid=0, foundFlag=0; Vid<ViewIdx && !foundFlag; Vid++)   (a)

  • for (X=0; X<(the current slice is a B slice? 2:1) && !foundFlag; X++)   (b)

  • for (i=0; i<NumRefPicsLX && !foundFlag; i++)   (c)

  • When Viewldx(RefPicListX[i]) is equal to Vid and PicOrderCnt(RefPicListX[i])==PicOrderCnt of the current picture, refViewldx is set equal to Vid and foundFlag is set equal to 1.   (d)
  • In the above syntax, foundFlag is a flag indicating whether an inter-view reference picture has been found in the reference picture lists. Loop (a) of the syntax is associated with the search through all views (i.e., from Vid=0 to Vid=ViewIdx −1, where Viewldx is the number of views). Loop (a) is terminated whenever an inter-view reference picture is found in the reference picture lists (as indicated by “!foundFlag”). Loop (b) of the syntax is associated with the reference list. For B pictures, two reference lists are used and otherwise, one reference list is used. Also, loop (b) is terminated whenever an inter-view reference picture is found in the reference picture lists (as indicated by “!foundFlag”). Loop (c) of the syntax is associated with the search through all reference pictures in the corresponding reference list (i.e., from i=0 to i=NumRefPicsLX −1, where NumRefPicsLX is the number of reference picture in reference list LX). Loop (c) is terminated whenever an inter-view reference picture is found in the reference picture lists (as indicated by “!foundFlag”). In procedure (d), it checks whether the picture order count of an underlying reference picture in the reference list (i.e., RefPicListX[i]) is equal to the picture order count of the current picture. If so, the reference view index of the default DV (i.e., refViewldx) is set to the view index of the inter-view reference picture (i.e., Vid) and foundFlag is set to 1 to terminate the process.
  • In the tenth embodiment, another exemplary syntax design to support the needed modification is illustrated as follows.
  • When no DV is available, the reference view index (i.e., refViewIdx) is set equal to 0, and the disparity vector is (i.e., mvDisp) is set equal to a default value, (i.e., (0, 0)). The variable of the refined disparity DV (i.e., mvRefinedDisp) is set equal to mvDisp.

  • for (Vid=0, foundFlag=0; Vid<ViewIdx && !foundFlag; Vid++)   (a)

  • for (X=0; X<(the current slice is a B slice? 2:1) && !foundFlag; X++)   (b)

  • for (i=0; i<NumRefPicsLX && !foundFlag; i++)   (c)

  • When ViewIdx(RefPicListX[i]) is equal to Vid and PicOrderCnt(RefPicListX[i])==PicOrderCnt of the current picture, refViewIdx is set equal to Vid and foundFlag is set equal to 1.   (e)
  • In the above syntax, loop (a) through loop (c) are also used as in the previous case. In procedure (e), it checks whether the view index of an underlying reference picture in the reference list (i.e., RefPicListX[i]) is equal to the underlying view index. If so, the reference view index of the default DV (i.e., refViewIdx) is set to the underlying view index (i.e., Vid) and foundFlag is set to 1 to terminate the process. Variable Vid starts from 0 and increments for each iteration. The view index assigned to the default according to the above syntax corresponds to a smallest view index.
  • FIG. 5 illustrates an exemplary flowchart of three-dimensional or multi-view video encoding or decoding that uses unified disparity vector derivation according to an embodiment of the present invention. The system receives input data associated with a current block in a dependent view as shown in step 510. The input data may correspond to un-coded or coded texture data. The input data may be retrieved from storage such as a computer memory, buffer (RAM or DRAM) or other media. The video bitstream may also be received from a processor such as a controller, a central processing unit, a digital signal processor or electronic circuits that produce the input data. A selected three-dimensional coding tool is selected in step 520, where the three-dimensional coding tool utilizes a derived DV (disparity vector). The derived DV for the current block is derived in step 530. The availability of the derived DV is checked in step 540. If the derived DV is available (i.e.,“Yes” path), the selected three-dimensional coding tool is applied to the current block using the derived DV in step 550. If the derived DV is not available (i.e., “No” path), the selected three-dimensional coding tool is applied to the current block using a default DV in step 560, where the default DV is set to point to a second inter-view reference picture in one reference picture list of the current block.
  • The flowchart shown above is intended to illustrate examples of unified disparity vector derivation. A person skilled in the art may modify each step, re-arranges the steps, split a step, or combine steps to practice the present invention without departing from the spirit of the present invention.
  • In an embodiment of the present invention, input data associated with a current block in a dependent view is received. The input data may correspond to un-coded or coded texture data. The input data may be retrieved from storage such as a computer memory, buffer (RAM or DRAM) or other media. A video bitstream may also be received from a processor such as a controller, a central processing unit, a digital signal processor or electronic circuits that produce the input data. A selected three-dimensional coding tool using a derived DV is determined, where the derived DV for a current block is obtained from one or more spatial neighboring block and one or more temporal neighboring block of the current block. The selected three-dimensional coding tool is applied to the current block using the derived DV if the derived DV is available. If the derived DV is not available, the selected three-dimensional coding tool is applied to the current block using a default DV, wherein the default DV is set to point to an inter-view reference picture in one reference picture list of the current block. The availability of the derived DV may be determined by checking if a first inter-view reference picture in a first view associated with the derived DV is in one reference picture list of the current block.
  • The above description is presented to enable a person of ordinary skill in the art to practice the present invention as provided in the context of a particular application and its requirement. Various modifications to the described embodiments will be apparent to those with skill in the art, and the general principles defined herein may be applied to other embodiments. Therefore, the present invention is not intended to be limited to the particular embodiments shown and described, but is to be accorded the widest scope consistent with the principles and novel features herein disclosed. In the above detailed description, various specific details are illustrated in order to provide a thorough understanding of the present invention. Nevertheless, it will be understood by those skilled in the art that the present invention may be practiced.
  • Embodiment of the present invention as described above may be implemented in various hardware, software codes, or a combination of both. For example, an embodiment of the present invention can be a circuit integrated into a video compression chip or program code integrated into video compression software to perform the processing described herein. An embodiment of the present invention may also be program code to be executed on a Digital Signal Processor (DSP) to perform the processing described herein. The invention may also involve a number of functions to be performed by a computer processor, a digital signal processor, a microprocessor, or field programmable gate array (FPGA). These processors can be configured to perform particular tasks according to the invention, by executing machine-readable software code or firmware code that defines the particular methods embodied by the invention. The software code or firmware code may be developed in different programming languages and different formats or styles. The software code may also be compiled for different target platforms. However, different code formats, styles and languages of software codes and other means of configuring code to perform the tasks in accordance with the invention will not depart from the spirit and scope of the invention.
  • The invention may be embodied in other specific forms without departing from its spirit or essential characteristics. The described examples are to be considered in all respects only as illustrative and not restrictive. The scope of the invention is therefore, indicated by the appended claims rather than by the foregoing description. All changes which come within the meaning and range of equivalency of the claims are to be embraced within their scope.

Claims (22)

1. A method of video coding for a three-dimensional or multi-view video encoding or decoding system, the method comprising:
receiving input data associated with a current block in a dependent view;
determining a selected three-dimensional coding tool, wherein the three-dimensional coding tool utilizes a derived DV (disparity vector);
deriving the derived DV for the current block from one or more neighboring block of the current block;
applying the selected three-dimensional coding tool to the current block using the derived DV if the derived DV is available; and
if the derived DV is not available, applying the selected three-dimensional coding tool to the current block using a default DV, wherein the default DV is set to point to an inter-view reference picture in one reference picture list of the current block.
2. The method of claim 1, wherein the derived DV is available if a first inter-view reference picture in a first view associated with the derived DV is in one reference picture list of the current block.
3. The method of claim 1, wherein the selected three-dimensional coding tool corresponds to VSP (view synthesis prediction).
4. The method of claim 1, wherein a default view index of the default DV is set to a view index of one inter-view reference picture of a current slice or picture with a minimum view index, and wherein the current slice or picture contains the current block.
5. The method of claim 1, wherein a default view index of the default DV is set to a view index of any inter-view reference picture in one reference picture list of the current block.
6. The method of claim 1, wherein a default view index of the default DV is set to a view index of one inter-view reference picture of a current slice or picture having a nearest view index, wherein the nearest view index is measured based on view distance or view index difference with the current slice or picture, and the current slice or picture contains the current block.
7. The method of claim 1, wherein if the derived DV is not available and if no inter-view reference picture can be found in any reference picture list of the current block, the default DV is set to a default view index corresponding to −1.
8. The method of claim 1, wherein if the derived DV is not available and if no inter-view reference picture can be found in any reference picture list of the current block, the selected three-dimensional coding tool is disabled.
9. The method of claim 1, wherein a default view index of the default DV is set to a view index of one inter-view reference picture having smallest quantization parameters.
10. The method of claim 1, wherein a default view index of the default DV is set to a view index of one inter-view reference picture that is found firstly among a search set according to a search order, wherein the search set includes all inter-view reference pictures in one or two reference lists of the current block, and wherein the search order starts from a zero picture index to a maximum reference picture index.
11. The method of claim 10, wherein the search set includes all inter-view reference pictures in reference list-0 and reference list-1 of the current block when the current block corresponds to a prediction unit (PU) in a B slice, and wherein the inter-view reference pictures in the reference list-0 are searched before or after the inter-view reference pictures in the reference list-1, or the inter-view reference pictures are searched in an interleaved order between the reference list-0 and the reference list-1.
12. The method of claim 1, wherein a default view index of the default DV is set to a view index of one inter-view reference picture that is found firstly among a search set according to a search order, wherein the search set includes all inter-view reference pictures in one or two reference lists of the current block, and wherein the search order starts from a zero view index to current view index minus 1.
13. The method of claim 12, wherein the search set includes all inter-view reference pictures in reference list-0 and reference list-1 of the current block when the current block corresponds to a prediction unit (PU) in a B slice, and wherein the search order starts from a zero picture index to a maximum reference picture index and wherein the inter-view reference pictures in the reference list-0 are searched before or after the inter-view reference pictures in the reference list-1, or the inter-view reference pictures are searched in an interleaved order between the reference list-0 and the reference list-1.
14. The method of claim 1, wherein a vector value of the default DV is set to a zero vector or a default vector.
15. The method of claim 14, wherein the default vector is derived from a converted disparity that is converted from a default depth value.
16. The method of claim 15, wherein the default depth value is explicitly signaled or implicitly determined for both an encoder and a decoder.
17. The method of claim 15, wherein the default depth value is determined based on a middle value, a mean value, or a medium value of valid depth values, or based on a dominant depth value, and wherein the dominant depth value is determined based on statistic of previously reconstructed depth values.
18. (canceled)
19. The method of claim 14, wherein the default vector is set to a selected disparity from default disparity values.
20. The method of claim 1, wherein the default DV is determined at each slice level or each picture level.
21. The method of claim 1, wherein the derived DV for the current block is derived from one or more spatial neighboring block of the current block, one or more temporal neighboring block of the current block, or one or more spatial and one or more temporal neighboring blocks of the current block.
22. (canceled)
US14/908,273 2013-08-13 2014-08-13 Method of deriving default disparity vector in 3D and multiview video coding Active 2034-11-11 US10230937B2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US14/908,273 US10230937B2 (en) 2013-08-13 2014-08-13 Method of deriving default disparity vector in 3D and multiview video coding

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
US201361865346P 2013-08-13 2013-08-13
US201361895468P 2013-10-25 2013-10-25
US14/908,273 US10230937B2 (en) 2013-08-13 2014-08-13 Method of deriving default disparity vector in 3D and multiview video coding
PCT/CN2014/084240 WO2015021914A1 (en) 2013-08-13 2014-08-13 Method of deriving default disparity vector in 3d and multiview video coding

Publications (2)

Publication Number Publication Date
US20160182884A1 true US20160182884A1 (en) 2016-06-23
US10230937B2 US10230937B2 (en) 2019-03-12

Family

ID=52468061

Family Applications (1)

Application Number Title Priority Date Filing Date
US14/908,273 Active 2034-11-11 US10230937B2 (en) 2013-08-13 2014-08-13 Method of deriving default disparity vector in 3D and multiview video coding

Country Status (5)

Country Link
US (1) US10230937B2 (en)
EP (1) EP3025498B1 (en)
CN (1) CN105453561B (en)
CA (1) CA2920413C (en)
WO (1) WO2015021914A1 (en)

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20150189323A1 (en) * 2014-01-02 2015-07-02 Mediatek Singapore Pte. Ltd. Method of Three-Dimensional and Multiview Video Coding Using a Disparity Vector
US20150304681A1 (en) * 2012-07-03 2015-10-22 Mediatek Singapore Pte. Ltd. Method and apparatus of inter-view motion vector prediction and disparity vector prediction in 3d video coding
US20150339826A1 (en) * 2014-05-22 2015-11-26 Brain Corporation Apparatus and methods for robotic operation using video imagery
US9939253B2 (en) 2014-05-22 2018-04-10 Brain Corporation Apparatus and methods for distance estimation using multiple image sensors
US10032280B2 (en) 2014-09-19 2018-07-24 Brain Corporation Apparatus and methods for tracking salient features
US10194163B2 (en) 2014-05-22 2019-01-29 Brain Corporation Apparatus and methods for real time estimation of differential motion in live video
US10197664B2 (en) 2015-07-20 2019-02-05 Brain Corporation Apparatus and methods for detection of objects using broadband signals
US10587894B2 (en) * 2014-10-08 2020-03-10 Lg Electronics Inc. Method and device for encoding/decoding 3D video
CN113170153A (en) * 2018-11-20 2021-07-23 交互数字Vc控股公司 Initializing current picture reference block vectors based on binary trees

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105453561B (en) 2013-08-13 2018-10-26 寰发股份有限公司 The method of export acquiescence disparity vector in three-dimensional and multi-view video coding
BR112021022174A2 (en) 2019-05-11 2021-12-21 Beijing Bytedance Network Tech Co Ltd Method for processing video data, apparatus for processing video data, storage medium and recording medium
CN117560490A (en) 2019-07-27 2024-02-13 北京字节跳动网络技术有限公司 Restriction of use of tools according to reference picture types
EP4029245A4 (en) 2019-10-12 2022-11-23 Beijing Bytedance Network Technology Co., Ltd. High level syntax for video coding tools

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20110216833A1 (en) * 2008-10-17 2011-09-08 Nokia Corporation Sharing of motion vector in 3d video coding
US20130114724A1 (en) * 2011-11-07 2013-05-09 Canon Kabushiki Kaisha Image encoding method, image encoding apparatus, and related encoding medium, image decoding method, image decoding apparatus, and related decoding medium
US20130229485A1 (en) * 2011-08-30 2013-09-05 Nokia Corporation Apparatus, a Method and a Computer Program for Video Coding and Decoding

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6055274A (en) * 1997-12-30 2000-04-25 Intel Corporation Method and apparatus for compressing multi-view video
CN101056398A (en) * 2006-03-29 2007-10-17 清华大学 A method and decoding and encoding method for capturing the video difference vector in the multi-video coding process
CN101222639B (en) 2007-01-09 2010-04-21 华为技术有限公司 Inter-view prediction method, encoder and decoder of multi-viewpoint video technology
KR20080066522A (en) 2007-01-11 2008-07-16 삼성전자주식회사 Method and apparatus for encoding and decoding multi-view image
CN101415115B (en) 2007-10-15 2011-02-02 华为技术有限公司 Method for encoding and decoding video based on movement dancing mode, and encoder and decoder thereof
CN105453561B (en) 2013-08-13 2018-10-26 寰发股份有限公司 The method of export acquiescence disparity vector in three-dimensional and multi-view video coding

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20110216833A1 (en) * 2008-10-17 2011-09-08 Nokia Corporation Sharing of motion vector in 3d video coding
US20130229485A1 (en) * 2011-08-30 2013-09-05 Nokia Corporation Apparatus, a Method and a Computer Program for Video Coding and Decoding
US20130114724A1 (en) * 2011-11-07 2013-05-09 Canon Kabushiki Kaisha Image encoding method, image encoding apparatus, and related encoding medium, image decoding method, image decoding apparatus, and related decoding medium

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
Tech et al. ("3D-HEVC Test Model 1", ITU-T July 2012) *

Cited By (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20150304681A1 (en) * 2012-07-03 2015-10-22 Mediatek Singapore Pte. Ltd. Method and apparatus of inter-view motion vector prediction and disparity vector prediction in 3d video coding
US9621920B2 (en) * 2014-01-02 2017-04-11 Hfi Innovation Inc. Method of three-dimensional and multiview video coding using a disparity vector
US20150189323A1 (en) * 2014-01-02 2015-07-02 Mediatek Singapore Pte. Ltd. Method of Three-Dimensional and Multiview Video Coding Using a Disparity Vector
US10194163B2 (en) 2014-05-22 2019-01-29 Brain Corporation Apparatus and methods for real time estimation of differential motion in live video
US20150339826A1 (en) * 2014-05-22 2015-11-26 Brain Corporation Apparatus and methods for robotic operation using video imagery
US9713982B2 (en) * 2014-05-22 2017-07-25 Brain Corporation Apparatus and methods for robotic operation using video imagery
US9939253B2 (en) 2014-05-22 2018-04-10 Brain Corporation Apparatus and methods for distance estimation using multiple image sensors
US10032280B2 (en) 2014-09-19 2018-07-24 Brain Corporation Apparatus and methods for tracking salient features
US10055850B2 (en) 2014-09-19 2018-08-21 Brain Corporation Salient features tracking apparatus and methods using visual initialization
US10268919B1 (en) 2014-09-19 2019-04-23 Brain Corporation Methods and apparatus for tracking objects using saliency
US10587894B2 (en) * 2014-10-08 2020-03-10 Lg Electronics Inc. Method and device for encoding/decoding 3D video
US10197664B2 (en) 2015-07-20 2019-02-05 Brain Corporation Apparatus and methods for detection of objects using broadband signals
CN113170153A (en) * 2018-11-20 2021-07-23 交互数字Vc控股公司 Initializing current picture reference block vectors based on binary trees
US11979585B2 (en) 2018-11-20 2024-05-07 Interdigital Madison Patent Holdings, Sas Current picture referencing block vector initialization with dual tree

Also Published As

Publication number Publication date
EP3025498A1 (en) 2016-06-01
CA2920413A1 (en) 2015-02-19
CN105453561A (en) 2016-03-30
WO2015021914A1 (en) 2015-02-19
EP3025498A4 (en) 2017-01-04
EP3025498B1 (en) 2019-01-16
CA2920413C (en) 2019-05-14
US10230937B2 (en) 2019-03-12
CN105453561B (en) 2018-10-26

Similar Documents

Publication Publication Date Title
US10230937B2 (en) Method of deriving default disparity vector in 3D and multiview video coding
US10021367B2 (en) Method and apparatus of inter-view candidate derivation for three-dimensional video coding
US20160309186A1 (en) Method of constrain disparity vector derivation in 3d video coding
EP2944087B1 (en) Method of disparity vector derivation in three-dimensional video coding
CA2896905C (en) Method and apparatus of view synthesis prediction in 3d video coding
US20160073132A1 (en) Method of Simplified View Synthesis Prediction in 3D Video Coding
US20140078254A1 (en) Method and Apparatus of Motion and Disparity Vector Prediction and Compensation for 3D Video Coding
US9621920B2 (en) Method of three-dimensional and multiview video coding using a disparity vector
US20150365649A1 (en) Method and Apparatus of Disparity Vector Derivation in 3D Video Coding
US20150304681A1 (en) Method and apparatus of inter-view motion vector prediction and disparity vector prediction in 3d video coding
US10110923B2 (en) Method of reference view selection for 3D video coding
CA2904424C (en) Method and apparatus of camera parameter signaling in 3d video coding
JP2015533038A (en) Method and apparatus for virtual depth value of 3D video encoding
US10075690B2 (en) Method of motion information prediction and inheritance in multi-view and three-dimensional video coding

Legal Events

Date Code Title Description
AS Assignment

Owner name: MEDIATEK INC., TAIWAN

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:LIN, JIAN-LIANG;ZHANG, NA;CHEN, YI-WEN;AND OTHERS;REEL/FRAME:037609/0629

Effective date: 20160118

AS Assignment

Owner name: HFI INNOVATION INC., TAIWAN

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:MEDIATEK INC.;REEL/FRAME:039609/0864

Effective date: 20160628

STCF Information on status: patent grant

Free format text: PATENTED CASE

MAFP Maintenance fee payment

Free format text: PAYMENT OF MAINTENANCE FEE, 4TH YEAR, LARGE ENTITY (ORIGINAL EVENT CODE: M1551); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY

Year of fee payment: 4