WO2014051133A1 - Method for processing one or more videos of a 3d-scene - Google Patents
Method for processing one or more videos of a 3d-scene Download PDFInfo
- Publication number
- WO2014051133A1 WO2014051133A1 PCT/JP2013/076448 JP2013076448W WO2014051133A1 WO 2014051133 A1 WO2014051133 A1 WO 2014051133A1 JP 2013076448 W JP2013076448 W JP 2013076448W WO 2014051133 A1 WO2014051133 A1 WO 2014051133A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- block
- depth
- motion
- current
- neighboring blocks
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims description 42
- 239000013598 vector Substances 0.000 claims abstract description 73
- 208000037170 Delayed Emergence from Anesthesia Diseases 0.000 claims description 2
- 230000002194 synthesizing effect Effects 0.000 claims 4
- 230000015572 biosynthetic process Effects 0.000 abstract description 12
- 238000003786 synthesis reaction Methods 0.000 abstract description 12
- 238000013507 mapping Methods 0.000 abstract 1
- 230000002123 temporal effect Effects 0.000 description 12
- 238000010586 diagram Methods 0.000 description 7
- 230000008569 process Effects 0.000 description 6
- 230000006835 compression Effects 0.000 description 2
- 238000007906 compression Methods 0.000 description 2
- 238000009795 derivation Methods 0.000 description 2
- 230000000694 effects Effects 0.000 description 2
- 238000007796 conventional method Methods 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 230000008570 general process Effects 0.000 description 1
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N13/00—Stereoscopic video systems; Multi-view video systems; Details thereof
- H04N13/10—Processing, recording or transmission of stereoscopic or multi-view image signals
- H04N13/106—Processing image signals
- H04N13/161—Encoding, multiplexing or demultiplexing different image signal components
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/103—Selection of coding mode or of prediction mode
- H04N19/105—Selection of the reference unit for prediction within a chosen coding or prediction mode, e.g. adaptive choice of position and number of pixels used for prediction
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/169—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
- H04N19/17—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
- H04N19/176—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/30—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using hierarchical techniques, e.g. scalability
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- H04N19/503—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
- H04N19/51—Motion estimation or motion compensation
- H04N19/56—Motion estimation with initialisation of the vector search, e.g. estimating a good candidate to initiate a search
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- H04N19/597—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding specially adapted for multi-view video sequence encoding
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N13/00—Stereoscopic video systems; Multi-view video systems; Details thereof
- H04N2013/0074—Stereoscopic image analysis
- H04N2013/0081—Depth or disparity estimation from stereoscopic image signals
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N13/00—Stereoscopic video systems; Multi-view video systems; Details thereof
- H04N2013/0074—Stereoscopic image analysis
- H04N2013/0085—Motion estimation from stereoscopic image signals
Definitions
- This invention relates generally to coding multiview videos, and more particularly to coding 3D multiview videos for predictive compression.
- Multiview video coding is essential for applications such as three dimensional television (3DTV), free viewpoint television (FTV), and multi- camera surveillance. Multiview video coding is also known as dynamic light field compression. As used herein, coding can include encoding, decoding or both in a codec, for example.
- VSP view synthesis prediction
- motion information from neighboring blocks is used to derive a motion vector.
- the derived motion vector is then used as a motion vector predictor (MVP) to predict the motion vector for the current block.
- MVP motion vector predictor
- MVP motion vector difference
- Fig. 1 shows a conventional method to code a current block.
- Step 110 derives a motion or disparity vector from a neighboring block, referred to as MotionDerive.
- Step 120 determines a motion or disparity vector, referred to as MotionCurrent, for the current block by applying motion estimation
- Fig. 2 shows the corresponding prior-art encoder.
- Element 201 shows blocks in a portion of a picture.
- a current block is denoted by a star "*"
- a neighboring block is denoted by a dot
- derive 202 a motion vector or disparity vector.
- the derived motion or disparity vector from 202 serves as motion vector predictor (MVP) 203.
- MVP motion vector predictor
- motion estimation is performed 205 for the current block to produce a motion vector (MotionCurrent) 206 for the current block.
- a motion vector difference (MVD) 208 is obtained, which is encoded 209 into the bitstream 210.
- Another output from motion estimation 205 is the reference picture, which serves as texture predictor 211.
- the texture residual 213 is obtained by performing 212 texture prediction based on the texture predictor 211 and the current picture 215.
- the texture residual 213 is also encoded 214 as part of the bitstream.
- Fig. 3 shows the decoder. From the neighboring blocks as shown in element 301, derive 302 a motion vector or disparity vector. The derived motion or disparity vector serves as the motion vector predictor (MVP) 303.
- MVP motion vector predictor
- motion vector difference (MVD) 308 is decoded 309 and fed to an adder 307.
- the motion vector predictor 303 and motion vector difference 308 are added 307, and the motion vector used for the current block MotionCurrent 306 is then obtained.
- the texture residual picture 313 is decoded 314.
- the current motion vector 306 and the texture residual picture are inputs to the motion compensation module 305. Together with texture reference buffer 304, the motion compensation is performed, and finally the decoded picture is outputted 315.
- VSP view synthesis prediction
- Conventional, forward warping based view synthesis prediction typically uses the texture and depth components from a reference view.
- forward warping normally provides a high fidelity
- a major disadvantage of forward warping based VSP is a significant increase in the complexity of the codec.
- backward warping based view synthesis prediction (B-VSP), as disclosed herein, is less complex. Therefore, backward warping based VSP is desired at certain applications. In addition, backward warping based VSP facilitates block based VSP reference generation.
- depth-first coding prevents the use of any depth coding tools that need to access the corresponding texture component.
- the embodiments use a motion or disparity vector from neighboring blocks to derive a depth block from previously coded depth images, such as those depth images coded in the previous access unit, or the same access unit but from other viewpoints.
- the derived depth block is assumed as an estimated depth block for the current block to be coded, which is finally used for backward warping based VSP.
- forward warping based view synthesis prediction typically uses the texture and depth components from a reference view.
- the forward warping normally provides a high fidelity at a higher cost.
- our view synthesis prediction uses backward warping for texture-coding, and does not require the depth component to be coded before the texture component.
- backward warping based VSP facilitates block based VSP reference generation, which is less complex and has implementation advantages.
- the invention uses a motion or disparity vector from neighboring blocks to derive a depth block from previously coded depth images, such as those depth images coded in the previous access unit, or the same access unit but from other viewpoints.
- Fig. 1 is a flowchart of prior-art method that uses neighboring motion information to code motion information of -a current block;
- Fig. 2 is a block diagram of prior-art encoder
- Fig. 3 is a block diagram of prior-art decoder
- Fig. 4 is a flowchart of a method that uses neighboring motion or disparity information to code a current block with backward VSP prediction according to embodiments of the invention
- FIG. 5A is a block diagram of an encoder using spatial prediction according to embodiments of the invention.
- FIG. 5B is a block diagram of an encoder using spatial prediction according to embodiments of the invention.
- Fig. 5B is a block diagram of an encoder using temporal prediction according to embodiments of the invention.
- Fig. 6A is a block diagram of a decoder using spatial prediction according to embodiments of the invention.
- Fig. 6B is a block diagram of a decoder using temporal prediction according to embodiments of the invention.
- Fig. 7 is a flowchart of an encoding method according to embodiments of the invention.
- Fig. 8 is a flowchart of a decoding method according to embodiments of the invention.
- Fig. 9 is an example to estimate the depth values from neighboring blocks according to embodiments of the invention.
- Fig. 10 is another example to estimate the depth values from
- Fig. 11 is a flowchart showing the decoding process of a block in a depth anchor picture using the B-VSP mode according to embodiments of the invention.
- the embodiments of our invention provide a method and system for generating a view synthesis prediction (VSP) reference picture based on a backward warping technique.
- the method does not use depth-first coding.
- the method uses motion or disparity information from neighboring blocks to a current to derive a depth block for the current block to be coded. Then, the derived depth block is used to perform backward warping.
- view synthesis generates a synthetic video from one or more videos of a scene, in which each video of a 3D scene is acquired by a corresponding camera arranged at a particular pose, and in which a view of each camera overlaps with the view of at least one other camera.
- the embodiments are characterized in deriving a depth block and forming a sample block to predict the current block based on a derived depth block.
- the embodiments disclose different ways to use the motion or disparity information from the neighboring blocks. Specifically, the derived motion or disparity information is directly used to derive a depth block, DepthDerive, which is further used to form a predictor for the sample values in the current block.
- Fig. 4 shows a method for coding according to embodiments of the invention. The steps are performed for each current block 401.
- Step 410 obtaining motion or disparity vector from neighboring blocks.
- Step 420 determining a depth block based on a corresponding reference depth image and the motion or disparity vector.
- Step 430 generating a prediction block based on the depth values using backward warping
- Step 440 performing predictive coding for the current block using the prediction block.
- Fig. 5A shows a comparable encoder block diagram. From the neighboring blocks shown in element 501, derive 502 a motion vector or disparity vector MotionDerive (MVP) 503. Optionally, MotionDerive or side information that would assist the decoder to determine MotionDerive can be coded and signaled 550, and is part of the coded bitstream 510.
- MVP disparity vector MotionDerive
- a depth block DepthDerive 505 is estimated 504 for the current block.
- a backward warping is performed 506 to obtain a texture predictor 507.
- texture prediction 508 obtains the texture residual 509 based on the texture predictor 507 and the current picture 515, which is encoded 511 into the bitstream.
- Fig. 6A shows the decoder. From the coded bitstream 610, side
- a depth block DepthDerive 605 is estimated 604 for the current block.
- DepthDerive 605 a backward warping 612 is performed 606 to obtain a texture predictor 607.
- the texture residual 609 is decoded 613 to obtain the texture predictor 607 and texture residual 609 for texture prediction 608 to output the decoded texture picture 615.
- the technique used by the embodiments can be referred to as depth compensated prediction, in contrast to motion or disparity compensated prediction, as in the prior art.
- Embodiment 1 anchor picture, depth from reference views are available
- texture and depth components from a reference picture are both coded before the current texture and depth.
- the coding order for the two view case is T0D0T1D1, where T and D
- 0 and 1 represent two views. Note, the texture and depth from view 0 are coded before the texture and depth for view 1.
- all blocks are either coded in Intra modes or Interview prediction modes.
- there are three neighboring blocks for the current block which are at the left (A), top (B) and top-right (C). The description can be easily expanded to use additional neighboring blocks.
- each block is associated with a disparity vector, denoted by Disparity A, DisparityB and DisparityC.
- a disparity vector for the current block can be derived from the neighboring blocks, denoted by Disaprity Derive.
- DisparityDerive Mean (Disparity, DisparityB, DisparityC).
- the derivation can be signaled as for merging a candidate list and index as defined in the emerging HEVC standard. That is, we construct a candidate list from the neighboring blocks. Then, an index is signaled in the bitstream to indicate the specific candidate used for
- a depth block DepthDerive in the depth image from the reference view can be identified and accessed. We use DepthDerive to do the backward warping for the current block.
- the general process uses the neighboring blocks are described as above when the neighboring blocks are all coded using Inter-view prediction modes. Below, we describe the cases when a neighboring block is not coded in any Inter-view prediction modes.
- the corresponding disparity vector is zero.
- a disparity vector can be determined from the depth block used for its VSP prediction.
- the disparity vector can be the average value of the entire depth block, or giving more weights to the depth pixels that are nearer to the current block.
- disparity vector DisparityDerive we first derive the disparity vector DisparityDerive and then use a single disparity to access the depth block DepthDerive.
- the multiple derived depth blocks are determined according to a weighted average to obtain a single depth block to be used for the backward warping.
- Fig. 7 shows the encoding process for a block in an anchor picture using the B-VSP as a candidate coding mode.
- Step 701 Let A, B and C be the neighboring blocks, DisparityX be their disparity vector (with X replaced by A, B or C).
- Step 702 If any block in A, B and C are coded in Intra mode, set its disparity vector to zero.
- Step 703 If any block in A, B and C are coded in VSP prediction modes, then set its disparity vector as the mean value of the depth block used for its VSP prediction.
- Step 704 Set DispairityDerive to be the mean of the DisparityX.
- Step 705 Obtain the depth block in the corresponding depth image, DepthDerive.
- Step 706 Use DepthDerive to perform backward warping based VSP.
- Step 707 The B-VSP mode is compared to other coding modes and is selected if the rate distortion cost is less.
- Fig. 8 is the decoding process of a block in an anchor picture that uses the proposed B-VSP mode.
- Step 801 Check if B-VSP mode is signaled for the current block. If yes, go to Step 802. Otherwise, go to Step 808.
- Step 802 Let A, B and C be the neighboring blocks, DisparityX be their disparity vector (with X replaced by A, B or C).
- Step 803 If any block in A, B and C are coded in Intra mode, then set the disparity vector to zero.
- Step 804 If any block in A, B and C are coded in VSP prediction modes, then set its disparity vector as the mean value of the depth block used for its VSP prediction.
- Step 805 Set DispairityDerive to be the mean of the DisparityX.
- Step 806 Obtain the depth block DepthDerive in the corresponding depth image.
- Step 807 Use DepthDerive to perform backward warping based VSP. Go to Step 809.
- Step 808 Decode the current block in other coding modes.
- Step 809 End of the procedure.
- Embodiment 2 anchor or non-anchor picture, depth from reference views are not available
- the texture components in an access unit are coded before all the depth components.
- the coding order for the two view case is T0T1D0D1, where both the texture from view 0 and view 1 are coded before depth.
- This embodiment can also apply when there is no texture dependency on the coding of depth.
- the disparity DisparityDerive can be derived from the neighboring blocks.
- a per-pixel depth block cannot be accessed.
- a linear interpolation is being used. Other interpolations can also be used.
- Fig. 9 shows an example interpolation, where a single depth value (X, Y and Z) for a neighboring block (A, B and C) are set from their disparity value DisparityA, DisparityB and DisparityC.
- the depth values for the current 4x4 block are interpolated as follows:
- a, b, c, d, e, f, and g are pixels in the current block.
- the depth value Z from block C is not used.
- the depth value that is nearer to the pixel to be interpolated is given more weights. Other interpolation methods are possible to fill the depth block.
- the depth value from a neighboring block can vary when the block is coded using B-VSP modes. In that case, the interpolation process for each pixel can be different.
- the depth value of a pixel at row r and column c, (r, c) can be interpolated as
- Depth(r, c) (Xc + Yr) / 2.
- Embodiment 3 non-anchor, depth from reference views are available
- the neighboring blocks use temporal prediction, other than inter-view prediction modes. However, it is not necessary to differentiate temporal prediction and inter-view prediction. Instead, in this embodiment, the associated reference texture of the motion or disparity vector has an available corresponding depth reference picture.
- the motion or disparity vectors from neighboring blocks are used to derive a motion or disparity vector, which is used to identify and access a depth block from the corresponding reference depth image.
- a zero disparity vector relative to the inter-view reference is used, rather than a zero motion vector relative to the temporal reference, when there is no mechanism in the bitstream to signal the neighboring block to be used to derive the motion or disparity vector. Otherwise, when there is a candidate list, an index mechanism is implemented. We add both zero disparity vector and a zero motion vector to the candidate list when there is an Intra coded neighbor block. Then, the encoder selects the optimal candidate based on the rate distortion cost.
- Embodiment 4 backward warping based VSP for depth image
- the above embodiments are also applicable to code the depth picture from a dependent view. It is impossible to implement a backward warping based VSP prediction for depth in the conventional VSP because the depth image is required to do backward warping. Our embodiments do not require the current block.
- the motion or disparity vectors of the neighboring blocks are utilized to get a depth block to do backward VSP, we use the sample values of the neighboring block to estimate a depth block. Because the current picture is a depth image, the neighboring blocks have per- pixel depths. It is more accurate to use the sample values, instead of the block level motion or disparity vector of those blocks to determine out the depth block to do B-VSP.
- Fig. 11 shows the corresponding decoding. The difference from the embodiment shown in Fig. 8 is that the motion or disparity vectors from neighboring blocks are not used.
- Step 1101 Check if B-VSP mode is signaled for the current block. If yes, go to Step 1102. Otherwise, go to Step 1104.
- Step 1102 Use the neighboring block (which are depth blocks) to estimate the depth DepthDerive of the current block.
- Step 1103 Use DepthDerive to perform backward warping based VSP. Go to Step 1105.
- Step 1104 Decode the current block in other coding modes.
- Step 1105 End of the procedure.
- Embodiment 5 warping in motion domain
- the warping was performed in the pixel or spatial domain.
- predictor for the texture of pixels in the current view are derived by warping the pixel values from the reference view to the current view, which is done in blocks 506 and 611 of Fig. 5 A and Fig. 6 A, respectively.
- the warping on the motion field can be performed at the pixel level, or the block level, similar to the texture warping process.
- the largest warping unit is at the prediction unit (PU) level.
- a maximal depth from the derived depth block is converted to the disparity vector, and finally the derived disparity vector are assumed as a translation offset for the current block to access the motion information.
- disparity vector derivation by referencing a depth block in depth reference picture can be further extended to other regular spatial or temporal neighboring blocks, as long as a depth block can be accessed from the depth image.
- the warping can be in the spatial domain as described in the related Application, the temporal domain as described in this Application, or in both the spatial and the temporal domains.
- backward warping based VSP facilitates block based VSP reference generation, which is less complex and has implementation advantages.
- the invention uses a motion or disparity vector from neighboring blocks to derive a depth block from previously coded depth images, such as those depth images coded in the previous access unit, or the same access unit but from other viewpoints.
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Compression Or Coding Systems Of Tv Signals (AREA)
- Testing, Inspecting, Measuring Of Stereoscopic Televisions And Televisions (AREA)
- Studio Devices (AREA)
Abstract
Description
Claims
Priority Applications (5)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP13774831.5A EP2901695B1 (en) | 2012-09-28 | 2013-09-20 | Method for processing one or more videos of a 3d-scene |
CN201380050841.7A CN104685882B (en) | 2012-09-28 | 2013-09-20 | For handling the method for one or more videos of 3D scenes |
KR1020157010892A KR101653118B1 (en) | 2012-09-28 | 2013-09-20 | Method for processing one or more videos of a 3d-scene |
JP2015506961A JP5976197B2 (en) | 2012-09-28 | 2013-09-20 | Method for processing one or more videos of a 3D scene |
ES13774831.5T ES2606161T3 (en) | 2012-09-28 | 2013-09-20 | Method to process one or more videos of a 3D scene |
Applications Claiming Priority (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US13/631,852 | 2012-09-28 | ||
US13/631,852 US9264691B2 (en) | 2012-09-28 | 2012-09-28 | Method and system for backward 3D-view synthesis prediction using neighboring blocks |
US13/742,461 US9253486B2 (en) | 2012-09-28 | 2013-01-16 | Method and system for motion field backward warping using neighboring blocks in videos |
US13/742,461 | 2013-01-16 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2014051133A1 true WO2014051133A1 (en) | 2014-04-03 |
Family
ID=49328602
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/JP2013/076448 WO2014051133A1 (en) | 2012-09-28 | 2013-09-20 | Method for processing one or more videos of a 3d-scene |
Country Status (8)
Country | Link |
---|---|
US (1) | US9253486B2 (en) |
EP (1) | EP2901695B1 (en) |
JP (1) | JP5976197B2 (en) |
KR (1) | KR101653118B1 (en) |
CN (1) | CN104685882B (en) |
ES (1) | ES2606161T3 (en) |
PL (1) | PL2901695T3 (en) |
WO (1) | WO2014051133A1 (en) |
Families Citing this family (15)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2014000664A1 (en) * | 2012-06-28 | 2014-01-03 | Mediatek Inc. | Method and apparatus of disparity vector derivation in 3d video coding |
WO2014005548A1 (en) * | 2012-07-05 | 2014-01-09 | Mediatek Inc. | Method and apparatus of unified disparity vector derivation for 3d video coding |
US9781416B2 (en) * | 2013-02-26 | 2017-10-03 | Qualcomm Incorporated | Neighboring block disparity vector derivation in 3D video coding |
FR3002716A1 (en) * | 2013-02-26 | 2014-08-29 | France Telecom | DERIVATION OF MOTION VECTOR OF DISPARITY, 3D VIDEO CODING AND DECODING USING SUCH DERIVATION |
WO2014166068A1 (en) * | 2013-04-09 | 2014-10-16 | Mediatek Inc. | Refinement of view synthesis prediction for 3-d video coding |
US9800895B2 (en) * | 2013-06-27 | 2017-10-24 | Qualcomm Incorporated | Depth oriented inter-view motion vector prediction |
KR101770211B1 (en) * | 2013-07-18 | 2017-09-05 | 엘지전자 주식회사 | Method and apparatus for processing video signal |
US10911779B2 (en) * | 2013-10-17 | 2021-02-02 | Nippon Telegraph And Telephone Corporation | Moving image encoding and decoding method, and non-transitory computer-readable media that code moving image for each of prediction regions that are obtained by dividing coding target region while performing prediction between different views |
TWI517096B (en) * | 2015-01-12 | 2016-01-11 | 國立交通大學 | Backward depth mapping method for stereoscopic image synthesis |
CN107770511A (en) * | 2016-08-15 | 2018-03-06 | ***通信集团山东有限公司 | A kind of decoding method of multi-view point video, device and relevant device |
GB2553315A (en) | 2016-09-01 | 2018-03-07 | Nokia Technologies Oy | Determining inter-view prediction areas |
CN107027025B (en) * | 2017-03-28 | 2019-06-28 | 清华大学深圳研究生院 | A kind of light field image compression method based on macro block of pixels adaptive prediction |
US10244164B1 (en) | 2017-09-11 | 2019-03-26 | Qualcomm Incorporated | Systems and methods for image stitching |
US11348265B1 (en) | 2017-09-15 | 2022-05-31 | Snap Inc. | Computing a point cloud from stitched images |
KR102155711B1 (en) * | 2018-11-22 | 2020-09-14 | 한양대학교 산학협력단 | Method and apparatus for performing fast motion estimation on point cloud data |
Family Cites Families (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1913633B (en) * | 2001-11-06 | 2011-06-01 | 松下电器产业株式会社 | Moving picture decoding method |
US7728877B2 (en) * | 2004-12-17 | 2010-06-01 | Mitsubishi Electric Research Laboratories, Inc. | Method and system for synthesizing multiview videos |
US8284237B2 (en) * | 2009-09-09 | 2012-10-09 | Nokia Corporation | Rendering multiview content in a 3D video system |
EP2559256A4 (en) * | 2010-04-14 | 2015-08-26 | Ericsson Telefon Ab L M | Methods and arrangements for 3d scene representation |
CN101917619B (en) * | 2010-08-20 | 2012-05-09 | 浙江大学 | Quick motion estimation method of multi-view video coding |
US9247249B2 (en) * | 2011-04-20 | 2016-01-26 | Qualcomm Incorporated | Motion vector prediction in video coding |
CN104662910B (en) * | 2012-09-21 | 2018-04-10 | 寰发股份有限公司 | The method and apparatus of virtual depth value in 3D Video codings |
-
2013
- 2013-01-16 US US13/742,461 patent/US9253486B2/en active Active
- 2013-09-20 PL PL13774831T patent/PL2901695T3/en unknown
- 2013-09-20 EP EP13774831.5A patent/EP2901695B1/en active Active
- 2013-09-20 KR KR1020157010892A patent/KR101653118B1/en active IP Right Grant
- 2013-09-20 CN CN201380050841.7A patent/CN104685882B/en active Active
- 2013-09-20 WO PCT/JP2013/076448 patent/WO2014051133A1/en active Application Filing
- 2013-09-20 JP JP2015506961A patent/JP5976197B2/en active Active
- 2013-09-20 ES ES13774831.5T patent/ES2606161T3/en active Active
Non-Patent Citations (6)
Title |
---|
CHI-LING WU ET AL: "3D-CE1.a related: Interview Skip mode with sub-partition scheme", 1. JCT-3V MEETING; 101. MPEG MEETING; 16-7-2012 - 20-7-2012; STOCKHOLM; (THE JOINT COLLABORATIVE TEAM ON 3D VIDEO CODING EXTENSION DEVELOPMENT OF ISO/IEC JTC1/SC29/WG11 AND ITU-T SG.16 ); URL: HTTP://PHENIX.INT-EVRY.FR/JCT2/,, no. JCT3V-A0050, 10 July 2012 (2012-07-10), XP030130049 * |
DONG TIAN ET AL: "Mitsubishi Response to MPEG Call for Proposal on 3D Video Coding Technology", 98. MPEG MEETING; 28-11-2011 - 2-12-2011; GENEVA; (MOTION PICTURE EXPERT GROUP OR ISO/IEC JTC1/SC29/WG11),, no. m22663, 24 November 2011 (2011-11-24), XP030051226 * |
FENG ZOU ET AL: "3D-CE1.a: Cross check on Nokia Proposal on Block based View Synthesis Prediction JCT2-A0107", 1. JCT-3V MEETING; 101. MPEG MEETING; 16-7-2012 - 20-7-2012; STOCKHOLM; (THE JOINT COLLABORATIVE TEAM ON 3D VIDEO CODING EXTENSION DEVELOPMENT OF ISO/IEC JTC1/SC29/WG11 AND ITU-T SG.16 ); URL: HTTP://PHENIX.INT-EVRY.FR/JCT2/,, no. JCT3V-A0019, 10 July 2012 (2012-07-10), XP030130018 * |
KIRAN NANJUNDA IYER ET AL: "Multiview video coding using depth based 3D warping", MULTIMEDIA AND EXPO (ICME), 2010 IEEE INTERNATIONAL CONFERENCE ON, IEEE, PISCATAWAY, NJ, USA, 19 July 2010 (2010-07-19), pages 1108 - 1113, XP031761757, ISBN: 978-1-4244-7491-2 * |
TIAN D ET AL: "CE1.h: Backward View Synthesis Prediction using Neighbouring Blocks", 3. JCT-3V MEETING; 103. MPEG MEETING; 17-1-2013 - 23-1-2013; GENEVA; (THE JOINT COLLABORATIVE TEAM ON 3D VIDEO CODING EXTENSION DEVELOPMENT OF ISO/IEC JTC1/SC29/WG11 AND ITU-T SG.16 ); URL: HTTP://PHENIX.INT-EVRY.FR/JCT2/,, no. JCT3V-C0152, 10 January 2013 (2013-01-10), XP030130568 * |
WENYI SU (USTC) ET AL: "3DV-CE1.a: Block-based View Synthesis Prediction for 3DV-ATM", 1. JCT-3V MEETING; 101. MPEG MEETING; 16-7-2012 - 20-7-2012; STOCKHOLM; (THE JOINT COLLABORATIVE TEAM ON 3D VIDEO CODING EXTENSION DEVELOPMENT OF ISO/IEC JTC1/SC29/WG11 AND ITU-T SG.16 ); URL: HTTP://PHENIX.INT-EVRY.FR/JCT2/,, no. JCT3V-A0107, 10 July 2012 (2012-07-10), XP030130106 * |
Also Published As
Publication number | Publication date |
---|---|
KR20150063124A (en) | 2015-06-08 |
US20140092210A1 (en) | 2014-04-03 |
CN104685882B (en) | 2018-05-18 |
EP2901695B1 (en) | 2016-11-02 |
JP5976197B2 (en) | 2016-08-23 |
US9253486B2 (en) | 2016-02-02 |
CN104685882A (en) | 2015-06-03 |
EP2901695A1 (en) | 2015-08-05 |
ES2606161T3 (en) | 2017-03-23 |
JP2015528649A (en) | 2015-09-28 |
PL2901695T3 (en) | 2017-02-28 |
KR101653118B1 (en) | 2016-08-31 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
EP2901695B1 (en) | Method for processing one or more videos of a 3d-scene | |
US9264691B2 (en) | Method and system for backward 3D-view synthesis prediction using neighboring blocks | |
JP6633694B2 (en) | Multi-view signal codec | |
JP5970609B2 (en) | Method and apparatus for unified disparity vector derivation in 3D video coding | |
US9819959B2 (en) | Method and apparatus for residual prediction in three-dimensional video coding | |
US10085039B2 (en) | Method and apparatus of virtual depth values in 3D video coding | |
US20150172714A1 (en) | METHOD AND APPARATUS of INTER-VIEW SUB-PARTITION PREDICTION in 3D VIDEO CODING | |
US20150365649A1 (en) | Method and Apparatus of Disparity Vector Derivation in 3D Video Coding | |
JP6042556B2 (en) | Method and apparatus for constrained disparity vector derivation in 3D video coding | |
US20150382019A1 (en) | Method and Apparatus of View Synthesis Prediction in 3D Video Coding | |
KR20060108952A (en) | Method for coding and inter-view balanced disparity estimation in multiview animation coding/decoding system | |
KR20130103568A (en) | Multiview image encodng method, multiview image decodng method, multiview image encodlng device, multiview lnlage decodlng device, and programs of same | |
EP1929783A1 (en) | Method of estimating disparity vector using camera parameters, apparatus for encoding and decoding multi-view picture using the disparity vectors estimation method, and computer-redadable recording medium storing a program for executing the method | |
WO2012177063A2 (en) | Encoding/decoding method and apparatus using a skip mode | |
Zou et al. | View synthesis prediction using skip and merge candidates for HEVC-based 3D video coding | |
CN105144714A (en) | Method and apparatus of disparity vector derivation in 3d video coding |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 13774831 Country of ref document: EP Kind code of ref document: A1 |
|
ENP | Entry into the national phase |
Ref document number: 2015506961 Country of ref document: JP Kind code of ref document: A |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
REEP | Request for entry into the european phase |
Ref document number: 2013774831 Country of ref document: EP |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2013774831 Country of ref document: EP |
|
ENP | Entry into the national phase |
Ref document number: 20157010892 Country of ref document: KR Kind code of ref document: A |