CN102137259A - Method and device for coding two paths of three-dimensional videos - Google Patents
Method and device for coding two paths of three-dimensional videos Download PDFInfo
- Publication number
- CN102137259A CN102137259A CN 201110091938 CN201110091938A CN102137259A CN 102137259 A CN102137259 A CN 102137259A CN 201110091938 CN201110091938 CN 201110091938 CN 201110091938 A CN201110091938 A CN 201110091938A CN 102137259 A CN102137259 A CN 102137259A
- Authority
- CN
- China
- Prior art keywords
- sampling
- coding
- sequence
- way
- oeuveres
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 title claims abstract description 25
- 238000005457 optimization Methods 0.000 claims abstract description 13
- 238000005070 sampling Methods 0.000 claims description 52
- 230000008569 process Effects 0.000 claims description 7
- 230000000694 effects Effects 0.000 abstract description 4
- 238000005259 measurement Methods 0.000 abstract 1
- 230000008901 benefit Effects 0.000 description 2
- 230000005540 biological transmission Effects 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 208000003164 Diplopia Diseases 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 238000000205 computational method Methods 0.000 description 1
- 230000007812 deficiency Effects 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 208000029444 double vision Diseases 0.000 description 1
- 238000012423 maintenance Methods 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
Images
Landscapes
- Compression Or Coding Systems Of Tv Signals (AREA)
Abstract
The invention provides a method and device for coding two paths of three-dimensional videos, wherein the method comprises the following steps of: splicing two paths of three-dimensional videos to obtain a binary syllabification sequence; in the term of the two paths of three-dimensional videos, carrying out rate distortion optimization coding on the binary syllabification sequence; and reconfiguring the coded binary syllabification sequence to obtain a coding result of each path of video. Through changing a distortion measurement selected by a macro block mode, the rate distortion performance of coding the two paths of three-dimensional videos is improved, and an optimal image coding effect is obtained.
Description
Technical field
The present invention relates to the three-dimensional video-frequency processing technology field, particularly a kind of coding method of two-way three-dimensional video-frequency and device.
Background technology
In recent years, along with popularizing that 3 D stereo is used, the research of stereoscopic image and video begins to become focus, and the application relevant with image and video also constantly widened, for example digital television broadcasting, video request program, long-distance education and medical treatment, wireless multimedia communication etc.Because the data volume of original image and video is very big, and bandwidth can not unconfinedly increase, and finishes the image transmission of big data quantity in order to utilize limited bandwidth, needs to use the effective video compress technique.
A kind of double vision point splicing of three-dimensional video-frequency has been proposed at present.Compare two-path video and transmit respectively, the encoding scheme of three-dimensional video-frequency splicing only need be transmitted half or data still less, therefore can reduce transmission bandwidth and reduce decoder complexity.In addition, spliced image can utilize traditional single channel encoder directly to compress, and is compatible strong.
The shortcoming of the coding method of existing Two bors d's oeuveres three-dimensional video-frequency is that the framework that adopts traditional single view to encode is handled, and does not consider the characteristics of the Two bors d's oeuveres attribute of signal source, therefore can't obtain optimum coding effect.
Summary of the invention
Purpose of the present invention is intended to solve at least one of above-mentioned technological deficiency.
For achieving the above object, one aspect of the present invention proposes a kind of coding method of two-way three-dimensional video-frequency, may further comprise the steps: A: the two-way three-dimensional video-frequency is spliced to obtain the Two bors d's oeuveres sequence; B: with described two-way three-dimensional video-frequency is reference, and described Two bors d's oeuveres sequence is carried out the rate-distortion optimization coding; And C: the Two bors d's oeuveres sequence of having encoded is reconstructed to obtain the coding result of each road video.
In one embodiment of the invention, described steps A further comprises: respectively described two-way video is carried out every row ground or interlacing ground down-sampling; And the two-path video behind the described down-sampling spliced to obtain the Two bors d's oeuveres sequence.
In one embodiment of the invention, described step B further comprises: B1: the macro block in the described Two bors d's oeuveres sequence is carried out up-sampling; B2: obtain the correspondence image zone of macro block in described two-way three-dimensional video-frequency behind the described up-sampling; B3: the coding distortion between the macro block behind the calculating up-sampling and the correspondence image zone of described two-way three-dimensional video-frequency; And B4:, determine described macroblock encoding pattern according to described coding distortion.
In one embodiment of the invention, described macro block is being carried out in the process of up-sampling, keeping the width of described macro block or highly constant.
In one embodiment of the invention, according to described coding distortion and formula min{J (λ)=D (x)+λ R (x) }, determine described macroblock encoding pattern, wherein, D (x) is described coding distortion, and R (x) is a code check, and λ is the Lagrangian factor.
The present invention also proposes a kind of code device of two-way three-dimensional video-frequency on the other hand, comprising: concatenation module is used for the two-way three-dimensional video-frequency is spliced to obtain the Two bors d's oeuveres sequence; Coding module, being used for described two-way three-dimensional video-frequency is reference, and described Two bors d's oeuveres sequence is carried out the rate-distortion optimization coding; And reconstructed module, be used for the Two bors d's oeuveres sequence of having encoded is reconstructed to obtain the coding result of each road video.
In one embodiment of the invention, described concatenation module further comprises: downsampling unit is used for respectively described two-way video being carried out every row ground or interlacing ground down-sampling; And concatenation unit, be used for the two-path video behind the described down-sampling is spliced to obtain the Two bors d's oeuveres sequence.
In one embodiment of the invention, described coding module further comprises: the up-sampling unit is used for the macro block of described Two bors d's oeuveres sequence is carried out up-sampling; Search unit is used for obtaining macro block behind the described up-sampling in the correspondence image zone of described two-way three-dimensional video-frequency; And computing unit, be used to calculate the coding distortion between the correspondence image zone of macro block behind the up-sampling and described two-way three-dimensional video-frequency, and determine described macroblock encoding pattern according to described coding distortion.
The present invention has improved the distortion performance of two-way stereo scopic video coding by changing the distortion metrics of Macroblock Mode Selection, obtains more excellent image encoding effect.
Aspect that the present invention adds and advantage part in the following description provide, and part will become obviously from the following description, or recognize by practice of the present invention.
Description of drawings
Above-mentioned and/or additional aspect of the present invention and advantage are from obviously and easily understanding becoming the description of embodiment below in conjunction with accompanying drawing, wherein:
Fig. 1 is the flow chart of coding method of the two-way three-dimensional video-frequency of the embodiment of the invention;
Fig. 2 is the flow chart of the rate-distortion optimization algorithm of the embodiment of the invention;
Fig. 3 is the schematic diagram of code device of the two-way three-dimensional video-frequency of the embodiment of the invention;
Fig. 4 is the structural representation of the concatenation module of one embodiment of the invention; And
Fig. 5 is the structural representation of the coding module of one embodiment of the invention.
Embodiment
Describe embodiments of the invention below in detail, the example of described embodiment is shown in the drawings, and wherein identical from start to finish or similar label is represented identical or similar elements or the element with identical or similar functions.Below by the embodiment that is described with reference to the drawings is exemplary, only is used to explain the present invention, and can not be interpreted as limitation of the present invention.
Be illustrated in figure 1 as the flow chart of coding method of the two-way three-dimensional video-frequency of the embodiment of the invention, this method may further comprise the steps:
Step S101 splices to obtain the Two bors d's oeuveres sequence the two-way three-dimensional video-frequency.
Particularly, at first the two-way three-dimensional video-frequency is carried out interlacing ground respectively or every row ground down-sampling, for example, carries out the down-sampling of odd column or even number line sampling respectively; Then the two-path video sequence behind the down-sampling is spliced, form the Two bors d's oeuveres sequence.
Step S102 is reference with the two-way three-dimensional video-frequency, and the Two bors d's oeuveres sequence is carried out the rate-distortion optimization coding.
Under traditional hybrid encoding frame, if will under the condition of minimum distortion, keep code check R to be no more than maximal rate R
Max, then need to select best coding parameter to reach best picture quality.
Rate-distortion optimization algorithm in the embodiment of the invention is considered the influence of the up-sampling reconstruct (introducing after a while) of the down-sampling of pretreatment stage (being step S101) and post-processed to video coding performance in step S103, traditional encryption algorithm is improved.
Be illustrated in figure 2 as the flow chart of the rate-distortion optimization algorithm of the embodiment of the invention, may further comprise the steps particularly:
Step S201 carries out up-sampling to the macro block in the Two bors d's oeuveres sequence.
Particularly, in the up-sampling process, keep the height of macro block constant, double width or keep width constant, highly double, for example the macro block up-sampling with 8*8 is 16*8 or 8*16.
Step S202 obtains the correspondence image zone of macro block in the two-way three-dimensional video-frequency behind the up-sampling.
The coordinate of reference macroblock top left corner pixel (x, y), if the maintenance width is constant in the up-sampling process, highly become twice, the then position (x of its corresponding original image
0, y
0) be:
Wherein, w is the width of video sequence;
If in the up-sampling process, keep highly constant, width change twice, the then position (x of its corresponding original image
0, y
0) be:
Wherein, h is the height of video sequence.
Step S203, the coding distortion D (x) between the macro block behind the calculating up-sampling and the correspondence image zone of two-way three-dimensional video-frequency.
In this step, calculate behind the up-sampling macro block and the coding distortion between the correspondence image zone in original left view or the right view, concrete computational process is identical with the computational methods of prior art, herein for simplicity, repeats no more.
Step S204 according to coding distortion D (x), determines the macroblock encoding pattern.
In coding distortion D (x) substitution rate distortion function J (λ)=D (x)+λ R (x), carry out lagrangian optimization, thereby determine the pattern of macro block, wherein, λ is the Lagrangian factor.
This mode of sampling, the problem of model selection just is converted in the span of λ, and the Lagrangian optimization of sampling finds and satisfies J (λ) minimum (R, D) point.
Then, according to the coding mode of determining by above-mentioned rate-distortion optimization algorithm the Two bors d's oeuveres sequence is encoded.
Step S103 is reconstructed to obtain the coding result of each road video the Two bors d's oeuveres sequence of having encoded.
Sequence behind the coding is split into two-way down-sampling reproducing sequence, then, by with step S102 in identical top sampling method, be original resolution with the down-sampling reproducing sequence interpolation of two-way, produce final left and right sides view coding then.
For realizing the foregoing description, the present invention also proposes a kind of code device of two-way three-dimensional video-frequency.Be illustrated in figure 3 as the structural representation of code device of the two-way three-dimensional video-frequency of the embodiment of the invention, this code device comprises: concatenation module 100, coding module 200 and reconstructed module 300.
Wherein, concatenation module 100 is used for the two-way three-dimensional video-frequency is spliced to obtain the Two bors d's oeuveres sequence.It is reference that coding module 200 is used for the two-way three-dimensional video-frequency, and the Two bors d's oeuveres sequence is carried out the rate-distortion optimization coding.Reconstructed module 300 is used for the Two bors d's oeuveres sequence of having encoded is reconstructed to obtain the coding result of each road video.
Particularly, as shown in Figure 4, concatenation module 100 can comprise downsampling unit 110 and concatenation unit 120.Downsampling unit 110 is used for respectively every road of two-way video being carried out every row ground or interlacing ground down-sampling.Concatenation unit 120 is used for the two-path video behind the down-sampling is spliced to obtain the Two bors d's oeuveres sequence.
As shown in Figure 5, coding module 200 can comprise up-sampling unit 210, search unit 220 and computing unit 230.Up-sampling unit 210 is used for the macro block of Two bors d's oeuveres sequence is carried out up-sampling.Search unit 220 is used for obtaining macro block behind the up-sampling in the correspondence image zone of two-way three-dimensional video-frequency.Computing unit 230 is used to calculate the coding distortion between the correspondence image zone of macro block behind the up-sampling and two-way three-dimensional video-frequency, and determines the macroblock encoding pattern according to described coding distortion.
The concrete course of work of each module and unit can be identical with the description in the said method, herein for simplicity, repeats no more.
The present invention has improved the distortion performance of two-way stereo scopic video coding by changing the distortion metrics of Macroblock Mode Selection, obtains more excellent image encoding effect.
Although illustrated and described embodiments of the invention, for the ordinary skill in the art, be appreciated that without departing from the principles and spirit of the present invention and can carry out multiple variation, modification, replacement and modification that scope of the present invention is by claims and be equal to and limit to these embodiment.
Claims (10)
1. the coding method of a two-way three-dimensional video-frequency is characterized in that, may further comprise the steps:
A: the two-way three-dimensional video-frequency is spliced to obtain the Two bors d's oeuveres sequence;
B: with described two-way three-dimensional video-frequency is reference, and described Two bors d's oeuveres sequence is carried out the rate-distortion optimization coding; And
C: the Two bors d's oeuveres sequence of having encoded is reconstructed to obtain the coding result of each road video.
2. method according to claim 1 is characterized in that, described steps A further comprises:
Respectively described two-way video is carried out every row ground or interlacing ground down-sampling; And
Two-path video behind the described down-sampling is spliced to obtain the Two bors d's oeuveres sequence.
3. method according to claim 1 is characterized in that, described step B further comprises:
B1: the macro block in the described Two bors d's oeuveres sequence is carried out up-sampling;
B2: obtain the correspondence image zone of macro block in described two-way three-dimensional video-frequency behind the described up-sampling;
B3: the coding distortion between the macro block behind the calculating up-sampling and the correspondence image zone of described two-way three-dimensional video-frequency; And
B4:, determine described macroblock encoding pattern according to described coding distortion.
4. method according to claim 3 is characterized in that, wherein, described macro block is being carried out in the process of up-sampling, keeps the width of described macro block or highly constant.
5. method according to claim 3 is characterized in that, described step B4 further comprises:
According to described coding distortion and following formula, determine described macroblock encoding pattern,
min{J(λ)=D(x)+λR(x)}
Wherein, D (x) is described coding distortion, and R (x) is a code check, and λ is the Lagrangian factor.
6. the code device of a two-way three-dimensional video-frequency is characterized in that, comprising:
Concatenation module is used for the two-way three-dimensional video-frequency is spliced to obtain the Two bors d's oeuveres sequence;
Coding module, being used for described two-way three-dimensional video-frequency is reference, and described Two bors d's oeuveres sequence is carried out the rate-distortion optimization coding; And
Reconstructed module is used for the Two bors d's oeuveres sequence of having encoded is reconstructed to obtain the coding result of each road video.
7. device according to claim 6 is characterized in that, described concatenation module further comprises:
Downsampling unit is used for respectively described two-way video being carried out every row ground or interlacing ground down-sampling; With
Concatenation unit is used for the two-path video behind the described down-sampling is spliced to obtain the Two bors d's oeuveres sequence.
8. device according to claim 6 is characterized in that, described coding module further comprises:
The up-sampling unit is used for the macro block of described Two bors d's oeuveres sequence is carried out up-sampling;
Search unit is used for obtaining macro block behind the described up-sampling in the correspondence image zone of described two-way three-dimensional video-frequency; With
Computing unit is used to calculate the coding distortion between the correspondence image zone of macro block behind the up-sampling and described two-way three-dimensional video-frequency, and determines described macroblock encoding pattern according to described coding distortion.
9. device according to claim 8 is characterized in that, described up-sampling unit is carrying out in the process of up-sampling the macro block in the described Two bors d's oeuveres sequence, keeps the height of described macro block or width constant.
10. device according to claim 8 is characterized in that, described computing unit is determined described macroblock encoding pattern according to described coding distortion, further comprises:
According to described coding distortion and following formula, determine described macroblock encoding pattern,
min{J(λ)=D(x)+λR(x)}
Wherein, D (x) is described coding distortion, and R (x) is a code check, and λ is the Lagrangian factor.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN 201110091938 CN102137259B (en) | 2011-04-13 | 2011-04-13 | Method and device for coding two paths of three-dimensional videos |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN 201110091938 CN102137259B (en) | 2011-04-13 | 2011-04-13 | Method and device for coding two paths of three-dimensional videos |
Publications (2)
Publication Number | Publication Date |
---|---|
CN102137259A true CN102137259A (en) | 2011-07-27 |
CN102137259B CN102137259B (en) | 2013-03-27 |
Family
ID=44296901
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN 201110091938 Active CN102137259B (en) | 2011-04-13 | 2011-04-13 | Method and device for coding two paths of three-dimensional videos |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN102137259B (en) |
Cited By (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102271255A (en) * | 2011-08-09 | 2011-12-07 | 清华大学 | Motion estimation method and device for dual-spelling stereo video coding |
CN102271270A (en) * | 2011-08-15 | 2011-12-07 | 清华大学 | Method and device for splicing binocular stereo video |
CN103957398A (en) * | 2014-04-14 | 2014-07-30 | 北京视博云科技有限公司 | Method and device for sampling, encoding and decoding three-dimensional image |
CN104363437A (en) * | 2014-11-28 | 2015-02-18 | 广东欧珀移动通信有限公司 | Method and apparatus for recording stereo video |
CN108111833A (en) * | 2016-11-24 | 2018-06-01 | 阿里巴巴集团控股有限公司 | For the method, apparatus and system of stereo video coding-decoding |
CN112911299A (en) * | 2019-12-03 | 2021-06-04 | 浙江宇视科技有限公司 | Video code rate control method and device, electronic equipment and storage medium |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5612735A (en) * | 1995-05-26 | 1997-03-18 | Luncent Technologies Inc. | Digital 3D/stereoscopic video compression technique utilizing two disparity estimates |
US20060268130A1 (en) * | 2005-05-26 | 2006-11-30 | Williams Karen E | In-camera panorama stitching method and apparatus |
CN101415115A (en) * | 2007-10-15 | 2009-04-22 | 华为技术有限公司 | Method for encoding and decoding video based on movement dancing mode, and encoder and decoder thereof |
CN101668205A (en) * | 2009-09-25 | 2010-03-10 | 南京邮电大学 | Self-adapting down-sampling stereo video compressed coding method based on residual error macro block |
CN101959067A (en) * | 2010-09-26 | 2011-01-26 | 北京大学 | Decision method and system in rapid coding mode based on epipolar constraint |
-
2011
- 2011-04-13 CN CN 201110091938 patent/CN102137259B/en active Active
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5612735A (en) * | 1995-05-26 | 1997-03-18 | Luncent Technologies Inc. | Digital 3D/stereoscopic video compression technique utilizing two disparity estimates |
US20060268130A1 (en) * | 2005-05-26 | 2006-11-30 | Williams Karen E | In-camera panorama stitching method and apparatus |
CN101415115A (en) * | 2007-10-15 | 2009-04-22 | 华为技术有限公司 | Method for encoding and decoding video based on movement dancing mode, and encoder and decoder thereof |
CN101668205A (en) * | 2009-09-25 | 2010-03-10 | 南京邮电大学 | Self-adapting down-sampling stereo video compressed coding method based on residual error macro block |
CN101959067A (en) * | 2010-09-26 | 2011-01-26 | 北京大学 | Decision method and system in rapid coding mode based on epipolar constraint |
Cited By (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102271255A (en) * | 2011-08-09 | 2011-12-07 | 清华大学 | Motion estimation method and device for dual-spelling stereo video coding |
CN102271270A (en) * | 2011-08-15 | 2011-12-07 | 清华大学 | Method and device for splicing binocular stereo video |
CN103957398A (en) * | 2014-04-14 | 2014-07-30 | 北京视博云科技有限公司 | Method and device for sampling, encoding and decoding three-dimensional image |
CN103957398B (en) * | 2014-04-14 | 2016-01-06 | 北京视博云科技有限公司 | A kind of sampling of stereo-picture, coding and coding/decoding method and device |
CN104363437A (en) * | 2014-11-28 | 2015-02-18 | 广东欧珀移动通信有限公司 | Method and apparatus for recording stereo video |
CN108111833A (en) * | 2016-11-24 | 2018-06-01 | 阿里巴巴集团控股有限公司 | For the method, apparatus and system of stereo video coding-decoding |
CN112911299A (en) * | 2019-12-03 | 2021-06-04 | 浙江宇视科技有限公司 | Video code rate control method and device, electronic equipment and storage medium |
CN112911299B (en) * | 2019-12-03 | 2023-02-28 | 浙江宇视科技有限公司 | Video code rate control method and device, electronic equipment and storage medium |
Also Published As
Publication number | Publication date |
---|---|
CN102137259B (en) | 2013-03-27 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN111837397B (en) | Error-cancelling code stream indication in view-dependent video coding based on sub-picture code streams | |
CN112740689B (en) | Video encoder, video decoder and corresponding methods | |
CN102137259B (en) | Method and device for coding two paths of three-dimensional videos | |
CN102939763B (en) | Calculating disparity for three-dimensional images | |
RU2487488C2 (en) | Method and system for encoding three-dimensional video signal, encapsulated three-dimensional video signal, method and system for three-dimensional video decoder | |
CN100496121C (en) | Image signal processing method of the interactive multi-view video system | |
CN101729892B (en) | Coding method of asymmetric stereoscopic video | |
US20120249750A1 (en) | Stereo-image quality and disparity/depth indications | |
CN102685532B (en) | Coding method for free view point four-dimensional space video coding system | |
CN112868232B (en) | Method and apparatus for intra prediction using interpolation filter | |
CN103108187B (en) | The coded method of a kind of 3 D video, coding/decoding method, encoder | |
CN102450010A (en) | Directed interpolation and data post-processing | |
CN103338370B (en) | A kind of multi-view depth video fast encoding method | |
CN101888566A (en) | Estimation method of distortion performance of stereo video encoding rate | |
CN103765474A (en) | Depth coding | |
CN104010196B (en) | 3D quality scalable video coding method based on HEVC | |
CN100581265C (en) | Processing method for multi-view point video | |
CN104885450A (en) | Image encoding method, image decoding method, image encoding device, image decoding device, image encoding program, and image decoding program | |
CN105637766A (en) | Method and apparatus of derivation for a binary partition pattern | |
CN102271270A (en) | Method and device for splicing binocular stereo video | |
CN102316323B (en) | Rapid binocular stereo-video fractal compressing and uncompressing method | |
CN101959067B (en) | Decision method and system in rapid coding mode based on epipolar constraint | |
Petrazzuoli et al. | Depth-based multiview distributed video coding | |
CN102111637A (en) | Stereoscopic video depth map generation method and device | |
CN103220532A (en) | Joint prediction encoding method and joint predication encoding system for stereoscopic video |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant |