US8306107B2 - Syntax elements to SVC to support color bit depth scalability - Google Patents

Syntax elements to SVC to support color bit depth scalability Download PDF

Info

Publication number
US8306107B2
US8306107B2 US12/311,922 US31192206A US8306107B2 US 8306107 B2 US8306107 B2 US 8306107B2 US 31192206 A US31192206 A US 31192206A US 8306107 B2 US8306107 B2 US 8306107B2
Authority
US
United States
Prior art keywords
bit depth
bit
prediction
indication
inter
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active, expires
Application number
US12/311,922
Other versions
US20090323804A1 (en
Inventor
Yong Ying Gao
Yu Wen Wu
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
InterDigital Madison Patent Holdings SAS
Original Assignee
Thomson Licensing SAS
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Thomson Licensing SAS filed Critical Thomson Licensing SAS
Assigned to THOMSON LICENSING reassignment THOMSON LICENSING ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: GAO, YONG YING, WU, YU WEN
Publication of US20090323804A1 publication Critical patent/US20090323804A1/en
Application granted granted Critical
Publication of US8306107B2 publication Critical patent/US8306107B2/en
Assigned to THOMSON LICENSING DTV reassignment THOMSON LICENSING DTV ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: THOMSON LICENSING
Assigned to THOMSON LICENSING DTV reassignment THOMSON LICENSING DTV ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: THOMSON LICENSING
Assigned to INTERDIGITAL MADISON PATENT HOLDINGS reassignment INTERDIGITAL MADISON PATENT HOLDINGS ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: THOMSON LICENSING DTV
Active legal-status Critical Current
Adjusted expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/187Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a scalable video layer
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/30Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using hierarchical techniques, e.g. scalability
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/103Selection of coding mode or of prediction mode
    • H04N19/105Selection of the reference unit for prediction within a chosen coding or prediction mode, e.g. adaptive choice of position and number of pixels used for prediction
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/184Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being bits, e.g. of the compressed video stream
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/42Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by implementation details or hardware specially adapted for video compression or decompression, e.g. dedicated software implementation
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/46Embedding additional information in the video signal during the compression process
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/60Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding
    • H04N19/61Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding in combination with predictive coding
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/70Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards

Definitions

  • This invention relates to the technical field of digital video coding. It presents a technical solution for a novel type of scalability: color bit depth scalability. Two new syntax elements are presented to be added to SVC.
  • This invention presents a scalable solution to encode the whole 12-bit raw video once to generate one bitstream that contains an H.264/AVC compatible base layer and a scalable enhancement layer. If an H.264/AVC decoder is available at the client end, only the base layer sub-bitstream is decoded and the decoded 8-bit video can be viewed on a conventional 8-bit display device; if the color bit depth scalable decoder is available at the client end, both the base layer and the enhancement layer sub-bitstreams will be decoded to obtain the 12-bit video and it can be viewed on a high quality display device that supports more than eight bit.
  • FIG. 1 illustrates a framework of color bit depth scalable coding
  • FIG. 1 The framework of the presented color bit depth scalable coding is shown in FIG. 1 .
  • two videos will be used as an input to the video codec: N-bit raw video and M-bit (usually 8-bit) video (N>M).
  • the M-bit video can be either converted from the N-bit raw video or given by other ways.
  • the M-bit video is encoded as the base layer using the inside H.264/AVC encoder.
  • the N-bit video is encoded as the enhancement layer using the scalable encoder.
  • the coding efficiency of the enhancement layer can be significantly improved by utilizing the information of the base layer. We call the utilization of the base layer information in encoding the enhancement layer inter-layer prediction.
  • the coded bitstreams will be multiplexed to form a scalable bitstream.
  • base layer decoder will use only the base layer sub-bitstream which is extracted from the whole bitstream, to provide a M-bit reconstructed video. By decoding the whole bitstream, N-bit video can be reconstructed.
  • bit_depth_scalability_flag bit_depth_pred_idc
  • bit_depth_scalability_flag 1 specifies that process of color bit depth prediction shall be invoked in the inter-layer prediction. Otherwise (equal to 0) specified that no process of color bit depth prediction shall be invoked (default).
  • bit_depth_pred_idc 0 specifies that the operation of bit-shift is utilized as the color bit depth inter-layer prediction (default). Otherwise is reserved for advanced color bit depth prediction.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)
  • Color Television Systems (AREA)

Abstract

This invention presents a scalable solution to encode the whole 12-bit raw video once to generate one bitstream that contains an H.264/AVC compatible base layer and a scalable enhancement layer. If an H.264/AVC decoder is available at the client end, only the base layer sub-bitstream is decoded and the decoded 8-bit video can be viewed on a conventional 8-bit display device; if the color bit depth scalable decoder is available at the client end, both the base layer and the enhancement layer sub-bitstreams will be decoded to obtain the 12-bit video and it can be viewed on a high quality display device that supports more than eight bit.

Description

This application claims the benefit, under 35 U.S.C. §365 of International Application PCT/CN2006/002848, filed Oct. 25, 2006, which was published in accordance with PCT Article 21(2) on May 2, 2008 in English.
FIELD OF THIS INVENTION
This invention relates to the technical field of digital video coding. It presents a technical solution for a novel type of scalability: color bit depth scalability. Two new syntax elements are presented to be added to SVC.
BACKGROUND OF THE INVENTION
In recent years, higher bit color depth rather than the conventional eight bit color depth is more and more desirable in many fields, such as scientific imaging, digital cinema, high-quality-video-enabled computer games, and professional studio and home theatre related applications. Accordingly, the state-of-the-art video coding standard—H.264/AVC—has already included Fidelity Range Extensions, which support up to 14 bits per sample and up to 4:4:4 chroma sampling.
However, none of the existing high bit coding solutions supports color bit depth scalability. Assume that we have a scenario with 2 different decoders (or clients with different requests for the color bit depth, e.g. 12 bit) for the same raw video. The existing H.264/AVC solution is to encoder the 12-bit raw video to generate bitstream no. 1 and then convert the 12-bit raw video to an 8-bit raw video and encode the 8-bit counterpart to generate bitstream no. 2. If we want to deliver the video to different clients that request different bit depths, we have to deliver it twice, or put the 2 bitstreams in one disk together. It is of low efficiency regarding both the compression ratio and the operational complexity.
SUMMARY OF THE INVENTION
This invention presents a scalable solution to encode the whole 12-bit raw video once to generate one bitstream that contains an H.264/AVC compatible base layer and a scalable enhancement layer. If an H.264/AVC decoder is available at the client end, only the base layer sub-bitstream is decoded and the decoded 8-bit video can be viewed on a conventional 8-bit display device; if the color bit depth scalable decoder is available at the client end, both the base layer and the enhancement layer sub-bitstreams will be decoded to obtain the 12-bit video and it can be viewed on a high quality display device that supports more than eight bit.
BRIEF DESCRIPTION OF THE DRAWINGS
FIG. 1 illustrates a framework of color bit depth scalable coding
DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT
The framework of the presented color bit depth scalable coding is shown in FIG. 1. In FIG. 1, two videos will be used as an input to the video codec: N-bit raw video and M-bit (usually 8-bit) video (N>M). The M-bit video can be either converted from the N-bit raw video or given by other ways.
The M-bit video is encoded as the base layer using the inside H.264/AVC encoder. The N-bit video is encoded as the enhancement layer using the scalable encoder. The coding efficiency of the enhancement layer can be significantly improved by utilizing the information of the base layer. We call the utilization of the base layer information in encoding the enhancement layer inter-layer prediction. Each picture—a group of Macroblocks (MBs)—will have two access units, one for the base layer and the other one for the enhancement layer. The coded bitstreams will be multiplexed to form a scalable bitstream.
During the decoding process, base layer decoder will use only the base layer sub-bitstream which is extracted from the whole bitstream, to provide a M-bit reconstructed video. By decoding the whole bitstream, N-bit video can be reconstructed.
In this part, we present the technical solution to color bit depth scalability. Two new syntax elements are added to SVC SPS in SVC extension (seq_parameter_set_svc_extension( )) to support color bit depth scalability: bit_depth_scalability_flag and bit_depth_pred_idc, as highlighted.
De-
scrip-
seq_parameter_set_svc_extension( ) { C tor
extended_spatial_scalability 0 u(2)
if ( chroma_format_idc > 0 ) {
chroma_phase_x_plus1 0 u(2)
chroma_phase_y_plus1 0 u(2)
}
if( extended_spatial_scalability = = 1 )
{
scaled_base_left_offset 0 se(v)
scaled_base_top_offset 0 se(v)
scaled_base_right_offset 0 se(v)
scaled_base_bottom_offset 0 se(v)
}
bit_depth_scalability_flag 0 u(1)
if ( bit_depth_scalability_flag ) {
bit_depth_pred_idc 0 ue(v)
}
fgs_coding_mode 2 u(1)
if( fgs_coding_mode = = 0 ) {
groupingSizeMinus1 2 ue(v)
} else {
numPosVector = 0
do {
if( numPosVector = = 0 ) {
scanIndex0 2 ue(v)
}
else {
deltaScanIndexMinus1[numPosVector] 2 ue(v)
}
numPosVector ++
}
while( scanPosVectLuma[ numPosVector −
1 ] < 15 )
}
}
bit_depth_scalability_flag equal to 1 specifies that process of color bit depth prediction shall be invoked in the inter-layer prediction. Otherwise (equal to 0) specified that no process of color bit depth prediction shall be invoked (default).
bit_depth_pred_idc equal to 0 specifies that the operation of bit-shift is utilized as the color bit depth inter-layer prediction (default). Otherwise is reserved for advanced color bit depth prediction.

Claims (5)

1. A method for encoding video data in a bit depth scalable manner, wherein an enhancement layer video is predicted from a reconstructed base layer video, and wherein at least a first and a different second indication are added to the data to define the process of bit depth scalability, wherein
if the first indication has a first value, no bit depth inter-layer prediction is utilized;
if the first indication has a second value, it specifies that bit depth inter-layer prediction is utilized, wherein the type of inter-layer prediction depends on the value of the second indication, wherein
if the second indication has a first value, inter-layer prediction based on bit-shift is used; and
if the second indication has another than the first value, an advanced bit depth prediction is utilized,
wherein two new syntax elements are added:
a first syntax element to signal the process of color bit depth scalability, which if it is equal to one specifies that the color bit depth prediction process is invoked in the inter-layer prediction, and otherwise specifies that no process of color bit depth prediction is invoked;
a second syntax element to specify the approach for color bit depth inter-layer prediction, which if it is equal to zero specifies that the operation of bit-shift is utilized as the color bit depth inter-layer prediction, and otherwise advanced color bit depth prediction is utilized as the color bit depth inter-layer prediction.
2. The method as claimed in claim 1, wherein each picture has two access units, one for the base layer and the other one for the enhancement layer.
3. The method as claimed in claim 1, wherein coded bitstreams are multiplexed to form a scalable bitstream.
4. The method as claimed in claim 1, wherein a default mode according to the first indication is that no process of color bit depth prediction is invoked.
5. The method as claimed in claim 1, wherein a default mode according to the second indication is that operation of bit-shift is utilized.
US12/311,922 2006-10-25 2006-10-25 Syntax elements to SVC to support color bit depth scalability Active 2029-01-27 US8306107B2 (en)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2006/002848 WO2008049271A1 (en) 2006-10-25 2006-10-25 New syntax elements to svc to support color bit depth scalability

Publications (2)

Publication Number Publication Date
US20090323804A1 US20090323804A1 (en) 2009-12-31
US8306107B2 true US8306107B2 (en) 2012-11-06

Family

ID=39324094

Family Applications (1)

Application Number Title Priority Date Filing Date
US12/311,922 Active 2029-01-27 US8306107B2 (en) 2006-10-25 2006-10-25 Syntax elements to SVC to support color bit depth scalability

Country Status (7)

Country Link
US (1) US8306107B2 (en)
EP (1) EP2084909A4 (en)
JP (1) JP2010507939A (en)
KR (1) KR20090086960A (en)
CN (1) CN101601298A (en)
BR (1) BRPI0622064A2 (en)
WO (1) WO2008049271A1 (en)

Families Citing this family (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101622878B (en) * 2007-01-10 2015-01-14 汤姆逊许可公司 Video encoding method and video decoding method for enabling bit depth scalability
WO2009051704A1 (en) * 2007-10-16 2009-04-23 Thomson Licensing Methods and apparatus for artifact removal for bit depth scalability
US8446958B2 (en) 2009-09-22 2013-05-21 Panasonic Corporation Image coding apparatus, image decoding apparatus, image coding method, and image decoding method
JP2012060261A (en) * 2010-09-06 2012-03-22 Sony Corp Image processing apparatus and method
US20140003527A1 (en) * 2011-03-10 2014-01-02 Dolby Laboratories Licensing Corporation Bitdepth and Color Scalable Video Coding
TWI575933B (en) * 2011-11-04 2017-03-21 杜比實驗室特許公司 Layer decomposition in hierarchical vdr coding
US9124899B2 (en) 2012-09-28 2015-09-01 Sharp Laboratories Of America, Inc. Motion derivation and coding for scaling video
US9414058B2 (en) 2012-12-25 2016-08-09 Mediatek Inc. Video processing apparatus capable of generating output video pictures/sequence with color depth different from color depth of encoded video bitstream
CN104604234B (en) * 2013-07-12 2019-07-16 索尼公司 Image encoding apparatus and method
GB2516424A (en) * 2013-07-15 2015-01-28 Nokia Corp A method, an apparatus and a computer program product for video coding and decoding

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2003036980A1 (en) 2001-10-26 2003-05-01 Koninklijke Philips Electronics N.V. Spatial scalable compression scheme using spatial sharpness enhancement techniques
US20040228535A1 (en) * 2003-05-15 2004-11-18 Matsushita Electric Industrial Co., Ltd Moving image decoding apparatus and moving image decoding method
US20050259729A1 (en) 2004-05-21 2005-11-24 Shijun Sun Video coding with quality scalability
US20060233235A1 (en) * 2005-04-14 2006-10-19 Samsung Electronics Co., Ltd. Video encoding/decoding apparatus and method capable of minimizing random access delay

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2003036980A1 (en) 2001-10-26 2003-05-01 Koninklijke Philips Electronics N.V. Spatial scalable compression scheme using spatial sharpness enhancement techniques
US20040228535A1 (en) * 2003-05-15 2004-11-18 Matsushita Electric Industrial Co., Ltd Moving image decoding apparatus and moving image decoding method
US20050259729A1 (en) 2004-05-21 2005-11-24 Shijun Sun Video coding with quality scalability
US20060233235A1 (en) * 2005-04-14 2006-10-19 Samsung Electronics Co., Ltd. Video encoding/decoding apparatus and method capable of minimizing random access delay

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
Search Report Dated Jul. 19, 2007.

Also Published As

Publication number Publication date
JP2010507939A (en) 2010-03-11
US20090323804A1 (en) 2009-12-31
EP2084909A4 (en) 2012-11-21
KR20090086960A (en) 2009-08-14
BRPI0622064A2 (en) 2014-05-27
CN101601298A (en) 2009-12-09
EP2084909A1 (en) 2009-08-05
WO2008049271A1 (en) 2008-05-02

Similar Documents

Publication Publication Date Title
US8306107B2 (en) Syntax elements to SVC to support color bit depth scalability
US8737474B2 (en) Method and apparatus for encoding and/or decoding video data using enhancement layer residual prediction for bit depth scalability
US20100091840A1 (en) Video encoding method and video decoding method for enabling bit depth scalability
US8477853B2 (en) Method and apparatus for encoding and/or decoding bit depth scalable video data using adaptive enhancement layer prediction
US8798149B2 (en) Enhancement layer residual prediction for bit depth scalability using hierarchical LUTs
EP1933563A1 (en) Method and apparatus for encoding and/or decoding bit depth scalable video data using adaptive enhancement layer residual prediction
US20230276061A1 (en) Scalable video coding system with parameter signaling
US8428129B2 (en) Method and apparatus for encoding and/or decoding video data using enhancement layer residual prediction for bit depth scalability
WO2008071542A1 (en) Method and apparatus for encoding and/or decoding video data using adaptive prediction order for spatial and bit depth prediction
US20110228855A1 (en) Device for Encoding Video Data, Device for Decoding Video Data, Stream of Digital Data
EP1933565A1 (en) Method and apparatus for encoding and/or decoding bit depth scalable video data using adaptive enhancement layer prediction

Legal Events

Date Code Title Description
AS Assignment

Owner name: THOMSON LICENSING, FRANCE

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:GAO, YONG YING;WU, YU WEN;REEL/FRAME:022589/0698;SIGNING DATES FROM 20090317 TO 20090318

Owner name: THOMSON LICENSING, FRANCE

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:GAO, YONG YING;WU, YU WEN;SIGNING DATES FROM 20090317 TO 20090318;REEL/FRAME:022589/0698

STCF Information on status: patent grant

Free format text: PATENTED CASE

FPAY Fee payment

Year of fee payment: 4

AS Assignment

Owner name: THOMSON LICENSING DTV, FRANCE

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:THOMSON LICENSING;REEL/FRAME:041370/0433

Effective date: 20170113

AS Assignment

Owner name: THOMSON LICENSING DTV, FRANCE

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:THOMSON LICENSING;REEL/FRAME:041378/0630

Effective date: 20170113

AS Assignment

Owner name: INTERDIGITAL MADISON PATENT HOLDINGS, FRANCE

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:THOMSON LICENSING DTV;REEL/FRAME:046763/0001

Effective date: 20180723

MAFP Maintenance fee payment

Free format text: PAYMENT OF MAINTENANCE FEE, 8TH YEAR, LARGE ENTITY (ORIGINAL EVENT CODE: M1552); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY

Year of fee payment: 8

MAFP Maintenance fee payment

Free format text: PAYMENT OF MAINTENANCE FEE, 12TH YEAR, LARGE ENTITY (ORIGINAL EVENT CODE: M1553); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY

Year of fee payment: 12