WO2003075579A2 - Method and system for layered video encoding - Google Patents

Method and system for layered video encoding Download PDF

Info

Publication number
WO2003075579A2
WO2003075579A2 PCT/IB2003/000789 IB0300789W WO03075579A2 WO 2003075579 A2 WO2003075579 A2 WO 2003075579A2 IB 0300789 W IB0300789 W IB 0300789W WO 03075579 A2 WO03075579 A2 WO 03075579A2
Authority
WO
WIPO (PCT)
Prior art keywords
significance
block
level
layer
recited
Prior art date
Application number
PCT/IB2003/000789
Other languages
French (fr)
Other versions
WO2003075579A3 (en
Inventor
Mihaela Van Der Schaar
Rama Kalluri
Original Assignee
Koninklijke Philips Electronics N.V.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Koninklijke Philips Electronics N.V. filed Critical Koninklijke Philips Electronics N.V.
Priority to JP2003573878A priority Critical patent/JP2005519543A/en
Priority to AU2003208500A priority patent/AU2003208500A1/en
Priority to KR10-2004-7013637A priority patent/KR20040091682A/en
Priority to EP03706790A priority patent/EP1483918A2/en
Priority to US10/506,342 priority patent/US20050213831A1/en
Publication of WO2003075579A2 publication Critical patent/WO2003075579A2/en
Publication of WO2003075579A3 publication Critical patent/WO2003075579A3/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/30Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using hierarchical techniques, e.g. scalability
    • H04N19/34Scalability techniques involving progressive bit-plane based encoding of the enhancement layer, e.g. fine granular scalability [FGS]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/30Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using hierarchical techniques, e.g. scalability
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/119Adaptive subdivision aspects, e.g. subdivision of a picture into rectangular or non-rectangular coding blocks
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/17Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/46Embedding additional information in the video signal during the compression process

Definitions

  • the present invention relates to video image encoding and more specifically to fractionally encoding enhancement layers of layer encoded video images.
  • FGS encoding such as Fine Granular Scalar (FGS), and wavelet encoding
  • FGS encoding encodes video images into a base-layer and an enhancement layer.
  • the base layer represents the minimum image that that may be transmitted over a network with an acceptable quality.
  • the enhancement layer represents additional image details that may be transmitted over the network when sufficient residual bandwidth is available.
  • Enhancement layers are encoded in a bit-plane format wherein the most significant bits of each enhancement layer value are stored in a first bit plane and each succeeding bit of each enhancement layer value is stored in a corresponding bit plane. During transmission of the enhancement layer, the values in each bit plane are successively transmitted until the available bandwidth is occupied.
  • Figure 1 illustrates an FGS fractional bit plane encoder in accordance with the principles of the present invention
  • Figure 2 illustrates a significance mapped enhancement layer bit plane
  • Figure 3 a illustrates a flow chart of an exemplary block diagram for identifying significant image areas within an image in accordance with the principles of the invention
  • Figure 3b illustrates a flow chart of an exemplary process for generating a significance map in accordance with the principles of the invention.
  • Figure 4 illustrates a system for determining significance mapped enhancement layer bit planes in accordance with the principles of the invention.
  • a method for encoding a video image composed of a plurality of pixel blocks containing at least one area determined to be significant within a corresponding sub-layer.
  • the method comprises the steps of associating a level of significance with each block of a known size within the at least one significant area, associating a level of significance with each successively larger block dependent upon the level of significance of at least one of the blocks of a known size contained within a successively larger block, and mapping each of the associated level of significance.
  • the significance map is transmitted and corresponding image layers may be reconstructed using the significance map.
  • FIG. 1 illustrates a block diagram of an exemplary fractional bit plane encoder 100 in accordance with the principles of the present invention.
  • input signal 110 is applied to summer 115, which is mixed with motion compensated images, as will be further discussed.
  • the combined signal is then applied to Discrete Coefficient Transformation (DCT) 120 to convert pixel values into coefficients.
  • DCT coefficients are next applied to quantizer 125 for quantization.
  • quantized DCT coefficients are then applied to a Variable Length Coder 130 and combiner 175.
  • DCT Discrete Coefficient Transformation
  • the quantized DCT coefficients are also applied to inverse quantizer 135 to restore the DCT coefficients.
  • the restored DCT coefficient are not exactly the same as the original DCT values as some information is lost in the quantization process.
  • the inverse quantized coefficients are next applied to inverse DCT 140 to recover the original pixel element after DCT and quantization processing. Similarly, a known difference between the original pixel elements and the restored pixel elements exists because some information is lost in the quantization process.
  • the recovered pixel elements are applied to motion estimator/motion compensator 145.
  • the motion estimated/compensated signal is then applied to summing device 115 to be combined with the original image 110.
  • the summed image 150 is also applied to summing device 155 along with the recovered pixel elements output from inverse DCT 140.
  • the output of summing device is a residual element between the original signal 110 and recovered base layer image.
  • the residual image is concurrently applied to enhancement layer encoder 160 and significance map encoder 165.
  • the results of significance map encoder 165 are further applied to enhancement encoder 170 for mapping the bit planes as will be more fully described.
  • the outputs of enhancement layer 170 and sigmficance map 165 are applied to combiner 180 and the combined output applied to combiner 175.
  • the output 190 of combiner 175 may then be transmitted over a network or stored for subsequent transmission.
  • Figure 2a illustrates an image frame 200 containing significant information, such as changes in boundaries, color or texture.
  • Significant images areas 210, 215, 220 may be identified using known methods.
  • areas that exhibit little or no change in textual may be identified as non-significant. Consequently, little or no information regarding these areas need be transmitted.
  • the determination of significant areas may be done by reviewing each pixel element.
  • the determination of significant areas may be done by reviewing corresponding DCT coefficients.
  • Figure 2b illustrates another aspect of the present invention, wherein a significant image area, for example 210, is associated with a plurality of blocks, corresponding macroblocks, and corresponding super-macroblocks.
  • image area 210 is composed of super-macroblocks 222, 224, 226, 228, 230 and 232.
  • Each super- macroblock may be partitioned into macroblocks.
  • super-macroblock 222 is shown partitioned into macroblocks 240, 242, 244 and 246.
  • Each macroblock 240, 242, 244 and 246 may be further partitioned into a mini-macroblock.
  • macroblock 240 is shown partitioned into mini-macroblocks 250, 252, 254, and 256.
  • Each mini-macroblock may be further partitioned into a block.
  • mini-macroblock 250 is shown partitioned in to blocks 260, 262, 264 and 266.
  • each super- macroblock may be similarly partitioned, identified and associated with macro-, mini-macro-, and blocks.
  • block 260 contains information associated with an 8x8 configuration of pixel elements. Furthermore, mini-macroblock 250 is associated with a 16x16 configuration of pixel elements, macroblock 240 is associated with a 32x32 configuration of pixel elements and super-macroblock 222 is associated with a 64x64 configuration of pixel elements.
  • block 260 is analogous with the DCT encoding of a corresponding block of pixel elements.
  • Figure 2c illustrates the bit-plane mapping 270 of the identified significant area 210 in bit planes 272, 274, and 276 in accordance with the preferred embodiment of the invention. In this case the enhancement layer is encoded using a three-bit-bitplane.
  • bit-planes may be any number and there is no intention to limit the bit-plane depth to that shown herein.
  • area 210 and associated super- macroblocks, macroblocks, mini-macro blocks, and blocks may be readily identified.
  • Figure 3a illustrates a flow chart of an exemplary process 300 for significance mapping in accordance with the principles of the invention.
  • significance mapping is initiated at an arbitrarily selected bit plane associated with the image or picture.
  • the bit-plane associated with the most-significant bits i.e, bit-plane 0 is selected at block 305.
  • a significance map associated with the selected bit plane is determined.
  • the significance map associated with the bit- plane is coded.
  • the texture of the blocks identified as being significant are coded and a bit- wise representation of the significance map is generated. This bit- wise representation of the significance map can be decoded at the receiving device to understand the significance map.
  • FIG. 325 a determination is made whether all the bit planes associated with the image have been processed. If the answer is negative, then a next/subsequent bit plane is selected at block 332 and the significance mapping process continues for selected next/subsequent bit plane. If, however, the answer is in the affirmative, then a determination is made at block 330 whether all the images have been processed. If the answer is negative, then a next/subsequent image or picture is selected at block 334. The significance mapping process then continues for each bit plane in the selected next/subsequent image or picture.
  • Figure 3b illustrates a flow chart of an exemplary significance mapping process 310. In this exemplary process an initial block size and associated minimum and maximum block sizes are determined at block 340.
  • an initial block size associated with the preferred block size is depicted.
  • the block is marked or identified as being insignificant at block 370.
  • a determination is made at block 360 whether the last block has been reached. If the answer is negative, then a next/subsequent block in the bit plane is selected at block.365. Processing continues on the selected next/subsequent block at block 345.
  • Processing then continues on each of the successively larger block until the block size exceeds a maximum block size at block 375.
  • FIG. 4 illustrates an exemplary embodiment of a system 400 that may be used for implementing the principles of the present invention.
  • System 400 may represent a TV transmitter or receiving system, a desktop, laptop or palmtop computer, a personal digital assistant (PDA), a video/image storage apparatus such as a video cassette recorder (VCR), a digital video recorder (DNR), a TiNO apparatus, etc., as well as portions or combinations of these and other devices.
  • System 400 may contain one or more input/output devices 402, processors 403, and memories 404, which may access one or more sources 401 that contain video images.
  • Sources 401 may be stored in permanent or semi-permanent media such as a television receiver (SDTV or HDTV), a VCR, RAM, ROM, hard disk drive, optical disk drive or other video image storage devices. Sources 401 may alternatively be accessed over one or more network connections 410 for receiving video from a server or servers over, for example a global computer communications network such as the Internet, a wide area network, a metropolitan area network, a local area network, a terrestrial broadcast system, a cable network, a satellite network, a wireless network, or a telephone network, as well as portions or combinations of these and other types of networks.
  • a global computer communications network such as the Internet, a wide area network, a metropolitan area network, a local area network, a terrestrial broadcast system, a cable network, a satellite network, a wireless network, or a telephone network, as well as portions or combinations of these and other types of networks.
  • Input/output devices 402, processors 403, and memories 404 may communicate over a communication medium 406.
  • Communication medium 406 may represent for example, a bus, a communication network, one or more internal connections of a circuit, circuit card or other apparatus, as well as portions and combinations of these and other communication media.
  • Input data from the sources 401 is processed in accordance with one or more software programs that may be stored in memories 404 and executed by processors 403 in order to supply fractionally encoded video images to network 420.
  • the fractionally encoded vided images may be transmitted to a storage device, or may be transmitted to a display system for real-time viewing of the encoded video image.
  • Processors 403 may be any means, such as general purpose or special purpose computing system, or may be a hardware configuration, such as a laptop computer, desktop computer, handheld computer, dedicated logic circuit, integrated circuit, Programmable Array Logic (PAL), Application Specific Integrated Circuit (ASIC), etc., that provides a known output in response to known inputs.
  • PAL Programmable Array Logic
  • ASIC Application Specific Integrated Circuit
  • the coding and decoding employing the principles of the present invention may be implemented by computer readable code executed by processor 403.
  • the code may be stored in the memory 404 or read/downloaded from a memory medium such as a CD-ROM or floppy disk (not shown).
  • hardware circuitry may be used in place of, or in combination with, software instructions to implement the invention.
  • the elements illustrated herein may also be implemented as discrete hardware elements.
  • the term processor may represent one or more processing units or computing units in communication with one or more memory units and other devices, e.g., peripherals, connected electronically to and communicating with the at least one processing unit.
  • the devices may be electronically connected to the one or more processing units via internal busses, e.g., ISA bus, microchannel bus, PCI bus, PCMCIA bus, etc., or one or more internal connections of a circuit, circuit card or other device, as well as portions and combinations of these and other communication media or an external network, e.g., the Internet and Intranet.

Abstract

In a layered encoding system having at least one layer comprising a plurality of sub-layers (272, 274, 276), a method is disclosed herein for encoding a video image (200) composed of a plurality of pixel blocks containing at least one area determined to be significant (200, 215, 220) within a corresponding sub-layer (272, 274, 276). The method comprises the steps of; associating a level of significance with each block (250, 252) of a known size within the at least one significant area (200), associating a level of significance with successively larger blocks (222, 244) dependent upon the level of significance of at least one of the blocks (250, 252) of a known size contained within said larger block (222, 244), and mapping each of the associated levels of significance. In another embodiment of the invention, the significance map is transmitted and corresponding image layers may be reconstructed using the significance map.

Description

METHOD AND SYSTEM FOR ENCODING FRACTIONAL BITPLANES
The present invention relates to video image encoding and more specifically to fractionally encoding enhancement layers of layer encoded video images.
Layer encoding, such as Fine Granular Scalar (FGS), and wavelet encoding, are well-known in the video image encoding art. FGS encoding, for example, encodes video images into a base-layer and an enhancement layer. The base layer represents the minimum image that that may be transmitted over a network with an acceptable quality. The enhancement layer represents additional image details that may be transmitted over the network when sufficient residual bandwidth is available.
Enhancement layers are encoded in a bit-plane format wherein the most significant bits of each enhancement layer value are stored in a first bit plane and each succeeding bit of each enhancement layer value is stored in a corresponding bit plane. During transmission of the enhancement layer, the values in each bit plane are successively transmitted until the available bandwidth is occupied.
A concept of fractional bit planes has been introduced in JPEG-2000 to differentiate the importance of the various bits within a bit plane and improve the efficiency of bit plane coding within a bit plane. This concept does not exist in other layer encoding methods, such as FGS. Hence, there is a need for an encoding method and device wherein areas of the video image that are determined to be significant are identified prior to encoding the enhancement layer.
In the drawings: Figure 1 illustrates an FGS fractional bit plane encoder in accordance with the principles of the present invention;
Figure 2 illustrates a significance mapped enhancement layer bit plane; Figure 3 a illustrates a flow chart of an exemplary block diagram for identifying significant image areas within an image in accordance with the principles of the invention;
Figure 3b illustrates a flow chart of an exemplary process for generating a significance map in accordance with the principles of the invention; and
Figure 4 illustrates a system for determining significance mapped enhancement layer bit planes in accordance with the principles of the invention.
It is to be understood that these drawings are solely for purposes of illustrating the concepts of the invention and are not intended as a definition of the limits of the invention. The embodiments shown in Figures 1 through 4 and described in the accompanying detailed description are to be used as illustrative embodiments and should not be construed as the only manner of practicing the invention. Also, the same reference numerals, possibly supplemented with reference characters where appropriate, have been used to identify similar elements.
In a layered encoding system having at least one layer comprising a plurality of sub-layers, a method is disclosed herein for encoding a video image composed of a plurality of pixel blocks containing at least one area determined to be significant within a corresponding sub-layer. The method comprises the steps of associating a level of significance with each block of a known size within the at least one significant area, associating a level of significance with each successively larger block dependent upon the level of significance of at least one of the blocks of a known size contained within a successively larger block, and mapping each of the associated level of significance. In another embodiment of the invention, the significance map is transmitted and corresponding image layers may be reconstructed using the significance map.
Figure 1 illustrates a block diagram of an exemplary fractional bit plane encoder 100 in accordance with the principles of the present invention. In this diagram, input signal 110 is applied to summer 115, which is mixed with motion compensated images, as will be further discussed. The combined signal is then applied to Discrete Coefficient Transformation (DCT) 120 to convert pixel values into coefficients. The DCT coefficients are next applied to quantizer 125 for quantization. The quantized DCT coefficients are then applied to a Variable Length Coder 130 and combiner 175.
The quantized DCT coefficients are also applied to inverse quantizer 135 to restore the DCT coefficients. As should be understood, the restored DCT coefficient are not exactly the same as the original DCT values as some information is lost in the quantization process. The inverse quantized coefficients are next applied to inverse DCT 140 to recover the original pixel element after DCT and quantization processing. Similarly, a known difference between the original pixel elements and the restored pixel elements exists because some information is lost in the quantization process. The recovered pixel elements are applied to motion estimator/motion compensator 145. The motion estimated/compensated signal is then applied to summing device 115 to be combined with the original image 110.
The summed image 150 is also applied to summing device 155 along with the recovered pixel elements output from inverse DCT 140. The output of summing device is a residual element between the original signal 110 and recovered base layer image. The residual image is concurrently applied to enhancement layer encoder 160 and significance map encoder 165. The results of significance map encoder 165 are further applied to enhancement encoder 170 for mapping the bit planes as will be more fully described.
The outputs of enhancement layer 170 and sigmficance map 165 are applied to combiner 180 and the combined output applied to combiner 175. The output 190 of combiner 175 may then be transmitted over a network or stored for subsequent transmission.
Figure 2a illustrates an image frame 200 containing significant information, such as changes in boundaries, color or texture. Significant images areas 210, 215, 220 may be identified using known methods. Correspondingly, areas that exhibit little or no change in textual may be identified as non-significant. Consequently, little or no information regarding these areas need be transmitted. Accordingly, in one embodiment of the invention, the determination of significant areas may be done by reviewing each pixel element. In a preferred embodiment, the determination of significant areas may be done by reviewing corresponding DCT coefficients.
Figure 2b illustrates another aspect of the present invention, wherein a significant image area, for example 210, is associated with a plurality of blocks, corresponding macroblocks, and corresponding super-macroblocks. Although a specific segmentation of the image is shown, it will be appreciated that the image may be segmented according to other criteria; as will be discussed below. In this illustrated example, image area 210 is composed of super-macroblocks 222, 224, 226, 228, 230 and 232. Each super- macroblock may be partitioned into macroblocks. For clarity, super-macroblock 222 is shown partitioned into macroblocks 240, 242, 244 and 246. Each macroblock 240, 242, 244 and 246 may be further partitioned into a mini-macroblock. For clarity, macroblock 240 is shown partitioned into mini-macroblocks 250, 252, 254, and 256. Each mini-macroblock may be further partitioned into a block. For clarity purposes, mini-macroblock 250 is shown partitioned in to blocks 260, 262, 264 and 266. As will be appreciated, each super- macroblock may be similarly partitioned, identified and associated with macro-, mini-macro-, and blocks.
In a preferred embodiment, block 260 contains information associated with an 8x8 configuration of pixel elements. Furthermore, mini-macroblock 250 is associated with a 16x16 configuration of pixel elements, macroblock 240 is associated with a 32x32 configuration of pixel elements and super-macroblock 222 is associated with a 64x64 configuration of pixel elements. In this preferred embodiment, block 260 is analogous with the DCT encoding of a corresponding block of pixel elements. Figure 2c illustrates the bit-plane mapping 270 of the identified significant area 210 in bit planes 272, 274, and 276 in accordance with the preferred embodiment of the invention. In this case the enhancement layer is encoded using a three-bit-bitplane. However, it should be understood that the depth of the bit-planes may be any number and there is no intention to limit the bit-plane depth to that shown herein. In this preferred embodiment, since the DCT information is mapped to each bit-plane, area 210 and associated super- macroblocks, macroblocks, mini-macro blocks, and blocks may be readily identified.
Figure 3a illustrates a flow chart of an exemplary process 300 for significance mapping in accordance with the principles of the invention. In this process significance mapping is initiated at an arbitrarily selected bit plane associated with the image or picture. In the illustrated preferred embodiment, the bit-plane associated with the most-significant bits, i.e, bit-plane 0, is selected at block 305. At block 310, a significance map associated with the selected bit plane is determined. At block 315, the significance map associated with the bit- plane is coded. At block 320, the texture of the blocks identified as being significant are coded and a bit- wise representation of the significance map is generated. This bit- wise representation of the significance map can be decoded at the receiving device to understand the significance map. At block 325, a determination is made whether all the bit planes associated with the image have been processed. If the answer is negative, then a next/subsequent bit plane is selected at block 332 and the significance mapping process continues for selected next/subsequent bit plane. If, however, the answer is in the affirmative, then a determination is made at block 330 whether all the images have been processed. If the answer is negative, then a next/subsequent image or picture is selected at block 334. The significance mapping process then continues for each bit plane in the selected next/subsequent image or picture. Figure 3b illustrates a flow chart of an exemplary significance mapping process 310. In this exemplary process an initial block size and associated minimum and maximum block sizes are determined at block 340. In this case, an initial block size associated with the preferred block size is depicted. At block 345 a determination is made whether the current block size is equal to the smallest block size. If the answer is in the affirmative, a determination is made at block 350, whether the current block has any non-zero coefficients. If the answer is in the affirmative, then the associated block is marked or identified as being significant at block 355.
However, if the answer is negative, then the block is marked or identified as being insignificant at block 370. After identifying the current block as significant, at block 355, or insignificant, at block 370, a determination is made at block 360 whether the last block has been reached. If the answer is negative, then a next/subsequent block in the bit plane is selected at block.365. Processing continues on the selected next/subsequent block at block 345.
If, however, the answer at block 360 is in the affirmative, i.e., all blocks at current-size have been processed, then a determination is made whether the current block- size is greater that the maximum block size. If the answer is in the negative, then the current block size is increased, preferably doubled, at block 380. Processing continues on each block associated with the increased size at block 345.
Returning to the determination at block 345, if the answer is negative, then a determination is made at block 385, whether smaller blocks, i.e., children within the larger block, are significant. If the answer is affirmative, then the larger block is marked or identified as being significant at block 355. If, however, the answer is in the negative, then the larger block is marked or identified as being insignificant at block 370.
Processing then continues on each of the successively larger block until the block size exceeds a maximum block size at block 375.
Figure 4 illustrates an exemplary embodiment of a system 400 that may be used for implementing the principles of the present invention. System 400 may represent a TV transmitter or receiving system, a desktop, laptop or palmtop computer, a personal digital assistant (PDA), a video/image storage apparatus such as a video cassette recorder (VCR), a digital video recorder (DNR), a TiNO apparatus, etc., as well as portions or combinations of these and other devices. System 400 may contain one or more input/output devices 402, processors 403, and memories 404, which may access one or more sources 401 that contain video images. Sources 401 may be stored in permanent or semi-permanent media such as a television receiver (SDTV or HDTV), a VCR, RAM, ROM, hard disk drive, optical disk drive or other video image storage devices. Sources 401 may alternatively be accessed over one or more network connections 410 for receiving video from a server or servers over, for example a global computer communications network such as the Internet, a wide area network, a metropolitan area network, a local area network, a terrestrial broadcast system, a cable network, a satellite network, a wireless network, or a telephone network, as well as portions or combinations of these and other types of networks.
Input/output devices 402, processors 403, and memories 404 may communicate over a communication medium 406. Communication medium 406 may represent for example, a bus, a communication network, one or more internal connections of a circuit, circuit card or other apparatus, as well as portions and combinations of these and other communication media. Input data from the sources 401 is processed in accordance with one or more software programs that may be stored in memories 404 and executed by processors 403 in order to supply fractionally encoded video images to network 420. The fractionally encoded vided images may be transmitted to a storage device, or may be transmitted to a display system for real-time viewing of the encoded video image.
Processors 403 may be any means, such as general purpose or special purpose computing system, or may be a hardware configuration, such as a laptop computer, desktop computer, handheld computer, dedicated logic circuit, integrated circuit, Programmable Array Logic (PAL), Application Specific Integrated Circuit (ASIC), etc., that provides a known output in response to known inputs.
In a preferred embodiment, the coding and decoding employing the principles of the present invention may be implemented by computer readable code executed by processor 403. The code may be stored in the memory 404 or read/downloaded from a memory medium such as a CD-ROM or floppy disk (not shown). In other embodiments, hardware circuitry may be used in place of, or in combination with, software instructions to implement the invention. For example, the elements illustrated herein may also be implemented as discrete hardware elements.
In one aspect of the invention, the term processor may represent one or more processing units or computing units in communication with one or more memory units and other devices, e.g., peripherals, connected electronically to and communicating with the at least one processing unit. Futhermore, the devices may be electronically connected to the one or more processing units via internal busses, e.g., ISA bus, microchannel bus, PCI bus, PCMCIA bus, etc., or one or more internal connections of a circuit, circuit card or other device, as well as portions and combinations of these and other communication media or an external network, e.g., the Internet and Intranet.
Fundamental novel features of the present invention have been shown, described, and pointed out as applied to preferred embodiments. It should be understood that various omissions and substitutions and changes in the apparatus described, in the form and details of the devices disclosed, and in their operation, may be made by those skilled in the art without departing from the spirit of the present invention. For example, although the present invention has been described with regard to FGS encoding, it should be understood that present invention would also be suitable for similarly developed layer encoding systems. Similarly, while super-macroblocks are discussed with regard to 64x64 arrays or matrices, it should be within the knowledge of those skilled in the art to vary the block size. Furthermore, while the boundaries of the super-macroblocks are shown fixed, it is contemplated that the super-macroblock boundaries may be dynamically determined based on the first indication of significant data.
It is also expressly intended that all combinations of those elements which perform substantially the same function in substantially the same way to achieve the same result are within the scope of the invention. Substitutions of elements from one described embodiment to another are also fully intended and contemplated.

Claims

CLA S:
1. In a layered encoding system having at least one layer comprising a plurality of sub-layers, a method for encoding a video image (200), composed of a plurality of pixel blocks, containing at least one area determined to be significant (210) within a corresponding sub-layer (272, 274, 276), said method comprising the steps of: a. associating a level of significance with each block of a known size (250,
252) within said at least one significant area (210); b. associating a level of significance with each of at least one successively larger blocks (222, 244) dependent upon said level of significance of at least one of said blocks (250, 252) of a known size contained within said successively larger block (222, 244); and c. mapping each of said associated levels of significance.
2. The method as recited in claim 1, further comprising the step of: repeating steps a-c for each of said sub-layers.
3. The method as recited in claim 1, further comprising the step of: transmitting said significance level mapping corresponding to said sub-layer.
4. The method as recited in claim 1, wherein said layer encoding system is a Fine Granular Scalable (FGS) System.
5. The method as recited in claim 4, wherein said sub-layer is a bit-plane (272, 274, 276).
6. The method as recited in claim 1, wherein said block size is selected from a predetermined set of sizes.
7. The method as recited in claim 1 , wherein said successively larger block has a known maximum value.
8. A system (400) for encoding (100) a video image (200) formed as a plurality of pixel blocks into at least one layer wherein one of said layers is composed of a plurality of sub-layers (272, 274, 276), said sub-layer including at least one significant area (210), comprising: means (165) for associating a level of significance with each block of a known size (250, 252) within said at least one significant area (210); means (165) for identifying a level of significance with each of at least one successively larger block (222, 244) dependent upon said level of significance of at least one of said blocks (250, 252) of a known size contained within said successively larger block (222, 244); and means (165) for mapping said level of significance.
9. The system as recited in claim 8, wherein said mapping includes information regarding each of said blocks of known size and successive blocks having a known level.
10. The system as recited in claim 8, wherein said known level is representative of a non-zero coefficient.
11. A decoding system for decoding images transmitted as a layer encoded signal, comprising: means for receiving data corresponding to a significance mapping of at least one sub-layer of said layered encoding signal; means for decoding said sigmficance map; and means for reconstructing a corresponding one for said sub-layers from said significance map.
12. The decoding system as recited in claim 11, further comprising: means for receiving said layer encoded signal transmitted over a network.
13. The decoding system as recited in claim 11, wherein said significance map includes information regarding blocks containing significant information.
PCT/IB2003/000789 2002-03-05 2003-03-04 Method and system for layered video encoding WO2003075579A2 (en)

Priority Applications (5)

Application Number Priority Date Filing Date Title
JP2003573878A JP2005519543A (en) 2002-03-05 2003-03-04 Method and system for layer video coding
AU2003208500A AU2003208500A1 (en) 2002-03-05 2003-03-04 Method and system for layered video encoding
KR10-2004-7013637A KR20040091682A (en) 2002-03-05 2003-03-04 Method and system for layered video encoding
EP03706790A EP1483918A2 (en) 2002-03-05 2003-03-04 Method and system for layered video encoding
US10/506,342 US20050213831A1 (en) 2002-03-05 2003-03-04 Method and system for encoding fractional bitplanes

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
US36259202P 2002-03-05 2002-03-05
US60/362,592 2002-03-05
US43405502P 2002-12-17 2002-12-17
US60/434,055 2002-12-17

Publications (2)

Publication Number Publication Date
WO2003075579A2 true WO2003075579A2 (en) 2003-09-12
WO2003075579A3 WO2003075579A3 (en) 2003-12-31

Family

ID=27791716

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/IB2003/000789 WO2003075579A2 (en) 2002-03-05 2003-03-04 Method and system for layered video encoding

Country Status (6)

Country Link
EP (1) EP1483918A2 (en)
JP (1) JP2005519543A (en)
KR (1) KR20040091682A (en)
CN (1) CN1640146A (en)
AU (1) AU2003208500A1 (en)
WO (1) WO2003075579A2 (en)

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1908289A1 (en) * 2005-04-13 2008-04-09 Nokia Corporation Method, device and system for effective fine granularity scalability (fgs) coding and decoding of video data
NO20090019L (en) * 2006-06-30 2009-03-24 Tech Univ Delft Ship with surface for bow control
US7536395B2 (en) 2006-06-06 2009-05-19 International Business Machines Corporation Efficient dynamic register file design for multiple simultaneous bit encodings
NO20093155A1 (en) * 2009-10-16 2011-04-18 Tandberg Telecom As Methods, computer programs and devices for encoding and decoding video
CN101527786B (en) * 2009-03-31 2011-06-01 西安交通大学 Method for strengthening definition of sight important zone in network video
US8483285B2 (en) 2008-10-03 2013-07-09 Qualcomm Incorporated Video coding using transforms bigger than 4×4 and 8×8
TWI419567B (en) * 2008-10-03 2013-12-11 Qualcomm Inc Video coding with large macroblocks
US8619856B2 (en) 2008-10-03 2013-12-31 Qualcomm Incorporated Video coding with large macroblocks
US8948258B2 (en) 2008-10-03 2015-02-03 Qualcomm Incorporated Video coding with large macroblocks
US11082697B2 (en) 2009-07-01 2021-08-03 Interdigital Vc Holdings, Inc. Methods and apparatus for signaling intra prediction for large blocks for video encoders and decoders

Families Citing this family (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR100834757B1 (en) * 2006-03-28 2008-06-05 삼성전자주식회사 Method for enhancing entropy coding efficiency, video encoder and video decoder thereof
KR100856064B1 (en) * 2006-06-12 2008-09-02 경희대학교 산학협력단 Method and Apparatus for preferential encoding/decoding in Fine Granular Scalability
DE602007010835D1 (en) * 2007-01-18 2011-01-05 Fraunhofer Ges Forschung QUALITY SCALABLE VIDEO DATA CURRENT
KR101624649B1 (en) 2009-08-14 2016-05-26 삼성전자주식회사 Method and apparatus for video encoding considering hierarchical coded block pattern, and method and apparatus for video decoding considering hierarchical coded block pattern
ES2554237T3 (en) * 2009-10-01 2015-12-17 Sk Telecom. Co., Ltd. Method and apparatus for encoding / decoding image using a split layer
EP2773047A3 (en) * 2011-10-24 2015-01-14 BlackBerry Limited Significance map encoding and decoding using partition selection

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1999049412A1 (en) * 1998-03-20 1999-09-30 University Of Maryland Method and apparatus for compressing and decompressing images
US20010016008A1 (en) * 1998-10-09 2001-08-23 Paramvir Bahl Method and apparatus for use in transmitting video information over a communication network
WO2001091454A2 (en) * 2000-05-25 2001-11-29 Koninklijke Philips Electronics N.V. Bit-plane dependent signal compression
US20020006161A1 (en) * 1999-07-06 2002-01-17 Van Der Schaar Mihaela Method and apparatus for improved efficiency in transmission of fine granular scalable selective enhanced images
US20020080878A1 (en) * 2000-10-12 2002-06-27 Webcast Technologies, Inc. Video apparatus and method for digital video enhancement

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1999049412A1 (en) * 1998-03-20 1999-09-30 University Of Maryland Method and apparatus for compressing and decompressing images
US20010016008A1 (en) * 1998-10-09 2001-08-23 Paramvir Bahl Method and apparatus for use in transmitting video information over a communication network
US20020006161A1 (en) * 1999-07-06 2002-01-17 Van Der Schaar Mihaela Method and apparatus for improved efficiency in transmission of fine granular scalable selective enhanced images
WO2001091454A2 (en) * 2000-05-25 2001-11-29 Koninklijke Philips Electronics N.V. Bit-plane dependent signal compression
US20020080878A1 (en) * 2000-10-12 2002-06-27 Webcast Technologies, Inc. Video apparatus and method for digital video enhancement

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
LI W: "FINE GRANULARITY SCALABILITY IN MPEG-4 FOR STREAMING VIDEO" ISCAS 2000. PROCEEDINGS OF THE 2000 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS. GENEVA, SWITZERLAND, MAY 28-31, 2000, IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, NEW YORK, NY: IEEE, US, vol. 5 OF 5, 2000, pages 299-302, XP000965729 ISBN: 0-7803-5483-4 *
VAN DER SCHAAR M ET AL: "Content-based selective enhancement for streaming video" PROCEEDINGS 2001 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING. ICIP 2001. THESSALONIKI, GREECE, OCT. 7 - 10, 2001, INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, NEW YORK, NY: IEEE, US, vol. 1 OF 3. CONF. 8, 7 October 2001 (2001-10-07), pages 977-980, XP010563929 ISBN: 0-7803-6725-1 *

Cited By (21)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1908289A1 (en) * 2005-04-13 2008-04-09 Nokia Corporation Method, device and system for effective fine granularity scalability (fgs) coding and decoding of video data
EP1908289A4 (en) * 2005-04-13 2011-01-26 Nokia Corp Method, device and system for effective fine granularity scalability (fgs) coding and decoding of video data
US7536395B2 (en) 2006-06-06 2009-05-19 International Business Machines Corporation Efficient dynamic register file design for multiple simultaneous bit encodings
NO20090019L (en) * 2006-06-30 2009-03-24 Tech Univ Delft Ship with surface for bow control
US9930365B2 (en) 2008-10-03 2018-03-27 Velos Media, Llc Video coding with large macroblocks
US8948258B2 (en) 2008-10-03 2015-02-03 Qualcomm Incorporated Video coding with large macroblocks
US11758194B2 (en) 2008-10-03 2023-09-12 Qualcomm Incorporated Device and method for video decoding video blocks
US11039171B2 (en) 2008-10-03 2021-06-15 Velos Media, Llc Device and method for video decoding video blocks
US8483285B2 (en) 2008-10-03 2013-07-09 Qualcomm Incorporated Video coding using transforms bigger than 4×4 and 8×8
TWI419567B (en) * 2008-10-03 2013-12-11 Qualcomm Inc Video coding with large macroblocks
US8619856B2 (en) 2008-10-03 2013-12-31 Qualcomm Incorporated Video coding with large macroblocks
US8634456B2 (en) 2008-10-03 2014-01-21 Qualcomm Incorporated Video coding with large macroblocks
US10225581B2 (en) 2008-10-03 2019-03-05 Velos Media, Llc Video coding with large macroblocks
US9788015B2 (en) 2008-10-03 2017-10-10 Velos Media, Llc Video coding with large macroblocks
CN101527786B (en) * 2009-03-31 2011-06-01 西安交通大学 Method for strengthening definition of sight important zone in network video
US11082697B2 (en) 2009-07-01 2021-08-03 Interdigital Vc Holdings, Inc. Methods and apparatus for signaling intra prediction for large blocks for video encoders and decoders
US11936876B2 (en) 2009-07-01 2024-03-19 Interdigital Vc Holdings, Inc. Methods and apparatus for signaling intra prediction for large blocks for video encoders and decoders
US8699580B2 (en) 2009-10-09 2014-04-15 Cisco Technology, Inc. Method, apparatus, and computer readable medium for video compression
NO20093155A1 (en) * 2009-10-16 2011-04-18 Tandberg Telecom As Methods, computer programs and devices for encoding and decoding video
EP2489194A4 (en) * 2009-10-16 2013-06-12 Cisco Systems Int Sarl Methods for video coding and decoding
EP2489194A1 (en) * 2009-10-16 2012-08-22 Cisco Systems International Sarl Methods for video coding and decoding

Also Published As

Publication number Publication date
JP2005519543A (en) 2005-06-30
AU2003208500A8 (en) 2003-09-16
WO2003075579A3 (en) 2003-12-31
CN1640146A (en) 2005-07-13
AU2003208500A1 (en) 2003-09-16
KR20040091682A (en) 2004-10-28
EP1483918A2 (en) 2004-12-08

Similar Documents

Publication Publication Date Title
KR101196975B1 (en) Method and apparatus for encoding video color enhancement data, and method and apparatus for decoding video color enhancement data
WO2003075579A2 (en) Method and system for layered video encoding
US20070065005A1 (en) Color space scalable video coding and decoding method and apparatus for the same
EP1290868A2 (en) Bit-plane dependent signal compression
US7245663B2 (en) Method and apparatus for improved efficiency in transmission of fine granular scalable selective enhanced images
JPH11513205A (en) Video coding device
Li Image compression: The mathematics of JPEG 2000
EP1401208A1 (en) Fine granularity scalability encoding/decoding apparatus and method
WO2007069829A1 (en) Method and apparatus for encoding and decoding video signals on group basis
US7406203B2 (en) Image processing method, system, and apparatus for facilitating data transmission
US6760479B1 (en) Super predictive-transform coding
US20060133483A1 (en) Method for encoding and decoding video signal
KR100603592B1 (en) Intelligent Water ring scan apparatus and method using Quality Factor, video coding/decoding apparatus and method using that
US20050213831A1 (en) Method and system for encoding fractional bitplanes
WO2003069917A1 (en) Memory-bandwidth efficient fine granular scalability (fgs) encoder
JP2004048607A (en) Digital image coding device and method thereof
US7016541B2 (en) Image processing method for facilitating data transmission
US20090074059A1 (en) Encoding method and device for image data
US20040066849A1 (en) Method and system for significance-based embedded motion-compensation wavelet video coding and transmission
Li Image Compression-the Mechanics of the JPEG 2000
Lu et al. Polynomial approximation coding for progressive image transmission
JP2003244443A (en) Image encoder and image decoder
US7519520B2 (en) Compact signal coding method and apparatus
JPH11136521A (en) Picture data processor
JPH0937250A (en) Image data decoder and image data decoding method

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A2

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ OM PH PL PT RO RU SC SD SE SG SK SL TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IT LU MC NL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
WWE Wipo information: entry into national phase

Ref document number: 2003706790

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: 1020047013637

Country of ref document: KR

Ref document number: 10506342

Country of ref document: US

WWE Wipo information: entry into national phase

Ref document number: 2003573878

Country of ref document: JP

WWE Wipo information: entry into national phase

Ref document number: 20038052415

Country of ref document: CN

WWP Wipo information: published in national office

Ref document number: 1020047013637

Country of ref document: KR

WWP Wipo information: published in national office

Ref document number: 2003706790

Country of ref document: EP

WWW Wipo information: withdrawn in national office

Ref document number: 2003706790

Country of ref document: EP