WO2012141551A2 - 멀티 트랙 비디오를 사용한 고속 스케일러블 비디오 코딩 장치 및 방법 - Google Patents
멀티 트랙 비디오를 사용한 고속 스케일러블 비디오 코딩 장치 및 방법 Download PDFInfo
- Publication number
- WO2012141551A2 WO2012141551A2 PCT/KR2012/002861 KR2012002861W WO2012141551A2 WO 2012141551 A2 WO2012141551 A2 WO 2012141551A2 KR 2012002861 W KR2012002861 W KR 2012002861W WO 2012141551 A2 WO2012141551 A2 WO 2012141551A2
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- encoding
- layer
- macroblock
- video
- mode
- Prior art date
Links
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/134—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
- H04N19/136—Incoming video signal characteristics or properties
- H04N19/137—Motion inside a coding unit, e.g. average field, frame or block difference
- H04N19/139—Analysis of motion vectors, e.g. their magnitude, direction, variance or reliability
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/103—Selection of coding mode or of prediction mode
- H04N19/107—Selection of coding mode or of prediction mode between spatial and temporal predictive coding, e.g. picture refresh
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/134—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
- H04N19/157—Assigned coding mode, i.e. the coding mode being predefined or preselected to be further used for selection of another element or parameter
- H04N19/159—Prediction type, e.g. intra-frame, inter-frame or bidirectional frame prediction
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/169—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
- H04N19/17—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
- H04N19/176—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/169—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
- H04N19/184—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being bits, e.g. of the compressed video stream
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/169—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
- H04N19/187—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a scalable video layer
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/30—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using hierarchical techniques, e.g. scalability
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/30—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using hierarchical techniques, e.g. scalability
- H04N19/31—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using hierarchical techniques, e.g. scalability in the temporal domain
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/40—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using video transcoding, i.e. partial or full decoding of a coded input stream followed by re-encoding of the decoded output stream
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/42—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by implementation details or hardware specially adapted for video compression or decompression, e.g. dedicated software implementation
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- H04N19/503—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
- H04N19/51—Motion estimation or motion compensation
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/60—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding
- H04N19/61—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding in combination with predictive coding
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N5/00—Details of television systems
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N7/00—Television systems
Definitions
- the present invention relates to an apparatus and method for fast scalable video coding using multi-track video. More particularly, the present invention relates to intra prediction or motion of a macroblock using encoding information of multi-track video.
- adaptive video streaming technology which adjusts and transmits the amount of video data according to the user's network environment.
- Technology to prevent users from watching videos that are cut off or broken in the middle is becoming common.
- Current adaptive video streaming technology mainly consists of creating compressed video with various data sizes for one original video and selecting compressed video according to user's network environment. You must ingest multiple compressed video, Multi-track video, for.
- Scalable Video Coding (SVC) method is based on H.264 by the join video technology (JVT) group of ITU and MPEG for the purpose of providing video services to various devices and network environments with one compressed video.
- JVT join video technology
- SVC has been standardized recently and is in the early stage of commercialization.
- SVC prepares several files with existing coding methods such as H.264 to provide files suitable for each device and network environment. It uses a multi-track video method.
- this re-encoding has a problem that takes a lot of time.
- the present invention has been made to solve the above-described problem, and utilizes encoding information including an encoding mode and prediction information of a multi-track video that is compressed video compressed in various formats.
- Fast scalable video coding using multitrack video for fast encoding macroblocks in inter-layer intra prediction mode or inter-layer / intra-layer motion prediction mode There is a technical problem in providing an apparatus and method.
- the scalable video encoding apparatus includes a video aligner, a layer of a video aligner that arranges a multi-track video into a plurality of layers according to a set criterion.
- a bitstream analyzer for extracting encoding information including an encoding mode and prediction information by analyzing the bitstream, a layer for performing scalable video encoding, and a layer for performing scalable video encoding
- a scalable video coding (SVC) encoding unit is configured to determine an encoding mode by using encoding information of an upper or lower layer of and to encode the determined encoding mode.
- the set criterion may be defined in the image quality order determined by the objective quality evaluation method, and the video alignment unit may arrange the multi-track video into a plurality of layers so that the high quality video is arranged in the upper layer according to the image quality order.
- the video aligner aligns the multi-track video so that the high resolution video is placed at the top, and when two or more videos having the same resolution exist, the video aligner arranges the video having the high frame rate at the top, And when two or more videos having the same frame rate exist, the video having a high bit rate may be arranged in a plurality of layers so that the video having a high bit rate is disposed above.
- the SVC encoding unit encodes each layer of the multi-track video in units of macroblocks, and sequentially performs scalable video encoding from the lowest layer to the highest layer, or the highest layer to the lowest layer among a plurality of layers.
- the SVC encoder may up-scale the decoded texture information of each of the macroblock of the layer performing scalable video encoding and the upper layer macroblock corresponding to the macroblock to inter-layer intra prediction. calculates the encoding cost of the inter-layer intra mode and calculates the encoding cost of the encoding mode of the layer performing scalable video encoding. Compare and, if the encoding cost of the inter-layer intra mode is less than the encoding cost of the encoding mode of the layer performing scalable video encoding, encode in the inter-layer intra mode, and if greater than or equal to, the encoding cost of the layer performing scalable video encoding You can encode in encoding mode.
- the bitstream analyzer may extract prediction information including macroblock segmentation information and motion vector information by decoding a video bitstream of an adjacent layer among a plurality of layers.
- the SVC encoding unit encodes each layer of the multi-track video in units of macroblocks, and uses the macroblock segmentation information and motion vector information to convert the macroblocks into an inter-layer motion prediction mode and an intra mode. Intra mode) or the encoding mode of the layer performing scalable video encoding.
- the SVC encoding unit uses a motion vector information to determine whether a macroblock of a layer that performs scalable video encoding is a motion prediction mode, and a second process of determining whether a lower layer macroblock of a macroblock is a motion prediction mode.
- the scalable video encoding method includes (a) arranging multi-track video into a plurality of layers according to a set criterion in the scalable video encoding apparatus. (B) extracting encoding information including an encoding mode and prediction information by analyzing the bitstreams of the aligned layers at, and a layer and a higher layer that perform scalable video encoding at the encoding apparatus. Or (c) determining an encoding mode using the encoding information of the lower layer and encoding the multitrack video with the determined encoding mode.
- the scalable video encoding method may further include generating a multi-track video by compressing an image in various formats before the step (a), or receiving a multi-track video at the encoding device.
- step (a) the multi-track video is arranged such that the video having a higher resolution is placed on a higher layer.
- step (a-1) two or more videos having the same resolution are present among the multi-track videos arranged according to the resolution.
- (A-2) reordering the video having a high frame rate to be placed on a higher layer, and when two or more videos having the same frame rate exist among the rearranged multi-track videos according to the frame rate.
- (a-3) rearranging the video having a high bit rate so as to be arranged in the upper layer.
- inter-layer intra prediction may be performed by up-scaling encoding information of a macroblock of a layer performing scalable video encoding and an upper layer macroblock corresponding to the macroblock.
- (c-1) performing prediction, calculating an encoding cost of an inter-layer intra mode and an encoding cost of an encoding mode of a layer performing scalable video encoding, respectively
- (c-2) comparing the calculated encoding cost to determine an encoding mode among interlayer intra mode or encoding mode of a layer performing scalable video encoding, and (c-3) scalable video to the determined encoding mode.
- step (c-3) if the calculated encoding cost of the interlayer intra mode is smaller than the encoding cost of the encoding mode of the layer performing scalable video encoding, the encoding mode is determined as the interlayer intra mode, and In this case, the encoding mode may be determined as an encoding mode of a layer that performs scalable video encoding.
- the set criterion is an image quality order determined by the objective image quality evaluation method
- the multi-track video may be arranged in a plurality of layers so that the high quality video is arranged in the upper layer according to the image quality order.
- step (c) includes determining whether a macroblock of the layer on which the scalable video encoding is to be performed is motion prediction mode using motion vector information, and the macroblock is motion prediction. In the case of the mode, perform the step (c-3), and if the macroblock is not the motion prediction mode, encoding the macroblock in the intra mode (c-2), and the lower layer macroblock of the macroblock is moved. In step (c-3) of determining whether the prediction mode is the motion prediction mode, and if the lower layer macroblock is the motion prediction mode, performing the step (c-5) and encoding the encoding mode of the macroblock if the lower layer macroblock is not the motion prediction mode.
- step (C-4) comparing the macroblock segmentation information of the up-scaled lower layer macroblock with the macroblock segmentation information of the macroblock (c-5) If the low block partitioning information is the same, perform step (c-7), and if the macroblock partitioning information is not the same, encoding in the encoding mode of the macroblock (c-6), the motion vector of the upscaled lower layer macroblock (C-7) comparing the motion vector of the macroblock to the macroblock. If the motion vectors are the same, the macroblock is encoded using the inter-layer motion prediction mode. Encoding (c-8) the encoding of the macroblock in steps (c-2), (c-4), (c-6) and (c-8). After this is completed, the next macroblock may be repeatedly performed from step (c-1), and may be sequentially encoded from the highest layer to the lowest macroblock.
- the fast scalable video coding apparatus and method using the multi-track video of the present invention utilizes encoding information of the multi-track video, thereby reducing the time to encode the scalable video.
- compression efficiency may be increased by comparing an encoding cost of an encoding mode of an existing multi-track video and an encoding cost of an inter-layer intra prediction mode to determine an encoding mode.
- the encoding mode is determined, thereby reducing the computation time required for motion prediction and reducing the scalable video encoding time. have.
- FIG. 1 is a block diagram showing an embodiment of a scalable video providing system according to an aspect of the present invention.
- FIG. 2 is a block diagram illustrating a first embodiment of a scalable video encoding apparatus according to another aspect of the present invention.
- FIG. 3 is a block diagram illustrating a second embodiment of a scalable video encoding apparatus according to another aspect of the present invention.
- FIG. 4 is a diagram for describing a function of a multi-track video generation unit that is a component of a scalable video encoding apparatus according to a second embodiment of the present invention.
- FIG. 5 is a diagram for describing a function of an SVC encoding unit that is a component of a scalable video encoding apparatus according to a first embodiment or a second embodiment of the present invention.
- FIG. 6 is a flowchart illustrating a scalable video encoding method according to another aspect of the present invention.
- FIG. 7 is a flowchart illustrating a first embodiment of a scalable video encoding method according to another aspect of the present invention.
- FIG. 8 is a flowchart illustrating a second embodiment of a scalable video encoding method according to another aspect of the present invention.
- FIG. 9 is a flowchart illustrating a third embodiment of a scalable video encoding method according to another aspect of the present invention.
- FIG. 10 is a flowchart illustrating a fourth embodiment of a scalable video encoding method according to another aspect of the present invention.
- FIG. 1 is a block diagram showing an embodiment of a scalable video providing system according to an aspect of the present invention.
- the scalable video providing system 1000 of the present invention includes a user terminal 100, a scalable video encoding apparatus 200, and a communication network 300.
- the user terminal 100 may receive the received scalable video, decode the display according to a specification, a network environment, or a service type of the user terminal.
- the user terminal 100 refers to a video display device including an IPTV, a set-top box, a smart phone, and a mobile communication terminal that supports the DMB function.
- the specification of the user terminal is information on the resolution and size of the display screen.
- the network environment refers to a frame rate, and the service type refers to a form in which a video such as a high definition television (HDTV), a digital standard television (SDTV), or a digital multimedia broadcasting (DMB) is displayed.
- HDMI high definition television
- SDTV digital standard television
- DMB digital multimedia broadcasting
- the scalable video encoding apparatus 200 performs a function of scalable video encoding a multi-track video, and the communication network 300 uses a scalable video transmitted from the scalable video encoding apparatus 200. It provides a path for transmitting to the terminal 100, a mobile communication network such as WCDMA, HDPA, 3G, 4G, local area network such as Bluetooth, Zigbee, Wi-Fi, wired communication network such as PSTN or wired / wireless internet And the like.
- a mobile communication network such as WCDMA, HDPA, 3G, 4G, local area network such as Bluetooth, Zigbee, Wi-Fi, wired communication network such as PSTN or wired / wireless internet And the like.
- the scalable video encoding apparatus 200 will be described in detail with reference to FIGS. 2 to 5.
- FIG. 2 is a configuration diagram illustrating a first embodiment of a scalable video encoding apparatus according to another aspect of the present invention
- FIG. 3 is a configuration diagram illustrating a second embodiment of a scalable video encoding apparatus according to another aspect of the present invention
- 4 is a diagram for describing a function of a multi-track video generation unit that is a component of a scalable video encoding apparatus according to a second embodiment of the present invention
- FIG. 5 is a first or second embodiment of the present invention.
- FIG. 7 illustrates a function of an SVC encoding unit that is a component of a scalable video encoding apparatus according to an example.
- the scalable video encoding apparatus 200 arranges multi-track video into a plurality of layers according to a set criterion, extracts encoding information of the aligned plurality of layers, and extracts the extracted encoding information. It encodes multi-track video into scalable video.
- Multi-track video refers to a plurality of compressed video having different formats of compressing the same video into various formats, and various encoding methods (eg, H. 264, etc.).
- the scalable video encoding apparatus 200 is an apparatus for encoding multi-track video, which is an existing encoded compressed video, as scalable video.
- the scalable video encoding apparatus 200 is scalable by utilizing encoding information of multi-track video when encoding multi-track video to scalable video. Speed up video encoding.
- the scalable video encoding apparatus 200 determines the encoding mode according to the 'set criteria' for arranging the multitrack video and the 'encoding information' used to determine the encoding mode, thereby encoding the encoding time by encoding the multitrack video. It can be shortened.
- the scalable video encoding apparatus 200 includes a video alignment unit 210, a bitstream analyzer 220, and an SVC encoder 230 as illustrated in FIG. 2. .
- the scalable video encoding apparatus arranges the multi-track video into a plurality of layers according to a set criterion, so that compressed video having different formats is arranged into the plurality of layers.
- the term 'layer' used hereinafter means compressed video.
- the video aligning unit 210 aligns the multi-track video with any one of a first criterion composed of a resolution, a frame rate, and a bit rate of the compressed video, or a second criterion, which is an objectively evaluated image quality order. Can be selected based on the set criteria.
- the bitstream analyzer 220 extracts encoding information including an encoding mode and prediction information by analyzing the bitstream of each layer.
- the encoding mode refers to intra mode or inter mode, which is a method of compressing a macroblock
- the prediction information includes a motion vector and macroblock partition information. Means information required for motion prediction.
- bitstream analyzer 220 extracts encoding information used when encoding multi-track video.
- the SVC encoding unit 230 uses encoding information of a layer that performs current scalable video encoding and an upper or lower layer of a layer that performs current scalable video encoding (hereinafter, referred to as a layer to be currently encoded). Determine the encoding mode and perform encoding in the determined encoding mode.
- the SVC encoding unit 230 determines encoding modes of a layer or a macroblock to be currently encoded using encoding information of two layers or corresponding macroblocks of two layers.
- the SVC encoding unit 230 determines the encoding mode of any one of the first encoding information consisting of the encoding cost (encoding cost) of the encoding mode, or the second encoding information including the motion vector and macroblock segmentation information. You can choose.
- the scalable video encoding apparatus 200 may determine as follows. Encode multitrack video in the same order as
- the video aligner 210 aligns the multi-track video into a plurality of layers according to a first criterion, that is, resolution, frame rate, and bit rate.
- the video aligning unit 210 arranges the multi-track video so that the high resolution video is arranged at the top, and when two or more videos having the same resolution exist, the video having the high frame rate is arranged at the top. If there is more than one video having the same resolution and frame rate, the video may be arranged in a plurality of layers so that a video having a high bit rate is placed on top.
- the bitstream analyzer 220 extracts encoding information by analyzing a bitstream of each layer, and sequentially provides encoding information of the lowest or highest layer to the SVC encoder 230.
- the SVC encoding unit 230 encodes each layer of the multi-track video in units of macroblocks, and sequentially performs scalable video encoding from the lowest layer to the highest layer, or the highest layer to the lowest layer among a plurality of layers.
- scalable video encoding When scalable video encoding is sequentially performed from the lowest layer, scalable video encoding of a bitstream of the lowest layer into a base layer is performed, followed by scalable video encoding of adjacent upper layers.
- the SVC encoding unit 230 up-scales decoded texture information of each of the macroblock of the layer to be currently encoded and the upper layer macroblock corresponding to the macroblock to inter-layer intra prediction. Intra prediction) is performed.
- the SVC encoding unit 230 calculates an encoding cost of an encoding mode of a layer to be currently encoded, calculates an encoding cost of an inter-layer intra mode through inter-layer intra prediction, and then calculates an encoding cost of the first layer. Used as encoding information.
- the SVC encoding unit 230 compares the calculated installation cost, and if the encoding cost of the inter-layer intra mode is less than the encoding cost of the encoding mode of the layer to be currently encoded, determines the encoding mode in the inter-layer intra mode, and If it is equal to or equal to, the encoding mode is determined by the encoding mode of the layer to be currently encoded.
- the SVC encoding unit 230 performs scalable video encoding of the multi-trap video in the determined encoding mode.
- the scalable video encoding apparatus 200 may determine as follows. Encode multitrack video in the same order as
- the video arranging unit 210 selects the second criterion as a set criterion, and arranges the multi-track video into a plurality of layers so that the high-definition video is placed in the upper layer according to the image quality order determined by the objective quality evaluation method.
- the objective image quality evaluation method can be divided into a full reference method for comparing the original image and the compressed image, a reduced reference method for extracting and evaluating only the main information from the original image, and a non-standard method for evaluating the image quality without the original image.
- the evaluation of the quality of compressed video may use the PSNR (Peak Signal to Noise Ratio) as an evaluation scale comparing the amount of loss from the original image or the amount of loss with the same loss at the same capacity.
- PSNR Peak Signal to Noise Ratio
- the bitstream analyzer 220 extracts prediction information including macroblock segmentation information and motion vector information as second encoding information through video bitstream decoding of an adjacent layer among a plurality of layers. And the prediction information of the uppermost layer and the layer adjacent to the SVC encoding unit 230.
- the SVC encoding unit 230 performs an inter-layer motion prediction mode, an intra mode, or scalable video encoding on a macroblock using macroblock segmentation information and motion vector information.
- the encoding mode of the encoding mode is determined, and each layer of the multi-track video is encoded in macroblock units.
- the SVC encoder 230 may determine whether a macroblock of a layer to be currently encoded is a motion prediction mode by using motion vector information, and may determine whether a lower layer macroblock of the macroblock is a motion prediction mode.
- a third process of determining whether the macroblock segmentation information of the up-scaled lower layer macroblock and the macroblock segmentation information of the macroblock are the same, and whether the motion vector of the macroblock and the lower layer macroblock is the same Determining a fourth process of determining sequentially to determine an encoding mode of a macroblock among an encoding mode of a layer performing inter-layer motion prediction mode, intra mode, or scalable video encoding Can be encoded in the determined encoding mode.
- the SVC encoder 230 encodes the layer macroblock to be encoded in the intra mode if the layer macroblock to be currently encoded is not the motion prediction mode in the first process, and performs the second process if the motion is the motion prediction mode.
- the SVC encoding unit 230 encodes the macroblock to be encoded in its own encoding mode, and performs the third process if it is the motion prediction mode.
- the SVC encoding unit 230 encodes the macroblock to be encoded in its own encoding mode and performs the fourth process if it is the same.
- the SVC encoding unit 230 encodes the macroblock to be encoded in its encoding mode, and if the motion vector of the macroblock is the same, to the inter-layer motion prediction mode. Encode the macroblock to be encoded currently.
- the SVC encoding unit 230 performs intra prediction using texture information of the input layer and the lower layer.
- the encoding mode is determined through the encoding mode or the motion prediction of the input layer and the lower layer.
- the SVC encoding unit 260 processes Discrete Cosine Transform (DCT) / Quantization and Context-adaptive variable-length coding (CAVLC) / Context-adaptive binary arithmetic coding (CABAC) for SVC video according to the determined operation mode. Do this.
- DCT Discrete Cosine Transform
- CAVLC Context-adaptive variable-length coding
- CABAC Context-adaptive binary arithmetic coding
- the scalable video encoding apparatus 200 encodes the intra prediction mode on the basis of the lower layer texture information through the SVC encoding unit 230 and moves to the motion prediction mode based on the motion information of the lower layer.
- the macroblock partitioning information of the macroblocks to be encoded are compared and the motion vectors are compared, and if the motion vectors are the same, the encoding is performed in the inter-layer motion prediction mode and the macroblock is encoded. If the partitioning information is not the same or the motion vector is not the same, encoding is performed in the encoding mode of the macroblock that performs the current encoding.
- the scalable video encoding apparatus includes a video alignment unit 210, a bitstream analyzer 220, an SVC encoder 230, a communication unit 240, and an original video. And a storage unit 250 and a multi-track video generator 260.
- the video alignment unit 210, the bitstream analyzer 220, and the SVC encoder 230 are the same as the scalable video encoding apparatus 200 according to the first embodiment, the communication unit 240 may be different from each other. Only the original video storage 250 and the multi-track video generator 260 will be described.
- the communication unit 240 is a component for accessing the user terminal through a communication network.
- the communication unit 240 forms a transmission channel of the control signal and the scalable video, and the original video storage unit 250 stores the unencoded original video.
- the multi-track video generator 260 receives an original video stored in the original video storage 250 and generates a multi-track video.
- the plurality of video encoders 1 to N generate original video as compressed video of different formats, and the generated compressed video of different formats means multi-track video.
- the scalable video encoding apparatus 200 according to the second embodiment further performs the function of generating the multi-track video by encoding the original video into various formats, and the scalable video encoding apparatus according to the first embodiment. There is a difference.
- FIGS. 7 to 10 illustrate first to fourth embodiments of the scalable video encoding method according to another aspect of the present invention. It is a flow chart.
- multi-track video is arranged into a plurality of layers according to a set criterion.
- encoding information including an encoding mode and prediction information is extracted by analyzing the bitstreams of the aligned layers (S620).
- an encoding mode is determined using encoding information of a layer performing scalable video encoding and upper or lower layers (S630), and the multi-track video is encoded in the determined encoding mode (S640). ).
- the scalable video encoding method may encode the scalable video at high speed by determining an encoding mode for encoding the multitrack video using encoding information of the existing multitrack video and the multitrack video.
- the scalable video encoding method according to the first embodiment selects and encodes the first reference and the first encoding information. As shown in FIG. 7, the scalable video encoding method of the multitrack video is S710 to S760. It is carried out according to the process.
- multi-track video having different formats is generated by receiving the original video (S710), and the multi-track video is sorted according to resolution, frame rate, and bit rate (S720).
- the scalable video encoding apparatus may generate a multi-track video by compressing an image into various formats.
- Multi-track video alignment (S720) is arranged so that high-resolution video is placed on a higher layer, and when two or more videos of the same resolution are present among the multi-track videos arranged according to the resolution, the frame rate is high. Reorder the video to be placed on the upper layer, and if there is more than one video having the same frame rate among the multi-track video rearranged according to the frame rate, it is performed by rearranging the video having the higher bit rate to be placed on the upper layer. Can be.
- the encoding information including the encoding mode and the prediction information of each aligned layer is extracted (S730), and the encoding information of the (N-1) th layer to be currently encoded and the encoding information of the Nth layer, which is the upper layer, is SVC. Pass it to the encoding section.
- the SVC encoding unit encodes the bitstream of the Nth layer in macroblock units by using the encoding information of the (N-1) th layer and the Nth layer (S740), and then determines whether the Nth layer is the highest layer (S750). When encoding is performed up to the highest layer, encoding is terminated, and when it is not the highest layer, scalable video encoding is performed from the lowest layer to the highest layer by repeating steps S740 to S750 by substituting N ⁇ -(N + 1).
- the scalable video encoding method according to the second embodiment embodies the processes S740 to S760 of FIG. 7 when the first reference and the first encoding information are selected and encoded, as shown in FIG. 8.
- multi-track video is aligned according to a first criterion (resolution, frame rate, bit rate), and the encoding information of each layer is extracted.
- up-scaling encoding information of a macroblock of a layer performing video encoding and an upper layer macroblock corresponding to the macroblock is performed to perform inter-layer intra prediction (S810).
- the encoding cost A of the inter-layer intra mode and the encoding cost B of the encoding mode of the layer performing scalable video encoding are respectively calculated (S820).
- the calculated encoding cost is compared (S830).
- the encoding mode is determined as an inter-layer intra mode, and the layer to be encoded is encoded (S840). If A is not less than B, the encoding mode is encoded into the encoding mode of the layer to be encoded. (S850).
- Processes S810 to S850 may be repeated until video of the lowest layer to the highest layer of the multi-track video is sequentially encoded.
- the scalable video encoding method according to the third embodiment selects and encodes the second reference and the second encoding information. As shown in FIG. 9, the scalable video encoding method of the multitrack video is S910 to S970. The process is carried out.
- the original video is input to generate a multitrack video having a different format (S910), and the multitrack video is arranged in image quality order (S920).
- the multi-track video alignment is determined in the image quality order determined by the objective quality evaluation method, and the multi-track video may be arranged in a plurality of layers so that the high quality video is arranged in the upper layer according to the image quality order.
- the scalable video encoding method extracts prediction information including encoding modes, macroblock segmentation information, and motion vector information of the aligned layers as encoding information (S930).
- the encoding mode of the macroblock of the Nth layer is determined using the macroblock segmentation information and the motion information of the Nth layer, which is the layer to be encoded, and the (N-1) th layer, which is the lower layer (S940), and the determined encoding mode is determined. Encoding is performed (S950).
- the scalable video encoding method according to the fourth embodiment embodies the processes S940 to S970 of FIG. 9 when the second reference and the second encoding information are selected and encoded, as shown in FIG. 10.
- the scalable video encoding method determines whether a macroblock of an N th layer, which is a layer to be currently encoded, using motion vector information is a motion prediction mode (S1010), and determines the N th layer of the N th layer. If the macroblock is not the motion prediction mode, the macroblock is encoded in the intra mode (S1020), and if the N-th layer macroblock is the motion prediction mode, the macroblock of the (N-1) th layer is the motion prediction mode. It is determined whether or not (S1030).
- the (N-1) th layer macroblock is not the motion prediction mode, it is encoded in the encoding mode of the Nth layer macroblock (S1040), and when the (N-1) th layer macroblock is the motion prediction mode, it is up.
- the macroblock segmentation information of the scaled (N-1) th layer macroblock and the macroblock segmentation information of the Nth macroblock are compared (S1050).
- the macroblock segmentation information is not the same, it is encoded in the encoding mode of the Nth layer macroblock (S1040). If the macroblock segmentation information is the same, the motion vector and N of the upscaled (N-1) th layer macroblock are N. The motion vector of the first macroblock is compared (S1060).
- the macroblock is encoded in the inter-layer motion prediction mode (S1070). If the motion vectors are not the same, the macroblock is encoded in the encoding mode of the N-th layer macroblock (S1040). ).
- an encoding mode is rapidly performed by utilizing encoding information of the existing multi-track video. By determining this, the encoding speed can be improved.
- the encoding mode of the multi-track video can be extracted at high speed from the encoding information of the existing multi-track video, and the motion prediction time having the highest computational amount can be increased at the time of encoding, thereby improving the conversion speed to scalable video. Can be.
- the present invention can be applied to an encoder for encoding scalable video, an apparatus having an encoder, and a system for transmitting content using the same.
- the present invention may be applied to a content media communication system including a content server serving a content including video and audio or a user terminal for receiving and displaying the content.
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Compression Or Coding Systems Of Tv Signals (AREA)
Abstract
Description
Claims (15)
- 멀티 트랙 비디오(multi-track video)를 설정된 기준에 따라 다수의 레이어(layer)로 정렬하는 비디오 정렬부;각 레이어의 비트스트림을 분석하여 인코딩 모드(encoding mode)와 예측 정보(prediction information)를 포함하는 인코딩 정보를 추출하는 비트스트림 분석부; 및스케일러블 비디오 인코딩을 수행하는 레이어와, 상기 스케일러블 비디오 인코딩을 수행하는 레이어의 상위 또는 하위 레이어의 인코딩 정보를 사용하여 인코딩 모드를 결정하고 결정된 인코딩 모드로 인코딩하는 SVC(Scalable Video Coding) 인코딩부;를 포함하는 스케일러블 비디오 인코딩 장치.
- 제1항에 있어서,상기 비디오 정렬부는, 상기 멀티 트랙 비디오를 해상도가 높은 비디오가 상위에 배치되도록 정렬하고, 해상도가 동일한 비디오가 두 개 이상 존재하는 경우 프레임율(frame rate)이 높은 비디오가 상위에 배치되도록 정렬하고, 상기 해상도 및 프레임율이 동일한 비디오가 두 개 이상 존재하는 경우 비트율(bit rate)이 높은 비디오가 상위에 배치되도록 다수의 레이어로 정렬하는 것을 특징으로 하는 스케일러블 비디오 인코딩 장치.
- 제1항에 있어서,상기 SVC 인코딩부는, 상기 멀티 트랙 비디오의 각 레어이를 매크로블록 단위로 인코딩하고, 상기 다수의 레이어 중 최하위 레이어부터 최상위 레이어까지, 또는 최상위 레이어부터 최하위 레이어까지 순차적으로 스케일러블 비디오 인코딩을 수행하는 것을 특징으로 하는 스케일러블 비디오 인코딩 장치.
- 제1항에 있어서,상기 SVC 인코딩부는, 상기 스케일러블 비디오 인코딩을 수행하는 레이어의 매크로블록과 상기 매크로블록에 대응되는 상위 레이어 매크로블록 각각의 디코딩된 텍스처 정보를 업스케일링(up-scaling)하여 인터 레이어 인트라 예측(Inter-layer Intra prediction)을 수행하고, 인터 레이어 인트라 모드(Inter-layer Intra mode)의 인코딩 코스트(encoding cost)를 계산하고 상기 스케일러블 비디오 인코딩을 수행하는 레이어의 인코딩 모드의 인코딩 코스트를 계산하여 계산된 인스팅 코스트를 비교하고,상기 인터 레이어 인트라 모드의 인코딩 코스트가 상기 스케일러블 비디오 인코딩을 수행하는 레이어의 인코딩 모드의 인코딩 코스트보다 작으면, 상기 인터 레이어 인트라 모드로 인코딩하고, 크거나 같으면, 상기 스케일러블 비디오 인코딩을 수행하는 레이어의 인코딩 모드로 인코딩하는 것을 특징으로 하는 스케일러블 비디오 인코딩 장치.
- 제1항에 있어서,상기 설정된 기준은 객관적 화질 평가 방법에 의해 결정된 화질 순서이며, 상기 비디오 정렬부는 상기 화질 순서에 따라 고화질 비디오가 상위 레이어에 배치되도록 상기 멀티 트랙 비디오를 다수의 레이어로 정렬하는 것을 특징으로 하는 스케일러블 비디오 인코딩 장치.
- 제1항에 있어서,상기 비트스트림 분석부는, 상기 다수의 레이어 중 인접하는 레이어의 비디오 비트스트림 디코딩을 통해 매크로블록 분할 정보와 움직임 벡터 정보를 포함하는 예측 정보(prediction information)를 추출하는 것을 특징으로 하는 스케일러블 비디오 인코딩 장치.
- 제1항에 있어서,상기 SVC 인코딩부는, 상기 멀티 트랙 비디오의 각 레어이를 매크로블록 단위로 인코딩하고, 매크로블록 분할 정보와 움직임 벡터 정보를 이용하여 상기 매크로블록을 인터 레이어 움직임 예측 모드(Intre-layer motion prediction mode), 인트라 모드(Intra mode) 또는 상기 스케일러블 비디오 인코딩을 수행하는 레이어의 인코딩 모드로 인코딩하는 것을 특징으로하는 스케일러블 비디오 코딩 장치.
- 제7항에 있어서,상기 SVC 인코딩부는, 상기 움직임 벡터 정보를 이용하여 상기 스케일러블 비디오 인코딩을 수행하는 레이어의 매크로블록이 움직임 예측 모드인지 판단하는 제1 과정, 상기 매크로블록의 하위 레이어 매크로블록이 움직임 예측 모드인지 판단하는 제2 과정, 업스케일링(up-scaling)된 상기 하위 레이어 매크로블록의 매크로블록 분할 정보와 상기 매크로블록의 매크로블록 분할 정보가 동일한지 판단하는 제3 과정, 및 상기 매크로블록과 상기 하위 레이어 매크로블록의 움직임 벡터가 동일한지 판단하는 제4 과정을 순차적으로 판단하여 상기 인터 레이어 움직임 예측 모드(Intre-layer motion prediction mode), 상기 인트라 모드(Intra mode) 또는 상기 스케일러블 비디오 인코딩을 수행하는 레이어의 인코딩 모드 중 상기 매크로블록의 인코딩 모드를 결정하는 것을 특징으로하는 스케일러블 비디오 코딩 장치.
- (a) 스케일러블 비디오 인코딩 장치에서 멀티 트랙 비디오(multi-track video)를 설정된 기준에 따라 다수의 레이어(layer)로 정렬하는 단계;(b) 상기 인코딩 장치에서 정렬된 각 레이어의 비트스트림을 분석하여 인코딩 모드(encoding mode)와 예측 정보(prediction information)를 포함하는 인코딩 정보를 추출하는 단계; 및(c) 상기 인코딩 장치에서 스케일러블 비디오 인코딩을 수행하는 레이어와 상위 또는 하위 레이어의 인코딩 정보를 사용하여 인코딩 모드를 결정하고 결정된 인코딩 모드로 상기 멀티 트랙 비디오를 인코딩하는 단계;를 포함하는 스케일러블 비디오 인코딩 방법.
- 제9항에 있어서,상기 (a) 단계 이전에, 상기 인코딩 장치에서 영상을 다양한 포맷으로 압축하여 멀티 트랙 비디오를 생성하거나, 또는 상기 인코딩 장치에서 상기 멀티 트랙 비디오를 입력받는 단계;를 더 포함하는 것을 특징으로 하는 스케일러블 비디오 인코딩 방법.
- 제9항에 있어서,상기 (a) 단계는,(a-1) 상기 멀티 트랙 비디오를 해상도가 높은 비디오가 상위 레이어에 배치되도록 정렬하는 단계;(a-2) 해상도에 따라 정렬된 상기 멀티 트랙 비디오 중 해상도가 동일한 비디오가 두 개 이상 존재하는 경우 프레임율(frame rate)이 높은 비디오가 상위 레이어에 배치되도록 재정렬하는 단계; 및(a-3) 프레임율에 따라 재정렬된 상기 멀티 트랙 비디오 중 프레임율이 동일한 비디오가 두 개 이상 존재하는 경우 비트율(bit rate)이 높은 비디오가 상위 레이어에 배치되도록 재정렬하는 단계;를 포함하여 구성되는 스케일러블 비디오 인코딩 방법.
- 제9항에 있어서,상기 (c) 단계는,(c-1) 상기 스케일러블 비디오 인코딩을 수행하는 레이어의 매크로블록과 상기 매크로블록에 대응되는 상위 레이어 매크로블록의 인코딩 정보를 업스케일링(up-scaling)하여 인터 레이어 인트라 예측(Inter-layer Intra prediction)을 수행하는 단계;(c-2) 인터 레이어 인트라 모드(Inter-layer Intra mode)의 인코딩 코스트(encoding cost)와 상기 스케일러블 비디오 인코딩을 수행하는 레이어의 인코딩 모드의 인코딩 코스트를 각각 계산하는 단계;(c-3) 상기 계산된 인코딩 코스트를 비교하여 인코딩 모드를 상기 인터 레이어 인트라 모드 또는 상기 스케일러블 비디오 인코딩을 수행하는 레이어의 인코딩 모드 중 결정하는 단계; 및(c-4) 상기 결정된 인코딩 모드로 상기 스케일러블 비디오 인코딩을 수행하는 레이어 매크로블록을 인코딩하는 단계;를 포함하여 구성되며,상기 (c-1) 단계 내지 (c-4) 단계는 상기 멀티 트랙 비디오의 최하위 레이어부터 최상위 레이어의 비디오가 순차적으로 인코딩 완료될 때까지 반복되는 것을 특징으로 하는 스케일러블 비디오 인코딩 방법.
- 제12항에 있어서,상기 (c-3) 단계에서,상기 계산된 인터 레이어 인트라 모드의 인코딩 코스트가 상기 스케일러블 비디오 인코딩을 수행하는 레이어의 인코딩 모드의 인코딩 코스트보다 작으면, 상기 인코딩 모드를 인터 레이어 인트라 모드로 결정하고, 크거나 같으면 상기 인코딩 모드를 상기 스케일러블 비디오 인코딩을 수행하는 레이어의 인코딩 모드로 결정하는 것을 특징으로 하는 스케일러블 비디오 인코딩 방법.
- 제9항에 있어서,상기 (a) 단계에서,상기 설정된 기준은 객관적 화질 평가 방법에 의해 결정된 화질 순서이며, 상기 화질 순서에 따라 고화질 비디오가 상위 레이어에 배치되도록 상기 멀티 트랙 비디오를 다수의 레이어로 정렬하는 것을 특징으로 하는 스케일러블 비디오 인코딩 방법.
- 제9항에 있어서,상기 (c) 단계는,(c-1) 움직임 벡터 정보를 이용하여 상기 스케일러블 비디오 인코딩을 수행하는 레이어의 매크로블록이 움직임 예측 모드(motion prediction mode)인지 판단하는 단계;(c-2) 상기 매크로블록이 움직임 예측 모드인 경우 (c-3) 단계를 수행하고, 상기 매크로블록이 움직임 예측 모드가 아닌 경우 인트라 모드(Intra mode)로 상기 매크로블록을 인코딩하는 단계;(c-3) 상기 매크로블록의 하위 레이어 매크로블록이 움직임 예측 모드인지 판단하는 단계;(c-4) 상기 하위 레이어 매크로블록이 움직임 예측 모드인 경우 (c-5) 단계를 수행하고, 상기 하위 레이어 매크로블록이 움직임 예측 모드가 아닌 경우 상기 매크로블록의 인코딩 모드로 인코딩하는 단계;(c-5) 업스케일링(up-scaling)된 상기 하위 레이어 매크로블록의 매크로블록 분할 정보와 상기 매크로블록의 매크로블록 분할 정보를 비교하는 단계;(c-6) 상기 매크로블록 분할 정보가 동일하면 (c-7) 단계를 수행하고, 상기 매크로블록 분할 정보가 동일하지 않으면 상기 매크로블록의 인코딩 모드로 인코딩하는 단계;(c-7) 업스케일링된 상기 하위 레이어 매크로블록의 움직임 벡터와 상기 매크로블록의 움직임 벡터를 비교하는 단계;(c-8) 상기 움직임 벡터가 동일하면 인터 레이어 움직임 예측 모드(Inter-layer motion prediction mode)로 상기 매크로블록을 인코딩하고, 상기 움직임 벡터가 동일하지 않으면 상기 매크로블록의 인코딩 모드로 인코딩하는 단계;를 포함하여 구성되며,상기 (c-2) 단계, (c-4) 단계, (c-6) 단계 및 (c-8) 단계에서 상기 매크로블록의 인코딩이 완료되면, 다음 매크로블록에 대하여 (c-1) 단계부터 반복하여 수행하며, 최상위 레이어부터 최하위 레이어의 매크로블록까지 순차적으로 인코딩하는 것을 특징으로 하는 스케일러블 비디오 인코딩 방법.
Priority Applications (7)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201280024116.8A CN103548353B (zh) | 2011-04-15 | 2012-04-16 | 使用多轨视频的高速可伸缩视频编码装置和方法 |
JP2014505084A JP5557265B1 (ja) | 2011-04-15 | 2012-04-16 | マルチトラックビデオを用いた高速スケーラブル・ビデオ・コーディング装置及び方法 |
US14/111,834 US9083949B2 (en) | 2011-04-15 | 2012-04-16 | High speed scalable video coding device and method using multi-track video |
EP12771035.8A EP2698995A4 (en) | 2011-04-15 | 2012-04-16 | DEVICE AND METHOD FOR HIGH SPEED EVOLVING VIDEO ENCODING USING MULTIPISTE VIDEO |
US14/797,471 US20150319443A1 (en) | 2011-04-15 | 2015-07-13 | High speed scalable video coding device and method using multi-track video |
US14/884,967 US20160037169A1 (en) | 2011-04-15 | 2015-10-16 | High speed scalable video coding device and method using multi-track video |
US14/884,943 US10750185B2 (en) | 2011-04-15 | 2015-10-16 | High speed scalable video coding device and method using multi-track video |
Applications Claiming Priority (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR10-2011-0035110 | 2011-04-15 | ||
KR1020110035110A KR101853744B1 (ko) | 2011-04-15 | 2011-04-15 | 멀티트랙 비디오를 스케일러블 비디오로 인코딩하는 방법 및 장치 |
KR10-2011-0047638 | 2011-05-20 | ||
KR1020110047638A KR101594411B1 (ko) | 2011-05-20 | 2011-05-20 | 고속 움직임 예측을 이용하여 멀티트랙 비디오를 스케일러블 비디오로 인코딩하는 방법 및 장치 |
Related Child Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US14/111,834 A-371-Of-International US9083949B2 (en) | 2011-04-15 | 2012-04-16 | High speed scalable video coding device and method using multi-track video |
US14/797,471 Continuation US20150319443A1 (en) | 2011-04-15 | 2015-07-13 | High speed scalable video coding device and method using multi-track video |
Publications (2)
Publication Number | Publication Date |
---|---|
WO2012141551A2 true WO2012141551A2 (ko) | 2012-10-18 |
WO2012141551A3 WO2012141551A3 (ko) | 2013-03-07 |
Family
ID=47009877
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/KR2012/002861 WO2012141551A2 (ko) | 2011-04-15 | 2012-04-16 | 멀티 트랙 비디오를 사용한 고속 스케일러블 비디오 코딩 장치 및 방법 |
Country Status (5)
Country | Link |
---|---|
US (4) | US9083949B2 (ko) |
EP (4) | EP3021587A3 (ko) |
JP (1) | JP5557265B1 (ko) |
CN (1) | CN103548353B (ko) |
WO (1) | WO2012141551A2 (ko) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103248622A (zh) * | 2013-04-09 | 2013-08-14 | 中国科学院计算技术研究所 | 一种自动伸缩的在线视频服务质量保障方法及*** |
Families Citing this family (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP6360154B2 (ja) * | 2013-04-05 | 2018-07-18 | ヴィド スケール インコーポレイテッド | 多重レイヤビデオコーディングに対するインターレイヤ基準画像エンハンスメント |
US10390087B2 (en) * | 2014-05-01 | 2019-08-20 | Qualcomm Incorporated | Hypothetical reference decoder parameters for partitioning schemes in video coding |
Family Cites Families (24)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6907070B2 (en) * | 2000-12-15 | 2005-06-14 | Microsoft Corporation | Drifting reduction and macroblock-based control in progressive fine granularity scalable video coding |
US20020126759A1 (en) * | 2001-01-10 | 2002-09-12 | Wen-Hsiao Peng | Method and apparatus for providing prediction mode fine granularity scalability |
CN1774930A (zh) | 2003-04-17 | 2006-05-17 | 皇家飞利浦电子股份有限公司 | 视频转码 |
KR100586883B1 (ko) * | 2004-03-04 | 2006-06-08 | 삼성전자주식회사 | 비디오 스트리밍 서비스를 위한 비디오 코딩방법, 프리디코딩방법, 비디오 디코딩방법, 및 이를 위한 장치와, 이미지 필터링방법 |
US20050195896A1 (en) * | 2004-03-08 | 2005-09-08 | National Chiao Tung University | Architecture for stack robust fine granularity scalability |
US20060023748A1 (en) * | 2004-07-09 | 2006-02-02 | Chandhok Ravinder P | System for layering content for scheduled delivery in a data network |
JP5017825B2 (ja) * | 2005-09-15 | 2012-09-05 | ソニー株式会社 | 復号装置および復号方法 |
EP2372922A1 (en) * | 2006-03-29 | 2011-10-05 | Vidyo, Inc. | System and method for transcoding between scalable and non-scalable video codecs |
US8320450B2 (en) * | 2006-03-29 | 2012-11-27 | Vidyo, Inc. | System and method for transcoding between scalable and non-scalable video codecs |
CA2647723A1 (en) * | 2006-03-29 | 2007-10-11 | Vidyo, Inc. | System and method for transcoding between scalable and non-scalable video codecs |
US7535383B2 (en) * | 2006-07-10 | 2009-05-19 | Sharp Laboratories Of America Inc. | Methods and systems for signaling multi-layer bitstream data |
US7991236B2 (en) * | 2006-10-16 | 2011-08-02 | Nokia Corporation | Discardable lower layer adaptations in scalable video coding |
US7907789B2 (en) * | 2007-01-05 | 2011-03-15 | Freescale Semiconductor, Inc. | Reduction of block effects in spatially re-sampled image information for block-based image coding |
CA2674438C (en) | 2007-01-08 | 2013-07-09 | Nokia Corporation | Improved inter-layer prediction for extended spatial scalability in video coding |
KR100809354B1 (ko) | 2007-02-02 | 2008-03-05 | 삼성전자주식회사 | 복원된 프레임의 프레임율을 업컨버팅하는 장치 및 방법 |
US8121191B1 (en) | 2007-11-13 | 2012-02-21 | Harmonic Inc. | AVC to SVC transcoder |
KR101144498B1 (ko) * | 2008-04-28 | 2012-05-15 | 에스케이플래닛 주식회사 | 스케일러블 비디오 제공 시스템 및 스케일러블 비디오 제공방법 |
KR101012204B1 (ko) * | 2008-04-24 | 2011-02-08 | 에스케이 텔레콤주식회사 | 스케일러블 비디오의 스트리밍 재생 시스템 및 스케일러블비디오의 스트리밍 재생 방법 |
US20100232521A1 (en) * | 2008-07-10 | 2010-09-16 | Pierre Hagendorf | Systems, Methods, and Media for Providing Interactive Video Using Scalable Video Coding |
IT1394245B1 (it) * | 2008-09-15 | 2012-06-01 | St Microelectronics Pvt Ltd | Convertitore per video da tipo non-scalabile a tipo scalabile |
KR20100072722A (ko) * | 2008-12-22 | 2010-07-01 | 삼성전자주식회사 | 스케일러블 변조 및 복조를 위한 영상데이터를 송수신하는 장치 및 방법 |
US9036705B2 (en) | 2009-03-13 | 2015-05-19 | Telefonaktiebolaget L M Ericsson (Publ) | Technique for bringing encoded data items into conformity with a scalable coding protocol |
US9549190B2 (en) | 2009-10-01 | 2017-01-17 | Sk Telecom Co., Ltd. | Method and apparatus for encoding/decoding image using variable-size macroblocks |
WO2011081643A2 (en) * | 2009-12-14 | 2011-07-07 | Thomson Licensing | Merging encoded bitstreams |
-
2012
- 2012-04-16 CN CN201280024116.8A patent/CN103548353B/zh active Active
- 2012-04-16 EP EP15194417.0A patent/EP3021587A3/en not_active Ceased
- 2012-04-16 EP EP15193360.3A patent/EP3007445A1/en not_active Withdrawn
- 2012-04-16 JP JP2014505084A patent/JP5557265B1/ja active Active
- 2012-04-16 EP EP12771035.8A patent/EP2698995A4/en not_active Withdrawn
- 2012-04-16 US US14/111,834 patent/US9083949B2/en active Active
- 2012-04-16 EP EP15193340.5A patent/EP3001683A3/en not_active Withdrawn
- 2012-04-16 WO PCT/KR2012/002861 patent/WO2012141551A2/ko active Application Filing
-
2015
- 2015-07-13 US US14/797,471 patent/US20150319443A1/en not_active Abandoned
- 2015-10-16 US US14/884,967 patent/US20160037169A1/en not_active Abandoned
- 2015-10-16 US US14/884,943 patent/US10750185B2/en active Active
Non-Patent Citations (2)
Title |
---|
None |
See also references of EP2698995A4 |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103248622A (zh) * | 2013-04-09 | 2013-08-14 | 中国科学院计算技术研究所 | 一种自动伸缩的在线视频服务质量保障方法及*** |
CN103248622B (zh) * | 2013-04-09 | 2016-02-24 | 中国科学院计算技术研究所 | 一种自动伸缩的在线视频服务质量保障方法及*** |
Also Published As
Publication number | Publication date |
---|---|
EP2698995A4 (en) | 2014-09-03 |
US20160037169A1 (en) | 2016-02-04 |
US10750185B2 (en) | 2020-08-18 |
US20160037168A1 (en) | 2016-02-04 |
JP5557265B1 (ja) | 2014-07-23 |
WO2012141551A3 (ko) | 2013-03-07 |
US9083949B2 (en) | 2015-07-14 |
US20140226723A1 (en) | 2014-08-14 |
JP2014520413A (ja) | 2014-08-21 |
CN103548353B (zh) | 2015-08-19 |
EP3001683A3 (en) | 2016-04-06 |
EP2698995A2 (en) | 2014-02-19 |
EP3007445A1 (en) | 2016-04-13 |
EP3001683A2 (en) | 2016-03-30 |
US20150319443A1 (en) | 2015-11-05 |
EP3021587A2 (en) | 2016-05-18 |
EP3021587A3 (en) | 2016-12-28 |
CN103548353A (zh) | 2014-01-29 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
KR101354833B1 (ko) | 디지털 비디오의 가변 해상도 인코딩 및 디코딩 기법 | |
WO2013141671A1 (ko) | 인터 레이어 인트라 예측 방법 및 장치 | |
CN102918836A (zh) | 用于非对称立体视频的帧封装 | |
WO2012141551A2 (ko) | 멀티 트랙 비디오를 사용한 고속 스케일러블 비디오 코딩 장치 및 방법 | |
WO2011068332A2 (ko) | 공간적 예측장치 및 그 예측방법, 그것을 이용한 영상 부호화 장치 및 방법, 및 영상 복호화 장치 및 방법 | |
CN113316932B (zh) | 编解码方法、装置和设备 | |
CN101888545B (zh) | 一种低码率信源压缩编码方法 | |
KR101096802B1 (ko) | 비디오 스트림 혼합장치 및 그 방법 | |
KR101853744B1 (ko) | 멀티트랙 비디오를 스케일러블 비디오로 인코딩하는 방법 및 장치 | |
WO2010027142A2 (ko) | 다시점 비디오의 송수신 시스템 및 방법 | |
KR101594411B1 (ko) | 고속 움직임 예측을 이용하여 멀티트랙 비디오를 스케일러블 비디오로 인코딩하는 방법 및 장치 | |
CN101202907A (zh) | 频道切换装置及方法 | |
KR101990098B1 (ko) | 멀티트랙 비디오를 스케일러블 비디오로 인코딩하는 방법 및 장치 | |
WO2018016879A1 (ko) | 360도 가상현실 방송 서비스 제공 방법 및 장치 | |
KR101875853B1 (ko) | 고속 움직임 예측을 이용하여 멀티트랙 비디오를 스케일러블 비디오로 인코딩하는 방법 및 장치 | |
WO2015102462A1 (ko) | 다중 영상의 단일 비트 스트림 생성방법 및 생성장치 | |
KR101834531B1 (ko) | 멀티트랙 비디오를 스케일러블 비디오로 인코딩하는 방법 및 장치 | |
KR101749613B1 (ko) | 고속 움직임 예측을 이용하여 멀티트랙 비디오를 스케일러블 비디오로 인코딩하는 방법 및 장치 | |
KR101626757B1 (ko) | 고속 움직임 예측을 이용하여 멀티트랙 비디오를 스케일러블 비디오로 인코딩하는 방법 및 장치 | |
KR101625441B1 (ko) | 고속 움직임 예측을 이용하여 멀티트랙 비디오를 스케일러블 비디오로 인코딩하는 방법 및 장치 | |
KR20180080167A (ko) | 고속 움직임 예측을 이용하여 멀티트랙 비디오를 스케일러블 비디오로 인코딩하는 방법 및 장치 | |
WO2020122308A1 (ko) | 다시점 영상 정보의 복호화 방법 및 장치 | |
Yuan et al. | Color space compatible coding framework for YUV422 video coding | |
WO2016108317A1 (ko) | 오류에 강인한 압축 영역에서의 영상 합성 방법, 장치 및 시스템 | |
WO2010143858A2 (ko) | 대용량의 영상을 효율적으로 전송하는 영상 부호화 장치 및 영상 복호화 장치 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 12771035 Country of ref document: EP Kind code of ref document: A2 |
|
ENP | Entry into the national phase |
Ref document number: 2014505084 Country of ref document: JP Kind code of ref document: A |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
WWE | Wipo information: entry into national phase |
Ref document number: 14111834 Country of ref document: US |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2012771035 Country of ref document: EP |