WO2011004598A1 - 動画像符号化方法、装置、プログラム、および集積回路 - Google Patents
動画像符号化方法、装置、プログラム、および集積回路 Download PDFInfo
- Publication number
- WO2011004598A1 WO2011004598A1 PCT/JP2010/004431 JP2010004431W WO2011004598A1 WO 2011004598 A1 WO2011004598 A1 WO 2011004598A1 JP 2010004431 W JP2010004431 W JP 2010004431W WO 2011004598 A1 WO2011004598 A1 WO 2011004598A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- slice
- picture
- encoding
- region
- inter
- Prior art date
Links
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/169—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
- H04N19/17—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
- H04N19/174—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a slice, e.g. a line of blocks or a group of blocks
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/103—Selection of coding mode or of prediction mode
- H04N19/107—Selection of coding mode or of prediction mode between spatial and temporal predictive coding, e.g. picture refresh
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/60—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding
- H04N19/61—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding in combination with predictive coding
Definitions
- the present invention relates to a moving picture coding method and a moving picture coding apparatus.
- an MPEG (Moving Picture Experts Group) -4 AVC method also known as ITU-T H.264 method
- the present invention relates to a moving picture coding method and a moving picture coding apparatus.
- multimedia In recent years, a multimedia era has been reached in which voice, images, and other pixel values are handled in an integrated manner.
- Conventional information media such as newspapers, magazines, televisions, radios, and telephones, have a means for transmitting information to people. , Has been picked up as a multimedia subject.
- multimedia refers to not only characters but also figures or sounds, particularly images, etc., being simultaneously associated with each other.
- it is an essential condition to represent the information in a digital format.
- the information amount of each information medium is estimated as a digital information amount
- the information amount per character is 1 to 2 bytes.
- an amount of information of 64 Kbits (telephone quality) per second is required for audio, and 100 Mbits (current television reception quality) per second is required for moving images. Therefore, it is not realistic to handle the enormous amount of information in digital form as it is with the information media.
- videophones have been put into practical use by an integrated services digital network (ISDN) having a transmission rate of 64 Kbit / s to 1.5 Mbit / s.
- ISDN integrated services digital network
- MPEG Motion Picture Experts Group
- ISO / IEC International Electrotechnical Commission, International Electrotechnical Commission
- MPEG-1 is a standard for compressing a moving picture signal to 1.5 Mbit / s, that is, information of a television signal to about 1/100.
- the target quality is a medium quality, that is, a quality that can be realized at a transmission speed of mainly about 1.5 Mbit / s, so that the demand for higher image quality is satisfied. Therefore, MPEG-2 was standardized. In MPEG-2, moving picture signals are compressed at 2 to 15 Mbit / s to realize TV broadcast quality.
- MPEG-4 has been standardized by a working group (ISO / IEC JTC1 / SC29 / WG11) that has been standardizing with MPEG-1 and MPEG-2.
- This MPEG-4 achieves a compression ratio higher than that of MPEG-1 and MPEG-2, and enables encoding / decoding / operation in units of objects, thereby realizing new functions necessary in the multimedia era.
- MPEG-4 achieves higher compression ratios than MPEG-1 and MPEG-2, and enables encoding, decoding and manipulation on an object basis.
- the image signal can be considered as a series of pictures (also referred to as frames or fields) that are sets of pixels at the same time.
- the pixels have a strong correlation with neighboring pixels in the picture
- compression using the correlation of the pixels in the picture is performed.
- the correlation between pixels is strong between two (a plurality of) consecutive pictures
- compression using the correlation between pixels is also performed between these pictures.
- inter coding compression using the correlation of pixels between a plurality of pictures and the correlation of pixels within a picture
- Compression using the correlation of pixels is called intra coding. Since this inter coding uses correlation between pictures, a compression rate higher than the compression rate in intra coding can be realized.
- a block (or a higher-level conceptual block in which a plurality of blocks are combined) is a set of pixels in a two-dimensional rectangular area. Macro block), and intra coding and inter coding can be switched in units of blocks.
- a slice which is a coding unit in which a plurality of blocks are grouped, is defined.
- a slice is the smallest unit that can be independently encoded and decoded, and can be decoded in units of slices even if a part of the stream is lost.
- FIG. 22 is a diagram for explaining the relationship between a slice S and a block MB (macroblock) when the MPEG standard slice division method is used.
- a picture P (one frame) shown in FIG. 22 is composed of a plurality of blocks MB (macroblocks).
- the blocks MB in the same row constitute one slice S. That is, the slice S is composed of a plurality of blocks MB included in the row of the slice S.
- the picture P has a plurality of rows each composed of one slice S.
- the hatched slice S is an I slice IS
- each other slice is a P slice PSm.
- the I slice IS is a slice composed of only intra-coded blocks.
- the P slice PSm is a slice composed of inter-coded blocks.
- the slice S must be composed of only blocks in the same row (only blocks in one row).
- the slice S can be configured with a plurality of rows.
- one picture can include two types of slices of an I slice and a P slice at the same time.
- an I slice refers to a slice that is encoded using only the correlation of pixels in the slice.
- the P slice means a slice that is encoded using the pixel correlation in the slice and the pixel correlation between the slices.
- “between slices” means between the slice and another slice other than the slice.
- the slice other than the slice may be a slice included in another picture different from the picture including the slice.
- an I slice contains only slices that do not use predictive coding (based on the image signal) from the surrounding (outside of the slice) image, that is, intra macroblocks that are intra-coded. It is a collected slice.
- the P slice is a slice in which compression efficiency is improved by predictive coding, that is, a slice configured by mixing inter macroblocks that are inter-coded and intra macroblocks.
- the I slice in this specification includes the following slices. That is, in this specification, a special P slice intentionally coded using only the correlation of pixels in the slice is also referred to as an I slice for convenience.
- FIG. 23 is a diagram for explaining the coding order of a plurality of blocks in the picture P.
- the blocks MB in the picture P shown in FIG. 22 are encoded in the order shown in FIG. 23, that is, in the picture P from left to right in the slice unit and from top to bottom in the slice unit. A stream is generated.
- the decoded pixel of the picture may not be correctly decoded. For example, even if an erasure occurs in the stream, when the next picture of the picture that has been lost and the image quality has deteriorated is decoded, if the next picture is intra-encoded, it is intra-encoded. Only a stream of slices that are present (based only on) can correctly decode the pixels. However, if the next picture is inter-coded when decoding the picture following the picture whose image quality has deteriorated due to loss, the next picture is the picture that was decoded immediately before, that is, the loss of the stream.
- the decoding is performed using the correlation with the picture with degraded image quality (refer to the picture decoded immediately before), the decoding process is correctly performed on all slices in the next picture of the lost stream. Even so, the original pixel value cannot be correctly decoded.
- the transmission delay time of the transmission bit rate smoothing apparatus is as long as several pictures to ten or more pictures, and it is appropriate to use the transmission bit rate smoothing apparatus for the purpose of transmitting an image signal with a low delay time. Absent. Therefore, by performing the encoding that makes the number of bits in units of pictures substantially constant by the method described below, low delay is realized and image quality degradation is prevented from recursively propagating.
- FIG. 24 is a diagram showing an example of slice division of pictures ((a) to (l)) that are continuous in time order.
- the hatched slice is the I slice IS as in FIG. 22, and the other slices are the P slice PSm.
- slices are in units of rows, as in the previous example.
- (a) to (l) in FIG. 24 are a plurality of pictures that are continuous in time order. That is, in FIG. 24, (a) is the first picture in time order, and (l) is the last picture in time order. In FIG. 24, the position of the I slice IS moves down one row in the next picture in time order, and returns to the highest row after moving to the lowest row (from (j) in FIG. 24 ( k)).
- a picture P is composed of an I slice IS that is strong against stream loss and a P slice PSm that is weak against stream loss but includes inter coding with a good compression rate, and the position of I slice IS (set position). )
- the picture P in time order.
- the slice at the position of the P slice PSm where the stream disappeared becomes the I slice IS in the subsequent picture in time order.
- the picture P is correctly decoded. That is, a stream with image degradation can be recovered. Therefore, it is possible to prevent indefinite propagation of image quality degradation.
- FIG. 25 is a diagram for explaining the conventional image quality degradation that occurs when the motion search range is not restricted.
- the I slice IS circulates to stop the propagation of the image quality deterioration (the picture is refreshed). Since the I slice IS is moving from top to bottom, the pictures are refreshed in order from the top slice.
- the picture can be correctly decoded at the position of the I slice IS and the pixel above the I slice IS.
- image quality degradation in the pixels below the I slice IS. That is, it is assumed that there is image quality deterioration in a pixel below the I slice IS in the picture N in which the I slice IS has not been decoded yet after the image quality deterioration caused by the transmission error.
- This area where the propagation of image quality degradation due to the I slice stops is called a refresh completion area RR (see FIG. 25), and an area that has not been encoded (decoded) in the I slice yet has image quality degradation is an unrefreshed area.
- the refresh completion area RR is an area composed of an I slice IS and each slice above the I slice IS.
- “above the I slice IS” is a position in the direction opposite to the traveling direction of the position encoded with the I slice IS (position where the I slice IS is set) with respect to the I slice IS.
- the unrefreshed area NR is an area composed of each slice below the I slice IS.
- below the I slice IS is a position in the advancing direction of the position encoded by the I slice IS with respect to the I slice IS.
- an encoding target block C (picture N + 1 in FIG. 25) and a comparison target picture (picture N in FIG. 25) are encoded.
- the pixel block is compared, and the difference from the pixel value at the position where the correlation between the pixels is the largest is encoded in block units. Searching for a position where the correlation between the pixels is large is called motion search.
- motion search in the reference destination picture (picture N), the range of the position of the block searched for is called a motion search range.
- the decoding device refers to the pixel value without image quality degradation due to a transmission error, and therefore decodes the inter-coded pixel. There is no degradation in image quality even when decoding.
- the encoding target block of the picture N + 1 is a block (encoding target block C1) in the refresh completion area RR
- encoding is performed with reference to the unrefreshed area NR of the picture N. That is, in this case, since the decoding apparatus cannot decode the block in the subsequent picture (see picture N + 2) by I slice (intra coding), a transmission error occurs in the decoding of the block and the block. This does not solve the image quality degradation caused by the problem. That is, when a reference picture (picture N) is referred to a block in an unrefreshed area NR from a block in the refresh completion area RR of the picture to be encoded (picture N + 1), the image quality deterioration is propagated. Occurs.
- FIG. 26 is a diagram showing processing when the search range is restricted.
- Patent Document 1 As such a conventional technique, for example, one described in Patent Document 1 is known.
- the motion search range is dynamically limited so that the motion search range does not include the unrefreshed region NR in the motion search in the coding of the block of the refresh completion region RR.
- the conventional encoding method has a problem that the control becomes complicated. For example, the size of the motion search range changes depending on the position, and the time for the motion search process changes. This complicates the control of the pipeline processing for motion search and requires a complicated circuit. As a result, the processing speed decreases and high-resolution data cannot be processed at a necessary speed. For example, high definition data cannot be processed properly.
- the present invention solves the above-described conventional problems, and provides a moving image encoding apparatus, method, and the like that prevent error propagation without dynamically limiting a motion search range and without referring to an unrefreshed area.
- the purpose is to provide.
- prevention of error propagation from the unrefreshed area to the refreshed area can be realized by a simple process, and thus can be realized by an apparatus having a simple configuration.
- an object of the present invention is to provide a device that can appropriately process even high-resolution data such as high-definition data.
- the encoding method of the present invention includes an I slice and a P slice in one picture, and the position of the included I slice in the picture is in the vertical direction of the picture for each picture.
- a moving image coding method for moving wherein the first P slice is included in a first region adjacent to the I slice, the first region being adjacent to the vertical direction of movement.
- a first encoding step for inter-encoding the image without using a motion vector and a second P slice included in a second region other than the first region are encoded using a motion vector.
- a second encoding step are encoded using a motion vector.
- including an I slice and a P slice in one picture means that the same picture includes an I slice and a P slice, and the picture including the I slice is the same as the picture including the P slice. Means that.
- the motion search function is prohibited in the P slice at the position above the I slice as shown in FIG. 5 without performing complicated processing of dynamically limiting the motion search range.
- the first P slice area is the lowermost part of the refresh completion area adjacent to the I slice in the direction opposite to the movement direction.
- a motion vector is not used, and an image at the same position that does not take motion into consideration is used. This prevents a reference to an unrefreshed region (region adjacent to the I slice from the direction of movement) in the reference destination picture.
- FIG. 1 is a block diagram of a moving picture coding apparatus according to the first embodiment.
- FIG. 2 is an explanatory diagram of the relationship between the slice division method and the reference in the first embodiment.
- FIG. 3 is an operation explanatory diagram of slice division and slice type determination in the first embodiment.
- FIG. 4 is a flowchart for explaining the operation in the first embodiment.
- FIG. 5 is a diagram for explaining restrictions on the motion search range of the present invention.
- FIG. 6 is a flowchart of processing according to the type of slice.
- FIG. 7 is a block diagram of the moving picture coding apparatus according to the second embodiment.
- FIG. 8 is a diagram illustrating an example of a difference in the number of I slice insertions according to the second embodiment.
- FIG. 9 is a flowchart of the video encoding apparatus 1A.
- FIG. 10 is a flowchart of the slice insertion number setting unit in the second embodiment.
- FIG. 11 is an explanatory diagram of the relationship between the slice division method and the reference according to the third embodiment.
- FIG. 12 is an explanatory diagram of operations for slice division and slice type determination in the third exemplary embodiment.
- FIG. 13 is a diagram for explaining the limitation of the motion search range in the third embodiment.
- FIG. 14 is an explanatory diagram of a recording medium for storing a program for realizing the moving picture coding apparatus according to each embodiment by a computer system.
- FIG. 15 is an explanatory diagram of a recording medium for storing a program for realizing the moving picture coding apparatus according to each embodiment by a computer system.
- FIG. 16 is an explanatory diagram of a recording medium for storing a program for realizing the moving picture coding apparatus according to each embodiment by a computer system.
- FIG. 17 is a diagram illustrating a plurality of pictures.
- FIG. 18 is a diagram illustrating a plurality of pictures.
- FIG. 19 is a diagram illustrating a plurality of NoMC-P slices.
- FIG. 20 is a diagram illustrating a plurality of pictures.
- FIG. 21 is a diagram illustrating a plurality of pictures.
- FIG. 22 is an explanatory diagram of the relationship between MPEG slices and blocks.
- FIG. 23 is an explanatory diagram of the coding order of a plurality of blocks in a picture.
- FIG. 24 is a diagram illustrating an example of slice division of pictures that are temporally consecutive.
- FIG. 25 is a diagram for explaining a case where the conventional motion search range is not restricted.
- FIG. 26 is a diagram for explaining the limitation of the conventional motion search range.
- FIG. 27 is a block diagram of a video encoding apparatus.
- FIG. 28 is a block diagram of the video encoding apparatus.
- FIG. 29 is a flowchart of the video encoding apparatus.
- FIG. 30 is a flowchart of the video encoding apparatus.
- the moving image encoding method includes an I slice (I slice PR2, I slice of FIG. 2) in one picture (picture PS to be encoded in FIG. 5, reference picture PR, subsequent picture PT, and the like). 41) and a P slice (a slice of the encoding target area PSA, the NoMC-P slice 42 and the MC-P slice 43 in FIG. 2), and the position (I slice is set in the picture) of the included I slice.
- the first P slice (NoMC-P slice 42) included in the first region R1 that overlaps the unrefreshed region PR3 is used without using a motion vector.
- First encoding step (inter-screen encoding) (step S3001: NoMC-P) and second region (second region) other than the first region (first P slice region) R2, second P slice area, MC-P slice
- step S3001: NoMC-P inter-screen encoding
- step S3001: NoMC-P second region
- step S3001: NoMC-P second region
- step S3001: NoMC-P second region
- step S3001: NoMC-P slice second region
- second region other than the first region (first P slice region) R2
- second P slice area MC-P slice
- the second P slice (MC-P slice 44, MC-P slice 43) included in the SC 44 and the MC-P slice 43 (MC-P slice 43x)) using the motion vector.
- a second encoding step (S3001: MC ⁇ ) (inter-screen encoding is performed with reference to the second predicted image searched from the search range (at the position specified by the motion vector obtained by the search)) (P1) is a moving picture encoding method (A1).
- first region R1 is also referred to as a first P slice region as appropriate
- second region R2 is also referred to as a second P slice region as appropriate.
- the first region R1 is a region formed of a predetermined range above the I slice 41 of the picture PS to be encoded (for example, (e) in FIG. 2). . This range will be described in more detail later.
- a reference destination picture (FIG. 2D, picture) is compared with a block included in a picture PS to be encoded (for example, FIG. 2E). Encoding is performed with reference to an image (second predicted image) searched from the search range of (PR).
- the encoding is performed only when the block is not a block of the NoMC-P slice 42 in the first region R1, and the NoMC-P slice 42 If it is a block, it is not performed (first inter-frame encoding unit 191 in FIG. 27, step Sa1 in FIG. 29).
- an image (first predicted image) at the same position as that block position in the reference picture PR ((d) in FIG. 2). ) Is referred to.
- the encoding is not performed and the NoMC Only when the block is the block of the -P slice 42, the encoding is performed (first inter-frame encoding unit 191, step Sa1).
- the first predicted image at the same position is simply used for a plurality of different blocks (blocks B1, B2, etc. in FIG. 5) in the first region.
- the search is avoided.
- a search in a plurality of different search ranges can be avoided, and a complicated circuit is not required, so that the configuration can be simplified and the processing can be performed at high speed.
- the first P slice area is a specific area described later.
- the specific area starts from an unrefreshed area (unrefreshed area PR3) in a reference picture (referenced picture PR) among refresh completed areas (refresh completed area PS1) in the target picture (target picture PS).
- the vertical width of the first P slice region has a predetermined size greater than zero.
- the direction of movement in the vertical direction and the opposite direction are directions from the I slice (I slice PR2) to the refresh completion region. Adjacent in the reverse direction is adjacent to the I slice from the side in that direction.
- the slice image of that area is displayed between screens without using motion vectors. Encoded.
- the image encoding method specifically includes, for example, a third code that inter-codes a P slice included in the first area (first P slice area) using a motion vector. Further when the I slice is repeatedly inserted (S41: YES, S4000: YES, S2005A, S2005C, a predetermined number of times (predetermined number) or more) ), When the P slice included in the first region (first P slice region) is inter-coded using a motion vector and an I slice is inserted only a predetermined number of times (a predetermined number of times) (S41).
- P slices included in the first area are motion vectors It is not inter-picture encoding or the moving picture coding method of performing.
- repeatedly inserting means to insert a number greater than or equal to the threshold value, and inserting only (only) a predetermined number of times means inserting only a number less than the threshold value.
- a video encoding apparatus is an apparatus that executes the above-described video encoding method.
- One picture includes an I slice and a P slice, and the position of the included I slice in the picture is:
- a moving picture coding apparatus (moving picture coding apparatus 1) that moves in the vertical direction of a picture for each picture, the first area adjacent to the I slice, and the direction of movement in the vertical direction
- the first P slice included in the first region adjacent in the reverse direction is inter-coded without using a motion vector, and the second P slice included in the second region other than the first region is encoded.
- a slice type determination unit (slice type setting unit 103, setting unit 103a, Sa0b) that determines a slice type so as to perform inter-frame coding using a motion vector, and a first P slice of the first region
- a first inter-frame coding unit (reference image duplicating unit 2003) that performs inter-frame coding without using a motion vector, and a second P slice in the second region by using a motion vector.
- This is a moving picture encoding apparatus including a second inter-screen encoding unit (search unit 2002a).
- the block may be determined whether the block is a block of the NoMC-P slice 42 in the first region R1.
- the second screen encoding unit is controlled to encode the block, and when the block is determined to be the block of the NoMC-P slice 42 May be controlled to be encoded by the first screen encoding unit (setting unit 103a, step Sa0b in FIG. 30).
- the moving picture encoding apparatus includes a slice insertion number setting unit (slice insertion number setting unit 105, Sa0a) that determines whether or not the number of insertions of an I slice is a predetermined value or more, and the slice type determination unit includes: When it is determined that the number of insertions is less than a predetermined value based on the setting of the number of slice insertions (S41: NO, S4000: NO, if less than the predetermined number in S2005C), the first area (first P slice area) , And the second area (second P slice area) are used, and when it is determined to be equal to or greater than a predetermined value (S41: YES, S4000: YES, S2005C if S2005C is equal to or greater than the predetermined number) Only the second area (second P slice area) may be used.
- a slice insertion number setting unit slice insertion number setting unit 105, Sa0a
- the slice type determination unit includes: When it is determined that the number of insertions
- the above-described A1 moving picture encoding method being executed until the number of insertions of the I slice is greater than or equal to a predetermined value.
- the number of insertions is large, even if improper propagation of image quality degradation occurs, the insertion after the occurrence usually suppresses the influence of propagation, and the image quality degradation due to propagation occurs within a short time. Disappear. For this reason, even if the method A1 is not executed, the image quality is hardly deteriorated.
- the method A1 is not executed, inter-frame encoding using a motion vector can be performed, and the amount of data after encoding can be reduced. That is, the amount of data after encoding can be further reduced while maintaining high image quality.
- FIG. 1 is a block diagram showing a configuration of a moving image encoding apparatus 1 according to the first embodiment of the present invention.
- the picture number counter unit 100 measures the number of pictures to be encoded. In addition, the picture number counter unit 100 notifies the slice type setting unit 103 of the number of pictures.
- the block number counter unit 102 measures the number of blocks in the picture to be encoded. Further, the block number counter unit 102 notifies the slice type setting unit 103 of the number of blocks.
- the motion search determination unit 104 receives a slice type notification from the slice type setting unit 103.
- the motion search determination unit 104 determines whether the encoding target slice is an MC-P slice (first P slice) that is a P slice for performing motion prediction, or a motion search. It is determined whether it is a NoMC-P slice (second P slice) that is a P slice that is not subjected to the above.
- the motion search determination unit 104 notifies the slice type setting unit 103 of the identification of the I slice, MC-P slice, and NoMC-P slice.
- the slice type setting unit 103 determines whether the encoding target slice to be encoded by the encoding unit 200 is an I slice or a P slice from the number of blocks notified from the block number counter unit 102. The slice type setting unit 103 notifies the motion search determination unit 104 of the determined slice type.
- the slice type setting unit 103 receives an identification from the motion search determination unit 104 as an MC-P slice or a NoMC-P slice.
- the slice type setting unit 103 determines the position of the I slice, the position of the NoMC-P slice, the position of the picture within the picture from the height of the image, the height of the I slice, the height of the P slice, and the height of the search range for motion search.
- the P slice division position and height are determined respectively.
- the slice type setting unit 103 determines the slice division position where the position of the set I slice has moved down by the height of the I slice. To do.
- the slice type determined by the slice type setting unit 103 is determined by the slice type setting unit 103 using a motion search unit 2001, a motion compensation unit 2002, a reference image duplication unit 2003, an intra-screen prediction unit 2004, and a selector unit in the encoding unit 200. Each is notified to 2005. Note that the entire motion search unit 2001 and motion compensation unit 2002 are called a search unit 2002a.
- the intra-screen prediction unit 2004 predicts an input image signal (pixel value) from an already encoded pixel (not shown) in the same picture, and uses the predicted pixel value as a predicted image (third predicted image).
- the data is output to the selector unit 2005.
- the intra-screen prediction unit 2004 may perform prediction from only the pixel of the slice at the position of the predicted image among the pixels in the same picture, for example.
- the intra-screen prediction unit 2004 identifies an image closest to the position of the predicted image among images at a plurality of positions suitable as the predicted image included in the slice, for example, The third predicted image may be specified.
- the motion search unit 2001 searches for a pixel position having the highest correlation with the input image signal, and notifies the motion compensation unit 2002 of the position (motion vector).
- the motion compensation unit 2002 reads out the pixel value at the position of the motion vector notified from the motion search unit 2001 from the reference image held by the reference image holding unit 2011, and selects it as a predicted image (second predicted image) as a selector unit 2005. Output to.
- the reference image duplication unit 2003 outputs the image at the block position held by the reference image holding unit 2011 to the selector unit 2005 as a predicted image (first predicted image).
- the reference image duplication unit 2003 outputs the first predicted image
- the motion compensation unit 2002 outputs the second predicted image
- the in-screen prediction unit 2004 outputs the third predicted image. Also good.
- the third predicted image is a predicted image for the moving image coding apparatus 1 to perform only spatial compression among spatial compression and temporal compression.
- the second predicted image is a predicted image for performing both compressions.
- the first predicted image is a predicted image for performing temporal compression only.
- the third predicted image is, for example, a predicted image for intra-coding the image.
- the second predicted image is a predicted image for inter-coding the image, for example.
- the selector unit 2005 is notified of the slice type (I slice, MC-P slice, NoMC-P slice) from the slice type setting unit 103. If the notified slice type is an I slice, the selector unit 2005 selects the predicted image (third predicted image) generated by the intra-screen prediction unit 2004.
- the selector unit 2005 encodes the predicted images (third predicted image and second predicted image) generated by the intra-screen prediction unit 2004 and the motion compensation unit 2002. Select one with a small number of bits.
- the selector unit 2005 uses the code among the prediction images (third prediction image and first prediction image) generated by the intra-screen prediction unit 2004 and the reference image replication unit 2003.
- the prediction image with the smaller number of quantization bits is selected.
- selection may be made from three of a first predicted image, a second predicted image, and a third predicted image.
- the subtractor 2006 performs subtraction between the input image and the predicted image (selected predicted image) selected by the selector unit 2005, and outputs a prediction error (subtracted image).
- the DCT / quantization unit 2007 performs transformation (orthogonal transformation) and quantization from the time domain to the frequency domain with respect to the prediction error (subtracted image), and the quantized value is compared with the entropy coding unit 2012 and the inverse quantum. To the conversion / inverse DCT unit 2008, respectively.
- the inverse quantization / inverse DCT unit 2008 performs inverse quantization and inverse transformation from the frequency domain to the time domain (inverse orthogonal transformation) on the quantization value output from the DCT / quantization unit 2007, Output the difference image.
- the adder 2009 adds the predicted image (selected predicted image) output from the selector unit 2005 and the difference image output from the inverse quantization / inverse DCT unit 2008 to generate a reconstructed image.
- the filter unit 2010 applies a deblocking filter for removing block distortion to the reconstructed image output from the adder 2009.
- the reference image holding unit 2011 holds the image output from the filter unit 2010 in a memory such as a memory that is at least a part of the reference image holding unit 2011, for example. Then, the held image to be held is referred to as a reference image from the motion search unit 2001, the motion compensation unit 2002, and the reference image duplication unit 2003, respectively.
- the filter unit 2010 is H.264. It is necessary for H.264, but is not necessary for image encoding such as MPEG-1, MPEG-2, and MPEG-4.
- the entropy encoding unit 2012 converts the quantized value, which is the output of the DCT / quantization unit 2007, into a bit string by variable-length encoding or arithmetic encoding, and converts the converted bit string to the packetizing unit 300 Output.
- the packetization unit 300 configures the bit string that is the output of the entropy encoding unit 2012 into packets that are divided into predetermined bit number units.
- the configured packet is transmitted to the image decoding apparatus via the network.
- FIG. 2 is a diagram illustrating data in the slice division method performed by the video encoding device 1.
- the picture (one frame) shown in FIG. 2 is composed of a plurality of blocks.
- a shaded block area (I slice 41) is an I slice.
- An area with vertical lines (NoMC-P slice 42) and a white area (area without hatching, MC-P slice 44) are refreshed P slices, and areas with horizontal lines (MC-P).
- the P slice 43) is a P slice including image quality deterioration due to a transmission error.
- the I slice 41, the NoMC-P slice 42, and the MC-P slice 44 constitute a refresh completion region PR4 (FIG. 5). Further, the MC-P slice 43 forms an unrefreshed region PR3 (FIG. 5).
- the other P slices are MC-P slices.
- the slice division determination unit may be, for example, at least a part of the slice type setting unit 103 (setting unit 103a) in FIG.
- (A) to (p) in FIG. 2 are a plurality of pictures that are consecutive in this order in time order.
- the slice type setting unit 103 sets the position of the I slice 41 in the picture by the height of the I slice 41 (this In the embodiment, the slice division is performed so that the line moves down L).
- the slice type setting unit 103 determines a P slice, which is a region with a vertical line directly above the I slice 41, as a NoMC-P slice (NoMC-P slice 42).
- the slice type setting unit 103 displays the screen until the NoMC-P slice 42 can secure the height W block line (while it cannot be secured).
- the area from the upper end to the I slice 41 (all areas) is determined as the NoMC-P slice 42.
- the slice type setting unit 103 divides the remaining area into P slices and cannot secure the height M block lines of the P slice at the uppermost end and the lowermost end of the screen.
- the height of the P slice at the screen edge is made smaller than the M block line.
- slices smaller than the M block line are exemplified by the uppermost MC-P slice 44 in (e) and the lowermost MC-P slice 43 in (d), for example.
- the search range of the block (block 44x) of slice #slc_n in (n) in FIG. 2 is the P slice (MC-P slice 43, (Unrefreshed area) is not included.
- error propagation can be prevented.
- the block of #slc_n (block 44x) is decoded by the decoder, the decoded image of the block of #slc_n is an area refreshed in the past (refresh completed area PR4 in FIG. 5: (( This is because the image is generated by the decoder by referring only to the m) I slice 41, NoMC-P slice 42, and MC-P slice 44 areas.
- FIG. 3 is a diagram illustrating operations of slice division and slice type determination.
- FIG. 4 is a flowchart of the moving picture encoding apparatus 1.
- FIG. 3 illustrates slice division and slice type determination operations of the slice type setting unit 103 and the motion search determination unit 104
- FIG. 4 illustrates a flowchart of the video encoding device 1.
- the height L 1 of the I slice 41
- the height M 4 of the MC-P slice (MC-P slice 43, MC-P slice 44)
- the slice division determination unit obtains the division size of a slice of one picture from the size of the I slice line, the MC-P slice line, the NoMC-P slice line, and the height of the screen. Keep it in memory.
- the slice division determination unit updates the slice division position and size. Specifically, as illustrated in FIG. 3, the slice division determination unit stores an array and the size of each slice. Then, the number of each slice between the start pointer and the end pointer is the number of macroblock lines of that slice constituting the picture. Each slice is associated with the slice type of the slice. Each time the number of pictures increases by 1, the content of data stored in the slice division determination unit 101 changes in the order of (a) to (j) in FIG. 3A to 3J correspond to FIGS. 2A to 2J, respectively.
- the slice division determination unit increments the value stored in the array at the position pointed to by the first pointer and decrements the value stored in the array at the position pointed to by the end pointer.
- the slice division determining unit determines that the height of the slice (the slice pointed to by the head pointer) is the maximum slice type of the slice (the slice pointed to by the head pointer) with respect to the head pointer (M-No slice is MC-P slice)
- M-No slice is MC-P slice
- the slice division determination unit moves one pointer when the value of the end pointer becomes 0 (that is, the height of the slice pointed to by the end pointer becomes 0). That is, the pointed slice is changed to a slice that has been moved by one.
- the slice division determination unit determines the slice height and the slice type while shifting the positions of the head pointer and the terminal pointer, respectively (S1001). Note that the data shown in FIG. 3 is stored, for example, by the slice division determination unit.
- the block number counter unit 102 sets the block number counter (value measured by the block number counter unit 102) to 0 (S1002).
- the slice type setting unit 103 reads out the slice type of the slice to be encoded and the size of the slice (number of macroblock lines) from the array in FIG. 3 (S1003).
- the product of the number of macroblock lines and the number of macroblocks in one line (one row) is the maximum number of blocks in the slice.
- the selector unit 2005 switches the predicted image creation method according to the slice type read from the array (S1004). That is, which predicted image is selected as the selected predicted image is changed.
- the selector unit 2005 sets the output (third predicted image) of the intra prediction unit 2004 of the encoding unit 200 as a candidate for a selected predicted image (S1005). .
- the selector unit 2005 sets the output (first predicted image) of the reference image duplicating unit 2003 as a candidate for a selected predicted image (S1006).
- the selector unit 2005 sets the second predicted image created by the motion search unit 2001 and the motion compensation unit 2002 of the encoding unit 200 as a candidate for a selected predicted image (S1007). ).
- the selector unit 2005 selects one of the predicted images in S1005, S1006, and S1007. In other words, among them, the number of bits that encoded the error with the encoding target block is (the smallest), or the number of bits that encoded the error is predicted to be small, or the size of the error One (smallest) is selected as the selected predicted image.
- the selector unit 2005 encodes the error (subtracted image) by the DCT / quantization unit 2007 and the entropy encoding unit 2012 (the subsequent stage unit 200a) (S1008).
- the block number counter unit 102 increments the block number by 1 when the encoding unit 200 completes the encoding in block units (S1009). Further, when the setting unit 103a or the like determines that the number of blocks after being incremented by 1 is not the maximum number of blocks of the slice, that is, the encoded block is not the last block of the slice (“No” in S1010) ) In steps S1004 to S1010, the moving image encoding apparatus 1 encodes the next block. If the number of blocks is the maximum number of blocks in the slice, the moving image encoding apparatus 1 encodes the next slice (“Yes” in S1010).
- the setting unit 103a and the like determine whether or not encoding of all slices of the picture has been completed (S1011).
- the slice division determination unit updates the reading position of the array in FIG. 3 (S1013).
- the slice type setting unit 103 or the like reads the array of the next slice (S1003).
- the picture number counter unit 100 increases the number of pictures by 1 (S1012).
- the setting unit 103a or the like determines whether or not encoding of all the pictures has been completed (S1014). If there is a picture that has not been coded, the moving picture coding apparatus 1 codes the next picture in S1001 to S1011.
- all or part of the P slices (MC-P slices, NoMC-P slices) in the description of the embodiments may not refer to only past images. That is, all or a part may be a slice (B slice) that refers to a past image and also refers to a future image.
- the image quality deterioration is infinite when one I-slice received later is received ( Propagation can be prevented for a long time). This can be prevented without dynamically changing the motion search range.
- FIG. 5 is a diagram showing a relationship between a picture PS to be encoded, a reference picture PR, and a subsequent picture PT after the target picture PS.
- the reference destination picture PR is a picture encoded by the encoding unit 200 before the encoding target picture PS is encoded. That is, the reference picture PR is a picture that is encoded with reference to the picture for the target picture PS.
- the reference picture PR has a refresh completion area PR4 and an unrefreshed area PR3.
- the refresh completion region PR4 has an I slice PR2 at the last part in the traveling direction (downward) of the I slice.
- the unrefreshed region PR3 has a region PR31 that may cause error propagation by being referenced at the forefront of the I slice in the traveling direction.
- the target picture PS has a refresh completion area PS1 and an unrefreshed area PS3.
- the I slice PR2 is included in the refresh completion area PS1.
- the subsequent picture PT in FIG. 5 is, for example, a picture next to the target picture PS.
- the motion compensation unit 2002 performs encoding using the first predicted image.
- a predicted image to be used is selected with sufficient freedom while avoiding that encoding with reference to the image in the unrefreshed region PR3 is performed on the image in the refresh completed region PS1.
- the data is sufficiently compressed while preventing the image quality degradation from propagating from the unrefreshed region PR3 to the refresh completed region PS1.
- the block to be encoded is a block of NoMC-P slice (block of the encoding target area PSA2)
- encoding by the second predicted image by the motion compensation unit 2002 is not performed, and the reference image Only the encoding by the 1st prediction image by the duplication part 2003 is performed.
- propagation of image quality degradation from the unrefreshed region PR3 to the refresh completed region PS1 is prevented while encoding is performed by a simple process using the first predicted image.
- the first predicted image is appropriately used. Encoding or encoding by the second predicted image may be performed. Further, when the block to be encoded is a block of a NoMC-P slice, encoding with the second predicted image may be performed as appropriate. As a result, the data can be more fully compressed.
- a region where the distance from the unrefreshed region PR3 in the I slice PR2 is equal to or smaller than a predetermined distance may be a margin region that is not referred to in the encoding by the first predicted image. preferable.
- a deblocking filter process and a decimal precision motion compensation process are performed.
- a predetermined first distance for example, 2 pixels
- the processing of decimal precision motion compensation of pixels is performed in the moving image coding apparatus 1, the distance from one pixel is determined in advance.
- the other pixel that is less than or equal to a predetermined second distance affects that one pixel.
- such an area of a predetermined distance (5 pixels) or less is a margin area that is not referred to in encoding by the first predicted image. That is, the width of the NoMC-P slice 42 (FIG. 2) is sufficiently large so that the region (paste margin region) that is equal to or smaller than the predetermined distance is not referred to in the encoding by the first predicted image. It is preferable that the width is very large.
- the moving image encoding method is a moving image encoding method for solving the following problem.
- the moving image coding method for this problem stops the motion search of the slice (NoMC-P slice 42) located above the I slice to be refreshed without dynamically limiting the motion search range. This is a method of preventing error propagation without referring to an unrefreshed area.
- FIG. 6 is a flowchart of processing according to the type of slice (slice type). In the process of FIG. 4, more specifically, for example, the operation shown in FIG. 6 may be performed.
- the selector unit 2005 selects the second predicted image (S3004C) by the motion compensation unit 2002. It selects as a prediction image (S3005C). Note that even if the MC-P slice is specified, the selector unit 2005 selects the first predicted image (S3003C) by the reference image duplicating unit 2003 or performs intra-screen prediction in a certain exception. The third predicted image (S3002C) by the unit 2004 may be selected. In this exception, only the third predicted image may be selected.
- the selector unit 2005 selects the first predicted image (S3003B) when the slice type setting unit 103 identifies the NoMC-P slice as the slice type (S3001: NoMC-P) (S3005B). Note that the selector unit 2005 may select the third predicted image (S3002B) in the case of a certain exception even if the NoMC-P slice is specified.
- the selector unit 2005 selects the third predicted image (S3002A) when the slice type setting unit 103 identifies the I slice as the slice type (S3001: I) (S3005A).
- the three processes of the process by the in-screen prediction unit 2004, the process by the reference image duplicating unit 2003, and the process by the motion search unit 2001 and the motion compensation unit 2002 are specifically performed in parallel with each other, for example. May be.
- the moving image encoding apparatus includes a slice type setting unit (slice type setting unit 103 and setting unit 103a), a selector unit (selector unit 2005), and a difference processing unit (following stage). Part 200a).
- the rear stage unit 200a includes a subtracter 2006, a DCT / quantization unit 2007, an entropy encoding unit 2012, and the like.
- each of the slice type setting units 103 and the like may specifically be function blocks of functions realized by a circuit, for example.
- the slice type setting unit determines the position of the I slice (I slice PR2, I slice PS2, I slice PT1) in the picture (reference destination picture PR, target picture PS, and subsequent picture PT).
- the slice type setting unit determines different positions as the positions of the I slices in a plurality of pictures.
- the difference processing unit encodes the moving image by encoding the difference between the block and the selected predicted image of the block for each block of the moving image picture.
- the selector unit selects the selected predicted image and causes the difference processing unit to use the selected selected predicted image.
- the selector unit selects the predicted image as the selected predicted image, and the reference destination encoded before the encoding target picture (target picture PS) is encoded.
- a past image that is an image of the current picture (reference destination picture PR) is selected as follows.
- the selector unit performs selection as follows according to a specific area (area of the NoMC-P slice 42 (first area R1)) described in detail later in the picture to be encoded.
- the specific area is an area (NoMC-P slice 42) that may refer to the unrefreshed area PR3 of the reference destination picture (reference destination picture PR) in the refresh completion area PS1 of the target picture PS. Area).
- the refresh completion area PS1 is an area where the positions of the I slices 41 (I slice PR2) in each picture before the picture (target picture PS) are gathered.
- the unrefreshed area PR3 is an area where the positions of the I slices 41 (I slice PT1) in each picture after the picture (reference destination picture PR) are gathered.
- the selector unit uses a block (second predicted image) at a position other than the position of the block in the reference destination picture as a selected predicted image. select.
- the selector unit does not select the block at the other position (second predicted image) among the blocks in the reference destination picture for the block in the specific area (the block of the NoMC-P slice 42).
- a selector part selects the block (1st prediction image) of the position same as the position of the block about a block of a specific area as a selection prediction image.
- the use of the first predicted image prevents the image quality degradation from propagating from the non-refresh area PR3 to the refresh completion area PS1.
- the propagation of image quality deterioration can be prevented simply by using the second predicted image at the same position, and both propagation prevention and processing simplicity can be achieved.
- processing can be performed at high speed, and both prevention of propagation and high-resolution data such as high-definition data can be processed.
- an exception to the second predicted image may be exceptionally selected for blocks in other regions.
- an exception to the first predicted image may be exceptionally selected for the block in the specific region.
- the slice type setting unit holds, for example, data (FIG. 3) for specifying the type of each slice included in the picture. Then, for example, the slice type setting unit changes the content of the data to be held to the content that specifies the type of the slice in the specific region as the NoMC-P slice 42. Then, the selector unit may perform the above-described processing based on, for example, the content of data held.
- the content of the held data includes, for example, information for specifying the position, range, and type of each slice of the picture, and information for specifying the first picture and the last picture of the picture. May be.
- the moving image encoding apparatus 1 may be provided in a video conference system that transmits a video image of a video conference between, for example, a first site and a second site. . Then, the moving image encoding apparatus 1 may encode the transmitted video image of the video conference. That is, for example, the moving image may be a full high definition (full high definition) moving image in a video conference, for example.
- the transmission delay is avoided and the display of the transmitted moving image is delayed.
- the display may be interrupted. Thereby, the realistic feeling of the video conference by the displayed moving image can be improved.
- processing according to the first region R1 (FIG. 5) and the second region R2 may be performed.
- the following processing is merely an example. The following processing may be performed only in a certain aspect.
- the picture ((c)) before the picture (for example, (d)) is set.
- the I slice 41 of the picture ((d)) may be set by the setting unit 103a at a position next to the position of the I slice 41.
- the next position is a position adjacent to the position of the I-slice 41 of the previous picture on the side in the traveling direction of the I-slice 41 (lower side in FIG. 2) than the position of the previous picture. is there.
- the position where the I slice 41 is set may be moved by the setting unit 103a in the direction of the traveling direction for each picture.
- a block (block of MC-P slices 43 and 44) of a picture PS to be encoded (blocks of MC-P slices 43 and 44) in a reference picture PR within a search range (search range SA) of that block ( It may be encoded by the second inter-screen encoding unit 192 using the second predicted image).
- the predicted image may be, for example, the second predicted image at a position searched from the search range in the reference picture PR.
- the second inter-frame encoding unit 192 may be, for example, part or all of the functions of the rear-stage unit 200a.
- the search range in the block of the second region R2 (FIG. 5) (see the search range Sx2 in FIG. 25) does not have to overlap with the unrefreshed region PR3.
- the search range (see search range Sx1 in FIG. 25) in the block of the first region R1 may have an overlap with the unrefreshed region PR3.
- the first predicted image (by the reference image copying unit 2003) for the block (blocks B1 and B2) in the first region R1 is an image at the same position as the position of the block in the reference picture PR. It is.
- the position of the first region R1 is within the refresh completion region PS1 in the target picture PS. For this reason, the same position as the position of the first region R1 in the reference picture PR is in the refresh completion region PR4.
- the position of the first predicted image for the block at the position of the region R1 is within the refresh completion region PS1 in the target picture PS.
- the second predicted image is used only in the coding of the block in the second region R2 (second inter-screen coding unit 192), and in the coding of the block in the first region R1, the first A predicted image may be used (first inter-screen encoding unit 191).
- the second inter-frame encoding unit 192 performs encoding using the second predicted image only on the blocks in the second region R2, and the blocks in the first region R1 It does not have to be done. Then, the first inter-frame encoding unit 191 does not perform the encoding using the first predicted image on the block in the second region R2, but only on the block in the first region R1. It may be done.
- the prediction image (first prediction image) in the refresh completion region PR4 in the reference destination picture PR is used so that the propagation of deterioration does not occur. it can.
- the NoMC-P slice 42 may be set in the first region R1 by the setting unit 103a. And other P slices (MC-P slice 43x) other than the set NoMC-P slice 42 are encoded with the second predicted image, and the set NoMC-P slice 42 is the first predicted image. May be encoded.
- FIG. 7 is a block diagram showing a configuration of a moving picture coding apparatus 1A according to Embodiment 2 of the present invention.
- the description of the same configuration as that of the moving image encoding apparatus 1 of Embodiment 1 will be omitted as appropriate.
- the slice insertion count setting unit 105 (for example, a part of the selection unit 105x (FIG. 28)) performs screen refresh for preventing image quality degradation propagation in the moving image coding apparatus 1A when a transmission error occurs.
- the number of insertions of the I slice by the moving picture coding apparatus 1A is determined.
- the slice insertion number setting unit 105 notifies the determined insertion number to the slice type setting unit 103 and the motion search determination unit 104, respectively.
- the determination of the number of insertions is based on the encoding result transmission method (S2001 in FIG. 10), the bit rate of the network to be transmitted (S2002), the presence or absence of a notification that a transmission error has occurred on the receiving side (S2003), and the like.
- the slice insertion number setting unit 105 This is performed by the slice insertion number setting unit 105. Specifically, in this determination process, it is selected whether the I slice is repeatedly inserted an infinite number of times (S2005A in FIG. 10) or a predetermined number of times (S2005B). In addition, specify the number of insertions. As will be described in detail later, in this process, in a fixed case (S2004: NO), the number of insertions may be designated as 0 and it may be selected not to be inserted.
- FIG. 8 is a diagram showing an example of the difference in the number of I slice insertions.
- FIG. 8A shows a case where an infinite number of times are inserted
- FIG. 8B shows a case where an I slice is inserted only once.
- FIG. 9 is a flowchart showing processing according to the number of insertions by the moving image encoding apparatus 1A.
- the motion search determination unit 104 determines whether the insertion method is an infinite number of insertions, or even if the insertion method is a predetermined number of insertions. If the insertion is an insertion with the number of insertions greater than or equal to the predetermined number, or any of them (S41 in FIG. 8: YES, S41a), the following processing is performed. That is, the process to be performed is a process in which the slice immediately above the I slice is the MC-P slice (S4000: YES, S4001 in FIG. 9).
- the motion search determination unit 104 directly above the I slice. Are determined as NoMC-P slices (S4000 in FIG. 9: NO, S4002).
- the circulation method notified from the slice insertion number setting unit 105 is a finite number of insertions (S2005C in FIG. 10)
- the motion search determination unit 104 every time the slice at the lowest position of a picture becomes an I slice.
- the retained number of circulations is decreased by 1 and the retained number of circulations becomes 0, all slices are set as MC-P slices.
- predetermined value (predetermined number) for determining the number of insertions of the I slice may be a fixed value that depends on the size of the picture (number of vertical lines), for example.
- the frequency of refresh by inserting an I slice is equal to or higher than a predetermined value (including infinite times) (S4000 in FIG. 9: YES)
- the P slice Are all MC-P slices (S4001), and if it is less than a predetermined value (S4000: NO)
- the NoMC-P slice and the MC-P slice are used together as in the first embodiment (S4002).
- the insertion method for inserting the I slice may be selected from the first insertion method for repeatedly inserting the I slice and the second insertion method for inserting only a predetermined number of times (a predetermined number of times). Good.
- the main video encoding method includes a setting step of setting the insertion method selected in this way, and in the third encoding step, when the set insertion method is the first insertion method, Inter-screen coding may be performed. In the fourth encoding step, the inter-frame encoding may be performed when the set insertion method is the second insertion method.
- the predetermined insertion method when the set insertion method is the second insertion method, the predetermined insertion method (the number of insertions is a predetermined number or more). In the case of a large number), processing may be performed.
- the fourth encoding step only when the set insertion method is the second insertion method and not the above-described predetermined case (the number of insertions is less than the predetermined number). Case).
- FIG. 10 is a flowchart of the slice insertion number setting unit 105.
- the slice insertion count setting unit 105 determines to insert an I slice an infinite number of times in the following cases (S2005A, S4000: YES in FIG. 9, S41: YES in FIG. 8).
- the slice insertion number setting unit 105 notifies the determination to the slice type setting unit 103 and the motion search determination unit 104.
- the slice insertion number setting unit 105 performs finite insertion of the I slice. The number of times (described as 1 in the present embodiment) is determined (S2005C). The slice insertion count setting unit 105 notifies the motion search determination unit 104 of the determination. When there is no packet loss in the network, the slice insertion count setting unit 105 does not notify the motion search determination unit 104 of the insertion of the I slice (“No” in S2004). When the network is NGN (Next Generation Network), it is guaranteed by the network provider that there is no packet loss. The slice insertion count setting unit 105 may perform the process of S2005B when the network is NGN.
- NGN Next Generation Network
- the moving picture coding apparatus 1A determines the insertion frequency of the I slice. That is, the number of image decoding devices to be distributed (S2001), the bit rate (S2002), the availability of notification of whether or not stream packets have been lost by the connected image decoding device (S2003), and the status of packet loss on the network (S2004) The decision is made accordingly. Thereby, the method of inserting the I slice is changed, and the moving picture coding apparatus 1A that performs coding in consideration of deterioration of the compression rate is configured.
- Each of the P slices 42Aa and 42Ab) (NoMC-P slice 42A) is inter-coded without using a motion vector, and among the plurality of first P slices included (NoMC-P slices 42Aa and 42Ab).
- the maximum value of the size of the first P slice (for example, the size of the NoMC-P slice 42Aa) is the maximum size of the second P slice (MC-P slices 43 and 44 in FIG. 19). This is a moving image encoding method smaller than the value (the size of the MC-P slice 43 in FIG. 19).
- the maximum value of the size of the first P slice (for example, the size of the NoMC-P slice 42Aa) is the first P slice having the maximum value. It may be larger than the size of the I slice (I slice 41 in FIG. 19) in the picture (picture PS in FIG. 19) including (NoMC-P slices 42Aa, 42Ab).
- 11 to 13 are diagrams for explaining the third embodiment.
- the moving picture encoding apparatus may have, for example, the same configuration as in FIG. Then, for example, processing similar to the processing of the flowchart of FIG. 4 may be performed, or processing similar to the processing of FIG. 6 may be performed.
- the difference value between the input image signal and the pixel position having the highest correlation may be encoded using motion search.
- the difference value is larger when encoding is performed without motion search. Since the size of the difference value is increased, the number of bits required for encoding increases. This means that the number of encoded bits of a slice in the no motion search range (NoMC-P slice) increases.
- the size (number of blocks) of the slice without motion search (NoMC-P slice), which is a slice in which the number of encoded bits increases, is the same as that of the slice using motion search (MC-P slice). Make it smaller than the size (number of blocks). In this way, the number of coded bits of the slice should be made constant. If the range in which motion search should not be performed (first region R1) is larger than the size of the slice without motion search (NoMC-P slice), the number of slices without motion search is set to a plurality. Thus, a range (first region R1) of a necessary size that should not be subjected to motion search is realized.
- the following operation may be performed.
- FIG. 13 shows the encoding target area PSA2.
- FIG. 11 shows a plurality of NoMC-P slices 42A.
- the encoding target area PSA2 includes a first encoding target area PSAa, a second encoding target area PSA2b, and (two or more portions).
- the first encoding target area PSAa is an area in which the first NoMC-P slice 42Aa (FIGS. 11 and 19) is set.
- the second encoding target area PSAb is an area in which the second NoMC-P slice 42Ab (FIGS. 11 and 19) is set.
- the NoMC-P slice (NoMC-P slice 42, 42A) is encoded without using the second predicted image. For this reason, the data amount of the encoded data in which the NoMC-P slice 42 is encoded is relatively large. That is, for example, such a large amount of data is obtained by encoding a slice other than the NoMC-P slice 42 (for example, the MC-P slice) using the second predicted image. It is conceivable that the amount of data is 10 times the amount of data.
- one slice is one transmission unit.
- the amount of data in the transmission unit of the NoMC-P slice 42 becomes a large amount of data such as 10 times the amount of data, and there is a risk that the fluctuation range of the data amount in each transmission unit will increase. is there.
- Such a relatively small size may be, for example, about 1 ⁇ 2 of the size of the NoMC-P slice 42 of FIG.
- the data structure of FIG. 12 may be used. That is, for example, the height of the first NoMC-P slice 42Aa (first row data in each of (a) to (j)) and the height of the second NoMC-P slice 42Ab (second row). Data) may be stored.
- the type of the NoMC-P slice 42A is NoMC-P. It may be determined (S3001: NoMC-P, S1004: NoMC-P).
- S3002B to S3005B may be performed for each NoMC-P slice 42A.
- the moving picture coding method according to the fourth embodiment is the first picture including the I slice and the P slice at the first time (for example, the time (i) in FIG. 17) ((i in FIG. 17). ) And a second picture (of (k) in FIG. 17) including an I slice and a P slice at a second time (time (k)) later than the first time. And a third picture (not including an I slice) at an intermediate time (time (j)) between the first time and the second time.
- This is a moving image encoding method for encoding (j) in FIG.
- the third picture ((j) in FIG. 17) is the first area (FIG. 17) in the first picture ((i) in FIG. 17).
- the first encoding step first inter-frame encoding unit 191, Sa1
- the first P slice NoMC-P slice in the region R3 of the third picture ((j) of FIG. 17).
- the third picture ((j) in FIG. 18) is the first area (the area of the I slice 41) in the second picture ((k) in FIG. 18). ) In the same region R3 of the third picture ((j) in FIG. 18) in the first encoding step (the region R3 in FIG. 18). (P slice) 42N is inter-coded without using a motion vector, and in the second encoding step, in the third picture ((j) of FIG. 18) of other areas other than the same area R3.
- the second P slice PNx may be inter-coded using a motion vector.
- the maximum value of the size of the first P slice 42B may be smaller than the maximum value of the size of the second P slice PMx included in the third picture ((j) in FIG. 20).
- Each of the P slices) 42C is inter-coded without using a motion vector, and among the plurality of first P slices 42C included in the third picture ((j) of FIG. 21), The maximum value of the size of the first P slice 42C may be smaller than the maximum value of the size of the second P slice PNx included in the third picture ((j) in FIG. 21).
- FIG. 17 is a diagram showing the NoMC-P slice 42M and the like.
- FIG. 17 there may be a picture PM in which no I slice is set.
- the picture PM is, for example, a picture at an intermediate time.
- the intermediate time is the earlier time of the picture PR (I) of FIG. 17 in which the I slice PR2 is set and the later time of the picture PS (K of FIG. 17) in which the I slice PS2 is set. It is an intermediate time between Specifically, the picture PM at the intermediate time is a picture immediately after the picture PR and a picture immediately before the picture PS. That is, the picture PR may be a picture before the picture PM, and the picture PS may be a next picture.
- the early-time picture PR may be, for example, the latest and latest-time picture among the pictures for which I slices have been set in the past when the intermediate picture PM is processed.
- the NoMC-P slice 42M may be set by the setting unit 103a in the picture PM at the intermediate time (step Sa0b).
- the NoMC-P slice 42M to be set is, for example, as shown in FIG. 17, a slice of a region R3 composed of both the region of the NoMC-P slice 42 and the region of the I slice PR2 in the picture PR at an early time. It is.
- the refresh completion area (NoMC-P slice 42M and the intermediate MCMC) from the unrefresh area (the I slice PR2, the NoMC-P slice 42, and the MC-P slice 44 area) of the picture PR at the earlier time.
- the propagation of deterioration to the MC-P slice 44 region may be eliminated.
- the deterioration in the refresh completion area of the picture PM at the intermediate time is eliminated, so that the refresh completion area of the picture PS at the later time is changed from the refresh completion area in the picture PM at the intermediate time (see FIG. 5 and the like). Propagation of degradation to) is reliably avoided. As a result, the deterioration that propagates to the refresh completion region of the late-time picture PS can be reliably eliminated.
- a picture PR at an early time includes one or more MC-P slices PRx (MC ⁇ ) in addition to the NoMC-P slice 42 and the I slice PR2.
- P slices 44 and 43) may be set by the setting unit 103a.
- the MC-P slice PMx having the same width as that of the MC-P slice PRx is located at the same position as the position of each MC-P slice PRx. It may be set. That is, the MC-P slice PMx in the same area as the area of each MC-P slice PRx may be set in the picture PM at the intermediate time.
- the slice division which is the same as the slice division in the picture PR at the early time, is performed by the setting unit 103a in the area other than the area of the NoMC-P slice 42M. Good.
- the division of the slice in the intermediate time picture PM may be a division corresponding to (similar to) the division of the slice in the early time picture PR.
- FIG. 17 there may be a plurality of intermediate time picture PMs (FIG. 17). That is, there may be an intermediate picture PM at each of two or more times between the early time of the picture PR and the late time of the picture PS. Then, the same processing as described above may be performed for each intermediate picture PM.
- the picture PR in which the I slice (I slice PS2) is set immediately before the intermediate time picture PM the picture PR in which the I slice (I slice PS2) is set immediately before the intermediate time picture PM.
- FIG. 18 is a diagram showing the NoMC-P slice 42N and the like.
- the NoMC-P slice 42N may be set in the picture PN at the intermediate time.
- the NoMC-P slice 42N to be set is a slice in the same area as the area of the NoMC-P slice 42 in the picture PS at a later time.
- the MC-P slice PNy having the same area as the area of the I slice 41 in the late-time picture PS may be set in the intermediate time picture PN.
- a normal MC-P slice (MC-P slice PNy) may be set in the same area.
- the MC-P slice PNx having the same area as that of each MC-P slice PSx in the late-time picture PS may be set in the intermediate-time picture PN.
- the division of the slice in the intermediate time picture PN is performed later in the area other than the area of the MC-P slice PNy (the area of the I slice 41 in the late-time picture PS). It may be the same as the division by the picture PS in FIG.
- the MC-P slice PNy area may be different only in the type of slice (MC-P slice, I slice).
- the division of the slice in the picture PN at the intermediate time may be a division corresponding to (similar to) the division in the picture PS at the later time.
- the NoMC-P slice 42N in FIG. 18 is smaller than the NoMC-P slice 42M in FIG. That is, for example, the NoMC-P slice 42N in FIG. 18 is the width of the movement of the I slice from the position of the I slice 41 in the early time picture PR to the position in the I slice PS2 in the late time picture PS. Therefore, it may be smaller than the NoMC-P slice 42M in FIG.
- the first predicted image is not used, and the data amount of the encoded data becomes relatively large.
- FIG. 20 is a diagram showing a picture PMB or the like at an intermediate time.
- the division corresponding to the division at the picture PR at the early time is performed in the same manner as the example in FIG. 17 described above.
- the number of NoMC-P slices 42B included in the plurality of NoMC-P slices 42B may be two, three, or other numbers, for example.
- FIG. 21 is a diagram showing a picture PMC and the like at an intermediate time.
- the division corresponding to the division at the late-time picture PS is performed in the same manner as the example in FIG. 18 described above.
- the number of NoMC-P slices 42B included in the plurality of NoMC-P slices 42B may be two, three, or any other number.
- the number of blocks in the NoMC-P slice may be smaller than the number of blocks in the normal P slice (MC-P slice 43).
- the number of bits after encoding of the NoMC-P slice can be avoided and reduced.
- packet loss in the transmission path through which the encoded data is transmitted can be prevented.
- one slice is one transmission unit (may be one packet). If the size of one transmission unit exceeds a certain size, packet loss is likely to occur.
- FIG. 14 to FIG. 16 are explanatory diagrams when the moving picture coding apparatus according to each of the above embodiments is implemented by a computer system using a program recorded on a recording medium such as a flexible disk.
- FIG. 14 is a diagram showing an example of a physical format of a disk FD of a flexible disk (see FIG. 15) which is a recording medium body.
- FIG. 15 is an external view (left figure) of the flexible disk, a sectional structure of the flexible disk (middle figure), and a diagram (right figure) showing the disk FD.
- the flexible disk includes a case F and a disk FD built in the case F.
- a plurality of tracks Tr are formed concentrically from the outer periphery toward the inner periphery.
- Each track Tr is divided into 16 sectors Se in the angular direction. Therefore, the program is recorded in an area allocated on the disk FD.
- FIG. 16 is a diagram showing a configuration of a computer system Cs that records the program on the flexible disk, and reads and reproduces the program from the flexible disk. For example, when recording the program for realizing the moving image encoding apparatus on a flexible disk, the computer system Cs writes the program on the flexible disk (the disk FD thereof) via the flexible disk drive FDD.
- the computer system Cs may execute the program in the flexible disk.
- the program is read from the flexible disk by the flexible disk drive FDD, and the read program is read from the flexible disk drive FDD to the computer. Transfer to system Cs.
- the computer system Cs implements the functions of the above-described moving picture coding apparatus by executing the transferred program.
- a disk (flexible disk) FD has been described as an example of a recording medium, but the same can be done using an optical disk.
- the recording medium is not limited to this, and can be similarly implemented as long as it can record a program such as an IC card, a ROM cassette, a USB (Universal Serial Bus) memory, a memory card (Memory Card), and the like.
- the program recorded in the HDD (hard disk drive), nonvolatile memory, RAM and ROM, SDD (Solid State Drive), etc. included in the computer system Cs is not limited to a recording medium removable from the computer system Cs.
- the computer system Cs may execute.
- the computer system Cs may execute a program acquired from the outside of the computer system Cs via a wired or wireless communication network.
- the moving picture encoding apparatuses shown in the first to fourth embodiments can be realized by the computer system Cs.
- each functional block included in the moving picture coding apparatus may be realized as an LSI which is an integrated circuit. These may be individually made into one chip, or may be made into one chip so as to include a part or all of them.
- the functional blocks other than the memory may be integrated into one chip.
- the name used here is LSI, but it may also be called IC, system LSI, super LSI, or ultra LSI depending on the degree of integration.
- the method of circuit integration is not limited to LSI, and implementation with a dedicated circuit or a general-purpose processor is also possible.
- An FPGA Field Programmable Gate Array
- a reconfigurable processor that can reconfigure the connection and setting of circuit cells inside the LSI may be used.
- only the means for storing the data to be encoded may be configured separately instead of being integrated into one chip.
- the vertical width of the first P slice region (the vertical width of the NoMC-P slice 42) is “a search range for motion detection in the second encoding step”. It may be equal to or greater than “ ⁇ I slice width”.
- the insertion method for inserting the I slice is the first insertion method for repeatedly inserting the I slice (S2005A), and a predetermined number of times.
- a moving image coding method including a second insertion method of inserting only (a predetermined number of times) and a selection step of selecting from (S2005C) may be constructed.
- an I slice is inserted based on whether or not the receiving side that receives the transmitted data notifies the encoding device that performs encoding that a decoding error has occurred on the receiving side (S2003). Even if a moving image coding method including a selection step of selecting an insertion method from a first insertion method of repeatedly inserting an I slice and a second insertion method of inserting only a predetermined number of times (only a predetermined number of times) is constructed. Good.
- the insertion method for inserting I slices is the same as the first insertion method for repeatedly inserting I slices and only a predetermined number of times (only a predetermined number of times).
- a moving picture coding method including a selection step of selecting from the second insertion method to be inserted may be constructed.
- the image processing method further includes a filtering step for performing deblocking filter processing (filter unit 2010, Sa4), and the vertical width of the first P slice region is determined so that one pixel is the other pixel in the deblocking filter processing.
- a moving picture encoding method may be constructed that is larger than the maximum distance (for example, a distance of two pixels) of the distance between two pixels that affects the two.
- the motion vector is detected in a unit smaller than a pixel (decimal motion compensation processing is performed), and the vertical width of the first P slice region is the motion compensation processing by the motion vector.
- one pixel may be larger than the maximum distance (for example, a distance of three pixels) between two pixels that affects the other pixel.
- the vertical width is, for example, refresh completion from the unrefreshed area (unrefreshed area PR3) to the blank area from the sum of the two maximum values of the filtering step and the decimal precision process. You may have the magnitude
- the width in the vertical direction is not limited to the width considering only the influence in the filter process or the like, but the width in consideration of the influence of the filter process or the like is further added. It may be more than the specified width.
- the vertical width of the first P slice region may be greater than or equal to the motion detection search range in the second encoding step.
- the lower end of the search range is above the upper end of the I slice of the picture to be encoded (target picture PS). That is, when the position of the upper end of the I slice of the target picture is the same as the position of the lower end of the I slice of the reference destination picture (reference destination picture PR), the lower end of the I slice of the reference destination picture.
- the lower end of the search range is the upper side. That is, the lower end of the search range is above the upper end of the non-refresh area of the reference picture.
- the search range for motion detection in the second encoding step the width of the I slice.
- the width of the NoMC-P slice 42 in (n) of FIG. 2 may be the same as the search range size W shown in (m), for example. That is, this width may be the same as W, or may be equal to or greater than W, or may be smaller than W.
- W search range size
- W the search range size
- the width of the NoMC-P slice 42 is relatively reduced, and the encoded data after the NoMC-P slice 42 is encoded is reduced. Thereby, avoidance of propagation and the smallness of the data after encoding can be compatible.
- the width of the search range is, for example, the downward direction in FIG. 2, that is, the width (W) in the traveling direction of the I slice, and may be the maximum value of the distance searched in the traveling direction.
- the width of the first P slice region may be, for example, equal to or greater than the search range (maximum value of the search distance). As a result, it is possible to avoid a sufficiently inappropriate propagation of image quality degradation. Furthermore, more specifically, the width of the first P slice region may be equal to or greater than the sum of the maximum value of the search distance and the above distance of the maximum value of the deblocking filter. Specifically, the width of the first P slice region may be equal to or greater than the sum of the maximum value of the search distance and the above distance of the maximum value of the motion compensation processing with decimal precision. Specifically, the width of the first P slice region may be equal to or greater than the sum of the three lengths.
- the first slice ((0) to [6] in FIG. 3) is selected from the plurality of slices ([0] to [6] in FIG. 3). And the ending slice) may be selected.
- the position of each slice eg, [0] NoMC-P slice
- the slice [0] NoMC-P slice
- the position at which the I slice ([1]) is set at each time (for example, time (d)) is set to the position next to the position at the immediately preceding time (time (c)).
- time (d) the position at which the I slice is set may be moved.
- the present invention can be used for a moving image encoding device, and in particular, can be used for a communication device or a set device that encodes a moving image, such as moving image bidirectional communication using a network, moving image distribution, or a monitoring camera. it can.
- DESCRIPTION OF SYMBOLS 100 Picture number counter part 102 Block number counter part 103 Slice type setting part 104 Motion search determination part 105 Slice insertion frequency setting part 200 Coding part 300 Packetization part 2001 Motion search part 2002 Motion compensation part 2003 Reference image duplication part 2004 In screen Prediction unit 2005 Selector unit 2006 Subtractor 2007 DCT / quantization unit 2008 Inverse quantization / inverse DCT unit 2009 Adder 2010 Filter unit 2011 Reference image holding unit 2012 Entropy encoding unit
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Compression Or Coding Systems Of Tv Signals (AREA)
Abstract
Description
(構成)
図1は、本発明の実施形態1の動画像符号化装置1の構成を示すブロック図である。
図2は、動画像符号化装置1が行うスライス分割方法におけるデータを示す図である。
図3は、スライス分割、および、スライスタイプ判定の動作を示す図である。
(構成)
図7は、本発明の実施の形態2の動画像符号化装置1Aの構成を示すブロック図である。以下の説明では、実施の形態1の動画像符号化装置1の構成と同じ構成については、説明を適宜省略する。
動き探索を行わずにインター符号化を行うと、動き探索を行ってインター符号化を行う場合よりも、符号化したデータのビット数が増加する。なぜならば、動き探索とは、符号化対象画像と、予測画像との間の差分値の大きさが小さくなるように探索することであるからである。つまり、動き探索を行わないということは、動き探索を行う場合のインター符号化より、大きさの大きな差分値を符号化することに相当するためである。
(1)多くの(予め定められた個数より多い)画像復号装置に、一斉に配信を行い(S2001の配信)、個々の画像復号装置から、パケット消失情報を受信したリフレッシュ動作の実現が困難な場合(S2001の「はい」)。
(2)送信するネットワークのビットレートが低く、圧縮率が低い(ビットレートが高い)Iスライスを頻繁に挿入すると(挿入し、かつ、誤り制御を行うと)、ネットワークで送信されるデータのデータ量が大きくて、画質劣化が著しい場合(S2002の「いいえ」)。
(3)通信路でパケット消失が起こったことを、動画像符号化装置1Aに通知することができない画像復号装置と、動画像符号化装置1Aが接続する場合(S2003の「いいえ」)。
実施の形態3の動画像符号化方法は、前記第1の符号化工程では、前記第1の領域(図19の第1のR1)に含まれる、複数の前記第1のPスライス(NoMC-Pスライス42Aa、42Ab)のそれぞれ(NoMC-Pスライス42A)を、動きベクトルを用いずに画面間符号化し、含まれる複数の前記第1のPスライス(NoMC-Pスライス42Aa、42Ab)のうちでの、前記第1のPスライスの大きさの最大値(例えばNoMC-Pスライス42Aaの大きさ)は、前記第2のPスライス(図19のMC-Pスライス43、44)の大きさの最大値(図19のMC-Pスライス43の大きさ)よりも小さい動画像符号化方法である。
実施の形態4の動画像符号化方法は、第1の時刻(例えば、図17の(i)の時刻)における、IスライスとPスライスとが含まれる第1の前記ピクチャ(図17の(i)のピクチャ)と、前記第1の時刻よりも遅い第2の時刻((k)の時刻)での、IスライスとPスライスとが含まれる第2の前記ピクチャ(図17の(k)のピクチャ)とをそれぞれ符号化し、さらに、前記第1の時刻と、前記第2の時刻との間の中間の時刻((j)の時刻)での、Iスライスが含まれない第3のピクチャ(図17の(j))を符号化する動画像符号化方法である。
本発明の実施の形態5では、上記実施の形態1~4で示した動画像符号化装置(動画像符号化装置1、動画像符号化装置1A)を実現するためのプログラムを、フレキシブルディスク等の記録媒体に記録する。そして、これにより、上記実施の形態1~4で示した処理を、独立したコンピュータシステムにおいて実施する。このような実施を行う例を説明する。
102 ブロック数カウンタ部
103 スライスタイプ設定部
104 動き探索判定部
105 スライス挿入回数設定部
200 符号化部
300 パケット化部
2001 動き探索部
2002 動き補償部
2003 参照画像複製部
2004 画面内予測部
2005 セレクタ部
2006 減算器
2007 DCT/量子化部
2008 逆量子化/逆DCT部
2009 加算器
2010 フィルタ部
2011 参照画像保持部
2012 エントロピー符号化部
Claims (19)
- 1つのピクチャにIスライスとPスライスとを含み、含まれる前記Iスライスの前記ピクチャ内の位置が、ピクチャ毎に、ピクチャの垂直方向に移動する動画像符号化方法であって、
前記Iスライスに隣接する第1の領域であって、前記垂直方向の移動の向きと逆向きに隣接する第1の領域に含まれる第1のPスライスを、動きベクトルを用いずに画面間符号化する第1の符号化工程と、
前記第1の領域以外の第2の領域に含まれる第2のPスライスを、動きベクトルを用いて画面間符号化する第2の符号化工程とを含む動画像符号化方法。 - 前記第1の領域の前記垂直方向の幅は、「前記第2の符号化工程における動き検出の探索範囲-Iスライスの幅」以上である、請求項1記載の動画像符号化方法。
- 前記第1の領域に含まれる前記第1のPスライスを、動きベクトルを用いて画面間符号化する第3の符号化工程をさらに含み、
前記第3の符号化工程では、Iスライスを繰り返し挿入する場合に、前記第1の領域に含まれる前記第1のPスライスを、動きベクトルを用いて画面間符号化し、
前記第1の符号化工程では、Iスライスを所定の回数だけ挿入する場合に、前記第1の領域に含まれる前記第1のPスライスを、動きベクトルを用いないで画面間符号化する請求項1記載の動画像符号化方法。 - ピクチャが符号化されたデータが送信されるネットワークの帯域に基づいて、Iスライスを挿入する挿入方法を、Iスライスを繰り返し挿入する第1の挿入方法と、所定の回数だけ挿入する第2の挿入方法とから選択する選択工程を含む請求項3記載の動画像符号化方法。
- 送信されたデータが受信される受信側から、符号化を行う符号化装置へ、前記受信側で復号エラーが発生したことを通知するか否かに基づいて、Iスライスを挿入する挿入方法を、Iスライスを繰り返し挿入する第1の挿入方法と、所定の回数だけ挿入する第2の挿入方法とから選択する選択工程を含む請求項3記載の動画像符号化方法。
- 他の受信機に一度に配信を行う送信方法に基づいて、Iスライスを挿入する挿入方法を、Iスライスを繰り返し挿入する第1の挿入方法と、所定の回数だけ挿入する第2の挿入方法とから選択する選択工程を含む請求項3記載の動画像符号化方法。
- デブロックフィルタ処理を行うフィルタ工程をさらに含み、
前記第1の領域の前記垂直方向の幅は、前記デブロックフィルタ処理において、一方の画素が他方の画素に影響を与える2つの画素の距離の最大値の距離より大きい請求項1記載の動画像符号化方法。 - 前記動きベクトルは、画素よりも小さい単位で検出され、
前記第1の領域の前記垂直方向の幅は、前記動きベクトルによる動き補償の処理において、一方の画素が、他方の画素に影響を与える2つの画素の距離の最大値の距離より大きい請求項1記載の動画像符号化方法。 - 前記第1の符号化工程では、前記第1の領域に含まれる、複数の前記第1のPスライスのそれぞれを、動きベクトルを用いずに画面間符号化し、
含まれる複数の前記第1のPスライスのうちでの、前記第1のPスライスの大きさの最大値は、前記第2のPスライスの大きさの最大値よりも小さい請求項1記載の動画像符号化方法。 - 前記第1のPスライスの大きさの前記最大値は、当該最大値の大きさを有する前記第1のPスライスが含まれる前記ピクチャにおける前記Iスライスの大きさ以上である請求項9記載の動画像符号化方法。
- 当該動画像符号化方法では、
第1の時刻における、IスライスとPスライスとが含まれる第1の前記ピクチャと、前記第1の時刻よりも遅い第2の時刻での、IスライスとPスライスとが含まれる第2の前記ピクチャとをそれぞれ符号化し、
さらに、前記第1の時刻と、前記第2の時刻との間の中間の時刻での、Iスライスが含まれない第3のピクチャを符号化する請求項1記載の動画像符号化方法。 - 前記第3のピクチャは、前記第1のピクチャにおける、前記第1の領域と、前記Iスライスの領域との両方で構成される領域を含み、
前記第1の符号化工程では、前記第3のピクチャの当該領域における第1のPスライスを、動きベクトルを用いずに画面間符号化し、
前記2の符号化工程では、前記第3のピクチャにおける、当該領域以外の他の領域の第2のPスライスを、動きベクトルを用いて画面間符号化する請求項11記載の動画像符号化方法。 - 前記第3のピクチャは、前記第2のピクチャにおける前記第1の領域と同じ領域を含み、
前記第1の符号化工程では、前記第3のピクチャの当該同じ領域における第1のPスライスを、動きベクトルを用いずに画面間符号化し、
前記2の符号化工程では、前記第3のピクチャの、当該同じ領域以外の他の領域の第2のPスライスを、動きベクトルを用いて画面間符号化する請求項11記載の動画像符号化方法。 - 前記第1の符号化工程では、前記第3のピクチャに含まれる前記領域の複数の前記第1のPスライスのそれぞれを、動きベクトルを用いずに画面間符号化し、
前記第3のピクチャに含まれる複数の前記第1のPスライスのうちでの、前記第1のPスライスの大きさの最大値は、当該第3のピクチャに含まれる前記第2のPスライスの大きさの最大値よりも小さい請求項12記載の動画像符号化方法。 - 前記第1の符号化工程では、前記第3のピクチャに含まれる前記領域の複数の前記第1のPスライスのそれぞれを、動きベクトルを用いずに画面間符号化し、
前記第3のピクチャに含まれる複数の前記第1のPスライスのうちでの、前記第1のPスライスの大きさの最大値は、当該第3のピクチャに含まれる前記第2のPスライスの大きさの最大値よりも小さい請求項13記載の動画像符号化方法。 - 1つのピクチャにIスライスとPスライスとを含み、含まれる前記Iスライスの前記ピクチャ内の位置が、ピクチャ毎に、ピクチャの垂直方向に移動する動画像符号化装置であって、
前記Iスライスに隣接する第1の領域であって、前記垂直方向の移動の向きと逆向きに隣接する第1の領域に含まれる第1のPスライスを、動きベクトルを用いずに画面間符号化し、前記第1の領域以外の第2の領域に含まれる第2のPスライスを、動きベクトルを用いて画面間符号化するように、スライスタイプを決定するスライスタイプ決定部と、
前記第1の領域の前記第1のPスライスを、動きベクトルを用いずに画面間符号化する第1の画面間符号化部と、
前記第2の領域の前記第2のPスライスを、動きベクトルを用いて画面間符号化する第2の画面間符号化部とを備えた動画像符号化装置。 - Iスライスの挿入回数が所定値以上かどうかを判定するスライス挿入回数設定部を備え、
前記スライスタイプ決定部は、前記挿入回数が所定値未満であると、前記スライス挿入回数設定部により判定された場合には、前記第1の領域、および前記第2の領域の両方を使用し、所定値以上であると判定された場合には、前記第2の領域のみを使用する請求項16記載の動画像符号化装置。 - 1つのピクチャにIスライスとPスライスとを含み、含まれる前記Iスライスの前記ピクチャ内の位置が、ピクチャ毎に、ピクチャの垂直方向に移動する際において、コンピュータが、複数のピクチャを符号化するためのコンピュータプログラムであって、
前記Iスライスに隣接する第1の領域であって、前記垂直方向の移動の向きと逆向きに隣接する第1の領域に含まれる第1のPスライスを、動きベクトルを用いずに画面間符号化する第1の符号化工程と、
前記第1の領域以外の第2の領域に含まれる第2のPスライスを、動きベクトルを用いて画面間符号化する第2の符号化工程とを前記コンピュータに実行させるためのコンピュータプログラム。 - 1つのピクチャにIスライスとPスライスとを含み、含まれる前記Iスライスの前記ピクチャ内の位置が、ピクチャ毎に、ピクチャの垂直方向に移動して、複数のピクチャを符号化する集積回路であって、
前記Iスライスに隣接する第1の領域であって、前記垂直方向の移動の向きと逆向きに隣接する第1の領域に含まれる第1のPスライスを、動きベクトルを用いずに画面間符号化する第1の符号化部と、
前記第1の領域以外の第2の領域に含まれる第2のPスライスを、動きベクトルを用いて画面間符号化する第2の符号化部とを備える集積回路。
Priority Applications (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2010548973A JPWO2011004598A1 (ja) | 2009-07-10 | 2010-07-07 | 動画像符号化方法、装置、プログラム、および集積回路 |
CN2010800019898A CN102077597A (zh) | 2009-07-10 | 2010-07-07 | 运动图像编码方法、装置、程序、以及集成电路 |
US13/001,072 US20120275518A1 (en) | 2009-07-10 | 2010-07-07 | Moving picture coding method, apparatus, program, and integrated circuit |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2009-163843 | 2009-07-10 | ||
JP2009163843 | 2009-07-10 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2011004598A1 true WO2011004598A1 (ja) | 2011-01-13 |
Family
ID=43429026
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/JP2010/004431 WO2011004598A1 (ja) | 2009-07-10 | 2010-07-07 | 動画像符号化方法、装置、プログラム、および集積回路 |
Country Status (4)
Country | Link |
---|---|
US (1) | US20120275518A1 (ja) |
JP (1) | JPWO2011004598A1 (ja) |
CN (1) | CN102077597A (ja) |
WO (1) | WO2011004598A1 (ja) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US10390076B2 (en) | 2016-03-29 | 2019-08-20 | Mitsubishi Electric Corporation | Image receiving/reproducing device, image generating/transmitting device, display system, image receiving/reproducing method, image generating/transmitting method, and computer readable medium |
US10674172B2 (en) | 2016-04-19 | 2020-06-02 | Mitsubishi Electric Corporation | Image processing apparatus, image processing method, and computer-readable recording medium |
Families Citing this family (17)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106851306B (zh) | 2011-01-12 | 2020-08-04 | 太阳专利托管公司 | 动态图像解码方法和动态图像解码装置 |
US10404998B2 (en) * | 2011-02-22 | 2019-09-03 | Sun Patent Trust | Moving picture coding method, moving picture coding apparatus, moving picture decoding method, and moving picture decoding apparatus |
MX2013009864A (es) | 2011-03-03 | 2013-10-25 | Panasonic Corp | Metodo de codificacion de imagenes en movimiento, metodo de decodificacion de imagenes en movimiento, aparato de codificacion de imagenes en movimiento, aparato de decodificacion de imagenes en movimiento y aparato de codificacion y decodificacion de imagenes en movimiento. |
JP2013247651A (ja) * | 2012-05-29 | 2013-12-09 | Canon Inc | 符号化装置、符号化方法およびプログラム |
JP6019797B2 (ja) * | 2012-06-22 | 2016-11-02 | 富士通株式会社 | 動画像符号化装置、動画像符号化方法、及びプログラム |
US11109036B2 (en) | 2013-10-14 | 2021-08-31 | Microsoft Technology Licensing, Llc | Encoder-side options for intra block copy prediction mode for video and image coding |
US20150103909A1 (en) * | 2013-10-14 | 2015-04-16 | Qualcomm Incorporated | Multi-threaded video encoder |
AU2013403224B2 (en) | 2013-10-14 | 2018-10-18 | Microsoft Technology Licensing, Llc | Features of intra block copy prediction mode for video and image coding and decoding |
US10390034B2 (en) | 2014-01-03 | 2019-08-20 | Microsoft Technology Licensing, Llc | Innovations in block vector prediction and estimation of reconstructed sample values within an overlap area |
KR102353787B1 (ko) | 2014-01-03 | 2022-01-19 | 마이크로소프트 테크놀로지 라이센싱, 엘엘씨 | 비디오 및 이미지 코딩/디코딩에서의 블록 벡터 예측 |
US11284103B2 (en) | 2014-01-17 | 2022-03-22 | Microsoft Technology Licensing, Llc | Intra block copy prediction with asymmetric partitions and encoder-side search patterns, search ranges and approaches to partitioning |
KR102401946B1 (ko) | 2014-03-04 | 2022-05-24 | 마이크로소프트 테크놀로지 라이센싱, 엘엘씨 | 인트라 블록 카피 예측에서의 스킵 모드 및 블록 플립핑 |
KR102311815B1 (ko) | 2014-06-19 | 2021-10-13 | 마이크로소프트 테크놀로지 라이센싱, 엘엘씨 | 통합된 인트라 블록 카피 및 인터 예측 모드 |
JP2017535145A (ja) | 2014-09-30 | 2017-11-24 | マイクロソフト テクノロジー ライセンシング,エルエルシー | 波面並列処理が可能にされた場合のピクチャ内予測モードに関する規則 |
JP2017069866A (ja) * | 2015-10-01 | 2017-04-06 | 富士通株式会社 | 動画像符号化装置、動画像符号化方法及び動画像符号化用コンピュータプログラム |
US10621731B1 (en) * | 2016-05-31 | 2020-04-14 | NGCodec Inc. | Apparatus and method for efficient motion estimation for different block sizes |
US10986349B2 (en) | 2017-12-29 | 2021-04-20 | Microsoft Technology Licensing, Llc | Constraints on locations of reference blocks for intra block copy prediction |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPS60146588A (ja) * | 1984-01-11 | 1985-08-02 | Nec Corp | 動画像信号の符号化方式およびその装置 |
JPH04286489A (ja) * | 1990-11-16 | 1992-10-12 | General Instr Corp | 動作補償を伴った連続ビデオ影像をリフレッシュする方法とその装置 |
JP2008258953A (ja) * | 2007-04-05 | 2008-10-23 | Ibex Technology Co Ltd | 符号化装置、符号化プログラムおよび符号化方法 |
JP2009021907A (ja) * | 2007-07-13 | 2009-01-29 | Fujitsu Ltd | 動画像符号化装置及び動画像符号化方法 |
Family Cites Families (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2002043385A2 (en) * | 2000-10-10 | 2002-05-30 | Broadcom Corporation | System and method for special reproduction modes of a digitally encoded video data stream |
DE102005029127A1 (de) * | 2005-06-23 | 2007-04-19 | On Demand Microelectronics Ag | Verfahren und Vorrichtung zur optimierten prädiktiven Videocodierung |
CN101309405B (zh) * | 2007-05-14 | 2011-04-20 | 华为技术有限公司 | 参考数据载入的方法及装置 |
JP4821723B2 (ja) * | 2007-07-13 | 2011-11-24 | 富士通株式会社 | 動画像符号化装置及びプログラム |
JP5050964B2 (ja) * | 2008-03-31 | 2012-10-17 | 富士通株式会社 | 復号装置,復号方法および復号プログラム |
-
2010
- 2010-07-07 WO PCT/JP2010/004431 patent/WO2011004598A1/ja active Application Filing
- 2010-07-07 JP JP2010548973A patent/JPWO2011004598A1/ja not_active Withdrawn
- 2010-07-07 US US13/001,072 patent/US20120275518A1/en not_active Abandoned
- 2010-07-07 CN CN2010800019898A patent/CN102077597A/zh active Pending
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPS60146588A (ja) * | 1984-01-11 | 1985-08-02 | Nec Corp | 動画像信号の符号化方式およびその装置 |
JPH04286489A (ja) * | 1990-11-16 | 1992-10-12 | General Instr Corp | 動作補償を伴った連続ビデオ影像をリフレッシュする方法とその装置 |
JP2008258953A (ja) * | 2007-04-05 | 2008-10-23 | Ibex Technology Co Ltd | 符号化装置、符号化プログラムおよび符号化方法 |
JP2009021907A (ja) * | 2007-07-13 | 2009-01-29 | Fujitsu Ltd | 動画像符号化装置及び動画像符号化方法 |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US10390076B2 (en) | 2016-03-29 | 2019-08-20 | Mitsubishi Electric Corporation | Image receiving/reproducing device, image generating/transmitting device, display system, image receiving/reproducing method, image generating/transmitting method, and computer readable medium |
US10674172B2 (en) | 2016-04-19 | 2020-06-02 | Mitsubishi Electric Corporation | Image processing apparatus, image processing method, and computer-readable recording medium |
Also Published As
Publication number | Publication date |
---|---|
US20120275518A1 (en) | 2012-11-01 |
JPWO2011004598A1 (ja) | 2012-12-20 |
CN102077597A (zh) | 2011-05-25 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2011004598A1 (ja) | 動画像符号化方法、装置、プログラム、および集積回路 | |
JP5417402B2 (ja) | 画像符号化方法および画像符号化装置 | |
JP5232114B2 (ja) | 画像符号化装置、及び画像符号化方法 | |
JP4570532B2 (ja) | 動き検出装置、動き検出方法、集積回路およびプログラム | |
JP4769717B2 (ja) | 画像復号化方法 | |
KR100628494B1 (ko) | 화상복호화방법 및 장치와 기록매체 | |
JP4884290B2 (ja) | 動画像復号化集積回路、動画像復号化方法、動画像復号化装置及び動画像復号化プログラム | |
JP4838253B2 (ja) | 画像符号化装置、画像符号化方法、画像復号化装置、画像復号化方法、および記録方法 | |
JP4755093B2 (ja) | 画像符号化方法および画像符号化装置 | |
EP2141930A1 (en) | Motion detection apparatus, motion detection method, and motion detection program | |
US8948255B2 (en) | Moving picture coding method for quantizing a plurality of pictures using a quantization step and a small quantization step having a size smaller than a size of the quantization step | |
US20120307896A1 (en) | Moving picture coding method, moving picture decoding method, and apparatuses of the same | |
JP4534935B2 (ja) | トランスコーダ、記録装置及びトランスコード方法 | |
US5706053A (en) | Compressed motion video code processor | |
US20070171977A1 (en) | Moving picture coding method and moving picture coding device | |
JP4440863B2 (ja) | 符号化復号化装置、符号化復号化方法、符号化復号化集積回路、および符号化復号化プログラム | |
JP2008011117A (ja) | 画像符号化におけるインターレース符号化時の参照ピクチャ決定方法 | |
JP2007228560A (ja) | 動画像符号化方法および動画像符号化装置 | |
TW201440500A (zh) | 影像處理裝置及方法 | |
JP2008072182A (ja) | 動画像復号化装置、動画像復号化方法、動画像復号化プログラム、動画像符号化装置、動画像符号化方法、動画像符号化プログラム、及び動画像符号化復号化装置 | |
JP4892450B2 (ja) | 画像符号化装置および画像符号化方法 | |
JP5240230B2 (ja) | トランスコーダ、記録装置及びトランスコード方法 | |
JP2007020123A (ja) | 動き検出装置、および動き検出方法 | |
JP2007151163A (ja) | 画像復号化方法及び装置 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
WWE | Wipo information: entry into national phase |
Ref document number: 201080001989.8 Country of ref document: CN |
|
ENP | Entry into the national phase |
Ref document number: 2010548973 Country of ref document: JP Kind code of ref document: A |
|
WWE | Wipo information: entry into national phase |
Ref document number: 13001072 Country of ref document: US |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 10796909 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 10796909 Country of ref document: EP Kind code of ref document: A1 |