JP2008124707A

JP2008124707A - Moving image encoding apparatus

Info

Publication number: JP2008124707A
Application number: JP2006305274A
Authority: JP
Inventors: Eiji Obara; 英司小原
Original assignee: Mitsubishi Electric Corp
Current assignee: Mitsubishi Electric Corp
Priority date: 2006-11-10
Filing date: 2006-11-10
Publication date: 2008-05-29
Anticipated expiration: 2026-11-10
Also published as: JP4762860B2

Abstract

<P>PROBLEM TO BE SOLVED: To provide a moving image encoding apparatus capable of improving the quality of an encoded image of the whole image sequence to be encoded in the case of encoding a moving image (especially, in the case of encoding a still image or an image of slow movement) by field structure. <P>SOLUTION: The moving image encoding apparatus provided with an encoding part 100 for compression-encoding an input image signal which is interlace images continued in a time base by the field structure of a GPO unit is provided with: a feature value extraction circuit 113 for extracting an image feature value from the input image signal; a still image judgment part 200 for generating still image judgment information indicating whether the input image signal is a still image or an image of slow movement on the basis of the image feature value; and an encoding control part 300 for calculating the target information generation quantity of a picture to be encoded in accordance with the still image judgment information and controlling th encoding part 100 so that the real information generation quality of a field to be encoded in the encoding part 100 is included within the target information generation quantity. <P>COPYRIGHT: (C)2008,JPO&INPIT

Description

本発明は、動画像である入力画像信号を符号化して伝送する動画像符号化装置に関する。 The present invention relates to a moving image encoding apparatus that encodes and transmits an input image signal that is a moving image.

インターレース画像である動画像の入力画像信号を符号化する構造としては、フレーム構造とフィールド構造の２種類がある。一般的に、フレーム構造は、静止した画像、あるいは動きの少ない画像に有効であり、フィールド構造は、動きの多い画像に有効であるといわれている。 There are two types of structures for encoding an input image signal of a moving image that is an interlaced image, a frame structure and a field structure. In general, the frame structure is effective for a still image or an image with little motion, and the field structure is effective for an image with much motion.

インターレース画像の符号化にフィールド構造を採用した場合には、フレーム構造を採用した場合と比較して、画像入力から符号化開始までの時間が短くできる。図１４は、符号化の対象となる入力画像信号の構造を示す説明図である。この図１４に示すように、インターレース画像は、ＯｄｄフィールドとＥｖｅｎフィールドから構成されており、インターレース画像の符号化においては、先ず、動画像符号化装置にＯｄｄフィールドの画像信号が入力され、次に、Ｅｖｅｎフィールドの画像信号が入力される。 When the field structure is used for encoding the interlaced image, the time from the image input to the start of encoding can be shortened compared to the case where the frame structure is used. FIG. 14 is an explanatory diagram showing the structure of an input image signal to be encoded. As shown in FIG. 14, an interlaced image is composed of an Odd field and an Even field. In encoding of an interlaced image, first, an image signal of the Odd field is input to the moving image encoding device, and then The image signal of the Even field is input.

フレーム構造は、ＯｄｄフィールドとＥｖｅｎフィールドを合成した構造で符号化を行うのに対して、フィールド構造ではＯｄｄフィールドとＥｖｅｎフィールドの符号化は、独立して行なわれる。したがって、フィールド構造の場合、Ｏｄｄフィールドの画像信号が入力され、符号化に必要な最低限の画像信号が入力された時点で符号化開始が可能となる。 The frame structure is encoded with a structure in which the Odd field and the Even field are combined, whereas the Odd field and the Even field are encoded independently in the field structure. Therefore, in the case of the field structure, encoding can be started when an image signal of the Odd field is input and a minimum image signal necessary for encoding is input.

一方、フレーム構造は、Ｏｄｄフィールドの画像信号とＥｖｅｎフィールドの画像信号で構成されるため、Ｏｄｄフィールドの画像信号の入力完了後、フレーム構造での符号化に必要な最低限のＥｖｅｎフィールドの画像信号が入力された時点で符号化開始が可能となる。 On the other hand, since the frame structure is composed of an Odd field image signal and an Even field image signal, after the input of the Odd field image signal is completed, the minimum Even field image signal necessary for encoding in the frame structure is required. Encoding can be started at the time when is input.

このことより、フィールド構造を採用する方が、画像信号の入力を開始した時点から符号化結果の出力を開始する時点までの時間、すなわち、符号化遅延が、フレーム構造を採用する場合に比べて短くなる。 Therefore, the time from when the input of the image signal is started to the time when the output of the encoding result is started, that is, the encoding delay is longer when the field structure is adopted than when the frame structure is adopted. Shorter.

一方、従来の動画像符号化装置では、フィールド構造で符号化した画質と比較して、フレーム構造で符号化した画質の方が良いとされていることから、フレーム構造を採用している場合が多い。フィールド構造を採用した符号化としては、例えば、動きの速いシーンではフィールド構造で符号化を行い、動きの少ないシーンではフレーム構造で符号化を行うように、シーンに応じて適応的にフィールド構造・フレーム構造の切り替えを行う方式が存在する。 On the other hand, in the conventional moving image encoding apparatus, since the image quality encoded with the frame structure is better than the image quality encoded with the field structure, the frame structure may be adopted. Many. For example, the field structure is adaptively adapted to the scene so that the field structure is encoded in a fast-moving scene and the frame structure is encoded in a low-motion scene. There is a method for switching the frame structure.

よって、静止画像、あるいは動きの少ない画像をフィールド構造で符号化した場合の画質を向上させることができれば、従来と比較して、画質は同等で、符号化遅延が短い動画像符号化装置を提供できることになる。 Therefore, if the image quality when a still image or an image with little motion is encoded with a field structure can be improved, a moving image encoding device that has the same image quality and a shorter encoding delay than the conventional one is provided. It will be possible.

しかしながら、上述したように、フレーム構造が有利とされる静止画像、あるいは動きの少ない画像を対象にフィールド構造で符号化を行う場合、フレーム構造を用いた符号化画像の画質と比較して、符号化画像の画質が劣るとされる。 However, as described above, when encoding with a field structure for a still image in which the frame structure is advantageous or an image with little motion, the image quality is compared with the image quality of the encoded image using the frame structure. The image quality of the converted image is inferior.

例えば、デジタル放送サービスに採用されているＭＰＥＧ−２（ＭｏｖｉｎｇＰｉｃｔｕｒｅＥｘｐｅｒｔｓＧｒｏｕｐ−２）を符号化方式に用いる場合、ＧＯＰ（ＧｒｏｕｐＰｉｃｔｕｒｅｓ）単位で符号化が行われる。図１５は、ＧＯＰ構造の一例を示す説明図である。ただし、この図１５に示すＧＯＰは、単なる一例であり、すべてのＧＯＰがこの構成であるとは限らない。 For example, when MPEG-2 (Moving Picture Experts Group-2) adopted in a digital broadcasting service is used as an encoding method, encoding is performed in units of GOP (Group Pictures). FIG. 15 is an explanatory diagram showing an example of the GOP structure. However, the GOP shown in FIG. 15 is merely an example, and not all GOPs have this configuration.

また、図１５では、複数のＧＯＰとして、ＧＯＰ（ｎ−１）、ＧＯＰ（ｎ）、ＧＯＰ（ｎ＋１）を例示している。そして、図１５では、異なるＧＯＰのピクチャ間隔が同一ＧＯＰ内のピクチャ間隔より広く描いているが、これは、ＧＯＰの境界を示すためであり、実際にはすべてのピクチャ間隔は、同一である。ピクチャ間隔とは、各ピクチャが入力される時間間隔を意味する。 In FIG. 15, GOP (n−1), GOP (n), and GOP (n + 1) are illustrated as a plurality of GOPs. In FIG. 15, the picture intervals of different GOPs are drawn wider than the picture intervals in the same GOP, but this is to indicate the boundary of the GOP, and in fact, all picture intervals are the same. The picture interval means a time interval at which each picture is input.

図１５に示すように、ＧＯＰは、複数のピクチャで構成されており、図１５の例では、ＧＯＰ（ｎ）は、ピクチャ０からピクチャ２７までの２８ピクチャ、すなわち、２８フィールドで構成されている。なお、各ピクチャに付されている番号０〜２７は、ピクチャを符号化順に並べ替えた後にナンバリングした数字を示している。これらのピクチャにおいて、Ｏｄｄフィールドは、０と偶数番号のピクチャであり、Ｅｖｅｎフィールドは、奇数番号のピクチャである。 As shown in FIG. 15, GOP is composed of a plurality of pictures, and in the example of FIG. 15, GOP (n) is composed of 28 pictures from picture 0 to picture 27, that is, 28 fields. . Numbers 0 to 27 assigned to each picture indicate numbers numbered after the pictures are rearranged in the encoding order. In these pictures, the Odd field is an even-numbered picture with 0, and the Even field is an odd-numbered picture.

ピクチャには３種類のタイプがあり、具体的には、ピクチャ内で符号化（イントラ符号化）を行うＩピクチャ（Ｉｎｔｒａ−ＣｏｄｅｄＰｉｃｔｕｒｅ）、過去のフィールドを参照して動き補償予測を行うＰピクチャ（Ｐｒｅｄｉｃｔｉｖｅ−ＣｏｄｅｄＰｉｃｔｕｒｅ）、過去と未来のフィールドを参照して動き補償予測を行うＢピクチャ（ＢｉｄｉｒｅｃｔｉｏｎａｌｌｙＰｒｅｄｉｃｔｉｖｅ−ＣｏｄｅｄＰｉｃｔｕｒｅ）の３タイプがある。 There are three types of pictures. Specifically, an I picture (Intra-coded Picture) that performs coding (intra coding) within a picture, and a P picture that performs motion compensation prediction with reference to past fields There are three types of (Predictive-Coded Picture) and B picture (Bidirectionally Predictive-Coded Picture) that performs motion compensation prediction with reference to past and future fields.

また、符号化結果である実情報発生量は、通常、Ｉピクチャが最も多く、次に、Ｐピクチャ、最も少ないのがＢピクチャである。また、フィールド構造のＧＯＰは、符号化順としては、ＯｄｄフィールドがＩピクチャ、ＥｖｅｎフィールドがＰピクチャとして符号化されるＩＰフィールドペアから始まる。図１５のＧＯＰ（ｎ）ではピクチャ０（Ｉピクチャ）とピクチャ１（Ｐピクチャ）がＩＰフィールドペアである。 Also, the actual information generation amount that is the result of encoding is usually the largest for I pictures, then P pictures, and the smallest is B pictures. The field structure GOP starts with an IP field pair in which the Odd field is encoded as an I picture and the Even field is encoded as a P picture. In GOP (n) in FIG. 15, picture 0 (I picture) and picture 1 (P picture) are IP field pairs.

フィールド構造を採用して動画像符号化を行う場合、Ｐピクチャは、過去の２つのフィールドを参照して動き補償予測を行うことができる。例えば、図１５のＧＯＰ（ｎ）を構成しているＰピクチャのピクチャ４は、ＩピクチャであるＯｄｄフィールドのピクチャ０とＰピクチャであるＥｖｅｎフィールドのピクチャ１を、ピクチャ５はＯｄｄフィールドのピクチャ４とＥｖｅｎフィールドのピクチャ１を参照できる。 When moving picture coding is performed using a field structure, a P picture can perform motion compensation prediction with reference to two past fields. For example, picture 4 of P picture making up GOP (n) in FIG. 15 is picture 0 of Odd field that is I picture and picture 1 of Even field that is P picture, and picture 5 is picture 4 of Odd field. And even field picture 1 can be referred to.

図１６は、動き補償予測における参照面の一例を示す説明図である。ただし、ＩＰフィールドペアのＰピクチャであるピクチャ１の参照可能なピクチャは制限されており、Ｏｄｄフィールドのピクチャ０への参照のみ可能である。したがって、図１５において、ＧＯＰ（ｎ）のピクチャ１は、ＧＯＰ（ｎ−１）のピクチャ２５への参照、ＧＯＰ（ｎ＋１）のピクチャ１は、ＧＯＰ（ｎ）のピクチャ２５への参照は禁止されている。すなわち、ＩＰフィールドペアのＰピクチャは、同一パリティのピクチャへの参照が禁止され、異パリティ、すなわちＯｄｄフィールドのＩピクチャへのみ参照が可能である。 FIG. 16 is an explanatory diagram illustrating an example of a reference plane in motion compensation prediction. However, the referenceable picture of picture 1 which is the P picture of the IP field pair is limited, and only reference to picture 0 of the odd field is possible. Accordingly, in FIG. 15, GOP (n) picture 1 is prohibited from reference to GOP (n−1) picture 25, and GOP (n + 1) picture 1 is prohibited from reference to GOP (n) picture 25. ing. That is, the P picture of the IP field pair is prohibited from referring to the picture with the same parity, and can be referred only to the I picture of the different parity, that is, the odd field.

静止画像、または動きが緩慢な画像では、同一パリティのピクチャ間の相関と比べ、異パリティのピクチャ間の相関は、低くなる傾向がある。また、ＩＰフィールドペアのＰピクチャ１は、異パリティのＩピクチャ０のみ参照可能である。このため、その符号化画質は、Ｉピクチャ０の符号化画質と比較して劣化が大きくなり、Ｐピクチャ１を参照するピクチャ、特に、同一パリティであるＥｖｅｎフィールドのピクチャ、例えば、Ｐピクチャ５、Ｂピクチャ７、Ｐピクチャ９などの符号化画質の劣化の原因となる。 In a still image or an image with slow motion, the correlation between pictures of different parity tends to be lower than the correlation between pictures of the same parity. Further, P picture 1 of the IP field pair can refer only to I picture 0 of different parity. Therefore, the encoded image quality is greatly deteriorated as compared with the encoded image quality of I picture 0, and a picture referring to P picture 1, particularly a picture of the Even field having the same parity, for example, P picture 5, This causes deterioration of the encoded image quality of the B picture 7 and the P picture 9.

図１７は、フィールド構造による符号化結果におけるＯｄｄフィールドとＥｖｅｎフィールドのＳＮＲ（ＳｉｇｎａｌｔｏＮｏｉｓｅＲａｔｉｏｎ：信号対雑音比）の遷移を示す図である。より具体的には、ＩＴＥ標準画像の１つである「ＷｅａｔｈｅｒＲｅｐｏｒｔ」（ＨＤＴＶ解像度１０８０ｉ）を対象に、ＴＭ５（ＴｅｓｔＭｏｄｅｌ５）の方式を用いて図１５に示したＧＯＰ構造で符号化を行った結果であるＯｄｄフィールドとＥｖｅｎフィールドのＳＮＲの遷移を示す。 FIG. 17 is a diagram illustrating a transition of an SNR (Signal to Noise Ratio) between the Odd field and the Even field in the encoding result by the field structure. More specifically, “Weather Report” (HDTV resolution 1080i), which is one of the ITE standard images, is encoded using the GOP structure shown in FIG. 15 using the TM5 (Test Model 5) method. The SNR transition of the Odd field and the Even field, which is the result of the above.

図１７において、ＩＰフィールドペアのＳＮＲを示している箇所を○で囲んでいる。この画像は、静止画像ではないものの、背景は静止しており、動きとしては、前面の人物の腕の動きが存在する程度の非常に動きが少ない緩慢な画像である。 In FIG. 17, a part indicating the SNR of the IP field pair is circled. Although this image is not a still image, the background is still, and the motion is a slow image with very little motion to the extent that there is a motion of the arm of the front person.

図１７から明らかな通り、ＩＰフィールドペアのＩピクチャ０とＰピクチャ１のＳＮＲの差が大きく、ＥｖｅｎフィールドであるＰピクチャ１のＳＮＲの方が低いことがわかる。また、各ＧＯＰ内の遷移を見ると、ＥｖｅｎフィールドのＳＮＲが、ＯｄｄフィールドのＳＮＲの下側を遷移しながら徐々にＳＮＲの差が縮まっていることが分かる。なお、ビットレートは、１８Ｍｂｐｓに設定して符号化を行った。 As can be seen from FIG. 17, the SNR difference between I picture 0 and P picture 1 in the IP field pair is large, and the SNR of P picture 1 as the even field is lower. Further, when looking at the transitions within each GOP, it can be seen that the SNR difference of the Even field gradually decreases as the SNR of the Even field transitions below the SNR of the Odd field. The bit rate was set to 18 Mbps for encoding.

図１７に示したＳＮＲの結果から、動きが非常に緩慢な画像を符号化した場合、ＧＯＰ先頭のＩＰフィールドペアのＰピクチャの劣化が映像シーケンス全体のＳＮＲの平均値を下げていることがわかる。なお、この状況は、静止画像を対象とした符号化結果についても同様である。 From the result of the SNR shown in FIG. 17, it can be seen that when an image with very slow motion is encoded, the deterioration of the P picture of the IP field pair at the head of the GOP reduces the average value of the SNR of the entire video sequence. . This situation is the same for the encoding result for still images.

そこで、静止画像、または動きが緩慢な画像の符号化画質を向上するために、従来の動画像符号化装置では、Ｉピクチャに対する目標情報発生量の割合を多くし、Ｐピクチャ、Ｂピクチャに対する目標情報発生量の割合を少なくしているものがある（例えば、特許文献１参照）。 Therefore, in order to improve the encoded image quality of a still image or an image with slow motion, the conventional moving image encoding apparatus increases the ratio of the target information generation amount with respect to the I picture and increases the target information with respect to the P picture and B picture. Some of them reduce the rate of information generation (see, for example, Patent Document 1).

特開２００５−３０３３６２号公報Japanese Patent Laying-Open No. 2005-303362

しかしながら、従来技術には次のような課題がある。
従来の動画像符号化装置は、各ピクチャに対する目標情報発生量の割合を調整するように構成されており、静止画像、または動きが緩慢な画像の符号化画質を向上させる効果を有する。しかしながら、従来の動画像符号化装置は、フレーム画像を対象としており、フィールド画像を用いた符号化に適用する場合には、次の課題を必ずしも解決していない。 However, the prior art has the following problems.
The conventional moving image coding apparatus is configured to adjust the ratio of the target information generation amount for each picture, and has an effect of improving the coding image quality of a still image or an image with slow motion. However, the conventional moving image encoding device is intended for frame images, and does not necessarily solve the following problems when applied to encoding using field images.

すなわち、フレーム構造では、ＯｄｄフィールドとＥｖｅｎフィールドとを合わせて符号化するため、Ｉピクチャの目標情報発生量の割合を多くすることは、ＯｄｄフィールドとＥｖｅｎフィールド両方の符号化画質が向上することになる。 That is, in the frame structure, since the Odd field and the Even field are encoded together, increasing the ratio of the target information generation amount of the I picture improves the encoded image quality of both the Odd field and the Even field. Become.

一方、フィールド構造で符号化を行う場合には、ＯｄｄフィールドとＥｖｅｎフィールドは、独立して符号化される。このため、従来技術のようにＩピクチャの目標情報発生量の割合を多くすると、Ｏｄｄフィールド側の画質は、向上するが、Ｅｖｅｎフィールド側の符号化画質は、必ずしも向上するとは限らない。 On the other hand, when encoding is performed with a field structure, the Odd field and the Even field are encoded independently. For this reason, when the ratio of the target information generation amount of the I picture is increased as in the prior art, the image quality on the odd field side is improved, but the encoded image quality on the even field side is not necessarily improved.

より具体的には、ＩＰペアのＰピクチャは、Ｉピクチャのみ参照するが、異パリティであるため相関が低く、Ｉピクチャの目標情報発生量の割合を多くしてＩピクチャの符号化画質を向上しても、Ｐピクチャの符号化画質が向上するとは限らない。 More specifically, the P picture of the IP pair refers only to the I picture, but has a low correlation because of the different parity, and increases the target information generation amount of the I picture to improve the encoded picture quality of the I picture. Even so, the encoded picture quality of the P picture is not always improved.

さらに、低遅延符号化を行う場合には、通常の符号化の場合と比較して、バッファサイズを小さくする。Ｉピクチャの目標情報発生量の割合が多くなると、遅延量が増加することに加え、バッファ破綻を回避するため、Ｐピクチャの目標情報発生量の割合をより少なくする必要が生じる。したがって、Ｉピクチャの符号化画質の向上がＰピクチャには影響しない可能性があることに加え、Ｐピクチャの目標情報発生量自体を少なくすることによる符号化画質の劣化が生ずることになる。 Furthermore, when performing low-delay encoding, the buffer size is made smaller than in normal encoding. When the ratio of the target information generation amount of the I picture increases, in addition to the increase of the delay amount, it is necessary to reduce the ratio of the target information generation amount of the P picture in order to avoid buffer failure. Therefore, the improvement of the encoded picture quality of the I picture may not affect the P picture, and the encoded picture quality is deteriorated by reducing the target information generation amount of the P picture itself.

本発明は上述のような課題を解決するためになされたもので、フィールド構造で動画像の符号化を行う際（特に、静止画像、または動きの緩慢な画像の符号化を行う際）に、符号化対象の映像シーケンス全体の符号化画像の画質を向上させることのできる動画像符号化装置を得ることを目的とする。 The present invention has been made to solve the above-described problems, and when encoding a moving image with a field structure (especially when encoding a still image or a slow-moving image), It is an object of the present invention to obtain a moving image encoding apparatus capable of improving the image quality of an encoded image of the entire video sequence to be encoded.

本発明に係る動画像符号化装置は、時間的に連続したインターレース画像である入力画像信号をＧＯＰ単位のフィールド構造で圧縮符号化する符号化部を有する動画像符号化装置において、入力画像信号の中から画像特徴量を抽出する特徴量抽出回路と、画像特徴量に基づいて入力画像信号が静止画像または動きが緩慢な画像であるか否かを示す静止画像判定情報を生成する静止画像判定部と、静止画像判定情報に応じて符号化の対象となるピクチャの目標情報発生量を算出し、符号化部で符号化を実行するフィールドの実情報発生量が前記目標情報発生量に納まるように符号化部を制御する符号化制御部とを備えるものである。 A moving image encoding apparatus according to the present invention includes an encoding unit that compresses and encodes an input image signal that is a temporally continuous interlaced image with a field structure in units of GOPs. A feature amount extraction circuit that extracts an image feature amount from the inside, and a still image determination unit that generates still image determination information indicating whether the input image signal is a still image or an image with slow motion based on the image feature amount Then, the target information generation amount of the picture to be encoded is calculated according to the still image determination information, and the actual information generation amount of the field where the encoding is performed by the encoding unit falls within the target information generation amount. And an encoding control unit that controls the encoding unit.

本発明によれば、静止画像判定情報に応じて符号化の対象となるピクチャの目標情報発生量を求め、目標情報発生量に基づいて符号化部を制御し、ＯｄｄフィールドとＥｖｅｎフィールドの符号化画質を同等にすることにより、フィールド構造で動画像の符号化を行う際（特に、静止画像、または動きの緩慢な画像の符号化を行う際）に、符号化対象の映像シーケンス全体の符号化画像の画質を向上させることのできる動画像符号化装置を得ることができる。 According to the present invention, the target information generation amount of the picture to be encoded is obtained according to the still image determination information, the encoding unit is controlled based on the target information generation amount, and the Odd field and the Even field are encoded. Encoding the entire video sequence to be encoded when encoding a moving image with a field structure (particularly when encoding a still image or a slow-moving image) by equalizing the image quality A moving image encoding apparatus capable of improving the image quality of an image can be obtained.

実施の形態１．
図１は、本発明の実施の形態１における動画像符号化装置の構成を示すブロック図である。図１に示した動画像符号化装置１は、符号化部１００、静止画像判定部２００、および符号化制御部３００で構成される。 Embodiment 1 FIG.
FIG. 1 is a block diagram showing a configuration of a moving picture coding apparatus according to Embodiment 1 of the present invention. The moving image encoding apparatus 1 illustrated in FIG. 1 includes an encoding unit 100, a still image determination unit 200, and an encoding control unit 300.

符号化部１００は、時間的に連続したインターレース画像である入力画像信号をフィールド構造で圧縮符号化する。さらに、本発明の実施の形態１においては、この符号化部１００内に、入力画像信号の中から画像特徴量を抽出する特徴量抽出回路を備えており、詳細は後述する。 The encoding unit 100 compresses and encodes an input image signal, which is a temporally continuous interlaced image, with a field structure. Furthermore, in the first embodiment of the present invention, the encoding unit 100 is provided with a feature amount extraction circuit that extracts an image feature amount from an input image signal, which will be described in detail later.

また、静止画像判定部２００は、画像特徴量に基づいて符号化の対象となる入力画像信号が静止画像または動きが緩慢な画像であるか否かを示す静止画像判定情報を生成する。さらに、符号化制御部３００は、静止画像判定部２００により生成された静止画像判定情報に基づいて目標情報発生量を算出し、算出した目標情報発生量に基づいて符号化部１００の圧縮符号化を制御する。 Further, the still image determination unit 200 generates still image determination information indicating whether or not the input image signal to be encoded is a still image or an image with slow motion based on the image feature amount. Further, the encoding control unit 300 calculates a target information generation amount based on the still image determination information generated by the still image determination unit 200, and the compression encoding of the encoding unit 100 based on the calculated target information generation amount. To control.

図２は、本発明の実施の形態１における動画像符号化装置の詳細構成を示すブロック図である。符号化部１００は、フレームメモリ１０１、ブロック化回路１０２、減算器１０３、変換回路１０４、量子化回路１０５、逆量子化回路１０６、逆変換回路１０７、加算器１０８、フレームメモリ１０９、動き補償回路１１０、符号化回路１１１、バッファ１１２、および特徴量抽出回路１１３を備えている。 FIG. 2 is a block diagram showing a detailed configuration of the moving picture coding apparatus according to Embodiment 1 of the present invention. The encoding unit 100 includes a frame memory 101, a blocking circuit 102, a subtractor 103, a conversion circuit 104, a quantization circuit 105, an inverse quantization circuit 106, an inverse conversion circuit 107, an adder 108, a frame memory 109, and a motion compensation circuit. 110, an encoding circuit 111, a buffer 112, and a feature amount extraction circuit 113.

また、静止画像判定部２００は、静止画像判定回路２０１を備えている。さらに、符号化制御部３００は、制御回路３０１およびメモリ３０２を備えている。 The still image determination unit 200 includes a still image determination circuit 201. Furthermore, the encoding control unit 300 includes a control circuit 301 and a memory 302.

次に、これらの個々の構成の機能、動作について、順に説明する。
まず始めに、フレームメモリ１０１には、符号化の対象となる入力画像の画像信号である入力画像信号が格納される。この入力画像信号は、先の図１４に示したように、１つのフレームが２つのフィールドから構成されるインターレース構造であり、ＯｄｄフィールドとＥｖｅｎフィールドより成り立っている。通常は、Ｏｄｄフィールドの画像信号から入力され、続いて、Ｅｖｅｎフィールドの画像信号が入力される。 Next, functions and operations of these individual configurations will be described in order.
First, the frame memory 101 stores an input image signal that is an image signal of an input image to be encoded. As shown in FIG. 14, the input image signal has an interlace structure in which one frame is composed of two fields, and is composed of an Odd field and an Even field. Usually, an image signal in the odd field is input, and then an image signal in the even field is input.

ブロック化回路１０２は、フレームメモリ１０１に格納された入力画像信号を分割する。以降の動作は、この分割された領域（ブロック）を単位に行われる。例えば、デジタル放送やＤＶＤ（ＤｉｇｉｔａｌＶｅｒｓａｔｉｌｅＤｉｓｃ）レコーダなどに採用されているＭＰＥＧ−２では、この領域は、マクロブロック（ＭＢ：Ｍａｃｒｏｂｌｏｃｋ）と呼ばれ、大きさは、１６×１６画素である。 The blocking circuit 102 divides the input image signal stored in the frame memory 101. Subsequent operations are performed in units of the divided areas (blocks). For example, in MPEG-2 adopted for digital broadcasting, a DVD (Digital Versatile Disc) recorder, and the like, this area is called a macroblock (MB) and has a size of 16 × 16 pixels.

減算器１０３は、ブロック化回路１０２を介した入力画像信号と、後述する動き補償回路１１０により生成された予測画像信号との誤差を求め、その予測誤差信号を出力する。変換回路１０４は、減算器１０３から出力された予測誤差信号を直交変換して、その変換係数を出力する。この直交変換の一例としては、離散コサイン変換（ＤＣＴ：ＤｉｓｃｒｅｔｅＣｏｓｉｎｅＴｒａｎｓｆｏｒｍ）の適用が考えられる。 The subtractor 103 obtains an error between an input image signal that has passed through the blocking circuit 102 and a predicted image signal generated by a motion compensation circuit 110 described later, and outputs the predicted error signal. The conversion circuit 104 orthogonally transforms the prediction error signal output from the subtracter 103 and outputs the conversion coefficient. As an example of this orthogonal transformation, application of a discrete cosine transformation (DCT) is conceivable.

量子化回路１０５は、変換回路１０４から出力される変換係数を量子化する。逆量子化回路１０６は、量子化回路１０５により量子化された変換係数を逆量子化する。逆変換回路１０７は、逆量子化回路１０６により逆量子化された変換係数を逆変換して予測誤差信号を復号する。この逆変換の一例としては、変換回路１０４で離散コサイン変換が適用された場合には、逆離散コサイン変換を適用することとなる。 The quantization circuit 105 quantizes the conversion coefficient output from the conversion circuit 104. The inverse quantization circuit 106 inversely quantizes the transform coefficient quantized by the quantization circuit 105. The inverse transform circuit 107 inversely transforms the transform coefficient inversely quantized by the inverse quantization circuit 106 and decodes the prediction error signal. As an example of the inverse transform, when the discrete cosine transform is applied in the transform circuit 104, the inverse discrete cosine transform is applied.

加算器１０８は、逆変換回路１０７により復号された予測誤差信号と、動き補償回路１１０により生成された予測画像信号とを加算して参照画像信号を生成する。フレームメモリ１０９は、加算器１０８により生成された参照画像信号が格納される。動き補償回路１１０は、フレームメモリ１０９に格納された参照画像信号と、ブロック化回路１０２で分割出力された入力画像信号から動き補償予測を実施して予測画像信号および動きベクトル情報を生成する。 The adder 108 adds the prediction error signal decoded by the inverse conversion circuit 107 and the prediction image signal generated by the motion compensation circuit 110 to generate a reference image signal. The frame memory 109 stores the reference image signal generated by the adder 108. The motion compensation circuit 110 performs motion compensation prediction from the reference image signal stored in the frame memory 109 and the input image signal divided and output by the blocking circuit 102 to generate a predicted image signal and motion vector information.

符号化回路１１１は、量子化回路１０５により量子化された変換係数、および動き補償回路１１０で生成された動きベクトル情報を符号化し、それらの符号化データを多重化する。また、符号化回路１１１は、多重化した符号化データをバッファ１１２に格納するとともに、外部に伝送する。 The encoding circuit 111 encodes the transform coefficient quantized by the quantization circuit 105 and the motion vector information generated by the motion compensation circuit 110, and multiplexes the encoded data. The encoding circuit 111 stores the multiplexed encoded data in the buffer 112 and transmits it to the outside.

さらに、特徴量抽出回路１１３は、フレームメモリ１０１に格納されている入力画像信号の中から画像特徴量を抽出する。この画像特徴量の一例として、特徴量抽出回路１１３は、静止画像判定に用いるフィールド間差分情報を画像特徴量として抽出することができる。このフィールド間差分情報は、時間軸上で連続する同一パリティのフィールド間差分情報および時間軸上で連続する異パリティのフィールド間差分情報を含んでいる。 Further, the feature amount extraction circuit 113 extracts an image feature amount from the input image signal stored in the frame memory 101. As an example of this image feature quantity, the feature quantity extraction circuit 113 can extract inter-field difference information used for still image determination as an image feature quantity. This inter-field difference information includes inter-field difference information of the same parity continuous on the time axis and inter-field difference information of different parity continuous on the time axis.

これに対して、静止画像判定部２００内の静止画像判定回路２０１は、特徴量抽出回路１１３と接続されており、特徴量抽出回路１１３で抽出された画像特徴量に基づいて静止画像判定情報の算出を行う。例えば、静止画像判定回路２０１は、フィールド間差分情報から静止画像判定情報を生成する。 On the other hand, the still image determination circuit 201 in the still image determination unit 200 is connected to the feature amount extraction circuit 113, and based on the image feature amount extracted by the feature amount extraction circuit 113, the still image determination information Perform the calculation. For example, the still image determination circuit 201 generates still image determination information from inter-field difference information.

また、符号化制御部３００内の制御回路３０１は、バッファ１１２、あるいは静止画像判定回路２０１の出力に基づいて量子化回路１０５を制御する。一例として、制御回路３０１は、バッファ１１２の残量から量子化回路１０５が量子化処理実行時に用いる量子化パラメータを算出することができる。あるいは、制御回路３０１は、静止画像判定回路２０１が生成した静止画像判定情報を基にして、目標情報発生量の補正値を算出することができる。そして、メモリ３０２は、制御回路３０１が算出した目標情報発生量の補正値を一次的に保持する。 Also, the control circuit 301 in the encoding control unit 300 controls the quantization circuit 105 based on the output of the buffer 112 or the still image determination circuit 201. As an example, the control circuit 301 can calculate the quantization parameter used when the quantization circuit 105 performs the quantization process from the remaining amount of the buffer 112. Alternatively, the control circuit 301 can calculate a correction value for the target information generation amount based on the still image determination information generated by the still image determination circuit 201. The memory 302 temporarily holds the correction value of the target information generation amount calculated by the control circuit 301.

ここで、本発明の動画像符号化装置１の特徴を最も示している部分は、符号化部１００内の特徴量抽出回路１１３、静止画像判定部２００内の静止画像判定回路２０１、および符号化制御部３００内の制御回路３０１である。そこで、これらの構成を中心にした一連の動作について、次に説明する。 Here, the most characteristic features of the moving picture encoding apparatus 1 of the present invention are the feature quantity extraction circuit 113 in the encoding unit 100, the still image determination circuit 201 in the still image determination unit 200, and the encoding. This is a control circuit 301 in the control unit 300. A series of operations centering on these configurations will be described next.

図３は、本発明の実施の形態１におけるフレームメモリ１０１に入力される画像信号を入力順に示した説明図である。フレームメモリ１０１には、インターレース画像のＯｄｄフィールドの画像信号から順に入力される。ここで、画像信号は、輝度信号と色差信号から構成されている。そして、フレームメモリ１０１には、必要に応じて複数のフィールドの画像信号が保持されている。 FIG. 3 is an explanatory diagram showing the image signals input to the frame memory 101 according to the first embodiment of the present invention in the order of input. The frame memory 101 is sequentially input from the image signal in the odd field of the interlaced image. Here, the image signal is composed of a luminance signal and a color difference signal. The frame memory 101 holds image signals of a plurality of fields as necessary.

この図３において、フィールド０、２、４、６がＯｄｄフィールドであり、フィールド１、３、５、７がＥｖｅｎフィールドである。動画像符号化においては、符号化効率を向上させるために、画像信号の並べ替えを行う場合がある。例えば、先の図１５に示したＧＯＰの並びは、符号化効率を向上させるために画像信号を並べ替えている。 In FIG. 3, fields 0, 2, 4, and 6 are Odd fields, and fields 1, 3, 5, and 7 are Even fields. In moving picture coding, image signals may be rearranged in order to improve coding efficiency. For example, in the GOP arrangement shown in FIG. 15, the image signals are rearranged in order to improve the encoding efficiency.

そこで、図３に示す順番で入力された画像信号に対して、図１５に示したＧＯＰと同じような並べ替えを行うことを考える。図４は、本発明の実施の形態１における符号化順に並べ替えた画像信号の説明図である。なお、図４における各ピクチャに付されている数字は、ピクチャを符号化順に並べ替えた後にナンバリングした数字ではなく、入力順のままの数字を示している。 Therefore, consider rearranging the image signals input in the order shown in FIG. 3 in the same manner as the GOP shown in FIG. FIG. 4 is an explanatory diagram of image signals rearranged in the encoding order according to Embodiment 1 of the present invention. Note that the numbers given to each picture in FIG. 4 are not numbers numbered after the pictures are rearranged in the coding order, but numbers in the input order.

最初に符号化されるのは、３番目に入力されたフィールド２であり、このフィールド２は、ピクチャ内で符号化（イントラ符号化）を行うＩピクチャである。また、フィールド３、フィールド６、フィールド７は、過去のフィールドを参照して動き補償予測を行うＰピクチャである。また、フィールド０、フィールド１、フィールド４、フィールド５は、過去と未来のフィールドを参照して動き補償予測を行うＢピクチャとなる。 The first field to be encoded is the field 2 that is input third, and this field 2 is an I picture that is encoded (intra-encoded) within the picture. Fields 3, 6, and 7 are P pictures that perform motion compensation prediction with reference to past fields. Field 0, field 1, field 4, and field 5 are B pictures that perform motion compensation prediction with reference to past and future fields.

例えば、フィールド４は、Ｂピクチャとするので、過去の画像であるＩピクチャのフィールド２、およびＰピクチャであるフィールド３とともに、未来の画像であるＰピクチャのフィールド６、７を参照画像とする。また、Ｐピクチャであるフィールド３は、Ｉピクチャであるフィールド２のみを参照画像とする。さらに、Ｐピクチャであるフィールド６は、フィールド２、フィールド３を参照画像とする。 For example, since the field 4 is a B picture, the fields 6 and 7 of the P picture, which are future images, are used as reference images together with the field 2 of the I picture, which is a past image, and the field 3, which is a P picture. In the field 3 that is a P picture, only the field 2 that is an I picture is used as a reference image. Further, field 6 which is a P picture uses field 2 and field 3 as reference images.

次に、フローチャートに基づいて、各構成要素の動作について説明する。図５は、本発明の実施の形態１における特徴量抽出回路１１３および静止画像判定回路２０１の動作を示すフローチャートである。まず始めに、ステップＳ５０１において、特徴量抽出回路１１３は、先の図３に示した入力順にしたがって、同一パリティのフィールド間のフィールド間差分絶対値和ＳＳＡＤを算出する。 Next, the operation of each component will be described based on the flowchart. FIG. 5 is a flowchart showing operations of the feature amount extraction circuit 113 and the still image determination circuit 201 according to Embodiment 1 of the present invention. First, in step S501, the feature amount extraction circuit 113 calculates an inter-field difference absolute value sum SSAD between fields of the same parity according to the input order shown in FIG.

次に、ステップＳ５０２において、特徴量抽出回路１１３は、異パリティのフィールド間のフィールド間差分絶対値和ＤＳＡＤを求める。図６は、本発明の実施の形態１における同一パリティのフィールド間のフィールド間差分絶対値和ＳＳＡＤ、および異パリティのフィールド間のフィールド間差分絶対値和ＤＳＡＤの算出に関する説明図である。 Next, in step S502, the feature amount extraction circuit 113 calculates an inter-field difference absolute value sum DSAD between fields of different parity. FIG. 6 is an explanatory diagram relating to the calculation of the inter-field difference absolute value sum SSAD between fields of the same parity and the inter-field difference absolute value sum DSAD between fields of different parity in Embodiment 1 of the present invention.

図６に示すように、同一パリティのフィールド間差分絶対値和ＳＳＡＤは、例えば、連続するＯｄｄフィールド間のフィールド０とフィールド２との間、フィールド２とフィールド４との間などで求められる。ここで、フィールド間差分絶対値和とは、それぞれのフィールドの同じ位置にある画素の信号における輝度値の差分の絶対値を合計したものである。 As shown in FIG. 6, the inter-field difference absolute value sum SSAD of the same parity is obtained, for example, between field 0 and field 2 between consecutive Odd fields, between field 2 and field 4, and the like. Here, the inter-field difference absolute value sum is a sum of absolute values of differences in luminance values in signals of pixels at the same position in each field.

一方、異パリティのフィールド間差分絶対値和ＤＳＡＤは、時間軸上で連続する２つのフィールド間の差分絶対値和であり、例えば、フィールド０とフィールド１との間、フィールド２とフィールド３との間などで求められる。このＤＳＡＤにおけるフィールド間差分絶対値和も、ＳＳＡＤと同様に、それぞれのフィールドの同じ位置にある画素の信号における輝度値の差分の絶対値を合計したものである。 On the other hand, the inter-field difference absolute value sum DSAD of different parity is the sum of absolute differences between two fields that are continuous on the time axis. For example, between the field 0 and the field 1, the field 2 and the field 3 It is required in between. The sum of absolute differences between fields in the DSAD is the sum of the absolute values of the differences in luminance values in the signals of the pixels at the same position in each field, as in the SSAD.

次に、ステップＳ５０３において、静止画像判定回路２０１は、同一パリティのフィールド間差分絶対値和ＳＳＡＤを、空間的変移を示す値であるとみなし、異パリティのフィールド間差分絶対値和ＤＳＡＤを、時間的変移を示す値であるとみなし、この２つの比Ｓ（Ｓ＝ＤＳＡＤ／ＳＳＡＤ）を求め、制御回路３０１に出力する。 Next, in step S503, the still image determination circuit 201 regards the inter-field difference absolute value sum SSAD of the same parity as a value indicating a spatial shift, and calculates the inter-field difference absolute value sum DSAD of different parity as time. The two ratios S (S = DSAD / SSAD) are obtained and output to the control circuit 301.

図７は、本発明の実施の形態１における画像信号の入力順と符号化順の時間軸での対応関係を示す説明図である。Ｓの算出に関しては、低遅延符号化の場合、図７に示すように、ピクチャ０では、入力後、符号化開始されるまで時間間隔が存在する。これに対し、ピクチャ２では、入力後、直ちに符号化が開始される。このため、図７のケースでは、Ｓの算出は、ピクチャ０とピクチャ２との間のＳＳＡＤと、ピクチャ０とピクチャ１との間のＤＳＡＤを用いて行ってもよい。 FIG. 7 is an explanatory diagram illustrating a correspondence relationship between the input order of image signals and the encoding order on the time axis according to Embodiment 1 of the present invention. Regarding the calculation of S, in the case of low delay encoding, as shown in FIG. 7, there is a time interval in picture 0 until encoding starts after input. In contrast, in picture 2, encoding is started immediately after input. For this reason, in the case of FIG. 7, the calculation of S may be performed using the SSAD between the picture 0 and the picture 2 and the DSAD between the picture 0 and the picture 1.

あるいは、例えば、図１５に示したＧＯＰ（ｎ）の符号化を実施する場合には、ＧＯＰ（ｎ−１）におけるＳＳＡＤの総和とＤＳＡＤの総和との比率をＳとして用いてもよい。すなわち、１つ前のＧＯＰの画像特徴量を用いてもよい。ただし、１つ前のＧＯＰでシーンチェンジが発生している場合は、シーンチェンジ後のＳＳＡＤとＤＳＡＤを用いる。 Alternatively, for example, when encoding GOP (n) shown in FIG. 15, the ratio of the sum of SSAD and the sum of DSAD in GOP (n−1) may be used as S. That is, the image feature amount of the previous GOP may be used. However, when a scene change has occurred in the previous GOP, SSAD and DSAD after the scene change are used.

次に、制御回路３０１の動作をフローチャートに基づいて説明する。図８は、本発明の実施の形態１における符号化制御部３００内の制御回路３０１の動作を示すフローチャートである。まず始めに、ステップＳ８０１において、制御回路３０１は、ＳｔｉｌｌＦｌａｇをＯｆｆに設定（リセット）する。このＳｔｉｌｌＦｌａｇとは、静止画像、あるいは動きが緩慢な画像である場合にＯｎ設定され、それ以外の画像である場合にＯｆｆ設定されるフラグである。 Next, the operation of the control circuit 301 will be described based on a flowchart. FIG. 8 is a flowchart showing the operation of the control circuit 301 in the encoding control unit 300 according to Embodiment 1 of the present invention. First, in step S801, the control circuit 301 sets Still Flag to Off (reset). The Still Flag is a flag that is set to On when the image is a still image or a slow-moving image, and is set to Off when the image is any other image.

次に、ステップＳ８０２において、制御回路３０１は、静止画像判定回路２０１により算出されたＳの値を読み取り、Ｓと所定の閾値ＴＨとを比較する。そして、ＳがＴＨ未満であった場合には、静止画像、あるいは動きが緩慢な画像であると判断し、Ｓ８０３に遷移し、ＳｔｉｌｌＦｌａｇをＯｎに設定した後、ステップＳ８０４に遷移する。一方、ＳがＴＨ以上であった場合には、ＳｔｉｌｌＦｌａｇはＯｆｆのままで、ステップＳ８０４に遷移する。なお、ＴＨの値は、例えば、０．４である。 Next, in step S802, the control circuit 301 reads the value of S calculated by the still image determination circuit 201 and compares S with a predetermined threshold value TH. If S is less than TH, it is determined that the image is a still image or a slow-moving image, the process proceeds to S803, Still Flag is set to On, and then the process proceeds to step S804. On the other hand, if S is greater than or equal to TH, StillFlag remains Off and the process proceeds to step S804. Note that the value of TH is, for example, 0.4.

次に、ステップＳ８０４において、制御回路３０１は、ｎに０をセットする。ここで、ｎは、ＧＯＰ内のピクチャのカウントに用いる。次に、ステップＳ８０５において、制御回路３０１は、ＧＯＰ内のすべてのピクチャに対する処理が終了したかを判断する。Ｎは、ＧＯＰ内のピクチャ枚数であり、例えば、先の図１５に示したＧＯＰでは、Ｎの値は、２８となる。 Next, in step S804, the control circuit 301 sets 0 to n. Here, n is used to count pictures in the GOP. Next, in step S805, the control circuit 301 determines whether the processing for all the pictures in the GOP has been completed. N is the number of pictures in the GOP. For example, the value of N is 28 in the GOP shown in FIG.

そして、このステップＳ８０５において、ＧＯＰ内のすべてのピクチャに対する処理が終了したと判断した場合（すなわち、ｎ＜Ｎでない場合）には、一連の処理を終了する。一方、ＧＯＰ内のすべてのピクチャに対する処理が終了していないと判断した場合（すなわち、ｎ＜Ｎである場合）には、次のステップＳ８０６に遷移する。 If it is determined in step S805 that the processing for all the pictures in the GOP has been completed (that is, if n <N), the series of processing ends. On the other hand, when it is determined that the processing for all the pictures in the GOP has not been completed (that is, when n <N), the process proceeds to the next step S806.

次に、ステップＳ８０６において、制御回路３０１は、符号化対象ピクチャの符号化に先立ち、ピクチャ単位の目標情報発生量を計算する。目標情報発生量の計算方式として、ここでは、ＭＰＥＧ−２のＴＭ５に採用された方式を記載しておく。 Next, in step S806, the control circuit 301 calculates a target information generation amount in units of pictures prior to encoding of the encoding target picture. Here, as a calculation method of the target information generation amount, a method adopted in MPEG-2 TM5 is described here.

ＴＭ５に採用された方式では、先ず、ＧＯＰを構成するピクチャにおいて、まだ符号化されていないピクチャに対して割り当てられている目標情報発生量Ｒを基に、各ピクチャの目標情報発生量を算出する。ＧＯＰを構成するピクチャが１枚も符号化されていない状態では、目標情報発生量Ｒは、ＧＯＰ全体の目標情報発生量に等しい。具体的なピクチャ単位の目標情報発生量の算出は、下式（１）〜（３）を用いて行う。 In the scheme adopted in TM5, first, in the pictures constituting the GOP, the target information generation amount of each picture is calculated based on the target information generation amount R assigned to a picture that has not yet been encoded. . In a state where no picture constituting the GOP is encoded, the target information generation amount R is equal to the target information generation amount of the entire GOP. The calculation of the specific target information generation amount in units of pictures is performed using the following equations (1) to (3).

式（１）〜（３）を用いて、それぞれＩピクチャの目標情報発生量Ｔｉ、Ｐピクチャの目標情報発生量Ｔｐ、そしてＢピクチャの目標情報発生量Ｔｂが算出される。
Ｔｉ＝Ｒ／｛１＋Ｎｐ・Ｘｐ／（Ｘｉ・Ｋｐ）＋Ｎｂ・Ｘｂ／（Ｘｉ・Ｋｂ）｝（１）
Ｔｐ＝Ｒ／｛Ｎｐ＋Ｎｂ・Ｋｐ・Ｘｂ／（Ｋｂ・Ｘｐ）｝（２）
Ｔｂ＝Ｒ／｛Ｎｂ＋Ｎｐ・Ｋｂ・Ｘｐ／（Ｋｐ・Ｘｂ）｝（３） Using the equations (1) to (3), the target information generation amount Ti for the I picture, the target information generation amount Tp for the P picture, and the target information generation amount Tb for the B picture are calculated.
Ti = R / {1 + Np · Xp / (Xi · Kp) + Nb · Xb / (Xi · Kb)} (1)
Tp = R / {Np + Nb · Kp · Xb / (Kb · Xp)} (2)
Tb = R / {Nb + Np · Kb · Xp / (Kp · Xb)} (3)

ここで、Ｎｐは、ＧＯＰ内でまだ符号化されていないＰピクチャの枚数を表す。また、Ｎｂは、ＧＯＰ内でまだ符号化されていないＢピクチャの枚数を表す。例えば、図１５に示したＧＯＰでは、Ｎｐの値は１３、Ｎｂの値は１４となる。ここで、ＧＯＰに含まれるＩピクチャの枚数は１であるため、Ｉピクチャの枚数を表す変数は存在しない。 Here, Np represents the number of P pictures that have not been encoded in the GOP. Nb represents the number of B pictures that have not yet been encoded in the GOP. For example, in the GOP shown in FIG. 15, the value of Np is 13 and the value of Nb is 14. Here, since the number of I pictures included in the GOP is 1, there is no variable representing the number of I pictures.

また、Ｘｉ、Ｘｐ、Ｘｂは、各ピクチャタイプの画面の複雑度を示しており、それぞれ、ＸｉがＩピクチャの複雑度、ＸｐがＰピクチャの複雑度、ＸｂがＢピクチャの複雑度を表している。この複雑度の値が相対的に大きいピクチャでは、情報発生量が相対的に多く、複雑度が相対的に小さいピクチャでは、情報発生量が相対的に少ない。 Xi, Xp, and Xb indicate the complexity of the screen of each picture type. Xi represents the complexity of the I picture, Xp represents the complexity of the P picture, and Xb represents the complexity of the B picture. Yes. A picture with a relatively large complexity value has a relatively large amount of information generation, and a picture with a relatively small complexity value has a relatively small amount of information generation.

また、ＫｐとＫｂは、それぞれＩピクチャの量子化スケールコードを基準とした場合のＰピクチャとＢピクチャの量子化スケールコードの比率を表している。 Kp and Kb represent the ratio of the quantization scale code of the P picture and the B picture when the quantization scale code of the I picture is used as a reference.

上述したように、制御回路３０１は、ステップＳ８０６において、Ｉピクチャの場合は式（１）を用いてＴｉを、Ｐピクチャの場合は式（２）を用いてＴｐを、Ｂピクチャの場合は式（３）を用いてＴｂをそれぞれ算出し、その後、ステップＳ８０７に遷移する。次に、ステップＳ８０７において、制御回路３０１は、ＳｔｉｌｌＦｌａｇがＯｎであるかを判断する。 As described above, in step S806, the control circuit 301 uses Equation (1) for I pictures, Ti for Equations (2) for P pictures, and Equations (2) for B Pictures. Tb is calculated using (3), and then the process proceeds to step S807. Next, in step S807, the control circuit 301 determines whether StillFlag is On.

そして、ＳｔｉｌｌＦｌａｇがＯｎでない場合には、後述するステップＳ８１０に遷移する。一方、ＳｔｉｌｌＦｌａｇがＯｎである場合には、ステップＳ８０８に遷移し、制御回路３０１は、Ｉピクチャであるかを判定する。そして、Ｉピクチャである場合には、ステップＳ８０９に遷移し、制御回路３０１は、先のステップＳ８０６で算出したＴｉに補正を加える。 And when StillFlag is not On, it changes to Step S810 mentioned below. On the other hand, if StillFlag is On, the process proceeds to step S808, and the control circuit 301 determines whether the picture is an I picture. If it is an I picture, the process proceeds to step S809, and the control circuit 301 corrects Ti calculated in the previous step S806.

補正方法としては、Ｔｉ＝（１−α）Ｔｉ（０＜α＜１）とし、ＳｔｉｌｌＦｌａｇがＯｎである場合のＩピクチャの目標情報発生量Ｔｉが、元の値よりもα・Ｔｉ分だけ小さい値に補正される。また、制御回路３０１は、α・Ｔｉの値を第１の補正量としてメモリ３０２に一時記憶する。そして、ステップＳ８０９の処理が終了すると、ステップＳ８１０に遷移する。 As a correction method, Ti = (1−α) Ti (0 <α <1), and when the Still Flag is On, the target information generation amount Ti of the I picture is smaller by α · Ti than the original value. It is corrected to the value. The control circuit 301 temporarily stores the value of α · Ti in the memory 302 as the first correction amount. Then, when the process of step S809 ends, the process proceeds to step S810.

一方、先のステップＳ８０８において、Ｉピクチャでないと判断した場合には、ステップＳ８１１に遷移し、制御回路３０１は、現在の符号化対象のピクチャがＰピクチャであるかを判断する。 On the other hand, if it is determined in step S808 that the current picture is not an I picture, the process proceeds to step S811, and the control circuit 301 determines whether the current picture to be encoded is a P picture.

Ｐピクチャである場合には、ステップＳ８１２に遷移し、制御回路３０１は、現在の符号化対象ピクチャがＥｖｅｎフィールドで、かつ、カウンタｎの値がＫ（０＜Ｋ＜Ｎ）未満であるかを判定する。この条件は、ＥｖｅｎフィールドでＧＯＰの先頭からＫ番目未満のＰピクチャのＴｐに補正を加えることを意味している。 If it is a P picture, the process proceeds to step S812, and the control circuit 301 determines whether the current encoding target picture is the Even field and the value of the counter n is less than K (0 <K <N). judge. This condition means that correction is applied to Tp of the P picture less than the Kth from the top of the GOP in the Even field.

この条件を満たした場合には、ステップＳ８１３に遷移し、制御回路３０１は、Ｔｐに補正を加える。補正方法は、先のステップＳ８０９でメモリ３０２に一時記憶している第１の補正量α・Ｔｉを、ステップＳ８１２の条件を満たすＰピクチャに分配する。 When this condition is satisfied, the process proceeds to step S813, and the control circuit 301 corrects Tp. In the correction method, the first correction amount α · Ti temporarily stored in the memory 302 in the previous step S809 is distributed to the P pictures that satisfy the condition in step S812.

具体的には、制御回路３０１は、ステップＳ８１２の条件を満たした各Ｐピクチャに、重み係数Ｗ（ｎ）を用いてα・Ｔｉ・Ｗ（ｎ）により分配を行う。各Ｗ（ｎ）は、１未満の正の値であり、Ｗ（ｎ）＞Ｗ（ｎ＋１）の関係にある。すなわち、時間的にＩピクチャに近いＰピクチャほど多くの情報発生量が分配される。また、Ｗ（ｎ）の合計は、１である。 Specifically, the control circuit 301 distributes to each P picture satisfying the condition of step S812 by α · Ti · W (n) using the weight coefficient W (n). Each W (n) is a positive value less than 1, and has a relationship of W (n)> W (n + 1). That is, a larger amount of information generation is distributed to P pictures that are closer to I pictures in terms of time. The total of W (n) is 1.

図９は、本発明の実施の形態１における重み係数Ｗに基づく目標情報発生量の分配に関する説明図であり、ステップＳ８１３におけるα・Ｔｉ・Ｗ（ｎ）の分配例を示したものである。 FIG. 9 is an explanatory diagram relating to the distribution of the target information generation amount based on the weighting factor W in the first embodiment of the present invention, and shows an example of the distribution of α · Ti · W (n) in step S813.

Ｋの値として１０を設定した場合には、ＥｖｅｎフィールドのＰピクチャでカウンタｎの値が１０未満となるＰピクチャ（すなわち、Ｐ１、Ｐ５、Ｐ９）に第１の補正量α・Ｔｉが分配される。Ｐ１に対しては、α・Ｔｉ・Ｗ（１）が、Ｐ５に対しては、α・Ｔｉ・Ｗ（５）が、Ｐ９に対しては、α・Ｔｉ・Ｗ（９）がそれぞれ分配される。このとき、Ｗ（１）＋Ｗ（５）＋Ｗ（９）＝１となる。 When K is set to 10, the first correction amount α · Ti is distributed to P pictures (that is, P1, P5, P9) in which the value of the counter n is less than 10 in the P picture of the Even field. The Α · Ti · W (1) is distributed to P1, α · Ti · W (5) is distributed to P5, and α · Ti · W (9) is distributed to P9. The At this time, W (1) + W (5) + W (9) = 1.

この分配により、Ｐ１の目標情報発生量は、図８のＳ８０６で算出されたＴｐ（１）にα・Ｔｉ・Ｗ（１）を加算した値となる。また、Ｐ５の目標情報発生量は、図８のＳ８０６で算出されたＴｐ（５）にα・Ｔｉ・Ｗ（５）を加算した値となる。さらに、Ｐ９の目標情報発生量は、図８のＳ８０６で算出されたＴｐ（９）にα・Ｔｉ・Ｗ（９）を加算した値となる。 By this distribution, the target information generation amount of P1 becomes a value obtained by adding α · Ti · W (1) to Tp (1) calculated in S806 of FIG. Further, the target information generation amount of P5 is a value obtained by adding α · Ti · W (5) to Tp (5) calculated in S806 of FIG. Further, the target information generation amount of P9 is a value obtained by adding α · Ti · W (9) to Tp (9) calculated in S806 of FIG.

また、このとき、図示していないが、バッファ１１２の空き容量と補正を加えた後のＴｐとを比較し、バッファ破綻が発生する可能性がある場合には、例えば、バッファ１１２の空き容量の９０％をＴｐの値として更新する。なお、重み係数Ｗ（ｎ）の値は、例えば、Ｗ（１）が０.７、Ｗ（５）が０.２、Ｗ（９）が０.１である。 At this time, although not shown in the figure, the free space of the buffer 112 is compared with the Tp after the correction, and if there is a possibility that a buffer failure may occur, for example, the free space of the buffer 112 90% is updated as the value of Tp. The values of the weighting factor W (n) are, for example, 0.7 for W (1), 0.2 for W (5), and 0.1 for W (9).

Ｓ８１３の処理が終了すると、Ｓ８１０に遷移する。また、先のステップＳ８０９の処理が終了した場合、先のステップＳ８１１の条件を満たさない場合、あるいは、先のステップＳ８１２の条件を満たさない場合にも、ステップＳ８１０に遷移する。そして、ステップＳ８１０において、制御回路３０１は、ＧＯＰ内のピクチャのカウント値であるｎに１を加えて、ｎの値を更新する。 When the process of S813 ends, the process proceeds to S810. In addition, when the process of the previous step S809 is completed, when the condition of the previous step S811 is not satisfied, or when the condition of the previous step S812 is not satisfied, the process proceeds to step S810. In step S810, the control circuit 301 updates the value of n by adding 1 to n that is the count value of the picture in the GOP.

ｎの値を更新した後、ステップＳ８０５に遷移し、制御回路３０１は、上述したステップＳ８０５〜ステップＳ８１３の処理を繰り返し行う。そして、最終的にステップＳ８０５において、制御回路３０１は、ＧＯＰ内のすべてのピクチャに対する処理が終了したと判断した場合には、一連の処理を終了する。この終了とは、１ＧＯＰ分の処理が終了することを意味する。また、図示していないが、目標情報発生量の算出が完了する、または補正が完了すると、各ピクチャ毎に前述の符号化動作が行われる。 After updating the value of n, the process proceeds to step S805, and the control circuit 301 repeats the processes of steps S805 to S813 described above. Finally, in step S805, when the control circuit 301 determines that the processing for all the pictures in the GOP has been completed, the control circuit 301 ends the series of processing. This termination means that processing for one GOP is terminated. Although not shown, when the calculation of the target information generation amount is completed or the correction is completed, the above-described encoding operation is performed for each picture.

目標情報発生量は、ピクチャ単位、すなわち、Ｉ０、Ｐ１、Ｂ２などの１フィールド単位に算出される。符号化動作においては、制御回路３０１が目標情報発生量を基に量子化パラメータをＭＢ単位で算出し、符号化中のフィールドの実情報発生量が目標情報発生量に納まるように、主に量子化回路１０５を制御する。この制御においては、実情報発生量が目標情報発生量を超えることを防ぐことと、ＧＯＰ全体の画質が均一となることを目的として量子化パラメータにリミット値を設定してもよい。 The target information generation amount is calculated in units of pictures, that is, in units of one field such as I0, P1, and B2. In the encoding operation, the control circuit 301 calculates a quantization parameter in MB units based on the target information generation amount, and the quantum information is mainly set so that the actual information generation amount of the field being encoded falls within the target information generation amount. The control circuit 105 is controlled. In this control, a limit value may be set in the quantization parameter for the purpose of preventing the actual information generation amount from exceeding the target information generation amount and making the image quality of the entire GOP uniform.

以上のように、実施の形態１によれば、静止画像判定情報に応じて符号化の対象となるピクチャの目標情報発生量を求め、目標情報発生量に基づいて符号化部を制御することにより、ＧＯＰ全体としての符号化画質を向上させることができる。 As described above, according to the first embodiment, the target information generation amount of the picture to be encoded is obtained according to the still image determination information, and the encoding unit is controlled based on the target information generation amount. Therefore, it is possible to improve the encoded image quality of the entire GOP.

具体的には、ＧＯＰ先頭のＯｄｄフィールドのＩピクチャは、情報発生量が削減されるため、符号化画質を示すＳＮＲは劣化する。しかしながら、このＩピクチャの次に符号化されるＥｖｅｎフィールドのＰピクチャにおいては、情報発生量が増加することにより符号化画質を示すＳＮＲが改善する。このため、このＩＰペアを含むＧＯＰを構成するピクチャの符号化画質を示すＳＮＲの平均値が改善し、ＧＯＰ全体としての符号化画質の向上を実現できる。 Specifically, since the amount of information generated is reduced in the I picture in the Odd field at the beginning of the GOP, the SNR indicating the encoded image quality deteriorates. However, in the P picture of the Even field encoded next to the I picture, the SNR indicating the encoded image quality is improved by increasing the information generation amount. For this reason, the average value of the SNR indicating the encoded image quality of the pictures constituting the GOP including the IP pair is improved, and the encoded image quality as a whole GOP can be improved.

さらに、先の図１７に示したようなＯｄｄフィールドとＥｖｅｎフィールドとのＳＮＲの差が縮小し、ＯｄｄフィールドとＥｖｅｎフィールドのＳＮＲの遷移がほぼ等しくなるという効果が得られる。 Further, the SNR difference between the Odd field and the Even field as shown in FIG. 17 is reduced, and the SNR transition between the Odd field and the Even field is almost equal.

これに対して、従来の動画像符号化装置では、Ｉピクチャの目標情報発生量を増加するため、フィールド構造に適応した場合、Ｏｄｄフィールド側のＳＮＲは向上するが、Ｅｖｅｎフィールド側のＳＮＲは劣化する。このため、図１７に示したＳＮＲの差が大きくなりＧＯＰ全体としての画質の改善がなされない。 On the other hand, in the conventional video encoding device, the target information generation amount of the I picture is increased, so that when applied to the field structure, the SNR on the odd field side is improved, but the SNR on the even field side is degraded. To do. For this reason, the difference in SNR shown in FIG. 17 becomes large, and the image quality as a whole GOP is not improved.

実施の形態２．
本実施の形態２においては、符号化制御部３００におけるＰピクチャの目標情報発生量Ｔｐの補正処理が、先の実施の形態１と異なる動画像符号化装置について説明する。なお、本実施の形態２における動画像符号化装置の構成は、図１に示す先の実施の形態１の構成と同一である。 Embodiment 2. FIG.
In the second embodiment, a moving picture encoding apparatus in which the correction processing of the target information generation amount Tp of the P picture in the encoding control unit 300 is different from that of the first embodiment will be described. Note that the configuration of the moving picture encoding apparatus according to the second embodiment is the same as that of the first embodiment shown in FIG.

図１０は、本発明の実施の形態２における符号化制御部３００内の制御回路３０１の動作を示すフローチャートである。図１０に示したステップの内、ステップＳ１００１〜ステップＳ１０１１の動作は、先の図８におけるステップＳ８０１〜ステップＳ８１１の動作と同じである。そこで、図８に示した先の実施の形態１とは異なるステップＳ１０１２およびステップＳ１０１３によるＰピクチャの目標情報発生量Ｔｐの補正処理について、詳細に説明する。 FIG. 10 is a flowchart showing the operation of the control circuit 301 in the encoding control unit 300 according to Embodiment 2 of the present invention. Of the steps shown in FIG. 10, the operations in steps S1001 to S1011 are the same as the operations in steps S801 to S811 in FIG. Therefore, the correction processing of the target information generation amount Tp of the P picture in step S1012 and step S1013 different from the first embodiment shown in FIG. 8 will be described in detail.

Ｐピクチャである場合には、ステップＳ１０１２に遷移し、制御回路３０１は、現在符号化対象のＰピクチャがＧＯＰ先頭のＩピクチャからカウントしてＫ（０＜Ｋ＜Ｎ）番目未満であるかを判定する。具体的には、カウンタｎの値をＫと比較し、ｎ＜Ｋの場合には、ステップＳ１０１３に遷移し、ｎ≧Ｋである場合には、ステップＳ１０１０に遷移する。 If it is a P picture, the process proceeds to step S1012, and the control circuit 301 counts whether or not the current P picture to be encoded is less than K (0 <K <N) from the I picture at the head of the GOP. judge. Specifically, the value of the counter n is compared with K. If n <K, the process proceeds to step S1013. If n ≧ K, the process proceeds to step S1010.

先の図８におけるステップＳ８１２では、現在の符号化対象ピクチャがＥｖｅｎフィールドであることも条件として判定していたが、このステップＳ１０１２においては、ｎとＫとの比較のみにより判定を行っており、ＥｖｅｎフィールドとＯｄｄフィールドの両方を補正の対象としている。 In step S812 in FIG. 8, it is determined as a condition that the current encoding target picture is an Even field, but in this step S1012, the determination is made only by comparing n and K. Both the Even field and the Odd field are subject to correction.

ステップＳ１０１２の条件を満たしている場合には、ステップＳ１０１３において、制御回路３０１は、Ｐピクチャの目標情報発生量に変更を加える。Ｓ１０１３における変更方法において、制御回路３０１は、バッファ１１２の占有量を基に計算を行う。 If the condition of step S1012 is satisfied, in step S1013, the control circuit 301 changes the target information generation amount of the P picture. In the changing method in S1013, the control circuit 301 performs a calculation based on the occupation amount of the buffer 112.

ここで、バッファ１２の占有量をバッファの最大サイズから減算した値、すなわち、ｎ番目のピクチャの符号化後のバッファの空き容量をＲＭＮ（ｎ）とした場合に、ｎ番目のピクチャの次に符号化される（ｎ＋１）番目のＰピクチャの目標情報発生量Ｔｐは、ＲＭＮ（ｎ）と重み係数ｗ（ｎ＋１）との積とする。ここで、本実施の形態２における重み係数ｗは、先の実施の形態１における重み係数Ｗとは異なり、符号化順に従ってあらかじめ決められている重み係数である。 Here, when the value obtained by subtracting the occupation amount of the buffer 12 from the maximum size of the buffer, that is, the free space of the buffer after the encoding of the nth picture is RMN (n), the next to the nth picture. The target information generation amount Tp of the (n + 1) th P picture to be encoded is the product of RMN (n) and the weight coefficient w (n + 1). Here, the weighting factor w in the second embodiment is a weighting factor that is determined in advance according to the coding order, unlike the weighting factor W in the first embodiment.

図１１は、本発明の実施の形態２におけるバッファの空き容量に基づく目標情報発生量の分配に関する説明図である。図１１の上段は、バッファ１１２の空き容量の遷移の一例を示しており、下段は、空き容量と重み係数ｗとの積から計算される目標発生情報量の遷移の一例を示している。 FIG. 11 is an explanatory diagram relating to the distribution of the target information generation amount based on the free space of the buffer according to the second embodiment of the present invention. The upper part of FIG. 11 shows an example of the transition of the free capacity of the buffer 112, and the lower part shows an example of the transition of the target generated information amount calculated from the product of the free capacity and the weighting factor w.

具体的には、Ｉ０の符号化終了後のバッファ空き容量は、ＲＭＮ（０）で示されており、このＲＭＮ（０）とｗ（１）の積が、Ｐ１の目標情報発生量Ｔｐとなる。同様にＰ４の場合には、ＲＭＮ（３）・ｗ（４）が目標情報発生量Ｔｐとなり、Ｐ５の場合には、ＲＭＮ（４）・ｗ（５）が目標情報発生量Ｔｐとなる。 Specifically, the buffer free space after the encoding of I0 is indicated by RMN (0), and the product of RMN (0) and w (1) is the target information generation amount Tp of P1. . Similarly, in the case of P4, RMN (3) · w (4) is the target information generation amount Tp, and in the case of P5, RMN (4) · w (5) is the target information generation amount Tp.

ステップＳ１０１３の処理は、ステップＳ１０１２の条件を満たすすべてのＰピクチャに対して適用される。先の実施の形態１では、ＥｖｅｎフィールドのＰピクチャのみを対象にＴｐの補正を加えたが、本実施の形態２では、ＯｄｄフィールドとＥｖｅｎフィールドの両方のＰピクチャのＴｐに変更を加える。このため、本実施の形態２によれば、両フィールドのＳＮＲの遷移がより近くなる。 The process of step S1013 is applied to all P pictures that satisfy the condition of step S1012. In the first embodiment, the Tp correction is applied only to the P picture in the Even field, but in the second embodiment, the Tp of the P picture in both the Odd field and the Even field is changed. For this reason, according to the second embodiment, the SNR transitions of both fields become closer.

図１２は、本発明の実施の形態２のフィールド構造による符号化結果におけるＯｄｄフィールドとＥｖｅｎフィールドのＳＮＲの遷移を示す図である。先の図１７と比較して、図１２では、ＩＰフィールドにおけるＳＮＲの差がなく、全体を通してＯｄｄフィールドとＥｖｅｎフィールドの差が縮小されており、加えて、ＳＮＲの値が改善されていることが分かる。 FIG. 12 is a diagram showing the transition of the SNR between the Odd field and the Even field in the encoding result by the field structure according to the second embodiment of the present invention. Compared to the previous FIG. 17, in FIG. 12, there is no SNR difference in the IP field, the difference between the Odd field and the Even field is reduced throughout, and in addition, the SNR value is improved. I understand.

図１３は、インターレース画像の構成を示す図である。フィールド構造のピクチャは、表示される際に、この図１３に示すように、ＯｄｄフィールドとＥｖｅｎフィールドが１ライン単位で交互に表示される。このため、両フィールドのＳＮＲの差がより少なくなることにより、主観画質がさらに向上するという効果が得られる。 FIG. 13 is a diagram illustrating a configuration of an interlaced image. When the field structure picture is displayed, as shown in FIG. 13, the Odd field and the Even field are alternately displayed in units of one line. For this reason, the effect that the subjective image quality is further improved is obtained by reducing the difference in SNR between the two fields.

以上のように、実施の形態２によれば、ＯｄｄフィールドとＥｖｅｎフィールドの両方のＰピクチャのＴｐに変更を加えることにより、ＧＯＰ全体としての符号化画質をさらに向上させることができる。さらに、先の実施の形態１と比較して、ＯｄｄフィールドとＥｖｅｎフィールドのＳＮＲの遷移がより近くなるという効果が得られる。 As described above, according to the second embodiment, it is possible to further improve the encoded image quality of the entire GOP by changing the Tp of the P picture in both the Odd field and the Even field. Furthermore, the effect that the SNR transition between the Odd field and the Even field becomes closer compared to the first embodiment is obtained.

なお、上述の実施の形態１、２においては、静止画像判定を同一パリティのフィールド間のフィールド間差分絶対値和と、異パリティのフィールド間のフィールド間差分絶対値和の比率で判定を行ったが、他の静止画像判定方法を用いても本発明は同様の効果を得ることができる。 In the first and second embodiments described above, still image determination is performed based on the ratio of the inter-field difference absolute value sum between fields of the same parity and the inter-field difference absolute value sum between fields of different parity. However, the present invention can obtain the same effect even if other still image determination methods are used.

また、上述の実施の形態１、２においては、特徴量抽出回路１１３を符号化部１００内に設けた場合について説明したが、本発明の動画像符号化装置は、このような構成に限定されるものではない。例えば、この特徴量抽出回路１１３の機能を静止画像判定部２００内に持たせることも可能であり、同様の効果を得ることができる。 In the first and second embodiments, the case where the feature amount extraction circuit 113 is provided in the encoding unit 100 has been described. However, the moving image encoding apparatus of the present invention is limited to such a configuration. It is not something. For example, the function of the feature amount extraction circuit 113 can be provided in the still image determination unit 200, and the same effect can be obtained.

また、上述の実施の形態１においては、ＥｖｅｎフィールドのＰピクチャのみを対象にＰピクチャの目標情報発生量Ｔｐの補正を行う場合を説明し、実施の形態２においては、ＥｖｅｎフィールドとＯｄｄフィールドの両方のＰピクチャを対象にＰピクチャの目標情報発生量Ｔｐの補正を行う場合を説明したが、本発明の動画像符号化装置は、このような処理に限定されるものではない。 Further, in the above-described first embodiment, a case where the target information generation amount Tp of the P picture is corrected only for the P picture in the Even field will be described. In the second embodiment, the Even field and the Odd field are corrected. Although the case where the target information generation amount Tp of the P picture is corrected for both P pictures has been described, the moving picture encoding apparatus of the present invention is not limited to such processing.

実施の形態１においても、ＥｖｅｎフィールドとＯｄｄフィールドの両方のＰピクチャを対象とすることができ、実施の形態２と同様の効果を得ることができる。また、実施の形態２においても、ＥｖｅｎフィールドのＰピクチャのみを対象とすることができ、実施の形態１と同様の効果を得ることができる。 Also in the first embodiment, P pictures in both the Even field and the Odd field can be targeted, and the same effect as in the second embodiment can be obtained. Also in the second embodiment, only the P picture in the Even field can be targeted, and the same effect as in the first embodiment can be obtained.

本発明の実施の形態１における動画像符号化装置の構成を示すブロック図である。It is a block diagram which shows the structure of the moving image encoder in Embodiment 1 of this invention. 本発明の実施の形態１における動画像符号化装置の詳細構成を示すブロック図である。It is a block diagram which shows the detailed structure of the moving image encoder in Embodiment 1 of this invention. 本発明の実施の形態１におけるフレームメモリに入力される画像信号を入力順に示した説明図である。It is explanatory drawing which showed the image signal input into the frame memory in Embodiment 1 of this invention in the order of input. 本発明の実施の形態１における符号化順に並べ替えた画像信号の説明図である。It is explanatory drawing of the image signal rearranged in the encoding order in Embodiment 1 of this invention. 本発明の実施の形態１における特徴量抽出回路および静止画像判定回路の動作を示すフローチャートである。It is a flowchart which shows operation | movement of the feature-value extraction circuit and still image determination circuit in Embodiment 1 of this invention. 本発明の実施の形態１における同一パリティのフィールド間のフィールド間差分絶対値和ＳＳＡＤ、および異パリティのフィールド間のフィールド間差分絶対値和ＤＳＡＤの算出に関する説明図である。It is explanatory drawing regarding calculation of the inter-field difference absolute value sum SSAD between the fields of the same parity and the inter-field difference absolute value sum DSAD between the fields of the different parity in Embodiment 1 of the present invention. 本発明の実施の形態１における画像信号の入力順と符号化順の時間軸での対応関係を示す説明図である。It is explanatory drawing which shows the correspondence on the time axis of the input order of an image signal and encoding order in Embodiment 1 of this invention. 本発明の実施の形態１における符号化制御部内の制御回路の動作を示すフローチャートである。It is a flowchart which shows operation | movement of the control circuit in the encoding control part in Embodiment 1 of this invention. 本発明の実施の形態１における重み係数Ｗに基づく目標情報発生量の分配に関する説明図である。It is explanatory drawing regarding distribution of the target information generation amount based on the weighting coefficient W in Embodiment 1 of this invention. 本発明の実施の形態２における符号化制御部内の制御回路の動作を示すフローチャートである。It is a flowchart which shows operation | movement of the control circuit in the encoding control part in Embodiment 2 of this invention. 本発明の実施の形態２におけるバッファの空き容量に基づく目標情報発生量の分配に関する説明図である。It is explanatory drawing regarding distribution of the target information generation amount based on the free capacity of the buffer in Embodiment 2 of this invention. 本発明の実施の形態２のフィールド構造による符号化結果におけるＯｄｄフィールドとＥｖｅｎフィールドのＳＮＲの遷移を示す図である。It is a figure which shows the transition of the SNR of the Odd field and Even field in the encoding result by the field structure of Embodiment 2 of this invention. インターレース画像を示す図である。It is a figure which shows an interlace image. 符号化の対象となる入力画像信号の構造を示す説明図である。It is explanatory drawing which shows the structure of the input image signal used as the object of an encoding. ＧＯＰ構造の一例を示す説明図である。It is explanatory drawing which shows an example of a GOP structure. 動き補償予測における参照面の一例を示す説明図である。It is explanatory drawing which shows an example of the reference surface in motion compensation prediction. フィールド構造による符号化結果におけるＯｄｄフィールドとＥｖｅｎフィールドのＳＮＲの遷移を示す図である。It is a figure which shows the transition of SNR of the Odd field and Even field in the encoding result by a field structure.

符号の説明Explanation of symbols

１動画像符号化装置、１００符号化部、１０１フレームメモリ、１０２ブロック化回路、１０３減算器、１０４変換回路、１０５量子化回路、１０６逆量子化回路、１０７逆変換回路、１０８加算器、１０９フレームメモリ、１１０動き補償回路、１１１符号化回路、１１２バッファ、１１３特徴量抽出回路、２００静止画像判定部、２０１静止画像判定回路、３００符号化制御部、３０１制御回路、３０２メモリ。 DESCRIPTION OF SYMBOLS 1 Moving image encoder, 100 encoding part, 101 frame memory, 102 block circuit, 103 subtractor, 104 conversion circuit, 105 quantization circuit, 106 inverse quantization circuit, 107 inverse conversion circuit, 108 adder, 109 Frame memory, 110 motion compensation circuit, 111 encoding circuit, 112 buffer, 113 feature extraction circuit, 200 still image determination unit, 201 still image determination circuit, 300 encoding control unit, 301 control circuit, 302 memory

Claims

時間的に連続したインターレース画像である入力画像信号をＧＯＰ単位のフィールド構造で圧縮符号化する符号化部を有する動画像符号化装置において、
前記入力画像信号の中から画像特徴量を抽出する特徴量抽出回路と、
前記画像特徴量に基づいて前記入力画像信号が静止画像または動きが緩慢な画像であるか否かを示す静止画像判定情報を生成する静止画像判定部と、
前記静止画像判定情報に応じて符号化の対象となるピクチャの目標情報発生量を算出し、前記符号化部で符号化を実行するフィールドの実情報発生量が前記目標情報発生量に納まるように前記符号化部を制御する符号化制御部と
を備えることを特徴とする動画像符号化装置。 In a video encoding device having an encoding unit that compresses and encodes an input image signal, which is an interlaced image that is temporally continuous, with a field structure in units of GOPs,
A feature amount extraction circuit for extracting an image feature amount from the input image signal;
A still image determination unit that generates still image determination information indicating whether the input image signal is a still image or a slow-moving image based on the image feature amount;
A target information generation amount of a picture to be encoded is calculated according to the still image determination information, and an actual information generation amount of a field in which encoding is performed by the encoding unit falls within the target information generation amount. A video encoding apparatus comprising: an encoding control unit that controls the encoding unit.

請求項１に記載の動画像符号化装置において、
前記特徴量抽出回路は、時間軸上で連続する同一パリティのフィールド間差分情報、および時間軸上で連続する異パリティのフィールド間差分情報を前記画像特徴量として抽出し、
前記静止画像判定部は、前記同一パリティのフィールド間差分情報と前記異パリティのフィールド間差分情報との比率を前記静止画像判定情報として生成し、
前記符号化制御部は、前記比率と所定閾値との比較結果に応じて前記目標情報発生量を算出する
ことを特徴とする動画像符号化装置。 The moving image encoding device according to claim 1,
The feature amount extraction circuit extracts the difference information between fields of the same parity continuous on the time axis and the difference information between fields of the different parity continuous on the time axis as the image feature amounts,
The still image determination unit generates a ratio between the difference information between fields of the same parity and the difference information between fields of different parity as the still image determination information,
The encoding control unit calculates the target information generation amount according to a comparison result between the ratio and a predetermined threshold value.

請求項１または２に記載の動画像符号化装置において、
前記符号化制御部は、
符号化の対象となるピクチャがＩピクチャであり、前記Ｉピクチャに対応する静止画像判定情報が静止画像または動きが緩慢な画像であることを示す情報である場合には、前記Ｉピクチャに対する目標情報発生量を小さくするように第１の補正量で補正し、
符号化の対象となるピクチャがＰピクチャであり、前記Ｐピクチャに対応する静止画像判定情報が静止画像または動きが緩慢な画像であることを示す情報であり、かつ前記Ｐピクチャが前記ＧＯＰ単位内の先頭からカウントして所定枚数未満の場合には、該当する１以上のＰピクチャに対する目標情報発生量を、前記Ｉピクチャの目標情報発生量の補正に用いた前記第１の補正量を分配して大きくなるように補正する
ことを特徴とする動画像符号化装置。 The moving picture encoding apparatus according to claim 1 or 2,
The encoding control unit
When the picture to be encoded is an I picture, and the still picture determination information corresponding to the I picture is information indicating that the picture is a still picture or a slow-moving picture, target information for the I picture Correct with the first correction amount to reduce the generation amount,
The picture to be encoded is a P picture, the still picture determination information corresponding to the P picture is information indicating that the picture is a still picture or a slow-moving picture, and the P picture is within the GOP unit. If the number is less than the predetermined number, the target information generation amount for the corresponding one or more P pictures is distributed as the first correction amount used for correcting the target information generation amount of the I picture. The moving picture coding apparatus is characterized in that correction is performed so as to increase.

請求項３に記載の動画像符号化装置において、
前記符号化制御部は、Ｐピクチャに対する補正後の目標情報発生量を、前記Ｐピクチャの１つ前のピクチャの符号化完了後における前記符号化部のバッファ空き容量に基づいてさらに補正することを特徴とする動画像符号化装置。 In the moving image encoding device according to claim 3,
The encoding control unit further corrects the corrected target information generation amount for the P picture based on the buffer free space of the encoding unit after the encoding of the picture immediately before the P picture is completed. A moving image encoding device.

請求項１または２に記載の動画像符号化装置において、
前記符号化制御部は、
符号化の対象となるピクチャがＩピクチャであり、前記Ｉピクチャに対応する静止画像判定情報が静止画像または動きが緩慢な画像であることを示す情報である場合には、前記Ｉピクチャに対する目標情報発生量を小さくするように第１の補正量で補正し、
符号化の対象となるピクチャがＰピクチャであり、前記Ｐピクチャに対応する静止画像判定情報が静止画像または動きが緩慢な画像であることを示す情報であり、かつ前記Ｐピクチャが前記ＧＯＰ単位内の先頭からカウントして所定枚数未満の場合には、該当する前記Ｐピクチャに対する目標情報発生量を、前記Ｐピクチャの１つ前のピクチャの符号化完了後における前記符号化部のバッファ空き容量に基づいて補正する
ことを特徴とする動画像符号化装置。 The moving picture encoding apparatus according to claim 1 or 2,
The encoding control unit
When the picture to be encoded is an I picture, and the still picture determination information corresponding to the I picture is information indicating that the picture is a still picture or a slow-moving picture, target information for the I picture Correct with the first correction amount to reduce the generation amount,
The picture to be encoded is a P picture, the still picture determination information corresponding to the P picture is information indicating that the picture is a still picture or a slow-moving picture, and the P picture is within the GOP unit. Is less than a predetermined number, the target information generation amount for the corresponding P picture is set to the buffer free space of the encoding unit after the encoding of the picture immediately before the P picture is completed. A moving picture encoding apparatus, wherein correction is performed based on the correction.

請求項３ないし５のいずれか１項に記載の動画像符号化装置において、
前記符号化制御部は、符号化の対象となるピクチャがＥｖｅｎフィールドのＰピクチャに対してのみ、Ｐピクチャに対する目標情報発生量の補正を行うことを特徴とする動画像符号化装置。 In the moving image encoder according to any one of claims 3 to 5,
The moving picture coding apparatus, wherein the coding control unit corrects the target information generation amount for a P picture only when the picture to be coded is a P picture in the Even field.