JP4662969B2

JP4662969B2 - Image processing apparatus and method

Info

Publication number: JP4662969B2
Application number: JP2007290754A
Authority: JP
Inventors: 雅彦吉本; 勇一郎村地; 博川口; 祐貴福山; 亮山本; 吉雄松田; 正幸深山
Original assignee: 株式会社半導体理工学研究センター
Priority date: 2007-11-08
Filing date: 2007-11-08
Publication date: 2011-03-30
Anticipated expiration: 2027-11-08
Also published as: JP2009116730A

Description

本発明は、階層化された画像データについてオプティカルフローを演算する画像処理装置及び方法に関する。 The present invention relates to an image processing apparatus and method for calculating an optical flow for layered image data.

オプティカルフローは動画像における連続２フレーム間での画素毎の動きベクトルのことであり、動画像認識処理の要素として使用される。すなわち、オプティカルフローは、画像の動きを表す画素毎の速度ベクトルで表され、この速度ベクトルを求めることにより画素の動きを検出する。白線検知や障害物認識などの車載分野、物体・動体認識やマニピュレーション用動き検出などのロボット分野、マルチメディアコンテンツ配信を高画質／高圧縮で実現するマルチメディア通信分野等における動画像認識に応用されている。特に車載やロボットでは実時間での処理が求められるため、オプティカルフロー計算の高速化は必須である。 The optical flow is a motion vector for each pixel between two consecutive frames in a moving image, and is used as an element of a moving image recognition process. That is, the optical flow is represented by a velocity vector for each pixel representing the motion of the image, and the motion of the pixel is detected by obtaining this velocity vector. Applied to in-vehicle field such as white line detection and obstacle recognition, robot field such as object / motion object recognition and motion detection for manipulation, multimedia communication field that realizes high-quality / high-compression multimedia content distribution, etc. ing. In particular, in-vehicle and robots require real-time processing, so it is essential to speed up the optical flow calculation.

従来の階層型ルーカス及びカナデ（Ｌｕｃｕｓ＆Ｋａｎａｄｅ）法によるオプティカルフローの計算を以下に説明する。現フレームの画像をＩ、次フレームの画像をＪとし、位置Ｘ＝（ｘ，ｙ）での画像の輝度値をそれぞれＩ（ｘ，ｙ）、Ｊ（ｘ，ｙ）とする。Ｉ上の点ｕ＝（ｕ_ｘ，ｕ_ｙ）からＪ上の最も類似性の高い点ｖ＝ｕ＋ｄ＝（ｕ_ｘ＋ｄ_ｘ，ｕ_ｙ＋ｄ_ｙ）への移動量ｄ＝（ｄ_ｘ，ｄ_ｙ）がオプティカルフローであり、次式のマッチング誤差関数を最小化するベクトルとして定義される。なお、当該明細書において、数式がイメージ入力された墨付き括弧の数番号と、数式が文字入力された大括弧の数式番号とを混在して用いており、また、当該明細書での一連の数式番号として「式（１）」の形式を用いて数式番号を式の最後部に付与して（付与していない数式も存在する）用いることとする。 The calculation of the optical flow by the conventional hierarchical Lucas & Kanade method will be described below. Assume that the current frame image is I, the next frame image is J, and the luminance values of the image at position X = (x, y) are I (x, y) and J (x, y), respectively. Movement amount d = (d _x , d _y ) from point u = (u _x , u _y ) on I to point v = u + d = (u _x + d _x , u _y + d _y ) on J ) Is an optical flow, and is defined as a vector that minimizes the matching error function of the following equation. In this specification, the number number of the black brackets in which the mathematical formula is imaged and the formula number of the square brackets in which the mathematical formula is input are used in combination. The formula number is assigned to the last part of the formula using the formula (1) as the formula number (there is also a formula that is not given).

ここで、マッチング誤差関数は整数のウィンドウ（２ｗ_ｘ＋１，２ｗ_ｙ＋１）の近傍画素について計算される。ｗ_ｘ，ｗ_ｙの値としては、主に２、３、４、５、６、７画素が用いられる。 Here, the matching error function is calculated for neighboring pixels in an integer window (2w _x +1, 2w _y +1). As the values of w _x and w _y , 2, 3, 4, 5, 6, and 7 pixels are mainly used.

大きな動きに対しても、精度良くオプティカルフローを求めるために、図２０に示すように画像を階層化することが行われている。図２０において、階層レベル０が原画像に対応しており、最も解像度が高い。上位階層の輝度値は例えば図２１に示すフィルタと１／２サブサンプリングによって求められる。これを下位階層から上位階層へと繰り返すことで、さらに上位の階層化画像を得る。階層数として実用的な値は２、３、４である。 In order to obtain an optical flow with high accuracy even for a large movement, the images are hierarchized as shown in FIG. In FIG. 20, layer level 0 corresponds to the original image and has the highest resolution. The luminance value of the upper layer is obtained by, for example, the filter and 1/2 subsampling shown in FIG. By repeating this from the lower hierarchy to the upper hierarchy, a higher hierarchical image is obtained. Practical values for the number of layers are 2, 3, and 4.

階層化を用いたオプティカルフロー導出では、まず最も上位の階層Ｌ_ｍでオプティカルフローが計算される。この計算結果は、１つ下の階層Ｌ_ｍ−１のオプティカルフローの初期値として階層Ｌ_ｍ−１に伝搬され、さらに階層Ｌ_ｍ−１の結果は階層Ｌ_ｍ−２に伝搬される。これを最下位階層まで繰り返すことによって最終的なオプティカルフローを得る。上位階層で算出されたオプティカルフローｇ^Ｌを初期予測値とすると、現階層のマッチング誤差関数は次式で定義される。 In optical flow derivation using hierarchization, first, an optical flow is calculated at the highest hierarchy L _m . The calculation result is transmitted to the hierarchical L _m-1 as the initial value of one optical flow hierarchy L _m-1 below, more hierarchical L _m-1 results are propagated to the hierarchical L _m-2. A final optical flow is obtained by repeating this to the lowest hierarchy. When the optical flow g ^L calculated in the upper layer is an initial predicted value, the matching error function in the current layer is defined by the following equation.

ここで、ｄ^Ｌ（残差）が現階層で求めるべきオプティカルフローとなる。上位階層で大きな動きｇ^Ｌを計算し、現階層で小さな動きｄ^Ｌを計算する。この残差オプティカルフローは次のレベルＬ−１の初期ベクトルｇ^Ｌ−１として伝搬される。つまり上位階層で得たオプティカルフローｇ^Ｌと現階層で得た残差オプティカルフローｄ^Ｌを合わせたものが下位階層Ｌ−１でのフローの初期値ｇ^Ｌ−１となる。下位階層への伝搬は次式で表される。 Here, d ^L (residual) is an optical flow to be obtained in the current hierarchy. A large motion g ^L is calculated in the upper layer, and a small motion d ^L is calculated in the current layer. This residual optical flow is propagated as the next level L-1 initial vector g ^L-1 . That is, the sum of the optical flow g ^L obtained in the upper hierarchy and the residual optical flow d ^L obtained in the current hierarchy is the initial value g ^L-1 of the flow in the lower hierarchy L-1. Propagation to the lower layer is expressed by the following equation.

［数１］
ｇ^Ｌ−１＝２（ｇ^Ｌ＋ｄ^Ｌ）（３） [Equation 1]
g ^L-1 = 2 (g ^L + d ^L ) (3)

次の階層での残差オプティカルフローｄ^Ｌ−１も上記と同様に計算され、この処理が最下位階層まで繰り返し行われ、最終的に式（１）で表されるオプティカルフローｄが求められる。 The residual optical flow d ^L−1 in the next hierarchy is also calculated in the same manner as described above, and this process is repeatedly performed up to the lowest hierarchy to finally obtain the optical flow d represented by the equation (1).

オプティカルフローの計算では、画素毎に独立した一連の方程式を解く必要があり、ＣＩＦ３０ｆｐｓのシーケンスの場合、数十ＧＯＰＳを超える演算量となる。このため、汎用プロセッサを用いたソフトウェア処理では、実時間処理を維持するために計算範囲を限定し、オプティカルフローの検出精度を落とす必要がある。実時間で全画面処理を高精度で行うためには専用ハードウェアが必要不可欠である。オプティカルフロープロセッサの１つであるＨＯＥプロセッサ（例えば、非特許文献１参照。）では最高解像度ＣＩＦ３０ｆｐｓを達成しているが、チップ面積が大きく、コスト面での負荷が大きい。今後、動画像認識処理において、高解像度な動画像への対応が必須となるため、より高解像度対応で小チップ面積のオプティカルフロープロセッサが必要となる。 In the calculation of the optical flow, it is necessary to solve a series of independent equations for each pixel. In the case of a sequence of CIF 30 fps, the calculation amount exceeds several tens of GOPS. For this reason, in software processing using a general-purpose processor, it is necessary to limit the calculation range and maintain optical flow detection accuracy in order to maintain real-time processing. Dedicated hardware is essential to perform full-screen processing with high accuracy in real time. The HOE processor (see, for example, Non-Patent Document 1), which is one of the optical flow processors, achieves the highest resolution CIF of 30 fps, but has a large chip area and a large cost burden. In the future, in moving image recognition processing, it will be essential to support high-resolution moving images, so an optical flow processor that supports higher resolution and has a small chip area will be required.

高速のオプティカルフロープロセッサを実現するために、ＰＬＫアルゴリズムを用いることが提案されている（例えば、非特許文献２参照。）ＰＬＫアルゴリズムは、急峻な動きに対応するための階層化手法を階層型ルーカス及びカナデ（Pyramidal Lcus & Kanade）法のアルゴリズムに導入したものであり、非特許文献２で公開されている。一般的にオプティカルフローアルゴリズムでは線形近似を用いるため、大きな動きを含むシーケンスではフロー検出精度が低下する。そこで、図２０に示すような階層化手法を用いることにより、大きな移動量を小さな移動量として扱うことができ、大きな動きに対応可能となる。ＰＬＫアルゴリズムは、他のアルゴリズム（例えば、非特許文献３−６参照。）と比較して、演算量が少なく、メモリ使用量が小さく、精度が高いため、本アルゴリズムはＶＬＳＩに実装するアルゴリズムとして適している。 In order to realize a high-speed optical flow processor, it has been proposed to use a PLK algorithm (see, for example, Non-Patent Document 2). The PLK algorithm uses a layered Lucas as a layering method for dealing with steep movements. And introduced in the algorithm of the Pyramidal Lcus & Kanade method. Since the optical flow algorithm generally uses linear approximation, the flow detection accuracy decreases in a sequence including a large motion. Therefore, by using a hierarchization method as shown in FIG. 20, a large movement amount can be handled as a small movement amount, and a large movement can be handled. Since the PLK algorithm has a smaller calculation amount, smaller memory usage, and higher accuracy than other algorithms (for example, see Non-Patent Documents 3-6), this algorithm is suitable as an algorithm implemented in VLSI. ing.

図２２は従来技術に係るＰＬＫアルゴリズムによるオプティカルフロー演算処理（以下、「ＯＰＦ（OPtical Flow)演算」という。）を示すフローチャートである。図２２において、ＰＬＫアルゴリズムによるＯＰＦ演算処理の全体処理（２つのフレームの画像データをメモリに入力した後の処理）は、公知のように、
（ａ）階層化画像生成演算（以下、「ＰＩＣ（Pyramidal Image Creation）演算」という。）処理（ステップＳ２，Ｓ３）と、
（ｂ）空間輝度勾配行列演算（以下、「ＳＧＭ（Spatial Gradient Matrix）演算」という。）処理（ステップＳ４）と、
（ｃ）ミスマッチベクトル演算（以下、「ＭＭＶ（Mismatch Vector）」演算という。)
処理（ステップＳ５）と、
（ｄ）ＯＰＦ演算処理（ステップＳ６）とを含む。
図２２において、ｉｔｒは繰り返し回数のパラメータであり、ｉｔｒｍａｘは繰り返し回数の最大値である。また、Ｌは階層数のパラメータであり、Ｌｍａｘは階層数の最大値である。なお、ステップＳ１では、これらの最大値をセットする初期化処理を行っている。 FIG. 22 is a flowchart showing an optical flow calculation process (hereinafter referred to as “OPF (OPtical Flow) calculation”) by the PLK algorithm according to the prior art. In FIG. 22, the entire processing of the OPF calculation processing by the PLK algorithm (processing after inputting the image data of two frames to the memory) is known as follows.
(A) Hierarchical image generation calculation (hereinafter referred to as “PIC (Pyramidal Image Creation) calculation”) processing (steps S2 and S3);
(B) Spatial luminance gradient matrix calculation (hereinafter referred to as “SGM (Spatial Gradient Matrix) calculation”) processing (step S4);
(C) Mismatch vector calculation (hereinafter referred to as “MMV (Mismatch Vector)” calculation)
Processing (step S5);
(D) OPF calculation processing (step S6).
In FIG. 22, itr is a parameter of the number of repetitions, and itrmax is a maximum value of the number of repetitions. L is a parameter for the number of layers, and Lmax is a maximum value for the number of layers. In step S1, initialization processing for setting these maximum values is performed.

ＰＬＫアルゴリズムにおいて、座標（ｘ，ｙ）のオプティカルフローであるｕは次式によって計算される。 In the PLK algorithm, u which is an optical flow of coordinates (x, y) is calculated by the following equation.

［数２］
Ｇｕ＝ｂ（４） [Equation 2]
Gu = b (4)

ここで、Ｉ_ｘ，Ｉ_ｙ，Ｉ_ｔはそれぞれｘ方向、ｙ方向、ｔ方向の輝度勾配であり、Ｇ、ｂをそれぞれ空間輝度勾配行列、ミスマッチベクトルと呼ぶ。ｗは重み係数である。Ｇ、ｂを注目画素を中心とした小領域（ウィンドウ）内で計算し、式（４）を解くことでオプティカルフローｕを得る。また、Σはウィンドウ内の画素全体における総和演算を表す。これはニュートン・ラプソン法による繰り返し処理が可能であり、求めたオプティカルフローを用いて中心画素の再定義を行い、式（４）を繰り返し計算することにより精度の高い解へと収束させることができる。 _Here, I _x, I y, _{I t} is the brightness gradient of the x-direction, y-direction, t directions, respectively, G, b each spatial brightness gradient matrix, referred to as a mismatch vector. w is a weighting coefficient. Optical flow u is obtained by calculating G and b within a small region (window) centered on the pixel of interest and solving equation (4). Further, Σ represents a summation operation for all pixels in the window. This can be iteratively processed by the Newton-Raphson method, redefining the central pixel using the obtained optical flow, and can be converged to a highly accurate solution by repeatedly calculating equation (4). .

特開平１１−１９５１０８号公報。Japanese Patent Application Laid-Open No. 11-195108. 特開２０００−１５５８３１号公報。JP 2000-155831 A. N. Minegishi, et al., "VLSI Architecture Study of a Real-Time Scalable Optical Flow Processor for Video Segmentation", IEICE transactions on electronics, Vol.E89-C, No.3, pp.230-242, 2006.N. Minegishi, et al., "VLSI Architecture Study of a Real-Time Scalable Optical Flow Processor for Video Segmentation", IEICE transactions on electronics, Vol.E89-C, No.3, pp.230-242, 2006. Jean-Yves Bouguet, "Pyramidal Implementation of the Lucas Kanade Feature Tracker Description of the algorithm", Open CV Documentation, Intel Corporation Microprocessor Research Laboratories, 1999.Jean-Yves Bouguet, "Pyramidal Implementation of the Lucas Kanade Feature Tracker Description of the algorithm", Open CV Documentation, Intel Corporation Microprocessor Research Laboratories, 1999. T. Yamamoto et al., "Improvement of Optical Flow by Moving Object Detection using Temporal Correlation", Transactions IIITE, Vol.55, No.6, pp.907-911, 2001.T. Yamamoto et al., "Improvement of Optical Flow by Moving Object Detection using Temporal Correlation", Transactions IIITE, Vol.55, No.6, pp.907-911, 2001. B.K.P. Horn et al., "Determining Optical Flow, Artificial Intelligence", Vol.17, pp.185-204, 1981.B.K.P.Horn et al., "Determining Optical Flow, Artificial Intelligence", Vol.17, pp.185-204, 1981. J.L. Barron et al., "Performance of Optical Flow Techniques, International Journal Computer Vision", Vol.12, No.1, pp.43-77, 1994.J.L.Barron et al., "Performance of Optical Flow Techniques, International Journal Computer Vision", Vol.12, No.1, pp.43-77, 1994. B. Lucas et al., "An Iterative Image Registration Technique With An Application to Stereo Vision", Proceedings of DARPA Image Understanding Workshop, pp. 121-130, 1981.B. Lucas et al., "An Iterative Image Registration Technique With An Application to Stereo Vision", Proceedings of DARPA Image Understanding Workshop, pp. 121-130, 1981.

しかしながら、オプティカルフローを求めるのに、その更新値に制限を加えていないために、その演算回路内部のキャッシュメモリサイズが大きくなり、演算精度も悪くなる。さらに除算回路の回路規模とサイクル数が大きいという問題点があった。 However, since there is no restriction on the update value for obtaining the optical flow, the cache memory size in the arithmetic circuit becomes large and the arithmetic accuracy also deteriorates. Furthermore, there is a problem that the circuit scale and the number of cycles of the divider circuit are large.

本発明の目的は以上の問題点を解決し、オプティカルフローの演算において、従来技術に比較して、より短い演算時間で演算可能であって計算コストを軽減でき、しかも演算精度を改善できる画像処理装置及び方法を提供することにある。 The object of the present invention is to solve the above-mentioned problems, and in the calculation of optical flow, image processing that can be performed in a shorter calculation time and can reduce the calculation cost and can improve the calculation accuracy as compared with the prior art. It is to provide an apparatus and method.

第１の発明に係る画像処理装置は、２つのフレームの画像データに対してサブサンプリングとフィルタリングとを実行することにより階層化画像データを生成する階層化演算手段と、
上記階層化演算手段により生成された階層化画像データに基づいて空間輝度勾配行列とミスマッチベクトルを演算し、上記演算された空間輝度勾配行列及びミスマッチベクトルに基づいてオプティカルフローを演算するオプティカルフロー演算手段とを備えた画像処理装置において、
上記オプティカルフロー演算手段は、上記演算されたオプティカルフローを上位階層から下位階層に伝搬させるときに上記オプティカルフローに所定の上限値を限定して更新することにより、上記オプティカルフローの可動域を所定の可動域に限定することを特徴とする。 An image processing apparatus according to a first aspect of the present invention is a hierarchization calculation means for generating hierarchized image data by performing sub-sampling and filtering on image data of two frames,
Optical flow calculation means for calculating a spatial luminance gradient matrix and a mismatch vector based on the hierarchical image data generated by the hierarchical calculation means, and calculating an optical flow based on the calculated spatial luminance gradient matrix and mismatch vector In an image processing apparatus comprising:
The optical flow calculation means limits the optical flow range of motion to a predetermined range by updating the optical flow with a predetermined upper limit value when propagating the calculated optical flow from an upper layer to a lower layer. It is limited to a movable range.

上記画像処理装置において、上記オプティカルフロー演算手段は、上記演算された空間輝度勾配行列及びミスマッチベクトルについて乗算演算を実行した後、回復法アルゴリズムを用いて除算演算を実行するときに、上記除算演算の演算結果における拡張ビットを削除することを特徴とする。 In the image processing apparatus, the optical flow calculation means performs the multiplication operation on the calculated spatial luminance gradient matrix and the mismatch vector, and then executes the division operation using the recovery method algorithm. It is characterized in that the extension bit in the operation result is deleted.

また、上記画像処理装置において、上記階層化演算手段及び上記オプティカルフロー演算手段は所定の複数の演算回路を用いてパイプライン処理を実行し、上記画像データのうちのオプティカルフローを演算する所定の領域のオプティカルフローを演算の終了を待たずに、次の領域のオプティカルフローを演算することを開始することを特徴とする。 In the image processing apparatus, the hierarchization calculation means and the optical flow calculation means execute a pipeline process using a plurality of predetermined calculation circuits, and calculate a predetermined area in the optical data of the image data. The calculation of the optical flow of the next region is started without waiting for the completion of the calculation of the optical flow of

第２の発明に係る画像処理方法は、２つのフレームの画像データに対してサブサンプリングとフィルタリングとを実行することにより階層化画像データを生成する階層化演算ステップと、
上記階層化演算ステップにより生成された階層化画像データに基づいて空間輝度勾配行列とミスマッチベクトルを演算し、上記演算された空間輝度勾配行列及びミスマッチベクトルに基づいてオプティカルフローを演算するオプティカルフロー演算ステップとを演算プロセッサ装置により実行する画像処理方法において、
上記オプティカルフロー演算ステップは、上記演算されたオプティカルフローを上位階層から下位階層に伝搬させるときに上記オプティカルフローに所定の上限値を限定して更新することにより、上記オプティカルフローの可動域を所定の可動域に限定することを特徴とする。 An image processing method according to a second aspect of the present invention is a hierarchization calculation step for generating hierarchized image data by performing sub-sampling and filtering on image data of two frames;
An optical flow calculation step of calculating a spatial luminance gradient matrix and a mismatch vector based on the hierarchical image data generated by the hierarchical calculation step, and calculating an optical flow based on the calculated spatial luminance gradient matrix and the mismatch vector. In an image processing method in which
In the optical flow calculation step, when the calculated optical flow is propagated from an upper layer to a lower layer, the optical flow is limited to a predetermined upper limit value and updated, thereby changing a range of motion of the optical flow to a predetermined level. It is limited to a movable range.

上記画像処理方法において、上記オプティカルフロー演算ステップは、上記演算された空間輝度勾配行列及びミスマッチベクトルについて乗算演算を実行した後、回復法アルゴリズムを用いて除算演算を実行するときに、上記除算演算の演算結果における拡張ビットを削除することを特徴とする。 In the image processing method, the optical flow calculation step includes performing the multiplication operation on the calculated spatial luminance gradient matrix and the mismatch vector, and then performing the division operation using the recovery method algorithm. It is characterized in that the extension bit in the operation result is deleted.

また、上記画像処理方法において、上記階層化演算ステップ及び上記オプティカルフロー演算ステップは所定の複数の演算回路を用いてパイプライン処理を実行し、上記画像データのうちのオプティカルフローを演算する所定の領域のオプティカルフローを演算の終了を待たずに、次の領域のオプティカルフローを演算することを開始することを特徴とする。 Further, in the image processing method, the hierarchization calculation step and the optical flow calculation step execute pipeline processing using a plurality of predetermined calculation circuits, and calculate a predetermined area in the optical data of the image data. The calculation of the optical flow of the next region is started without waiting for the completion of the calculation of the optical flow of

本発明に係る画像処理装置及び方法によれば、記演算されたオプティカルフローを上位階層から下位階層に伝搬させるときに上記オプティカルフローに所定の上限値を限定して更新することにより、上記オプティカルフローの可動域を所定の可動域に限定し、好ましくは、上記演算された空間輝度勾配行列及びミスマッチベクトルについて乗算演算を実行した後、回復法アルゴリズムを用いて除算演算を実行するときに、上記除算演算の演算結果における拡張ビットを削除し、より好ましくは、上記階層化演算ステップ及び上記オプティカルフロー演算ステップは所定の複数の演算回路を用いてパイプライン処理を実行し、上記画像データのうちのオプティカルフローを演算する所定の領域のオプティカルフローを演算の終了を待たずに、次の領域のオプティカルフローを演算することを開始する。これにより上記演算プロセッサのキャッシュ容量、除算回路の面積及び処理サイクル数を従来技術に比較して、大幅に削減できる。これにより、演算プロセッサのハードウェアの低消費電力化、高速化、及び省面積化を行うことができる。従って、オプティカルフローの演算において、従来技術に比較してより短い演算時間で演算することができ、オプティカルフローの演算プロセッサを構成したときにそのハードウェアを大幅に小型化できる。 According to the image processing apparatus and method of the present invention, when the calculated optical flow is propagated from the upper layer to the lower layer, the optical flow is updated by limiting a predetermined upper limit value to the optical flow. The range of motion is limited to a predetermined range of motion, and preferably, when the division operation is performed using the recovery method algorithm after performing the multiplication operation on the calculated spatial luminance gradient matrix and the mismatch vector, the division is performed. An extension bit in the calculation result of the calculation is deleted, and more preferably, the hierarchization calculation step and the optical flow calculation step execute pipeline processing using a plurality of predetermined calculation circuits, and the optical data of the image data Without waiting for the end of the calculation, the optical flow of the predetermined area where the flow is calculated It starts computing the optical flow area. As a result, the cache capacity of the arithmetic processor, the area of the division circuit, and the number of processing cycles can be greatly reduced as compared with the prior art. As a result, it is possible to reduce the power consumption, speed, and area of the hardware of the arithmetic processor. Therefore, in the calculation of the optical flow, the calculation can be performed in a shorter calculation time as compared with the prior art, and the hardware can be greatly downsized when the optical flow calculation processor is configured.

以下、本発明に係る実施形態について図面を参照して説明する。なお、以下の各実施形態において、同様の構成要素については同一の符号を付している。 Hereinafter, embodiments according to the present invention will be described with reference to the drawings. In addition, in each following embodiment, the same code | symbol is attached | subjected about the same component.

第１の実施形態．
図１は本発明の第１の実施形態に係るＯＰＦ演算プロセッサ２０の詳細構成を示すブロック図である。また、図２は図１のＯＰＦ演算プロセッサ２０によって実行される、オプティカルフローの演算において画像のウィンドウＢの稼働限定域１０１を設けることを示す画像フレーム１００の図である。第１の実施形態に係るＯＰＦ演算プロセッサ２０は、
（Ａ）オプティカルフローの更新値に上限値を設定し、例えば３画素以上をｘ方向及びｙ方向で切り捨てること、
（Ｂ）オプティカルフロー演算において、画素の処理順序をインターリーブすることにより、パイプライン処理でのアイドル期間を回避すること、
（Ｃ）ＯＰＦ演算回路３０内の除算回路３２（図６参照。）の拡張ビット数を削減すること
を特徴としている。 First embodiment.
FIG. 1 is a block diagram showing a detailed configuration of the OPF arithmetic processor 20 according to the first embodiment of the present invention. FIG. 2 is a diagram of an image frame 100 showing that the operation limited area 101 of the image window B is provided in the optical flow calculation executed by the OPF calculation processor 20 of FIG. The OPF arithmetic processor 20 according to the first embodiment
(A) An upper limit value is set for the update value of the optical flow, and for example, 3 pixels or more are truncated in the x direction and the y direction.
(B) avoiding an idle period in pipeline processing by interleaving the processing order of pixels in an optical flow calculation;
(C) The number of extension bits of the division circuit 32 (see FIG. 6) in the OPF arithmetic circuit 30 is reduced.

図１において、ＯＰＦ演算プロセッサ２０を含む演算プロセッサ集積回路は例えばＶＬＳＩで実装され、全体制御用ＣＰＵ１０と、ＣＰＵバス２０ａと、処理すべき動画像データを予め記憶する画像メモリ１１と、メモリバス２０ｂと、ＯＰＦ演算プロセッサ２０と、ＣＰＵバス２０ａに接続された演算結果データメモリ（外部メモリ）１２と、インターフェース１３と、ディスプレイ１４とを備えて構成される。ここで、ＯＰＦ演算プロセッサ２０は、
（ａ）ＣＰＵバス２０ａに接続され、ＣＰＵ１０の制御のもとでＯＰＦ演算プロセッサ２０内の処理シーケンスを制御するシーケンスコントローラ２１と、
（ｂ）メモリバス２０ｂに接続され、画像メモリ１１からの制御のもとでＯＰＦ演算プロセッサ２０内でのアドレスを発生して各回路に出力するアドレス発生器２２と、
（ｃ）画像メモリ１１からの第１のフレーム画像の画像データを記憶するフレームＡメモリ２３と、
（ｄ）画像メモリ１１からの第２のフレーム画像の画像データを記憶するフレームＢメモリ２４と、
（ｅ）メモリ２３，２４に格納された各フレームの画像データに基づいて階層化画像の画像データを演算することにより生成するＰＩＣ演算回路２５と、
（ｆ）ＰＩＣ演算回路２５により生成された階層化された第１の画像データを記憶する階層化画像Ａメモリ２６と、
（ｇ）ＰＩＣ演算回路２５により生成された階層化された第２の画像データを記憶する階層化画像Ｂメモリ２７と、
（ｈ）階層化画像Ａメモリ２６に格納された画像データに基づいて空間輝度勾配行列を演算することにより生成するＳＧＭ演算回路２８と、
（ｉ）階層化画像Ｂメモリ２７に格納された画像データに基づいてミスマッチベクトルを演算することにより生成するＭＭＶ演算回路２９と、
（ｊ）上記演算された空間輝度勾配行列及びミスマッチベクトルに基づいてオプティカルフローを演算するＯＰＦ演算回路３０と
を備えて構成される。ここで、ＰＩＣ演算回路２５と、ＳＧＭ演算回路２８と、ＭＭＶ演算回路２９と、ＯＰＦ演算回路３０とはそれぞれ並列にパイプライン処理を行う。 In FIG. 1, an arithmetic processor integrated circuit including an OPF arithmetic processor 20 is implemented by, for example, VLSI, a CPU 10 for overall control, a CPU bus 20a, an image memory 11 for storing moving image data to be processed in advance, and a memory bus 20b. An OPF arithmetic processor 20, an arithmetic result data memory (external memory) 12 connected to the CPU bus 20a, an interface 13, and a display 14. Here, the OPF arithmetic processor 20
(A) a sequence controller 21 connected to the CPU bus 20a and controlling a processing sequence in the OPF arithmetic processor 20 under the control of the CPU 10;
(B) an address generator 22 connected to the memory bus 20b, which generates an address in the OPF arithmetic processor 20 under the control of the image memory 11 and outputs it to each circuit;
(C) a frame A memory 23 for storing image data of the first frame image from the image memory 11;
(D) a frame B memory 24 for storing image data of the second frame image from the image memory 11;
(E) a PIC operation circuit 25 that generates the image data of the hierarchized image based on the image data of each frame stored in the memories 23 and 24;
(F) a hierarchized image A memory 26 for storing the hierarchized first image data generated by the PIC arithmetic circuit 25;
(G) a hierarchized image B memory 27 for storing the second hierarchized image data generated by the PIC arithmetic circuit 25;
(H) an SGM calculation circuit 28 that generates a spatial luminance gradient matrix based on image data stored in the hierarchical image A memory 26;
(I) an MMV calculation circuit 29 that generates a mismatch vector based on image data stored in the hierarchical image B memory 27;
(J) An OPF calculation circuit 30 that calculates an optical flow based on the calculated spatial luminance gradient matrix and mismatch vector. Here, the PIC operation circuit 25, the SGM operation circuit 28, the MMV operation circuit 29, and the OPF operation circuit 30 perform pipeline processing in parallel.

ＰＩＣ演算回路２５は、メモリ２３，２４に格納された２つのフレームの画像データに基づいて、図２０に示すように、サブサンプリングとガウシアンフィルタリングを行い、階層化画像データを生成してそれぞれメモリ２６，２７に出力して格納する。ＳＧＭ演算回路２８はメモリ２６に格納された階層化画像データに基づいて空間輝度勾配行列Ｇを算出する。ここで、ＳＧＭ演算回路２８では、まず現フレームの階層化画像データから小数点画素の輝度値を補間し、現フレームにおける処理用の小領域であるウィンドウＡを作成する。次に、ウィンドウＡ内でｘ方向及びｙ方向の空間輝度勾配を求める。そして、求めた空間輝度勾配を用いて、画素毎に空間輝度勾配行列の要素を計算する。最後に求められた空間輝度勾配行列の要素を行方向に加算する。これらの処理をウィンドウサイズと同じ回数だけ繰り返すことによりウィンドウ内の総和の空間輝度勾配行列Ｇが導出される。ＭＭＶ演算回路２９はメモリ２７に格納された階層化画像データに基づいてミスマッチベクトルｂを演算する。ＭＭＶ演算回路２９では、まず上位階層のオプティカルフローを用いて、次フレームの階層化画像から移動先の小数点画素の輝度値を補間する。次に、ウィンドウＢを作成し、ＳＧＭ演算回路２８により求めたウィンドウＡとウィンドウＢを用いて時間輝度勾配を求める。そしてＳＧＭ演算回路２８により求められた空間輝度勾配行列と時間輝度勾配から各画素のミスマッチベクトルを計算する。最後にＳＧＭ演算回路２８と同様に計算結果を加算することで、ウィンドウ内の総和のミスマッチベクトルｂが導出される。ＯＰＦ演算回路３０は空間輝度勾配行列Ｇ及びミスマッチベクトルｂに基づいてオプティカルフローｕを算出する。ＯＰＦ演算回路３０では、ＳＧＭ演算回路２８及びＭＭＶ演算回路２９により計算された空間輝度勾配行列Ｇ及びミスマッチベクトルｂを用いて式（４）を解き、オプティカルフローｕを計算する。すなわち、オプティカルフローｕは次式で計算できる。 The PIC arithmetic circuit 25 performs sub-sampling and Gaussian filtering on the basis of the image data of the two frames stored in the memories 23 and 24, generates hierarchical image data, respectively, as shown in FIG. , 27 to be stored. The SGM arithmetic circuit 28 calculates a spatial luminance gradient matrix G based on the hierarchized image data stored in the memory 26. Here, the SGM arithmetic circuit 28 first interpolates the luminance value of the decimal point pixel from the hierarchized image data of the current frame, and creates a window A that is a small area for processing in the current frame. Next, the spatial luminance gradient in the x direction and the y direction in the window A is obtained. Then, using the obtained spatial luminance gradient, the elements of the spatial luminance gradient matrix are calculated for each pixel. Finally, the elements of the spatial luminance gradient matrix obtained are added in the row direction. By repeating these processes as many times as the window size, the total spatial brightness gradient matrix G in the window is derived. The MMV arithmetic circuit 29 calculates the mismatch vector b based on the hierarchized image data stored in the memory 27. First, the MMV arithmetic circuit 29 interpolates the luminance value of the destination decimal point pixel from the hierarchized image of the next frame using the optical flow of the upper layer. Next, window B is created, and a time luminance gradient is obtained using window A and window B obtained by SGM operation circuit 28. Then, a mismatch vector of each pixel is calculated from the spatial luminance gradient matrix and the temporal luminance gradient obtained by the SGM calculation circuit 28. Finally, by adding the calculation results in the same manner as in the SGM operation circuit 28, the mismatch vector b of the sum in the window is derived. The OPF arithmetic circuit 30 calculates an optical flow u based on the spatial luminance gradient matrix G and the mismatch vector b. The OPF arithmetic circuit 30 solves the equation (4) using the spatial luminance gradient matrix G and the mismatch vector b calculated by the SGM arithmetic circuit 28 and the MMV arithmetic circuit 29, and calculates the optical flow u. That is, the optical flow u can be calculated by the following equation.

［数３］
ｕ＝Ｇ^−１ｂ（７） [Equation 3]
u = G ⁻¹ b (7)

最後に、算出されたオプティカルフローを、前回の計算で得られたオプティカルフローに加算し、オプティカルフローを更新する。演算されたオプティカルフローｕは演算結果データメモリ１２に格納された後、インターフェース１３を介してディスプレイ１４に出力されて表示される。 Finally, the calculated optical flow is added to the optical flow obtained in the previous calculation to update the optical flow. The calculated optical flow u is stored in the calculation result data memory 12 and then output to the display 14 via the interface 13 and displayed.

従来のＯＰＦプロセッサにおいてリアルタイム処理を実現しようとした場合、キャッシュ容量の増大、サイクル数の増加などの問題からリアルタイム処理と省面積、低消費電力の両立が非常に困難である。これに対して、本発明に係る第１の実施形態では、精度の向上、キャッシュ容量の低減、サイクル数の低減等、リアルタイム処理と省面積、低消費電力を同時に実現するための２つの要素技術を提案する。 When trying to realize real-time processing in a conventional OPF processor, it is very difficult to achieve both real-time processing, area saving, and low power consumption due to problems such as an increase in cache capacity and an increase in the number of cycles. On the other hand, in the first embodiment according to the present invention, two elemental technologies for simultaneously realizing real-time processing, area saving, and low power consumption, such as improvement in accuracy, reduction in cache capacity, and reduction in the number of cycles. Propose.

本発明に係る第１の実施形態では、ＯＰＦプロセッサ２０の要素技術として、２つの新しい提案手法を提案する。第１の提案手法はオプティカルフロー更新値の上限の設定であり、第２の提案手法はウィンドウインターリーブ手法である。以下、それぞれの新しい提案手法について説明する。 In the first embodiment according to the present invention, two new proposed methods are proposed as elemental technologies of the OPF processor 20. The first proposed method is to set the upper limit of the optical flow update value, and the second proposed method is a window interleave method. Each new proposed method will be described below.

まず、第１の提案手法に係るオプティカルフローの上限の設定について説明する。従来法のＯＰＦプロセッサにおいては、オプティカルフローの大きさに上限を設定していない。このため、雑音などにより、ありえない大きさの更新値が発生した場合でもそのまま演算を進める。これに対し、第１の提案手法ではオプティカルフローの大きさに上限を設定する。すなわち、第１の提案手法では、３以上の長さのオプティカルフローが検出された場合、これを３に切り捨てる（図２）。すなわち、図２において、画像フレーム１００内のウィンドウＢについて次のウィンドウＢをＯＰＦ演算するときに、ウィンドウＢの可動限定域１０１を設定し、その更新値をその範囲に限定する（上限値を設定する）ものである。具体的には図３のフローチャートを用いる。 First, the setting of the upper limit of the optical flow according to the first proposed method will be described. The conventional OPF processor does not set an upper limit on the size of the optical flow. For this reason, even if an update value having an impossible size is generated due to noise or the like, the calculation proceeds as it is. On the other hand, in the first proposed method, an upper limit is set for the size of the optical flow. That is, in the first proposed method, when an optical flow having a length of 3 or more is detected, this is rounded down to 3 (FIG. 2). That is, in FIG. 2, when the next window B is OPF-calculated for the window B in the image frame 100, the movable limited area 101 of the window B is set, and the updated value is limited to that range (the upper limit value is set). To do). Specifically, the flowchart of FIG. 3 is used.

図３は図１のＯＰＦ演算プロセッサ２０によって実行されるＰＬＫアルゴリズムによるオプティカルフロー演算処理を示すフローチャートである。図３のオプティカルフロー演算処理は、従来技術に係る図２２のオプティカルフロー演算処理に比較して、ステップＳ１１−Ｓ１４の処理を追加したことを特徴としている。ステップＳ６のＯＰＦ演算処理後のステップＳ１１において、上記演算されたオプティカルフローｕのｘ成分ｕ_ｘが３以下であれば、そのままオプティカルフローとして用いるが、３を超える場合は、ステップＳ１２において当該オプティカルフローｕのｘ成分ｕ_ｘを上限値３に更新する。次いで、ステップＳ１３において、上記演算されたオプティカルフローｕのｙ成分ｕ_ｙが３以下であれば、そのままオプティカルフローとして用いるが、３を超える場合は、ステップＳ１４において当該オプティカルフローｕのｙ成分ｕ_ｙを上限値３に更新する。そして、ステップＳ７に進む。ここで、上限値を３に更新しているが、本発明はこれに限らず、所定の上限値であってもよい。 FIG. 3 is a flowchart showing optical flow calculation processing by the PLK algorithm executed by the OPF calculation processor 20 of FIG. The optical flow calculation process of FIG. 3 is characterized in that the processes of steps S11 to S14 are added as compared to the optical flow calculation process of FIG. 22 according to the prior art. In step S11 after the OPF calculation processing in step S6, if the following x-component u _x is 3 in the operational optical flow u, if it is used as it is as the optical flow, in excess of 3, the optical flow in step S12 update u of x component _{u x} the upper limit 3. Next, in step S13, if the calculated y component u _y of the optical flow u is 3 or less, it is used as an optical flow as it is. If it exceeds 3, the y component u _{y of the} optical flow u is exceeded in step S14. Is updated to the upper limit 3. Then, the process proceeds to step S7. Here, although the upper limit value is updated to 3, the present invention is not limited to this, and may be a predetermined upper limit value.

この第１の提案手法は従来法に対し以下の３つの利点を得る。利点の１つめは精度の向上である。正しいオプティカルフローはある一定以上の大きさを示すことはないため、ノイズの影響などで誤って求められた極端に大きいオプティカルフローを許容することは精度の劣化につながる。 The first proposed method has the following three advantages over the conventional method. The first advantage is improved accuracy. Since the correct optical flow does not indicate a certain size or larger, allowing an extremely large optical flow that is erroneously obtained due to the influence of noise or the like leads to deterioration of accuracy.

図４は従来技術及び第１の実施形態に係るＯＰＦ演算プロセッサ２０のシミュレーション結果であって、各パラメータに対する平均絶対誤差ＭＡＥを示す図である。また、図５は従来技術及び第１の実施形態に係るＯＰＦ演算プロセッサ２０のシミュレーション結果であって、各パラメータに対する平均二乗誤差ＭＳＥを示す図である。図４及び図５において、Ｌは階層数を示し、Ｗはウィンドウサイズを示し、Ｉは繰り返し回数を示す。図４及び図５から明らかなように、上限を設定した場合、ＭＡＥはほぼ変わらず、ＭＳＥは大きく改善されている。 FIG. 4 is a simulation result of the OPF arithmetic processor 20 according to the related art and the first embodiment, and shows the average absolute error MAE for each parameter. FIG. 5 is a simulation result of the OPF arithmetic processor 20 according to the related art and the first embodiment, and shows a mean square error MSE for each parameter. 4 and 5, L indicates the number of layers, W indicates the window size, and I indicates the number of repetitions. As apparent from FIGS. 4 and 5, when the upper limit is set, the MAE is not substantially changed, and the MSE is greatly improved.

次いで、利点の２つめはキャッシュの容量の削減である。オプティカルフローに上限を設定しない場合、フレーム全体の画素をキャッシュに保持する必要があり、キャッシュ容量が大きくなりやすい。これに対しオプティカルフローの上限を設定した場合、フレームの一部の画素をキャッシュに保持すればよいこととなり、キャッシュ容量を削減することが可能である。フレーム全体を保持する場合１２００キロバイト必要であったキャッシュ容量を９８．５％削減し、１８キロバイトにすることができる。 Next, the second advantage is a reduction in cache capacity. If no upper limit is set for the optical flow, it is necessary to hold the pixels of the entire frame in the cache, and the cache capacity tends to increase. On the other hand, when the upper limit of the optical flow is set, it is only necessary to hold some pixels of the frame in the cache, and the cache capacity can be reduced. If the entire frame is held, the cache capacity required for 1200 kilobytes can be reduced by 98.5% to 18 kilobytes.

さらに、利点の３つめは、回復法アルゴリズムを用いたＯＰＦ演算回路３０内の除算回路３２−１乃至３２−４（総称して、符号３２を付す。）におけるサイクル数及び回路規模の削減である。図６は図１のＯＰＦ演算回路３０の構成を示すブロック図である。図６において、ＯＰＦ演算回路３０は、式（４）を用いてオプティカルフローｕを演算するために、３２ビット乗算回路３１と、パイプライン処理のための４個の除算回路３２−１乃至３２−４を備えて構成される。ここで、３２ビット乗算回路３１による乗算後の行列Ｇ’及びベクトルｂ’は次式で表される。 Further, the third advantage is the reduction in the number of cycles and the circuit scale in the division circuits 32-1 to 32-4 (generically, denoted by reference numeral 32) in the OPF arithmetic circuit 30 using the recovery method algorithm. . FIG. 6 is a block diagram showing a configuration of the OPF arithmetic circuit 30 of FIG. In FIG. 6, the OPF arithmetic circuit 30 uses a 32-bit multiplication circuit 31 and four division circuits 32-1 to 32-- for pipeline processing in order to calculate an optical flow u using Expression (4). 4 is configured. Here, the matrix G ′ and the vector b ′ after multiplication by the 32-bit multiplication circuit 31 are expressed by the following equations.

［数４］
Ｇ’＝［ＸＸＸ…Ｘ０００…０］（８）
ここで、［ＸＸＸ…Ｘ］＝Ｇであり、［０００…０］＝拡張ビットである。
［数５］
ｂ’＝［０００…０ＹＹＹ…Ｙ］（９）
ここで、［０００…０］＝拡張ビットであり、［ＹＹＹ…Ｙ］＝ｂである。 [Equation 4]
G ′ = [XXX ... X000 ... 0] (8)
Here, [XXX... X] = G and [000... 0] = extension bit.
[Equation 5]
b '= [000 ... 0YYY ... Y] (9)
Here, [000... 0] = extended bit and [YYY... Y] = b.

図７は図６の除算回路３２−１乃至３２−４（３２）の構成を示すブロック図である。ここで、図７の除算回路３２は回復法を用いたアーキテクチャである。図７において、除算回路３２は、マルチプレクサ３３と、２個のレジスタ３４，３５と、減算器３６と、マルチプレクサ３７とを備えて構成される。ここで、マルチプレクサ３３，３７はクロックＣＬに基づいて互いに交互にデータの１つを選択するように切り替える。行列Ｇ’はレジスタ３５を介して減算器３６に入力される一方、ベクトルｂ’はマルチプレクサ３３及びレジスタ３４を介して減算器３６に入力される。減算器３６はレジスタ３４からのデータからレジスタ３５からのデータを減算し、減算結果の符号１ビットを部分商の符号ビットとして出力するとともに、減算結果のデータ（符号を除く。）をマルチプレクサ３７に出力する。マルチプレクサ３７は上述のようにクロックＣＬに従って、入力される２つのデータのうちの１つのデータを選択してマルチプレクサ３３に出力する。 FIG. 7 is a block diagram showing the configuration of the division circuits 32-1 to 32-4 (32) of FIG. Here, the division circuit 32 of FIG. 7 is an architecture using a recovery method. In FIG. 7, the division circuit 32 includes a multiplexer 33, two registers 34 and 35, a subtractor 36, and a multiplexer 37. Here, the multiplexers 33 and 37 are switched so as to alternately select one of the data based on the clock CL. The matrix G ′ is input to the subtractor 36 via the register 35, while the vector b ′ is input to the subtractor 36 via the multiplexer 33 and the register 34. The subtracter 36 subtracts the data from the register 35 from the data from the register 34, outputs the sign 1 bit of the subtraction result as the sign bit of the partial quotient, and the subtraction result data (excluding the sign) to the multiplexer 37. Output. As described above, the multiplexer 37 selects one of the two input data according to the clock CL and outputs it to the multiplexer 33.

回復法アルゴリズムにおいては、除数をＧ、被除数をｂとした場合Ｇ＞ｂを満たす必要がある。この条件を満たすため、従来技術に係るアーキテクチャにおいては除数ｎビット及び被除数ｎビットに対しそれぞれ拡張ビットｎビットを追加し２ｎビットとする必要がある。これに対し、本実施形態では、オプティカルフローすなわち除算結果の上限を設定することにより、上限以上の値をすべて上限に丸めることができるため（具体的には、各除算回路３２からの除算結果のデータのビット数を制限する。）、拡張ビット数を削減することが可能である。オプティカルフローの上限をｍに制限することにより、拡張ビット数は√（ｍ）となる。回復法を用いたアーキテクチャでは拡張後のビット数が演算サイクル数となるため、演算サイクル数を２ｎからｎ＋√（ｍ）に削減できる。演算ビット数が減るため、回路規模の削減にもつながる。提案手法を用いた場合、回復法アルゴリズムを用いた除算回路３２の回路規模を４７％削減することができる。 In the recovery algorithm, it is necessary to satisfy G> b when the divisor is G and the dividend is b. In order to satisfy this condition, in the architecture according to the prior art, it is necessary to add 2 bits of extension bits to the divisor n bits and the dividend n bits, respectively, to 2 n bits. On the other hand, in the present embodiment, by setting the upper limit of the optical flow, that is, the division result, all values above the upper limit can be rounded to the upper limit (specifically, the division result from each division circuit 32 is The number of bits of data is limited), and the number of extension bits can be reduced. By limiting the upper limit of the optical flow to m, the number of extension bits becomes √ (m). In the architecture using the recovery method, since the number of bits after expansion becomes the number of operation cycles, the number of operation cycles can be reduced from 2n to n + √ (m). Since the number of operation bits is reduced, the circuit scale is also reduced. When the proposed method is used, the circuit scale of the divider circuit 32 using the recovery method algorithm can be reduced by 47%.

次に、第２の提案手法に係るウィンドウインターリーブ手法について説明する。階層的なオプティカルフローを求める場合、従来法のオプティカルフロープロセッサにおいては、上位階層の結果が得られるまで下位階層の演算を開始できず、パイプラインストールが起こるという問題が発生していた。これに対し提案法では、結果が相互に影響しない複数画素の上位階層の結果を連続的に演算し、その後に下位階層の演算を行うウィンドウインターリーブ手法を用いている。図８は従来技術に係るパイプライン処理におけるＯＰＦ演算プロセッサの各演算回路の演算タイミングを示すフロー図である。また、図９は第１の実施形態に係るウィンドウインターリーブ法を用いたパイプライン処理における図１のＯＰＦ演算プロセッサ２０の各演算回路の演算タイミングを示すフロー図である。図８及び図９において、各矩形部分は各回路での画像データを示し、ここで、Ｌ３は階層３を示し、ｐ０は画素列０を示し、ｒｏｗ０は画素行０を示し、以下同様である。 Next, a window interleaving method according to the second proposed method will be described. In the case of obtaining a hierarchical optical flow, the conventional optical flow processor has a problem in that the operation of the lower layer cannot be started until the result of the upper layer is obtained, and the pipeline installation occurs. In contrast, in the proposed method, a window interleaving method is used in which the results of the upper layer of a plurality of pixels whose results do not affect each other are continuously calculated and then the lower layer is calculated. FIG. 8 is a flowchart showing the operation timing of each operation circuit of the OPF operation processor in the pipeline processing according to the prior art. FIG. 9 is a flowchart showing the calculation timing of each calculation circuit of the OPF calculation processor 20 in FIG. 1 in the pipeline processing using the window interleaving method according to the first embodiment. 8 and 9, each rectangular portion indicates image data in each circuit, where L3 indicates the hierarchy 3, p0 indicates the pixel column 0, row0 indicates the pixel row 0, and so on. .

図８及び図９から明らかなように、ＰＩＣ演算回路２５と、ＭＭＶ演算回路２９内のＭＶ演算器と、ＭＭＶ演算回路２９内の加算器と、ＯＰＦ演算回路３０とが並列に処理されるパイプライン処理を実行する。しかしながら、図８から明らかなように、従来技術では、ある画素行のオプティカルフローｕを計算するまで、次の画素行を計算しないために無駄な「アイドル期間」が発生している。これに対して、本実施形態では、図１のシーケンスコントローラ２１及びアドレス発生器２２の制御により、ある画素行のオプティカルフローｕの計算終了を待たずに、次の画素行についてＰＩＣ演算回路２５及びＭＭＶ演算回路２９のＭＶ演算器の処理を開始しており、「アイドル期間」が発生しない。従って、従来技術に比較してパイプラインストールのサイクルを削減することが可能である。これにより、従来技術に比較して、ウィンドウインターリーブ手法を用いることによって必要サイクル数を６５％削減することができる。 As is apparent from FIGS. 8 and 9, the PIC arithmetic circuit 25, the MV arithmetic unit in the MMV arithmetic circuit 29, the adder in the MMV arithmetic circuit 29, and the OPF arithmetic circuit 30 are processed in parallel. Execute line processing. However, as is apparent from FIG. 8, in the conventional technique, the next pixel row is not calculated until the optical flow u of a certain pixel row is calculated, and thus a useless “idle period” occurs. On the other hand, in the present embodiment, the control of the sequence controller 21 and the address generator 22 in FIG. 1 does not wait for the completion of the calculation of the optical flow u of a certain pixel row, and the PIC arithmetic circuit 25 and The processing of the MV calculator of the MMV calculation circuit 29 is started, and the “idle period” does not occur. Therefore, it is possible to reduce the pipeline installation cycle compared to the prior art. Thereby, compared with the prior art, the required number of cycles can be reduced by 65% by using the window interleaving method.

以上説明したように、本実施形態によれば、（Ａ）オプティカルフローの更新値に上限値を設定し、例えば３画素以上をｘ方向及びｙ方向で切り捨て、（Ｂ）オプティカルフロー演算において、画素の処理順序をインターリーブすることにより、パイプライン処理でのアイドル期間を回避し、（Ｃ）ＯＰＦ演算回路３０内の除算回路３２の拡張ビット数を削減することにより、ＯＰＦプロセッサ２０のキャッシュ容量、除算回路３２の面積及び処理サイクル数を従来技術に比較して、大幅に削減できる。これにより、ＯＰＦプロセッサ２０のハードウェアの低消費電力化、高速化、及び省面積化を行うことができる。 As described above, according to the present embodiment, (A) an upper limit value is set for the update value of the optical flow, for example, three or more pixels are discarded in the x direction and the y direction, and (B) By interleaving the processing order, the idle period in the pipeline processing is avoided, and (C) the cache capacity and division of the OPF processor 20 are reduced by reducing the number of extension bits of the dividing circuit 32 in the OPF arithmetic circuit 30. Compared with the prior art, the area of the circuit 32 and the number of processing cycles can be greatly reduced. Thereby, it is possible to reduce the power consumption, increase the speed, and reduce the area of the hardware of the OPF processor 20.

第２の実施形態．
図１０は従来技術に係るオプティカルフロー演算における第１乃至第４の画素順序の演算方法を示す図であり、図１１は本発明の第２の実施形態に係るオプティカルフロー演算における第１の画素順序の演算方法を示す図である。図１のＰＩＣ演算回路２５によって実行される階層化処理では、下位レベルの画像に対してサブサンプリングを用いて、上位階層の画像を作成しているため、下位階層の座標（２ｘ，２ｙ）で表される画素のオプティカルフローを求める場合に使われる階層化画像は、下位階層の座標（２ｘ＋１，２ｙ）、（２ｘ，２ｙ＋１）、（２ｘ＋１，２ｙ＋１）のときに用いられる階層化画像と同じものになる。そのため、図１０の従来技術では、画面左上から右下まで１行毎に走査して行く処理順序では、偶数行の処理で作成した階層化画像を、再び奇数行で作成し直すことになる。すなわち、図１０において、階層化画像を保存する内部キャッシュメモリ（ＰＩＣ演算回路２５内の内部キャッシュメモリをいう。）の内容が階層化画像Ａ、Ｂ、Ｃと更新されてゆくことになるが、次の奇数行の処理では、再びＡからＣまでの階層化画像を作成する必要が出てくる。そのために処理に重複が生じ、無駄なサイクルとなってしまい、性能の低下や消費電力の増加を招くことになる。これを避けるために１行分の階層化画像Ａ、Ｂ、Ｃを記憶するキャッシュメモリを用意すると、今度は膨大なハードウェアの増加を招くことになる。 Second embodiment.
FIG. 10 is a diagram illustrating a first to fourth pixel order calculation method in the optical flow calculation according to the related art, and FIG. 11 is a first pixel order in the optical flow calculation according to the second embodiment of the present invention. It is a figure which shows the calculation method of. In the hierarchization processing executed by the PIC arithmetic circuit 25 in FIG. 1, since the upper layer image is created by using subsampling for the lower level image, the lower layer coordinates (2x, 2y) are used. The hierarchized image used when obtaining the optical flow of the represented pixel is the same as the hierarchized image used when the coordinates (2x + 1, 2y), (2x, 2y + 1), (2x + 1, 2y + 1) are in the lower hierarchy. become. Therefore, in the prior art of FIG. 10, in the processing order in which scanning is performed line by line from the upper left to the lower right of the screen, the hierarchized image created by the processing of the even lines is again created by the odd lines. That is, in FIG. 10, the contents of the internal cache memory (referred to as the internal cache memory in the PIC arithmetic circuit 25) that stores the hierarchical image are updated as the hierarchical images A, B, and C. In the processing of the next odd row, it becomes necessary to create a hierarchized image from A to C again. For this reason, the processing is duplicated, resulting in a useless cycle, resulting in a decrease in performance and an increase in power consumption. In order to avoid this, if a cache memory for storing the hierarchized images A, B, and C for one line is prepared, a huge increase in hardware will be caused.

これに対して、本発明の第２の実施形態では、例えば図１１に示すような「Ｚ字形状」の画素順序で繰り返しサブサンプリングした後オプティカルフローを求める。同じ階層化画像を使用する隣接４画素単位に行うことになるので、階層化処理の重複が回避され、サイクル数の削減が可能になる。図１２は図１１の第１の画素順序の演算方法（Ｌ＝３のとき）を示す図であり、図１３は図１１の第１の画素順序の演算方法（Ｌ＝４のとき）を示す図であるすなわち、階層数Ｌが２より大きい場合が図１２及び図１３に示されており、４Ｌ−１画素単位の階層繰り返し処理となる。 On the other hand, in the second embodiment of the present invention, the optical flow is obtained after repeated sub-sampling in the “Z-shaped” pixel order as shown in FIG. 11, for example. Since the same hierarchized image is used for every four adjacent pixels, duplication of the hierarchizing process is avoided, and the number of cycles can be reduced. FIG. 12 is a diagram illustrating the first pixel order calculation method in FIG. 11 (when L = 3), and FIG. 13 illustrates the first pixel order calculation method in FIG. 11 (when L = 4). That is, the case where the number of layers L is greater than 2 is shown in FIG. 12 and FIG. 13 and is a layer repetition process in units of 4L-1 pixels.

図１４乃至図１６は図１１の変形例であって、それぞれ本発明の第２の実施形態に係るオプティカルフロー演算における第２乃至第４の画素順序の演算方法を示す図である。図１４では、「Ｕ字形状」の画素順序で繰り返しサブサンプリングした後オプティカルフローを求める。同じ階層化画像を使用する隣接４画素単位に行うことになるので、階層化処理の重複が回避され、サイクル数の削減が可能になる。また、図１５では、「コの字形状」又は「Ｃの字を左右反転した形状」の画素順序でサブサンプリングした後オプティカルフローを求める。同じ階層化画像を使用する隣接４画素単位に行うことになるので、階層化処理の重複が回避され、サイクル数の削減が可能になる。さらに、図１６では、「Ｘ字形状」又は「αの字を左右反転した形状」の画素順序でサブサンプリングした後オプティカルフローを求める。同じ階層化画像を使用する隣接４画素単位に行うことになるので、階層化処理の重複が回避され、サイクル数の削減が可能になる。 FIGS. 14 to 16 are modifications of FIG. 11 and are diagrams illustrating second to fourth pixel order calculation methods in the optical flow calculation according to the second embodiment of the present invention, respectively. In FIG. 14, the optical flow is obtained after repeated sub-sampling in the “U-shaped” pixel order. Since the same hierarchized image is used for every four adjacent pixels, duplication of the hierarchizing process is avoided, and the number of cycles can be reduced. In FIG. 15, the optical flow is obtained after sub-sampling in the pixel order of “U-shape” or “C-shape reversed horizontally”. Since the same hierarchized image is used for every four adjacent pixels, duplication of the hierarchizing process is avoided, and the number of cycles can be reduced. Further, in FIG. 16, the optical flow is obtained after sub-sampling in the pixel order of “X shape” or “shape obtained by horizontally inverting the shape of α”. Since the same hierarchized image is used for every four adjacent pixels, duplication of the hierarchizing process is avoided, and the number of cycles can be reduced.

次いで、第２の実施形態に係るＯＰＦ演算プロセッサ２０におけるオプティカルフローの上位階層から下位階層への伝搬方法の改善について以下に説明する。 Next, a description from the upper layer of the optical flow definitive in OPF calculation processor 20 according to the second embodiment below improve the propagation process to the lower hierarchy.

ＰＩＣ演算回路２５による階層化処理では１／２のサブサンプリングを行っているため、階層Ｌでの１画素は、階層Ｌ−１での４画素分の情報量に相当する。図１７は本発明の第２の実施形態に係る第１のオプティカルフローの伝搬方法を示す図である。この伝搬方法では、図１７に示すように、階層Ｌにおいて座標Ｌ（０，０）の１画素から、階層Ｌ−１の４画素の座標に対応するＬ（１，０）、Ｌ（０，１）、Ｌ（１，１）の３種類の内挿画素を作成し、それぞれのオプティカルフローを求め、階層Ｌ−１に伝搬する。すなわち、当該伝搬方法では、以下のようにオプティカルフローを演算している。なお、座標Ｌ（０，０）は階層Ｌの画素座標（０，０）を示し、座標Ｌ−１（０，０）は階層Ｌ−１の画素座標（０，０）を示し、以下同様に表記する。 In the hierarchization processing by the PIC arithmetic circuit 25, 1/2 sub-sampling is performed, so one pixel in the hierarchy L corresponds to the amount of information for four pixels in the hierarchy L-1. FIG. 17 is a diagram showing a first optical flow propagation method according to the second embodiment of the present invention. In this propagation method, as shown in FIG. 17, L (1,0), L (0, L) corresponding to coordinates of four pixels on the layer L-1 from one pixel on the layer L in the layer L. 1) Three types of interpolated pixels of L (1,1) are created, their respective optical flows are obtained, and propagated to the layer L-1. That is, in the propagation method, the optical flow is calculated as follows. The coordinate L (0, 0) indicates the pixel coordinate (0, 0) of the layer L, the coordinate L-1 (0, 0) indicates the pixel coordinate (0, 0) of the layer L-1, and so on. Indicate.

（１）Ｌ（０，０）のオプティカルフロー導出：Ｌ（０，０）の輝度値Ｉ（０，０）を使って求めて、下位階層に伝搬させる。 (1) Optical flow derivation of L (0,0): Obtained using the luminance value I (0,0) of L (0,0) and propagate to the lower layer.

（２）Ｌ（１，０）のオプティカルフロー導出：Ｉ（０，０）とＬ（２，０）の輝度値Ｉ（２，０）を使ってＬ（１，０）の輝度値Ｉ（１，０）を次式により内挿する。
［数６］
Ｉ（１，０）＝（１−ｓ）Ｉ（０，０）＋ｓＩ（２，０）（１０）
ここで、ｓはｘ方向の内挿係数であって、ｓ＝１／２に設定される。これは、２：１サブサンプリングによりＩ（１，０）が存在しないためである。この輝度値を用いてオプティカルフローを計算し、その後下位階層に伝搬させる。 (2) Optical flow derivation of L (1, 0): L (1, 0) luminance value I (() using I (0, 0) and L (2, 0) luminance value I (2, 0) 1, 0) is interpolated by the following equation.
[Equation 6]
I (1,0) = (1-s) I (0,0) + sI (2,0) (10)
Here, s is an interpolation coefficient in the x direction, and is set to s = 1/2. This is because I (1,0) does not exist due to 2: 1 subsampling. An optical flow is calculated using this luminance value, and then propagated to a lower layer.

（３）Ｌ（０，１）のオプティカルフロー導出：Ｉ（０，０），Ｉ（０，２）を使って輝度値Ｉ（０，１）を内挿する。
［数７］
Ｉ（１，０）＝（１−ｔ）Ｉ（０，０）＋ｔＩ（０，２）（１１）
ここで、ｔはｙ方向の内挿係数であって、ｓ＝１／２に設定される。 (3) Optical flow derivation of L (0,1): I (0,0) and I (0,2) are used to interpolate the luminance value I (0,1).
[Equation 7]
I (1,0) = (1-t) I (0,0) + tI (0,2) (11)
Here, t is an interpolation coefficient in the y direction, and is set to s = 1/2.

（４）Ｌ（１，１）のオプティカルフロー導出：Ｉ（０，０）、Ｉ（０，２）、Ｉ（２，０）、Ｉ（２，２）を使って輝度値Ｉ（１，１）を内挿する。
［数８］
Ｉ（１，１）
＝（１−ｓ）（１−ｔ）Ｉ（０，０）＋ｓ（１−ｔ）Ｉ（２，０）
＋（１−ｓ）ｔＩ（０，２）＋ｓｔＩ（２，２）（１２） (4) Optical flow derivation of L (1,1): luminance value I (1,1) using I (0,0), I (0,2), I (2,0), I (2,2) 1) is interpolated.
[Equation 8]
I (1,1)
= (1-s) (1-t) I (0,0) + s (1-t) I (2,0)
+ (1-s) tI (0,2) + stI (2,2) (12)

以上説明したように、第１のオプティカルフローの伝搬方法によれば、上位階層で演算した複数のオプティカルフローのうち演算すべき座標に隣接する２つ又は４つの画素のオプティカルフローを内挿することにより演算したオプティカルフローを下位階層に伝搬して下位階層のオプティカルフローを演算する。 As described above, according to the first optical flow propagation method, the optical flow of two or four pixels adjacent to the coordinates to be calculated among a plurality of optical flows calculated in the upper layer is interpolated. The optical flow calculated by the above is propagated to the lower layer to calculate the optical flow of the lower layer.

図１８は第２の実施形態に係る第２のオプティカルフローの伝搬方法を示す図である。この伝搬方法では、座標Ｌ（１／２，１／２）で内挿画素を１つだけ作成し、その１画素を用いてオプティカルフロー演算を行い、その値をＬ−１（０，０）、Ｌ−１（１，０）、Ｌ−１（０，１）、Ｌ−１（１，１）の４画素に対して共通に伝搬し階層Ｌ−１での初期値とする。すなわち、中心の座標は（１／２，１／２）と考え、Ｉ（１／２，１／２）を内挿により求めて、次式のごとくオプティカルフローを計算する。それを下位階層の４画素共通に用いる。 FIG. 18 is a diagram illustrating a second optical flow propagation method according to the second embodiment. In this propagation method, only one interpolation pixel is created at coordinates L (1/2, 1/2), optical flow calculation is performed using the one pixel, and the value is calculated as L-1 (0, 0). , L-1 (1, 0), L-1 (0, 1), and L-1 (1, 1) are propagated in common to the initial value in the layer L-1. That is, assuming that the coordinates of the center are (1/2, 1/2), I (1/2, 1/2) is obtained by interpolation, and the optical flow is calculated as follows. This is used in common for the lower four pixels.

［数９］
Ｉ（１／２，１／２）
＝（１−ｓ）（１−ｔ）Ｉ（０，０）＋ｓ（１−ｔ）Ｉ（２，０）
＋（１−ｓ）ｔＩ（０，２）＋ｓｔＩ（２，２）（１３）
ここで、ｓ＝ｔ＝１／４である。 [Equation 9]
I (1/2, 1/2)
= (1-s) (1-t) I (0,0) + s (1-t) I (2,0)
+ (1-s) tI (0,2) + stI (2,2) (13)
Here, s = t = 1/4.

以上説明したように、第２のオプティカルフローの伝搬方法によれば、上位階層で演算した各オプティカルフローを下位階層の複数の画素に共通に伝搬してそれを初期値として当該複数の画素で共通に用いて下位階層のオプティカルフローを演算し、ここで、上記上位階層で演算したオプティカルフローは、上記上位階層の画像領域を互いに隣接する４つの画素を含む領域を単位として分割し、その各領域内の４つの画素のオプティカルフローを内挿することにより、その各領域内の平均座標におけるオプティカルフローを演算した結果のオプティカルフローである。これにより、内挿画素作成のサイクルが１画素分だけに削減でき、オプティカルフロー演算も１画素分のみとなる。この場合、全４画素に対して、等しい距離においてオプティカルフローを計算するため、オプティカルフローの精度を維持しつつサイクル数の削減が可能となり、性能向上が図られる。 As described above, according to the second optical flow propagation method, each optical flow calculated in the upper layer is propagated in common to a plurality of pixels in the lower layer, and is used as an initial value in common with the plurality of pixels. The optical flow calculated in the lower hierarchy is divided into the image areas in the upper hierarchy in units of areas including four pixels adjacent to each other. This is an optical flow obtained as a result of calculating the optical flow at the average coordinates in each region by interpolating the optical flows of the four pixels. As a result, the interpolation pixel creation cycle can be reduced to only one pixel, and the optical flow calculation is also limited to one pixel. In this case, since the optical flow is calculated at the same distance for all four pixels, the number of cycles can be reduced while maintaining the accuracy of the optical flow, and the performance can be improved.

図１９は第２の実施形態に係る第３のオプティカルフローの伝搬方法を示す図である。この伝搬方法では、座標Ｌ（０，０）の画素から１画素分のオプティカルフローを求め、その値をＬ−１（０，０）、Ｌ−１（１，０）、Ｌ−１（０，１）、Ｌ−１（１，１）の４画素に対して共通に伝搬し階層Ｌ−１での初期値とする。すなわち、Ｉ（０，０）を用いてオプティカルフローを求め、下位階層の４画素で共通に伝搬させる。 FIG. 19 is a diagram illustrating a third optical flow propagation method according to the second embodiment. In this propagation method, an optical flow for one pixel is obtained from a pixel having coordinates L (0, 0), and the values are obtained as L-1 (0, 0), L-1 (1, 0), L-1 (0). , 1) and L-1 (1, 1) are propagated in common to the four pixels and set as an initial value in the layer L-1. That is, an optical flow is obtained using I (0, 0) and propagated in common in the four pixels in the lower layer.

以上説明したように、第２のオプティカルフローの伝搬方法によれば、上位階層で演算した各オプティカルフローを下位階層の複数の画素に共通に伝搬してそれを初期値として当該複数の画素で共通に用いて下位階層のオプティカルフローを演算し、ここで、上記上位階層で演算したオプティカルフローは、上記上位階層の画像領域を互いに隣接する４つの画素を含む領域を単位として分割し、その各領域内の４つの画素のうちの１つの画素のオプティカルフローである。従って、精度は若干劣化するものの内挿画素作成処理が不要となり、さらに他の３画素のオプティカルフロー計算も不要となる。従って、演算器の削減と、更なるサイクル数の削減が可能となり、性能向上及びハードウェアの小型化を図ることができる。 As described above, according to the second optical flow propagation method, each optical flow calculated in the upper layer is propagated in common to a plurality of pixels in the lower layer, and is used as an initial value in common with the plurality of pixels. The optical flow calculated in the lower hierarchy is divided into the image areas in the upper hierarchy in units of areas including four pixels adjacent to each other. This is an optical flow of one of the four pixels. Accordingly, although the accuracy is slightly deteriorated, the interpolation pixel creation process is not required, and the optical flow calculation of the other three pixels is not required. Therefore, it is possible to reduce the number of arithmetic units and the number of cycles, and it is possible to improve performance and downsize hardware.

以上説明したように、本実施形態によれば、ＰＩＣ演算回路２５による階層化処理において、同じ階層化画像を使用する隣接４画素単位に行うことになるので、階層化処理の重複が回避され、サイクル数の削減が可能になる。また、上位階層で求めたオプティカルフローを下位階層の複数の画素に共通に伝搬させて下位階層のオプティカルフローの演算を行うので、オプティカルフローを外挿するサイクルやそのための演算回路などのハードウェアが不要となる。従って、従来技術に比較して、オプティカルフローの演算時間を短縮できるとともに、ＯＰＦ演算プロセッサのハードウェアを小型化できる。 As described above, according to the present embodiment, in the hierarchization processing by the PIC arithmetic circuit 25, since the same hierarchized image is performed in units of four adjacent pixels, duplication of the hierarchization processing is avoided, The number of cycles can be reduced. In addition, since the optical flow obtained in the upper layer is propagated in common to a plurality of pixels in the lower layer and the optical flow in the lower layer is calculated, hardware such as a cycle for extrapolating the optical flow and an arithmetic circuit therefor are used. It becomes unnecessary. Therefore, the optical flow calculation time can be shortened and the hardware of the OPF calculation processor can be downsized as compared with the prior art.

以上詳述したように、本発明に係る画像処理装置及び方法によれば、記演算されたオプティカルフローを上位階層から下位階層に伝搬させるときに上記オプティカルフローに所定の上限値を限定して更新することにより、上記オプティカルフローの可動域を所定の可動域に限定し、好ましくは、上記演算された空間輝度勾配行列及びミスマッチベクトルについて乗算演算を実行した後、回復法アルゴリズムを用いて除算演算を実行するときに、上記除算演算の演算結果における拡張ビットを削除し、より好ましくは、上記階層化演算ステップ及び上記オプティカルフロー演算ステップは所定の複数の演算回路を用いてパイプライン処理を実行し、上記画像データのうちのオプティカルフローを演算する所定の領域のオプティカルフローを演算の終了を待たずに、次の領域のオプティカルフローを演算することを開始する。これにより上記演算プロセッサのキャッシュ容量、除算回路の面積及び処理サイクル数を従来技術に比較して、大幅に削減できる。これにより、演算プロセッサのハードウェアの低消費電力化、高速化、及び省面積化を行うことができる。従って、オプティカルフローの演算において、従来技術に比較してより短い演算時間で演算することができ、オプティカルフローの演算プロセッサを構成したときにそのハードウェアを大幅に小型化できる。 As described above in detail, according to the image processing apparatus and method of the present invention, when the calculated optical flow is propagated from the upper layer to the lower layer, the predetermined upper limit value is limited to the optical flow and updated. Thus, the range of motion of the optical flow is limited to a predetermined range of motion, and preferably, after performing the multiplication operation on the calculated spatial luminance gradient matrix and mismatch vector, the division operation is performed using the recovery algorithm. When executing, the extension bit in the operation result of the division operation is deleted, more preferably, the layered operation step and the optical flow operation step perform pipeline processing using a predetermined plurality of operation circuits, Calculates the optical flow of a predetermined area that calculates the optical flow of the image data Without waiting for the end, it starts to calculating an optical flow in the following areas. As a result, the cache capacity of the arithmetic processor, the area of the division circuit, and the number of processing cycles can be greatly reduced as compared with the prior art. As a result, it is possible to reduce the power consumption, speed, and area of the hardware of the arithmetic processor. Therefore, in the calculation of the optical flow, the calculation can be performed in a shorter calculation time as compared with the prior art, and the hardware can be greatly downsized when the optical flow calculation processor is configured.

本発明の第１の実施形態に係るＯＰＦ演算プロセッサ２０の詳細構成を示すブロック図である。It is a block diagram which shows the detailed structure of the OPF arithmetic processor 20 which concerns on the 1st Embodiment of this invention. 図１のＯＰＦ演算プロセッサ２０によって実行される、オプティカルフローの演算において画像のウィンドウＢの稼働限定域１０１を設けることを示す画像フレーム１００の図である。FIG. 2 is a diagram of an image frame 100 showing that an operation limited area 101 of an image window B is provided in an optical flow calculation executed by the OPF calculation processor 20 of FIG. 1. 図１のＯＰＦ演算プロセッサ２０によって実行されるＰＬＫアルゴリズムによるオプティカルフロー演算処理を示すフローチャートである。It is a flowchart which shows the optical flow arithmetic processing by the PLK algorithm performed by the OPF arithmetic processor 20 of FIG. 従来技術及び第１の実施形態に係るＯＰＦ演算プロセッサ２０のシミュレーション結果であって、各パラメータに対する平均絶対誤差ＭＡＥを示す図である。It is a simulation result of the OPF arithmetic processor 20 which concerns on a prior art and 1st Embodiment, Comprising: It is a figure which shows the average absolute error MAE with respect to each parameter. 従来技術及び第１の実施形態に係るＯＰＦ演算プロセッサ２０のシミュレーション結果であって、各パラメータに対する平均二乗誤差ＭＳＥを示す図である。It is a simulation result of the OPF arithmetic processor 20 which concerns on a prior art and 1st Embodiment, Comprising: It is a figure which shows the mean square error MSE with respect to each parameter. 図１のＯＰＦ演算回路３０の構成を示すブロック図である。FIG. 2 is a block diagram illustrating a configuration of an OPF arithmetic circuit 30 in FIG. 1. 図６の除算回路３２−１乃至３２−４（３２）の構成を示すブロック図である。It is a block diagram which shows the structure of the division circuits 32-1 thru | or 32-4 (32) of FIG. 従来技術に係るパイプライン処理におけるＯＰＦ演算プロセッサの各演算回路の演算タイミングを示すフロー図である。It is a flowchart which shows the arithmetic timing of each arithmetic circuit of the OPF arithmetic processor in the pipeline process which concerns on a prior art. 第１の実施形態に係るウィンドウインターリーブ法を用いたパイプライン処理における図１のＯＰＦ演算プロセッサ２０の各演算回路の演算タイミングを示すフロー図である。It is a flowchart which shows the arithmetic timing of each arithmetic circuit of the OPF arithmetic processor 20 of FIG. 1 in the pipeline process using the window interleave method which concerns on 1st Embodiment. 従来技術に係るオプティカルフロー演算における画素順序の演算方法を示す図である。It is a figure which shows the calculation method of the pixel order in the optical flow calculation which concerns on a prior art. 本発明の第２の実施形態に係るオプティカルフロー演算における第１の画素順序の演算方法を示す図である。It is a figure which shows the calculation method of the 1st pixel order in the optical flow calculation which concerns on the 2nd Embodiment of this invention. 図１１の第１の画素順序の演算方法（Ｌ＝３のとき）を示す図である。It is a figure which shows the calculation method (when L = 3) of the 1st pixel order of FIG. 図１１の第１の画素順序の演算方法（Ｌ＝４のとき）を示す図である。It is a figure which shows the calculation method (when L = 4) of the 1st pixel order of FIG. 本発明の第２の実施形態に係るオプティカルフロー演算における第２の画素順序の演算方法を示す図である。It is a figure which shows the calculation method of the 2nd pixel order in the optical flow calculation which concerns on the 2nd Embodiment of this invention. 本発明の第２の実施形態に係るオプティカルフロー演算における第３の画素順序の演算方法を示す図である。It is a figure which shows the calculation method of the 3rd pixel order in the optical flow calculation which concerns on the 2nd Embodiment of this invention. 本発明の第２の実施形態に係るオプティカルフロー演算における第４の画素順序の演算方法を示す図である。It is a figure which shows the calculation method of the 4th pixel order in the optical flow calculation which concerns on the 2nd Embodiment of this invention. 本発明の第２の実施形態に係る第１のオプティカルフローの伝搬方法を示す図である。It is a figure which shows the propagation method of the 1st optical flow which concerns on the 2nd Embodiment of this invention. 本発明の第２の実施形態に係る第２のオプティカルフローの伝搬方法を示す図である。It is a figure which shows the propagation method of the 2nd optical flow which concerns on the 2nd Embodiment of this invention. 本発明の第２の実施形態に係る第３のオプティカルフローの伝搬方法を示す図である。It is a figure which shows the propagation method of the 3rd optical flow which concerns on the 2nd Embodiment of this invention. 従来技術に係るオプティカルフローの演算における階層化を示す図である。It is a figure which shows the hierarchization in the calculation of the optical flow which concerns on a prior art. 従来技術に係るオプティカルフローの演算におけるフィルタリングの一例を示す図である。It is a figure which shows an example of the filtering in the calculation of the optical flow which concerns on a prior art. 従来技術に係るＰＬＫアルゴリズムによるオプティカルフロー演算処理を示すフローチャートである。It is a flowchart which shows the optical flow calculation process by the PLK algorithm which concerns on a prior art.

符号の説明Explanation of symbols

１０…ＣＰＵ、
１１…画像メモリ、
１２…演算結果データメモリ、
１３…インターフェース、
１４…ディスプレイ、
２０…ＯＰＦ演算プロセッサ、
２１…シーケンスコントローラ、
２２…アドレス発生器、
２３…フレームＡメモリ、
２４…フレームＢメモリ、
２５…ＰＩＣ演算回路、
２６…階層化画像Ａメモリ、
２７…階層化画像Ｂメモリ、
２８…ＳＧＭ演算回路、
２９…ＭＭＶ演算回路、
３０…ＯＰＦ演算回路、
３１…３２ビット乗算回路、
３２−１乃至３２−４…除算回路、
３３，３７…マルチプレクサ、
３４，３５…レジスタ、
３６…減算器、
１００…画像フレーム。 10 ... CPU,
11: Image memory,
12 ... Operation result data memory,
13 ... Interface,
14 ... Display,
20 ... OPF processor,
21 ... Sequence controller,
22: Address generator,
23: Frame A memory,
24 ... Frame B memory,
25 ... PIC arithmetic circuit,
26: Hierarchical image A memory,
27: Hierarchical image B memory,
28 ... SGM arithmetic circuit,
29 ... MMV arithmetic circuit,
30 ... OPF arithmetic circuit,
31 ... 32-bit multiplication circuit,
32-1 to 32-4... Division circuit,
33, 37 ... Multiplexer,
34, 35 ... registers,
36 ... subtractor,
100: Image frame.

Claims

２つのフレームの画像データに対してサブサンプリングとフィルタリングとを実行することにより階層化画像データを生成する階層化演算手段と、
上記階層化演算手段により生成された階層化画像データに基づいて空間輝度勾配行列とミスマッチベクトルを演算し、上記演算された空間輝度勾配行列及びミスマッチベクトルに基づいてオプティカルフローを演算するオプティカルフロー演算手段とを備えた画像処理装置において、
上記オプティカルフロー演算手段は、上記演算されたオプティカルフローを上位階層から下位階層に伝搬させるときに上記オプティカルフローに所定の上限値を限定して更新することにより、上記オプティカルフローの可動域を所定の可動域に限定し、
上記階層化演算手段及び上記オプティカルフロー演算手段は所定の複数の演算回路を用いてパイプライン処理を実行し、上記画像データのうちのオプティカルフローを演算する所定の領域のオプティカルフローを演算の終了を待たずに、次の領域のオプティカルフローを演算することを開始することを特徴とする画像処理装置。 Hierarchization calculation means for generating hierarchized image data by performing sub-sampling and filtering on image data of two frames;
Optical flow calculation means for calculating a spatial luminance gradient matrix and a mismatch vector based on the hierarchical image data generated by the hierarchical calculation means, and calculating an optical flow based on the calculated spatial luminance gradient matrix and mismatch vector In an image processing apparatus comprising:
The optical flow calculation means limits the optical flow range of motion to a predetermined range by updating the optical flow with a predetermined upper limit value when propagating the calculated optical flow from an upper layer to a lower layer. Limited to the range of motion ,
The hierarchization calculation means and the optical flow calculation means execute pipeline processing using a plurality of predetermined calculation circuits, and finish the calculation of the optical flow in a predetermined area for calculating the optical flow in the image data. An image processing apparatus, which starts calculating an optical flow of the next area without waiting .

上記オプティカルフロー演算手段は、上記演算された空間輝度勾配行列及びミスマッチベクトルについて乗算演算を実行した後、回復法アルゴリズムを用いて除算演算を実行するときに、上記除算演算の演算結果における拡張ビットを削除することを特徴とする請求項１記載の画像処理装置。 The optical flow calculation means performs the multiplication operation on the calculated spatial luminance gradient matrix and the mismatch vector, and then executes an extension bit in the calculation result of the division operation when performing the division operation using the recovery algorithm. The image processing apparatus according to claim 1, wherein the image processing apparatus is deleted.

２つのフレームの画像データに対してサブサンプリングとフィルタリングとを実行することにより階層化画像データを生成する階層化演算ステップと、
上記階層化演算ステップにより生成された階層化画像データに基づいて空間輝度勾配行列とミスマッチベクトルを演算し、上記演算された空間輝度勾配行列及びミスマッチベクトルに基づいてオプティカルフローを演算するオプティカルフロー演算ステップとを演算プロセッサ装置により実行する画像処理方法において、
上記オプティカルフロー演算ステップは、上記演算されたオプティカルフローを上位階層から下位階層に伝搬させるときに上記オプティカルフローに所定の上限値を限定して更新することにより、上記オプティカルフローの可動域を所定の可動域に限定し、
上記階層化演算ステップ及び上記オプティカルフロー演算ステップは所定の複数の演算回路を用いてパイプライン処理を実行し、上記画像データのうちのオプティカルフローを演算する所定の領域のオプティカルフローを演算の終了を待たずに、次の領域のオプティカルフローを演算することを開始することを特徴とする画像処理方法。 A hierarchization calculation step of generating hierarchized image data by performing sub-sampling and filtering on image data of two frames;
An optical flow calculation step of calculating a spatial luminance gradient matrix and a mismatch vector based on the hierarchical image data generated by the hierarchical calculation step, and calculating an optical flow based on the calculated spatial luminance gradient matrix and the mismatch vector. In an image processing method in which
In the optical flow calculation step, when the calculated optical flow is propagated from an upper layer to a lower layer, the optical flow is limited and updated by limiting a predetermined upper limit value, so that a range of motion of the optical flow is determined. Limited to the range of motion ,
The hierarchization calculation step and the optical flow calculation step execute pipeline processing using a plurality of predetermined calculation circuits, and finish the calculation of the optical flow in a predetermined area for calculating the optical flow of the image data. An image processing method, which starts calculating an optical flow of the next area without waiting .

上記オプティカルフロー演算ステップは、上記演算された空間輝度勾配行列及びミスマッチベクトルについて乗算演算を実行した後、回復法アルゴリズムを用いて除算演算を実行するときに、上記除算演算の演算結果における拡張ビットを削除することを特徴とする請求項３記載の画像処理方法。 In the optical flow calculation step, the multiplication operation is performed on the calculated spatial luminance gradient matrix and the mismatch vector, and then the extension bit in the calculation result of the division operation is set when performing the division operation using the recovery algorithm. 4. The image processing method according to claim 3 , wherein the image processing method is deleted.