JP4690971B2

JP4690971B2 - Shape estimation apparatus and shape estimation program

Info

Publication number: JP4690971B2
Application number: JP2006234613A
Authority: JP
Inventors: 俊彦三須; 正樹高橋; 昌秀苗村; 真人藤井; 伸行八木
Original assignee: Japan Broadcasting Corp
Current assignee: Japan Broadcasting Corp
Priority date: 2006-08-30
Filing date: 2006-08-30
Publication date: 2011-06-01
Anticipated expiration: 2026-08-30
Also published as: JP2008059224A

Abstract

<P>PROBLEM TO BE SOLVED: To provide a three-dimensional shape estimation device and its program capable of reducing an amount of operations and an amount of memory and applying to an object which can be moved or deformed. <P>SOLUTION: The shape estimation device 10 for estimating a three-dimensional shape of the target object by a particle filter method comprises a particle filter means 3 having a silhouette extracting means 2<SB>1</SB>-2<SB>N</SB>for extracting silhouettes S<SB>1</SB>-S<SB>N</SB>of the target objects from input images I<SB>1</SB>-I<SB>N</SB>, a particle creating means 21 for creating the designated number of particles which are two or more in a searching range D, a weighted operation means for changing a weighting factor of the particle on the basis of the silhouettes S<SB>1</SB>-S<SB>N</SB>, a re-extracting means for re-extracting the particles on the basis of the weighting factor, and a state transition means for transiting a state vector of the re-extracting particles, and a metaball creating means 4 for creating a three-dimensional model of a metaball expression from the designated number of particles. <P>COPYRIGHT: (C)2008,JPO&INPIT

Description

本発明は、オブジェクトを撮影して生成された画像中の映像オブジェクトに基づいて、オブジェクトの３次元形状を推定する形状推定装置及びそのプログラムに関する。 The present invention relates to a shape estimation apparatus and a program for estimating a three-dimensional shape of an object based on a video object in an image generated by photographing the object.

従来、実写画像から対象物体の３次元形状をモデリングする場合には、ステレオマッチ法、視体積交差法もしくはそれらを複合させた手法により形状推定を行っている。これらの手法では、対象物体の周囲に複数のカメラを設置したり、１台のカメラを移動させたり、あるいは対象物体を回転台などで移動させたりすることによって、カメラと対象物体との間の相対位置を変化させて撮影した画像を用いている。 Conventionally, when modeling a three-dimensional shape of a target object from a photographed image, shape estimation is performed by a stereo match method, a view volume intersection method, or a method combining them. In these methods, a plurality of cameras are installed around the target object, one camera is moved, or the target object is moved on a turntable or the like. An image taken by changing the relative position is used.

ステレオマッチ法では、異なる視点の画像間の各局所領域に対し、例えば、画素値の相互相関を計算することにより、点対応を求める。点対応は、視差量、奥行きあるいは３次元座標に変換することが可能である。これら３次元座標の相互間を網状に接続することで、ポリゴン表現による３次元形状を得ることが可能である。 In the stereo match method, for each local region between images of different viewpoints, for example, a point correspondence is obtained by calculating a cross-correlation of pixel values. Point correspondence can be converted into parallax amount, depth, or three-dimensional coordinates. By connecting these three-dimensional coordinates to each other in a net shape, it is possible to obtain a three-dimensional shape by polygon representation.

視体積交差法は、ボクセルカービングとも呼ばれる。３次元空間に仮想的にボクセルを配置し、対象物体の画像上における２次元領域と視点とを通る錐体の内部に属するボクセルのみを削り出す操作を、複数の視点に対し適用することで、ボクセル表現による３次元形状が得られる。 The view volume intersection method is also called voxel carving. By applying voxels virtually in a three-dimensional space and scraping only the voxels belonging to the inside of a cone passing through the two-dimensional region and the viewpoint on the target object image, to a plurality of viewpoints, A three-dimensional shape by voxel expression is obtained.

また、視体積交差法の一種として、３次元空間における前記した錐体の内部に対して、視点毎に投票操作を行う手法がある（特許文献１）。この手法には、得票数の多寡に応じて物体表面を抽出し、これをポリゴン表現等へ変換する手法も含まれている。 As a kind of view volume intersection method, there is a method of performing a voting operation for each viewpoint on the inside of the above-mentioned cone in a three-dimensional space (Patent Document 1). This method includes a method of extracting an object surface according to the number of votes and converting it to a polygon representation or the like.

さらに、視体積交差法に基づき対象物体の概形を得た後、ステレオマッチにより詳細形状を得る手法がある（特許文献２）。この手法では、ステレオマッチのための探索領域の削減、ステレオマッチにおける誤対応の影響の軽減、そして視体積交差法では推定不能な凹形状への対応が可能となっている。 Furthermore, there is a method of obtaining a detailed shape by stereo matching after obtaining the rough shape of the target object based on the visual volume intersection method (Patent Document 2). With this method, it is possible to reduce the search area for stereo matching, reduce the influence of erroneous correspondence in stereo matching, and cope with concave shapes that cannot be estimated by the visual volume intersection method.

また、生物や機械などのモデリングには、関節や筋肉などの配置や運動の拘束条件（キネマティクス）を前提条件に用いるモデルベースの形状推定手法もある。この場合、画像上の対象物体のシルエットと適合するように、関節角などのパラメータを調整することで物体の形状が得られる。モデルベースの手法は、プリミティブ表現のほか、ポリゴン表現やメタボール表現などさまざまな表現形式への適用が考えられている。 There is also a model-based shape estimation method that uses the arrangement of joints and muscles and the constraint condition of motion (kinematics) as a precondition for modeling of organisms and machines. In this case, the shape of the object can be obtained by adjusting parameters such as the joint angle so as to match the silhouette of the target object on the image. The model-based method is considered to be applied to various expression formats such as polygon expression and metaball expression in addition to primitive expression.

一方、状態量の任意の統計分布を効率的に記述できるパーティクルフィルタを用いて、物体の位置を推定する手法が知られている。このパーティクルフィルタでは、物体の位置などを表す状態量（パーティクル又は粒子とよぶ）が複数存在し、これらのパーティクルのうち、物体の観測結果に合致するパーティクルの重みを増加させ、最終的には全パーティクルの状態の期待値を計算することで、物体の位置が推定される（例えば、特許文献３）。また、パーティクルフィルタでは、状態遷移（パーティクルの状態量の時間変化）が、状態量の確率密度により定義できるため、非常に複雑なダイナミクスをモデリングすることが可能である。
特開平１０−１２４７０４号公報（段落００３９、図３）特開２００３−２７１９２８号公報（段落００５０〜段落００６１、図７、図８）特開２００５−４４３５２号公報（段落００８６〜段落００９０、図５） On the other hand, a technique for estimating the position of an object using a particle filter that can efficiently describe an arbitrary statistical distribution of state quantities is known. In this particle filter, there are multiple state quantities (referred to as particles or particles) that represent the position of an object, and among these particles, the weight of particles that match the observation result of the object is increased, and eventually all By calculating the expected value of the particle state, the position of the object is estimated (for example, Patent Document 3). In the particle filter, since state transition (time change of the state quantity of particles) can be defined by the probability density of the state quantity, it is possible to model very complicated dynamics.
Japanese Patent Laid-Open No. 10-124704 (paragraph 0039, FIG. 3) Japanese Patent Laying-Open No. 2003-271928 (paragraphs 0050 to 0061, FIGS. 7 and 8) Japanese Patent Laying-Open No. 2005-44352 (paragraph 0086 to paragraph 0090, FIG. 5)

しかしながら、ステレオマッチ法では、十分な分解能のモデルを得るためには数多くの点対応を求める必要があり、演算コストが少なくなかった。また、点対応の精度が形状精度に大きく影響するため、対象物体にテクスチャが少ない場合には形状誤差が大きくなるという問題があった。 However, in the stereo match method, in order to obtain a model with sufficient resolution, it is necessary to obtain a large number of point correspondences, and the calculation cost is not small. In addition, since the accuracy of point correspondence greatly affects the shape accuracy, there is a problem that the shape error increases when the target object has few textures.

一方、視体積交差法では、十分な分解能のモデルを得るためにはボクセルを細かくする必要があり、膨大なメモリ領域が必要であった。また、視点毎に錐体状の領域を切り出す演算操作も、各ボクセルの錐体状の領域に対する内外判定を要するため、演算コストも大きかった。 On the other hand, in the view volume intersection method, in order to obtain a model with sufficient resolution, it is necessary to make the voxel finer, and an enormous memory area is required. In addition, the calculation operation for cutting out the cone-shaped region for each viewpoint also requires calculation inside / outside of the cone-shaped region of each voxel, and thus the calculation cost is high.

モデルベースの手法では、対象物体の取り得る形状を十分に記述できるだけの関節等のモデル化が必要であるため、対象物体が限定されていた。また、対象物体が想定外の３次元形状をとった場合や、入力画像にノイズや乱れが生じた場合に、形状推定が不可能になるおそれもあった。 In the model-based method, modeling of joints or the like that can sufficiently describe the shapes that can be taken by the target object is necessary, so that the target object is limited. In addition, when the target object has an unexpected three-dimensional shape, or when noise or disturbance occurs in the input image, there is a possibility that shape estimation may become impossible.

また、パーティクルフィルタの手法は、例えば、ボールなどの移動する物体の重心位置を追跡する用途、経済指標等の動的な機構のパラメータ学習、このような指標等の予測、２次元形状輪郭の追跡などに用いられているが、メタボール表現による３次元形状をパーティクルの状態から生成する用途には適用されていなかった。 In addition, the particle filter method includes, for example, the use of tracking the center of gravity of a moving object such as a ball, dynamic mechanism parameter learning such as an economic index, prediction of such an index, etc., tracking of a two-dimensional shape contour However, it has not been applied to the use of generating a three-dimensional shape by metaball expression from the state of particles.

本発明は、このような課題を解決するためになされたもので、演算量やメモリ消費量を低減する３次元の形状推定装置及びその方法を提供することを目的とする。 The present invention has been made to solve such a problem, and an object of the present invention is to provide a three-dimensional shape estimation apparatus and method for reducing the amount of calculation and memory consumption.

そのために、請求項１に記載の形状推定装置は、オブジェクトが撮影された画像中の映像オブジェクトに基づき、３次元座標を含む状態ベクトルと重み係数とを有した情報であるパーティクルを用いたパーティクルフィルタによって、オブジェクトの３次元形状を推定する形状推定装置であって、シルエット抽出手段と、パーティクルフィルタ手段と、メタボール生成手段とを備え、前記パーティクルフィルタ手段は、パーティクル生成手段と、重み係数変更手段と、再サンプリング手段と、状態遷移手段とを有する構成とした。 For this purpose, the shape estimation apparatus according to claim 1 is a particle filter using particles, which are information having a state vector including three-dimensional coordinates and a weighting coefficient, based on a video object in an image in which the object is photographed. Is a shape estimation device for estimating a three-dimensional shape of an object, comprising a silhouette extraction unit, a particle filter unit, and a metaball generation unit, wherein the particle filter unit includes a particle generation unit, a weighting factor changing unit, The re-sampling means and the state transition means are provided.

かかる構成によれば、形状推定装置は、シルエット抽出手段によって、入力された画像に含まれる映像オブジェクトの領域をシルエットとして抽出する。また、形状推定装置は、パーティクルフィルタ手段のパーティクル生成手段によって、オブジェクトを探索する予め定められた探索領域内を示す３次元座標を有する、２以上である所定数のパーティクルを生成する。パーティクル生成手段で生成された所定数のパーティクルは、重み係数変更手段によって、シルエット抽出手段で抽出されたシルエットに基づいて、重み係数が変更される。 According to such a configuration, the shape estimation apparatus extracts the region of the video object included in the input image as a silhouette by the silhouette extraction unit. In addition, the shape estimation apparatus generates a predetermined number of particles equal to or greater than 2 having three-dimensional coordinates indicating a predetermined search area in which an object is searched by the particle generation unit of the particle filter unit. The weight coefficient of the predetermined number of particles generated by the particle generating means is changed by the weight coefficient changing means based on the silhouette extracted by the silhouette extracting means.

ここで、シルエット（画像）の画素値は、映像オブジェクトの領域らしさが高いほど大きな値（例えば、“１”に近い値）を有し、映像オブジェクトの領域らしさが低いほど小さな値（例えば、“０”に近い値）を有する。そして、パーティクルの重み係数は、例えば、当該パーティクルの３次元位置座標の入力画像の画像座標への投影変換位置におけるシルエットの画素値を、元の重み係数に乗算した値に更新される。そのため、パーティクルの位置に対応するシルエットが、映像オブジェクトらしくないほど、その重み係数は小さな値に変更される。 Here, the pixel value of the silhouette (image) has a larger value (for example, a value closer to “1”) as the region likelihood of the video object is higher, and a smaller value (for example, “ A value close to 0 ″). Then, for example, the particle weighting coefficient is updated to a value obtained by multiplying the original weighting coefficient by the pixel value of the silhouette at the projection conversion position of the three-dimensional position coordinate of the particle to the image coordinate of the input image. Therefore, the weight coefficient is changed to a smaller value so that the silhouette corresponding to the position of the particle does not look like a video object.

重み係数変更手段で重み係数を変更されたパーティクルは、再サンプリング手段によって、３次元座標について再サンプリングして同じ個数のパーティクルを再生成すると共に、それらのパーティクルの重み係数を均一にする。次に、状態遷移手段によって、再サンプリング手段によって再サンプリングされたパーティクルの状態ベクトルを遷移させる。
そして、パーティクルフィルタ手段によって、重み係数変更手段と再サンプリング手段と状態遷移手段とによるパーティクルに対する処理を繰り返し行う。 The particles whose weighting coefficient has been changed by the weighting coefficient changing means resample the three-dimensional coordinates by the resampling means to regenerate the same number of particles, and make the weighting coefficients of those particles uniform. Next, the state transition unit causes the state vector of the particles resampled by the resampling unit to transition.
Then, the particle filter means repeatedly performs processing on the particles by the weight coefficient changing means, the resampling means, and the state transition means.

また、メタボール生成手段によって、パーティクルフィルタ手段で処理された所定数のパーティクルに基づいて、メタボール表現による３次元形状モデルを生成する。
これによって、形状推定装置は、パーティクルフィルタ手段において使用するパーティクルの数に応じた分解能、精度、メモリ消費量及び演算量において、オブジェクトの形状情報を取得し、メタボール表現への変換によって当該オブジェクトの形状を推定することができる。 In addition, the metaball generation unit generates a three-dimensional shape model based on the metaball expression based on a predetermined number of particles processed by the particle filter unit.
Thereby, the shape estimation device acquires object shape information with resolution, accuracy, memory consumption, and calculation amount according to the number of particles used in the particle filter means, and converts the object shape into a metaball representation. Can be estimated.

請求項２に記載の形状推定装置は、請求項１に記載の形状推定装置において、前記メタボール生成手段は、前記重み係数変更手段によって重み係数が変更された前記所定数のパーティクルに基づいて、メタボール表現による３次元形状モデルを生成するように構成した。 The shape estimation device according to claim 2 is the shape estimation device according to claim 1, wherein the metaball generation unit is configured to generate a metaball based on the predetermined number of particles whose weighting factors are changed by the weighting factor changing unit. A three-dimensional shape model by expression is generated.

かかる構成によれば、重み係数変更手段から出力される所定数のパーティクルは、それぞれオブジェクト内に分散された位置におけるオブジェクトが存在する確からしさに応じた重み係数を有しており、より正確にオブジェクトの形状に関する情報を有している。そのため、形状推定装置は、メタボール生成手段によって、各パーティクルを重み係数に応じた半径のメタボールに変換してオブジェクトの３次元形状を推定する。
これによって、オブジェクトの形状を精度よく推定することができる。 According to such a configuration, each of the predetermined number of particles output from the weighting factor changing unit has a weighting factor corresponding to the probability that the object exists at a position dispersed in the object, and more accurately It has information about the shape. Therefore, the shape estimation device estimates the three-dimensional shape of the object by converting each particle into a metaball having a radius corresponding to the weighting coefficient by the metaball generation unit.
Thereby, the shape of the object can be estimated with high accuracy.

請求項３に記載の形状推定装置は、請求項１に記載の形状推定装置において、前記メタボール生成手段は、前記再サンプリング手段によって再サンプリングされた前記所定数のパーティクルに基づいて、メタボール表現による３次元形状モデルを生成するように構成した。 The shape estimation apparatus according to claim 3 is the shape estimation apparatus according to claim 1, wherein the metaball generation unit is configured to perform metaball representation 3 based on the predetermined number of particles resampled by the resampler. It was configured to generate a dimensional shape model.

かかる構成によれば、再サンプリング手段から出力される所定数のパーティクルは、重み係数が均一で、オブジェクトが存在する確からしさに応じて個数に変換された分布を有している。そのため、形状推定装置は、メタボール生成手段によって、各パーティクルを均一な半径のメタボールに変換してオブジェクトの３次元形状を推定する。
これによって、オブジェクトの形状の推定に必要な演算量を低減することができる。 According to such a configuration, the predetermined number of particles output from the resampling means have a distribution in which the weight coefficient is uniform and is converted into the number according to the probability that the object exists. Therefore, the shape estimation device estimates the three-dimensional shape of the object by converting each particle into a metaball having a uniform radius by the metaball generation means.
As a result, the amount of calculation required for estimating the shape of the object can be reduced.

請求項４に記載の形状推定装置は、請求項１乃至請求項３の何れか一項に記載の形状推定装置において、前記メタボール生成手段によって生成された前記メタボール表現による３次元形状モデルに対して、収縮又は膨張を行う３次元多値モルフォロジ処理によって３次元形状モデルを修正するモルフォロジ処理手段をさらに備えて構成した。 The shape estimation device according to claim 4 is the shape estimation device according to any one of claims 1 to 3, wherein the shape estimation device according to the metaball expression generated by the metaball generation unit is a three-dimensional shape model. The apparatus further includes a morphology processing means for correcting the three-dimensional shape model by the three-dimensional multi-value morphology processing for contraction or expansion.

かかる構成によれば、モルフォロジ処理手段によって、メタボール生成手段で生成された３次元形状モデルにおいて、例えば、真の形状より拡大あるいは縮小して推定された形状を、収縮あるいは膨張を行う３次元多値モルフォロジ処理を適用して修正する。
これによって、メタボール生成手段による推定形状の誤差を軽減することができる。 According to such a configuration, in the three-dimensional shape model generated by the metaball generation unit by the morphology processing unit, for example, a three-dimensional multivalue that contracts or expands a shape estimated by enlarging or reducing the true shape. Modify by applying morphological processing.
Thereby, the error of the estimated shape by the metaball generating means can be reduced.

請求項５に記載の形状推定装置は、請求項１乃至請求項４の何れか一項に記載の形状推定装置において、さらに１以上のシルエット抽出手段と、当該シルエット抽出手段に対応する重み係数変更手段とを備えるように構成した。 The shape estimation apparatus according to claim 5 is the shape estimation apparatus according to any one of claims 1 to 4, further comprising at least one silhouette extraction unit and a weight coefficient change corresponding to the silhouette extraction unit. Means.

かかる構成によれば、複数のシルエット抽出手段と、対応する重み係数変更手段とによって、例えば、複数の視点から撮影された入力画像に基づき、パーティクルの重み係数を変更する。
これによって、後段の再サンプリング手段によって、パーティクルの存在範囲を効果的に絞り込んで再編成することができ、形状推定の精度および推定値算出における収束速度を向上させることができる。 According to such a configuration, the particle weighting coefficient is changed by the plurality of silhouette extracting units and the corresponding weighting factor changing unit based on, for example, input images taken from a plurality of viewpoints.
As a result, the resampling means at the subsequent stage can effectively narrow down and reorganize the existence range of the particles, thereby improving the accuracy of shape estimation and the convergence speed in calculating the estimated value.

請求項６に記載の形状推定プログラムは、オブジェクトが撮影された画像中の映像オブジェクトに基づき、３次元座標を含む状態ベクトルと重み係数とを有した情報であるパーティクルを用いたパーティクルフィルタによって、前記オブジェクトの３次元形状を推定するために、コンピュータを、シルエット抽出手段、パーティクルフィルタ手段、メタボール生成手段、として機能させる形状推定プログラムであって、前記パーティクルフィルタ手段は、パーティクル生成手段、重み係数変更手段、再サンプリング手段、状態遷移手段、を含むこととした。 The shape estimation program according to claim 6 is based on a video object in an image in which an object is photographed, and a particle filter that uses particles that are information having a state vector including a three-dimensional coordinate and a weighting coefficient. A shape estimation program that causes a computer to function as a silhouette extraction unit, a particle filter unit, and a metaball generation unit in order to estimate a three-dimensional shape of an object, wherein the particle filter unit includes a particle generation unit and a weight coefficient change unit , Re-sampling means, and state transition means.

かかる構成によれば、形状推定プログラムは、シルエット抽出手段によって、入力された画像に含まれる映像オブジェクトの領域をシルエットとして抽出する。また、形状推定プログラムは、パーティクルフィルタ手段のパーティクル生成手段によって、オブジェクトを探索する予め定められた探索領域内を示す３次元座標を有する、２以上である所定数のパーティクルを生成する。パーティクル生成手段で生成された所定数のパーティクルは、重み係数変更手段によって、シルエット抽出手段で抽出されたシルエットに基づいて、重み係数が変更される。重み係数変更手段で重み係数を変更されたパーティクルは、再サンプリング手段によって、３次元座標について再サンプリングして同じ個数のパーティクルを再生成すると共に、それらのパーティクルの重み係数を均一にする。次に、状態遷移手段によって、再サンプリング手段によって再サンプリングされたパーティクルの状態ベクトルを遷移させる。
そして、パーティクルフィルタ手段によって、重み係数変更手段と再サンプリング手段と状態遷移手段とによるパーティクルに対する処理を繰り返し行う。
また、メタボール生成手段によって、パーティクルフィルタ手段で処理された所定数のパーティクルに基づいて、メタボール表現による３次元形状モデルを生成する。
これによって、形状推定装置は、パーティクルフィルタ手段において使用するパーティクルの数に応じた分解能、精度、メモリ消費量及び演算量において、オブジェクトの形状情報を取得し、メタボール表現への変換によって当該オブジェクトの形状を推定することができる。 According to this configuration, the shape estimation program extracts the region of the video object included in the input image as a silhouette by the silhouette extraction unit. In addition, the shape estimation program generates a predetermined number of particles equal to or greater than 2 having three-dimensional coordinates indicating a predetermined search area in which an object is searched by the particle generation unit of the particle filter unit. The weight coefficient of the predetermined number of particles generated by the particle generating means is changed by the weight coefficient changing means based on the silhouette extracted by the silhouette extracting means. The particles whose weighting coefficient has been changed by the weighting coefficient changing means resample the three-dimensional coordinates by the resampling means to regenerate the same number of particles, and make the weighting coefficients of those particles uniform. Next, the state transition unit causes the state vector of the particles resampled by the resampling unit to transition.
Then, the particle filter means repeatedly performs processing on the particles by the weight coefficient changing means, the resampling means, and the state transition means.
In addition, the metaball generation unit generates a three-dimensional shape model based on the metaball expression based on a predetermined number of particles processed by the particle filter unit.
Thereby, the shape estimation device acquires object shape information with resolution, accuracy, memory consumption, and calculation amount according to the number of particles used in the particle filter means, and converts the object shape into a metaball representation. Can be estimated.

請求項１又は請求項６に記載の発明によれば、演算量やメモリ量を低減して、オブジェクトの３次元形状を推定することができる。
請求項２に記載の発明によれば、より正確にオブジェクトの３次元形状を推定することができる。
請求項３に記載の発明によれば、より少ない演算量でオブジェクトの３次元形状を推定することができる。
請求項４に記載の発明によれば、メタボール変換による形状推定の誤差を修正し、より正確にオブジェクトの３次元形状を推定することができる。
請求項５に記載の発明によれば、より精度よく、また、より迅速にオブジェクトの３次元形状を推定することができる。 According to the invention described in claim 1 or claim 6, it is possible to estimate the three-dimensional shape of the object by reducing the calculation amount and the memory amount.
According to invention of Claim 2, the three-dimensional shape of an object can be estimated more correctly.
According to the third aspect of the invention, the three-dimensional shape of the object can be estimated with a smaller amount of calculation.
According to the fourth aspect of the invention, it is possible to correct the shape estimation error due to the metaball transformation and more accurately estimate the three-dimensional shape of the object.
According to the fifth aspect of the present invention, the three-dimensional shape of the object can be estimated more accurately and more quickly.

以下、本発明の実施の形態について適宜図面を参照して詳細に説明する。
［形状推定装置の構成］
まず、図１を参照して、実施形態の形状推定装置１０の構成について説明する。ここで、図１は本発明にかかる実施形態の形状推定装置の構成を示すブロック図である。
図１に示した実施形態の形状推定装置１０は、Ｎ個（Ｎは１以上の整数）の入力手段１（１_１〜１_Ｎ）と、Ｎ個のシルエット抽出手段２（２_１〜２_Ｎ）と、パーティクルフィルタ手段３と、メタボール生成手段４と、モルフォロジ処理手段５と、出力手段６と、を含んで構成されている。 Hereinafter, embodiments of the present invention will be described in detail with reference to the drawings as appropriate.
[Configuration of shape estimation device]
First, with reference to FIG. 1, the structure of the shape estimation apparatus 10 of embodiment is demonstrated. Here, FIG. 1 is a block diagram showing the configuration of the shape estimation apparatus according to the embodiment of the present invention.
The shape estimation apparatus 10 of the embodiment shown in FIG. 1 includes N input means 1 (1 ₁ to 1 _N ) and N silhouette extraction means 2 (2 ₁ to 2 _N). ), Particle filter means 3, metaball generation means 4, morphology processing means 5, and output means 6.

入力手段１（１_１〜１_Ｎ）は、図示しないＮ台のカメラによって撮影された対象物体（オブジェクト）の入力画像Ｉ_１〜Ｉ_Ｎを、それぞれ対応するシルエット抽出手段２（２_１〜２_Ｎ）に入力する入力インターフェースである。また、この入力手段１_１〜１_Ｎは、それぞれ入力画像Ｉ_１〜Ｉ_Ｎを撮影した際のカメラパラメータＣＰ_１〜ＣＰ_Ｎをパーティクルフィルタ手段３に出力する。ここで、カメラパラメータＣＰ_１〜ＣＰ_Ｎとは、それぞれ入力画像Ｉ_１〜Ｉ_Ｎを撮影した際のカメラの撮影位置の座標を含み、姿勢情報（例えば、パン角、チルト角等で表される撮影方向）、画角情報（例えば、ズーム量等）などである。 The input means 1 (1 ₁ to 1 _N ) converts the input images I _{1 to} I _N of target objects (objects) photographed by N cameras (not shown) into the corresponding silhouette extraction means 2 (2 ₁ to 2 _N). ) Input interface. Further, the input unit ₁ 1 to 1 _N outputs the camera parameters _CP 1 ~ CP _N at the time of photographing the input image _I 1 ~I _N each particle filter unit 3. Here, the camera parameters CP ₁ ~ CP _N, a camera coordinate of the imaging position of the time of photographing the input image I ₁ ~I _N respectively, orientation information (e.g., pan angle is represented by a tilt angle, etc. Shooting direction), angle of view information (for example, zoom amount, etc.).

シルエット抽出手段２（２_１〜２_Ｎ）は、Ｎ箇所の視点で撮影された対象物体（オブジェクト）の入力画像Ｉ_１〜Ｉ_Ｎから対象物体像（映像オブジェクト）の領域を抽出し、シルエット画像（以下、シルエット）Ｓ_１〜Ｓ_Ｎを生成する。シルエットとは、入力画像（例えばＩ_１）を構成する各画素に関して、当該画素が対象物体像に属するか否かを数値的に区別した画像である。
なお、特許請求の範囲の記載において、オブジェクトとは対象物体のことであり、映像オブジェクトとは対象物体を撮影した入力画像中の対象物体像のことである。 The silhouette extraction means 2 (2 _{1 to} 2 _N ) extracts the region of the target object image (video object) from the input images I _{1 to} I _{N of} the target object (object) photographed from N viewpoints, and the silhouette image (Hereinafter, silhouettes) S _{1 to} S _N are generated. A silhouette is an image that numerically distinguishes whether each pixel constituting an input image (for example, I ₁ ) belongs to a target object image.
In the description of the claims, an object is a target object, and a video object is a target object image in an input image obtained by photographing the target object.

例えば、注目する画素が対象物体像上にある場合には値“１”を、対象物体像上にない場合には値“０”を割り当てた２値画像をシルエットとして用いることができる。
また、例えば、注目する画素が対象物体像上にある可能性が高い場合には大きな値（例えば“１”に近い値）を、対象物体像上にない可能性が高い場合には小さな値（例えば“０”に近い値）を設定し、前記の何れとも判別できない場合には中間的な値（例えば“０．５”）を設定した多値の画像であってもよい。 For example, a binary image to which a value “1” is assigned when the pixel of interest is on the target object image and a value “0” is assigned when the pixel is not on the target object image can be used as the silhouette.
In addition, for example, a large value (for example, a value close to “1”) when the pixel of interest is highly likely to be on the target object image, a small value (for example, a value close to “1”) is high. For example, a multi-valued image in which an intermediate value (for example, “0.5”) is set may be used.

以下、ｎ番目（ｎ＝１，２，……，Ｎ）の視点の入力画像をＩ_ｎ、シルエットをＳ_ｎと表記する。また、（ｘ，ｙ）は入力画像における画素位置を表し、ｘは水平座標を、ｙは垂直座標を表す。さらに、画素位置（ｘ，ｙ）における入力画像Ｉ_ｎおよびシルエットＳ_ｎの画素値を、それぞれＩ_ｎ（ｘ，ｙ）およびＳ_ｎ（ｘ，ｙ）と表記する。 Hereinafter, n-th (n = 1,2, ......, N ) the input image of the view of _{I n,} the silhouette is denoted as _{S n.} Further, (x, y) represents a pixel position in the input image, x represents a horizontal coordinate, and y represents a vertical coordinate. Furthermore, the pixel values of the input image _{I n} and silhouettes _{S n} at the pixel position (x, y), respectively _I n (x, y) and _S n (x, y) and notation.

各入力画像Ｉ_ｎを得る手段はそれぞれ任意であり、例えば、デジタルスチルカメラやビデオカメラで電子的に撮影された画像であってもよいし、銀塩写真をイメージスキャナやデジタルスチルカメラ、ビデオカメラなどによって電子化した画像であっても構わない。また、これらの手段を組み合わせて用いてもよい。
入力画像Ｉ_ｎはカラー画像であってもモノクロ画像であっても多バンド画像であってもよいが、以下では各画素が赤、緑及び青の３原色により構成されるカラー画像であるものとして説明する。一方、シルエットＳ_ｎは、各画素が“０”以上“１”以下の値をとる多値画像とし、対象物体像上である可能性が高いほど大きな値をとるものとして説明する。 Each input image I _n means for obtaining is arbitrary respectively, for example, a digital still camera and a electronically captured image may be a video camera, an image scanner or a digital still camera to silver halide photograph, a video camera For example, the image may be digitized. Moreover, you may use combining these means.
As the input image I _n is may be a multi-band image may be a monochrome image may be a color image, the following is a color image composed of the three primary colors of each pixel red, green and blue explain. On the other hand, the silhouette _Sn is assumed to be a multi-valued image in which each pixel has a value of “0” or more and “1” or less, and takes a larger value as the possibility of being on the target object image increases.

シルエット抽出手段２_ｎは、例えば、クロマキー法による対象物体像の抽出手法を用いることができる。この場合は、対象物体を区別可能な色の背景の前に対象物体を配置して撮影し、入力画像Ｉ_ｎを取得する。例えば、青い布など単色の人工的な素材を背景として用いたり、芝生のような単色に近い自然物を背景として用いることができる。クロマキーによる場合、入力画像Ｉ_ｎの画素位置（画像座標）（ｘ，ｙ）における画素値Ｉ_ｎ（ｘ，ｙ）が背景色に類似している場合ほど、対象物体像である可能性が低いため、シルエットＳ_ｎ（ｘ，ｙ）の値を小さくするものとする。 The silhouette extraction unit 2 _n can use, for example, a target object image extraction method by a chroma key method. In this case, it is taken place the object in front of the distinct color of the background of the target object, and acquires the input image I _n. For example, a monochromatic artificial material such as a blue cloth can be used as a background, or a natural object close to a single color such as lawn can be used as a background. If by chroma key, as in the case where the pixel value I _{n (x,} y) at the pixel position of the input image I _n (image coordinates) (x, y) are similar to the background color, it is likely to be the object image Therefore, the value of the silhouette S _n (x, y) is assumed to be small.

ここで、図２を参照して、シルエットＳ_ｎの生成について説明する。図２の（ａ）は、対象物体をカメラで撮影する様子を示した図、（ｂ）はカメラによって撮影された入力画像、（ｃ）はシルエットである。
図２（ａ）に示したように、形状を探索する領域として予め定められた領域Ｄ内に配置された対象物体Ｏ（この例では四角錐）を、カメラＣ_１により側方から、カメラＣ_２により上方から撮影することにより、それぞれ、図２（ｂ）に示したように、対象物体像Ｏ’が含まれる入力画像Ｉ_１及び入力画像Ｉ_２を得ている。これらの入力画像Ｉ_１及び入力画像Ｉ_２に対し、図２（ｂ）の網掛部の色を背景色としてクロマキー処理を行った結果が、図２（ｃ）に示したシルエットＳ_１及びシルエットＳ_２である。図２（ｃ）では、“１”又は“１”に近い大きな値を白で、“０”又は“０”に近い小さな値を黒で表示している。 Here, with reference to FIG. 2, the generation of silhouette S _n. FIG. 2A shows a state where the target object is photographed by the camera, FIG. 2B shows an input image photographed by the camera, and FIG. 2C shows a silhouette.
As shown in FIG. 2 (a), the target object disposed within a predetermined area D as an area for searching a shape O (four-sided pyramid in this example), from the side by the camera C _1, the camera C by photographing from above by _2, respectively, as shown in FIG. 2 (b), to obtain an input image I ₁ and the input image I ₂ including the target object image O '. The result of performing chroma key processing on the input image I ₁ and the input image I _{2 using} the shaded color in FIG. 2B as the background color is the result of the silhouette S ₁ and the silhouette S shown in FIG. ₂ . In FIG. 2C, a large value close to “1” or “1” is displayed in white, and a small value close to “0” or “0” is displayed in black.

また、シルエット抽出手段２_ｎは、例えば、背景差分法により実装してもよい。この場合、シルエット抽出手段２_ｎには、対象物体像がないときの画像を背景画像Ｂ_ｎとして、図示しない記憶手段に予め記憶しておく。そして、入力画像の各画素の画素値Ｉ_ｎ（ｘ，ｙ）が、対応する画素位置の背景画像の画素値Ｂ_ｎ（ｘ，ｙ）に類似している場合ほど、対象物体像である可能性が低いため、シルエットＳ_ｎ（ｘ，ｙ）の値を小さくするものとする。
このほか、入力画像Ｉ_ｎから対象物体像らしさを表すシルエットＳ_ｎが得られる手法であれば、他の手法を用いてもよい。 Further, the silhouette extraction unit 2 _n may be implemented by, for example, the background difference method. In this case, the silhouette extraction unit 2 _n stores an image when there is no target object image as a background image B _n in a storage unit (not shown) in advance. Then, the pixel image I _n (x, y) of each pixel of the input image is more likely to be a target object image as the pixel value B _n (x, y) of the background image at the corresponding pixel position is more similar. Since the property is low, the value of the silhouette S _n (x, y) is assumed to be small.
In addition, as long as it is a method for silhouette S _n is obtained from the input image I _n represents the object image likelihood may use other techniques.

パーティクルフィルタ手段３は、シルエット抽出手段２_１〜２_Ｎから入力されるシルエットＳ_１〜Ｓ_Ｎと、入力手段１_１〜１_Ｎから入力されるカメラパラメータＣＰ_１〜ＣＰ_Ｎを用いて、パーティクルフィルタ手法におけるパーティクル（粒子）に対するフィルタ処理を行い、パーティクル群（粒子群）をメタボール生成手段４に出力する。
ここで、パーティクルとは、３次元座標（位置成分）を含む状態ベクトルと重み係数とを有する情報であり、パーティクルフィルタとは、このようなパーティクルを対象物体内に適宜散布し、複数のパーティクルの集合からなるパーティクル群の状態ベクトルと重み係数とを重ね合わせることで物体の３次元形状を表現する手法である。 Particle filter unit 3, using the silhouette _S 1 to S _N inputted from the silhouette extraction unit ₂ 1 to 2 _N, the camera parameters _CP 1 ~ CP _N input from the input means ₁ 1 to 1 _N, the particle filter Filter processing is performed on particles (particles) in the technique, and a particle group (particle group) is output to the metaball generation unit 4.
Here, the particle is information having a state vector including a three-dimensional coordinate (position component) and a weighting coefficient, and the particle filter appropriately scatters such particles in the target object, This is a technique for expressing a three-dimensional shape of an object by superimposing a state vector of a group of particles and a weighting factor.

次に、図３を参照して、パーティクルフィルタ手段３の詳細な構成について説明する。ここで、図３は、パーティクルフィルタ手段の構成を示したブロック図である。
図３に示したように、パーティクルフィルタ手段３は、パーティクル生成手段２１と、スイッチ２２と、Ｎ個の加重演算手段（重み係数変更手段）２３（２３_１〜２３_Ｎ）と、再サンプリング手段２４と、遅延手段２５と、状態遷移手段２６と、から構成されている。 Next, the detailed configuration of the particle filter means 3 will be described with reference to FIG. Here, FIG. 3 is a block diagram showing the configuration of the particle filter means.
As shown in FIG. 3, the particle filter unit 3 includes a particle generation unit 21, a switch 22, N weight calculation units (weight coefficient changing units) 23 (23 ₁ to 23 _N ), and a resampling unit 24. And a delay means 25 and a state transition means 26.

パーティクル生成手段２１は、パーティクルフィルタ手段３におけるパーティクルの初期化のための手段であり、シルエット抽出手段２_１から入力されるシルエットＳ_１と入力手段１_１を介して入力されるカメラパラメータＣＰ_１とに基づき、対象物体の存在候補領域内に分布する所定数（Ｍ個）のパーティクルを生成するパーティクルの初期化手段である。 Particle generation unit 21 is a means for particles initialization of the particle filter unit 3, a camera parameter CP ₁ inputted via the silhouette S ₁ and the input means 1 ₁ inputted from the silhouette extraction unit 2 ₁ Is a particle initialization means for generating a predetermined number (M) of particles distributed within the target object existence candidate region.

生成するパーティクルの個数Ｍは２以上の整数であり、例えば、Ｍ＝１００００とすることができる。個々のパーティクルは、それぞれ状態ベクトルと重み係数とからなるベクトル量である。以下、ｍ番目（ｍ＝１，２，……，Ｍ）のパーティクルの状態ベクトルをｓ_ｍ、重み係数をｗ_ｍと表記する。 The number M of particles to be generated is an integer greater than or equal to 2, for example, M = 10000. Each particle is a vector quantity composed of a state vector and a weight coefficient. Hereinafter, the state vector of the m-th particle (m = 1, 2,..., M) is expressed as s _m , and the weight coefficient is expressed as w _m .

状態ベクトルｓ_ｍは、その成分に少なくとも３次元空間における座標（Ｘ_ｍ，Ｙ_ｍ，Ｚ_ｍ）を含むものとする。状態ベクトルｓ_ｍは、例えば、式（１）に示すように３次元空間における座標（Ｘ_ｍ，Ｙ_ｍ，Ｚ_ｍ）のみで構成することができる。 The state vector s _m includes at least coordinates (X _m , Y _m , Z _m ) in a three-dimensional space as its components. For example, the state vector s _m can be configured by only coordinates (X _m , Y _m , Z _m ) in a three-dimensional space as shown in the equation (1).

あるいは、状態ベクトルｓ_ｍは、例えば、式（２）に示すように、３次元空間における座標及び速度からなる６次元のベクトルとしてもよい。式（２）において、・（ドット）は時間微分を表し、右辺の４〜６行目の要素は、それぞれ、ｍ番目のパーティクルのＸ、Ｙ、Ｚ方向の速度を表す。 Alternatively, the state vector s _m, for example, as shown in equation (2) may be a six-dimensional vector consisting of the coordinates and velocities in the three-dimensional space. In the formula (2), • (dot) represents time differentiation, and the elements on the 4th to 6th lines on the right side represent the velocities in the X, Y, and Z directions of the mth particle, respectively.

さらに、状態ベクトルｓ_ｍに、例えば、加速度を導入して９次元のベクトルとしてもよいし、過去の座標、速度、加速度などを導入してもよい。 Furthermore, the state vector s _m, for example, may be a 9-dimensional vector by introducing acceleration, it may be introduced past coordinates, velocity, acceleration and the like.

ここで、図４及び図２を参照してパーティクル生成手段２１の機能について説明する。図４は、パーティクルを生成する様子を模式的に示した図である。
パーティクル生成手段２１は、まず、図２（ａ）に示した例のように、対象物体Ｏが領域Ｄに完全に包含されるように、３次元空間上の閉領域である領域Ｄを設定する。すなわち、領域Ｄは形状推定を行う探索領域である。
そして、式（３）に示すように、シルエットＳ_１に応じて定められる確率密度関数ｐ（Ｘ，Ｙ，Ｚ｜Ｓ_１）に従うＭ個のサンプル（Ｘ_ｍ，Ｙ_ｍ，Ｚ_ｍ）（ｍ＝１，２，…，Ｍ）を生成する。 Here, the function of the particle generation means 21 is demonstrated with reference to FIG.4 and FIG.2. FIG. 4 is a diagram schematically showing how particles are generated.
First, the particle generation unit 21 sets a region D that is a closed region in the three-dimensional space so that the target object O is completely included in the region D, as in the example illustrated in FIG. . That is, the region D is a search region for performing shape estimation.
Then, as shown in Expression (3), M samples (X _m , Y _m , Z _m ) (m) according to the probability density function p (X, Y, Z | S ₁ ) determined according to the silhouette S ₁ = 1, 2, ..., M).

（Ｘ_ｍ，Ｙ_ｍ，Ｚ_ｍ）〜ｐ（Ｘ，Ｙ，Ｚ｜Ｓ_１） …（３） (X _m , Y _m , Z _m ) to p (X, Y, Z | S ₁ ) (3)

式（３）の確率密度関数ｐ（Ｘ，Ｙ，Ｚ｜Ｓ_１）として、例えば、式（４）の関数を用いることができる。 As the probability density function p (X, Y, Z | S ₁ ) of Expression (3), for example, the function of Expression (4) can be used.

式（４）において、ｆ_ｎ（Ｘ，Ｙ，Ｚ）及びｇ_ｎ（Ｘ，Ｙ，Ｚ）は入力画像Ｉ_ｎを撮影したカメラＣ_ｎ（例えば、図２のカメラＣ_１、Ｃ_２）による投影変換を表す関数である。すなわち、３次元座標（Ｘ，Ｙ，Ｚ）で表される点は、カメラＣ_ｎによって撮影された２次元画像内において、２次元画像座標（ｆ_ｎ（Ｘ，Ｙ，Ｚ），ｇ_ｎ（Ｘ，Ｙ，Ｚ））に変換される。ｆ_ｎ（Ｘ，Ｙ，Ｚ）及びｇ_ｎ（Ｘ，Ｙ，Ｚ）はカメラやレンズの特性、カメラ位置及びカメラ姿勢等のカメラパラメータに応じて決まる関数であり、例えば、ピンホールや一般的なレンズの場合は透視投影変換となり、魚眼レンズの場合は等距離投影変換や正射影変換となる。 In the formula _{(4), f n (X} , Y, Z) and _{g n (X, Y, Z} ) is due to the camera _{C n} obtained by photographing the input image _{I n} (e.g., camera _C 1, _{C 2} in FIG. 2) It is a function representing projection transformation. That is, the point represented by 3-dimensional coordinates (X, Y, Z), in the two-dimensional image captured by the camera _{C n,} 2-dimensional image coordinates _{(f n (X, Y,} Z), g n ( X, Y, Z)). f _n (X, Y, Z) and g _n (X, Y, Z) are functions determined according to camera parameters such as camera and lens characteristics, camera position, and camera posture. In the case of a simple lens, perspective projection conversion is performed, and in the case of a fisheye lens, equidistant projection conversion or orthographic projection conversion is performed.

パーティクルＰは、対象物体Ｏの最大の探索領域である領域Ｄ内であって、かつシルエットＳ_１及びカメラパラメータＣＰ_１から推定される対象物体Ｏの存在候補領域に分布するように生成される。シルエットＳ_１及びカメラパラメータＣＰ_１から推定される対象物体Ｏの存在候補領域とは、カメラＣ_１の第一光学主点（視点）から、シルエットＳ_１の０でない画素を通して観測する方位に含まれる領域のことである。簡単のためにシルエットＳ_１が２値（“０”及び“１”）の場合について説明すると、図４のシルエットＳ_１の白い部分が“０”でない画素、すなわち“１”の画素とすると、（Ｘ_ｍ，Ｙ_ｍ，Ｚ_ｍ）は、カメラＣ_１の第一光学主点と画像面上のシルエットＳ_１のＳ_１（ｘ，ｙ）＝１なる領域とを通る錐体状領域（この例では三角錐）の内部にほぼ均一に分布する。ただし、最大の探索領域である領域Ｄ内に限るものとする。
また、パーティクルＰの重み係数ｗ_ｍは、初期値として、すべて均一の値（例えば、１／Ｍ）を設定する。 The particles P are generated so as to be distributed within the region D that is the maximum search region of the target object O and distributed in the existence candidate regions of the target object O estimated from the silhouette S ₁ and the camera parameter CP ₁ . The existence candidate region of the target object O estimated from the silhouette S ₁ and the camera parameter CP ₁ is included in the orientation observed from the first optical principal point (viewpoint) of the camera C ₁ through the non-zero pixel of the silhouette S _1. It is an area. For the sake of simplicity, the case where the silhouette S ₁ is binary (“0” and “1”) will be described. If the white portion of the silhouette S _{1 in} FIG. 4 is a non- “0” pixel, that is, a “1” pixel, (X _m , Y _m , Z _m ) is a cone-shaped region (this one that passes through the first optical principal point of the camera C ₁ and the region S ₁ (x, y) = 1 of the silhouette S ₁ on the image plane. In the example, it is distributed almost uniformly inside the triangular pyramid). However, it is limited to the area D which is the maximum search area.
Further, the weighting coefficient w _m of the particles P is all set to a uniform value (for example, 1 / M) as an initial value.

なお、最初に生成するパーティクルＰは、最大の探索領域である領域Ｄ内に一様に分布するように生成してもよいが、最初に設定する探索領域を、シルエットＳ_１から推定される対象物体の存在領域に限定することで、形状推定のための収束までの繰り返し演算処理の回数を低減することができる。 Incidentally, the particles P generated first, but may be generated so as to be distributed uniformly in the area D is the maximum search area, the search region set initially, is estimated from the silhouette S ₁ object By limiting to the region where the object exists, the number of iterative calculation processes until convergence for shape estimation can be reduced.

スイッチ２２は、パーティクル生成手段２１から入力される初期化されたパーティクルか、後記する加重演算手段２３_１から入力されるフィルタ処理が進んだパーティクルかを選択し、加重演算手段２３_２に出力する。
スイッチ２２は、パーティクルの初期化のための切り替えスイッチであり、形状推定を開始した直後（例えば、対象物体を本装置の前に据え、準備が整った直後）や、使用者の必要に応じて、パーティクル生成手段２１側を、所定の時間だけ閉じて、パーティクル生成手段２１で生成された必要な個数のパーティクルを、加重演算手段２３_２に供給する。これ以外の期間は、加重演算手段２３_１側を閉じて、加重演算手段２３_１によって処理されたパーティクルを、加重演算手段２３_２にそのまま出力する。 Switch 22, either initialized particles supplied from the particle generating means 21, and select whether filtering processing proceeds particles inputted from the weighted calculation unit 23 ₁ below, and outputs the weighted calculation unit 23 _2.
The switch 22 is a change-over switch for initializing particles. Immediately after starting shape estimation (for example, immediately after the target object is placed in front of the apparatus and ready) or according to the user's needs , supplies the particle generation unit 21 side, it closes a predetermined time, the necessary number of particles generated by the particle generator 21, the weighted calculation unit 23 _2. Other periods, closes the weighted calculation unit 23 ₁ side, the particles that have been processed by the weighted arithmetic means 23 _1, directly outputs the weighted calculation unit 23 _2.

加重演算手段（重み係数変更手段）２３（２３_１〜２３_Ｎ）は、各パーティクルの状態ベクトルｓ_ｍがシルエットＳ_１〜Ｓ_Ｎと矛盾しないか評価し（尤度を評価し）、その評価結果に応じて重み係数ｗ_ｍを修正する。
具体的には、式（５）に示したように、パーティクルの３次元座標（Ｘ_ｍ，Ｙ_ｍ，Ｚ_ｍ）の画像座標への投影変換位置におけるシルエットＳ_ｎの画素値Ｓ_ｎ（ｆ_ｎ（Ｘ_ｍ，Ｙ_ｍ，Ｚ_ｍ），ｇ_ｎ（Ｘ_ｍ，Ｙ_ｍ，Ｚ_ｍ））を、元の重み係数ｗ_ｍに乗じた値を、変更後の重み係数ｗ_ｍとする。
シルエットＳ_ｎの画素値は、対象物体像（映像オブジェクト）の存在する確からしさ（可能性）が高いほど大きな値（例えば“１”）を有し、低いほど小さな値（例えば“０”）を有するため、対象物体像の存在する確からしさが低い位置に配置されたパーティクルほど、その重み係数ｗ_ｍは小さな値に変換される。 Weighted arithmetic means (weighting coefficient changing means) 23 ₍₂₃ 1 ~ 23 _N), the state vector _{s m} of each particle is evaluated not in conflict with the silhouette _S 1 to S _N (evaluate the likelihood), the evaluation result The weighting factor w _m is modified according to
Specifically, as shown in Expression (5), the pixel value S _n (f _n ) of the silhouette S _n at the projection conversion position of the three-dimensional coordinates (X _m , Y _m , Z _m ) of the particles to the image coordinates. _{_{_{_{(X m, Y m, Z}}}} m), g n (X m, Y m, Z m) to), a value obtained by multiplying the original weighting factor _{w m,} the weighting factor _{w m} after the change.
Pixel values of the silhouette S _n has an existence probability of the target object image (image object) (possibility) is higher the larger the value (e.g. "1"), the lower the smaller value (e.g. "0") Therefore, the weight coefficient w _m is converted to a smaller value as the particle is located at a lower probability that the target object image exists.

ｗ_ｍ ← ｗ_ｍ・Ｓ_ｎ（ｆ_ｎ（Ｘ_ｍ，Ｙ_ｍ，Ｚ_ｍ），ｇ_ｎ（Ｘ_ｍ，Ｙ_ｍ，Ｚ_ｍ）） …（５） w _m ← w _m · S _n (f _n (X _m , Y _m , Z _m ), g _n (X _m , Y _m , Z _m )) (5)

但し、（Ｘ_ｍ，Ｙ_ｍ，Ｚ_ｍ）＝（状態ベクトルｓ_ｍの位置成分）であり、ｆ_ｎ（Ｘ_ｍ，Ｙ_ｍ，Ｚ_ｍ）及びｇ_ｎ（Ｘ_ｍ，Ｙ_ｍ，Ｚ_ｍ）は、カメラＣ_ｎによって撮影された入力画像Ｉｎの画像座標への投影変換を行う関数であり、ｆ_ｎ（Ｘ_ｍ，Ｙ_ｍ，Ｚ_ｍ）は水平座標、ｇ_ｎ（Ｘ_ｍ，Ｙ_ｍ，Ｚ_ｍ）は垂直座標に変換するための関数である。また、←（左矢印）は、右辺の計算結果を左辺の変数に代入することを示す記号である。 _However, a _{_{(X m, Y m, Z}} m) = ( position component of the state vector _{_{_{s m), f n (X}}} m, Y m, Z m) and _{_{_{g n (X m, Y m}}} , Z m) Is a function that performs projection conversion to the image coordinates of the input image In taken by the camera C _n , where f _n (X _m , Y _m , Z _m ) is the horizontal coordinate, g _n (X _m , Y _m , Z _m ) is a function for converting to vertical coordinates. Further, ← (left arrow) is a symbol indicating that the calculation result on the right side is assigned to the variable on the left side.

加重演算手段２３_１は、後記する状態遷移手段２６から入力されるパーティクルに対して、シルエット抽出手段２_１から入力されるシルエットＳ_１及び入力手段１_１を介して入力されるカメラパラメータＣＰ_１に基づいて、各パーティクルの重み係数ｗ_ｍを修正する。そして、重み係数ｗ_ｍを修正したパーティクルをスイッチ２２に出力する。 Weighted arithmetic means 23 _1, to the particles is input from the state transition means 26 to be described later, the camera parameters CP ₁ inputted via the silhouettes S ₁ and the input means 1 ₁ is input from the silhouette extraction unit 2 ₁ Based on this, the weight coefficient w _m of each particle is corrected. Then, the particles whose weight coefficient w _m is corrected are output to the switch 22.

加重演算手段２３_２は、スイッチ２２から入力されるパーティクルに対して、シルエット抽出手段２_２から入力されるシルエットＳ_２及び入力手段１_２を介して入力されるカメラパラメータＣＰ_２に基づいて、各パーティクルの重み係数ｗ_ｍを修正する。そして、重み係数ｗ_ｍを修正したパーティクルを加重演算手段２３_３（不図示）に出力する。 Weighting calculation means 23 _2, to the particles which are inputted from the switch 22, based on the camera parameters CP ₂ inputted via the silhouettes S ₂ and the input unit 1 ₂ is input from the silhouette extraction unit 2 _2, each The particle weight coefficient w _m is corrected. Then, the particles whose weight coefficient w _m is corrected are output to the weight calculation means 23 ₃ (not shown).

以下、同様に、ｎ番目の加重演算手段２３_ｎは、前段の加重演算手段２３_ｎ−１から入力されるパーティクルに対して、シルエット抽出手段２_ｎから入力されるシルエットＳ_ｎ及び入力手段１_ｎを介して入力されるカメラパラメータＣＰ_ｎに基づいて、各パーティクルの重み係数ｗ_ｍを修正し、重み係数ｗ_ｍを修正したパーティクルを、後段の加重演算手段２３_ｎ＋１に出力する。 Hereinafter, similarly, the weighted calculation unit 23 _n of the n-th, to the particles supplied from the weighted calculation unit _{23 n-1} of the preceding stage, silhouettes _{S n} and input means _{1 n} inputted from the silhouette extraction unit _{2 n} The weight coefficient w _m of each particle is corrected based on the camera parameter CP _n input via, and the particle with the corrected weight coefficient w _m is output to the subsequent weight calculation means 23 _{n + 1} .

最後段の加重演算手段２３_Ｎは、重み係数ｗ_ｍを修正したパーティクル（パーティクル群Ｐ_Ａ）を、再サンプリング手段２４に出力すると共に、メタボール生成手段４（図１参照）に出力する。 The last weight calculation means 23 _N outputs the particles (particle group P _A ) whose weight coefficient w _m has been corrected to the resampling means 24 and to the metaball generation means 4 (see FIG. 1).

なお、加重演算手段２３は、複数段を接続する必要はなく、１段のみとしてもよい。また、複数のシルエット抽出手段２_１〜２_Ｎ（図１参照）を備えると共に、複数段の加重演算手段２３_１〜２３_Ｎを接続することで、複数の視点から撮影した入力画像Ｉ_１〜Ｉ_ＮによるシルエットＳ_１〜Ｓ_Ｎに基づいて重み係数ｗ_ｍを変更することができ、後段の再サンプリング手段２４によって、パーティクルの存在範囲を効果的に絞り込んで再編成することができる。そのため、形状推定の精度及び推定値算出における収束速度を向上することができる。 Note that the weight calculation unit 23 does not need to connect a plurality of stages, and may have only one stage. In addition, a plurality of silhouette extraction means 2 ₁ to 2 _N (see FIG. 1) are provided, and input images I _{1 to} I photographed from a plurality of viewpoints are connected by connecting a plurality of stages of weight calculation means 23 ₁ to 23 _N. _The weighting factor w _m can be changed based on the silhouettes S _{1 to} S _N by _N, and the resampling means 24 in the subsequent stage can effectively narrow down and reorganize the existence range of particles. Therefore, the accuracy of shape estimation and the convergence speed in calculating the estimated value can be improved.

ここで、図５を参照して加重演算手段２３の機能について説明する。図５は、パーティクルの重み係数を更新する様子を模式的に示した図である。
図５においては、図４に示したパーティクルＰに対して、加重演算手段２３_２によって、カメラＣ_２から得られた入力画像Ｉ_２（図２参照）によるシルエットＳ_２及びカメラパラメータＣＰ_２に基づいて、重み係数ｗ_ｍの更新を行う様子を表している。 Here, the function of the weight calculation means 23 will be described with reference to FIG. FIG. 5 is a diagram schematically showing how the weighting coefficient of particles is updated.
In FIG. 5, based on the silhouette S ₂ and the camera parameter CP ₂ based on the input image I ₂ (see FIG. 2) obtained from the camera C ₂ by the weight calculation means 23 ₂ for the particle P shown in FIG. This represents how the weighting coefficient w _m is updated.

黒丸（Ｐ_１）及び白丸（Ｐ_２）で示したパーティクルＰは、図４に示した、前の処理までに得られているパーティクル群の３次元の位置を模式的に表している。式（５）に示したように、各パーティクルＰの３次元座標（Ｘ_ｍ，Ｙ_ｍ，Ｚ_ｍ）を２次元画像面に投影した像の画像座標（ｆ_２（Ｘ_ｍ，Ｙ_ｍ，Ｚ_ｍ），ｇ_２（Ｘ_ｍ，Ｙ_ｍ，Ｚ_ｍ）)におけるシルエットＳ_２の画素値Ｓ_２（ｆ_２（Ｘ_ｍ，Ｙ_ｍ，Ｚ_ｍ），ｇ_２（Ｘ_ｍ，Ｙ_ｍ，Ｚ_ｍ）)を、パーティクルＰの重み係数ｗ_ｍに乗じて、新たな重み係数ｗ_ｍとする。 Particles P indicated by black circles (P ₁ ) and white circles (P ₂ ) schematically represent the three-dimensional positions of the particle groups obtained up to the previous processing shown in FIG. As shown in Expression (5), the image coordinates (f ₂ (X _m , Y _m , Z) of the image obtained by projecting the three-dimensional coordinates (X _m , Y _m , Z _m ) of each particle P onto the two-dimensional image plane. _m ), g ₂ (X _m , Y _m , Z _m )), the pixel value S ₂ (f ₂ (X _m , Y _m , Z _m ), g ₂ (X _m , Y _m , Z _m )) of the silhouette S ₂ )) Is multiplied by the weight coefficient w _m of the particle P to obtain a new weight coefficient w _m .

簡単のためにシルエットＳ_２が２値画像の場合について説明すると、パーティクルの３次元座標の画像座標への投影変換位置が、シルエットＳ_２の“０”でない画素上にあるか否かを判定し、シルエットＳ_２の“０”でない画素上にあると判定された黒丸のパーティクルＰ_１に関しては重みｗ_ｍをそのまま変化させずに保つ。一方、シルエットＳ_２の“０”である画素上にあると判定された白丸のパーティクルＰ_２に対しては、ｗ_ｍ×０＝０を新たなｗ_ｍとする。 For the sake of simplicity, the case where the silhouette S ₂ is a binary image will be described. It is determined whether or not the projection conversion position of the three-dimensional coordinates of the particles to the image coordinates is on a pixel other than “0” of the silhouette S _2. For the black circle particle P ₁ determined to be on a pixel that is not “0” in the silhouette S _2, the weight w _m is kept unchanged. On the other hand, for the particles _{P 2} white circles is determined to be on the pixel is a silhouette _{S 2} "0", the _{w m} × 0 = 0 as the new _{w m.}

図３に戻って、パーティクルフィルタ手段３の構成について説明を続ける。
再サンプリング手段２４は、Ｎ個の加重演算手段２３_１〜２３_Ｎを経て、最後段の加重演算手段２３_Ｎから出力されるＭ個のパーティクルを入力し、各パーティクルの重み係数ｗ_ｍに応じて再サンプリング（再編成）して、新たなＭ個のパーティクル（パーティクル群Ｐ_Ｂ）を生成し、遅延手段２５に出力すると共に、メタボール生成手段４（図１参照）に出力する。また、再サンプリング手段２４は、新たに生成するＭ個のパーティクルの重みｗ_ｍを、例えば、すべて１／Ｍとなるようにする。 Returning to FIG. 3, the description of the configuration of the particle filter means 3 will be continued.
The re-sampling means 24 inputs M particles output from the last weight calculation means 23 _N via _N weight calculation means 23 ₁ to 23 _N , and according to the weight coefficient w _m of each particle. Re-sampling (reorganization) generates new M particles (particle group P _B ) and outputs them to the delay means 25 and to the metaball generation means 4 (see FIG. 1). Further, the resampling unit 24 sets the weights w _m of M particles to be newly generated to 1 / M, for example.

ここで、図６を参照して、再サンプリング手段２４の機能について説明する。図６は、再サンプリングの様子を模式的に示した図であり、（ａ）は、再サンプリング前のパーティクルを示し、（ｂ）は、再サンプリング後のパーティクルを示す。 Here, the function of the resampling means 24 will be described with reference to FIG. 6A and 6B are diagrams schematically showing the state of resampling. FIG. 6A shows particles before resampling, and FIG. 6B shows particles after resampling.

図６（ａ）において、丸印は、再サンプリング手段２４に入力された番号ｍが１から１０までのパーティクルを表している。各丸印の座標（ここでは便宜上１次元で表している）が各パーティクルの状態ベクトルｓ_ｍを表し、その大きさが重み係数ｗ_ｍを表している。 In FIG. 6A, the circles indicate the particles with the number m input to the resampling means 24 from 1 to 10. Each circle of coordinates (represented here for convenience one dimension in) represents the state vector s _m of each particle, its magnitude represents a weighting factor w _m.

再サンプリング手段２４では、再サンプリング前のパーティクルの重み係数ｗ_ｍの大きさに応じた個数の当該パーティクルのコピーを生成する。すなわち、パーティクルの重み係数ｗ_ｍが大きいものほど数多くのパーティクルのコピーを生成し、ｗ_ｍが小さいものほどコピーの生成数が少なくするか、あるいは皆無となるようにパーティクルを再編成（再サンプリング）して、図６（ｂ）に示すように、重み係数ｗ_ｍが均一な、新たなパーティクル群を生成する。 The resampling means 24 generates a number of copies of the particles corresponding to the size of the weight coefficient w _m of the particles before resampling. That is, the larger the particle weighting factor w _m is, the more particles are generated, and the smaller the w _m is, the smaller the number of copies is generated, or the particles are reorganized (re-sampling) Then, as shown in FIG. 6B, a new particle group having a uniform weight coefficient w _m is generated.

図６に示した例では、再サンプリング前の１０個のパーティクルついて、ｍ＝５のパーティクルは４個のパーティクルに変換され、ｍ＝４，６のパーティクルは、それぞれ２個のパーティクルに変換され、ｍ＝７，９のパーティクルは、それぞれ１個のパーティクルに変換され、ｍ＝１，２，３，８，１０のパーティクルは消滅し、新たに１０個のパーティクルが再編成されている。 In the example shown in FIG. 6, for 10 particles before re-sampling, m = 5 particles are converted into 4 particles, m = 4, 6 particles are converted into 2 particles, The particles with m = 7 and 9 are each converted into one particle, the particles with m = 1, 2, 3, 8, and 10 disappear, and 10 particles are newly reorganized.

次に、図７を参照して、再サンプリングの手順について説明する。ここで、図７は、再サンプリング手段によるパーティクルの再サンプリングの様子を示した図である。
再サンプリング前のパーティクルの状態ベクトルをｓ_ｍ、重み係数をｗ_ｍとし、再サンプリング後のパーティクルの状態ベクトルをｓ_ｍ ^{（ｎｅｗ）}、重み係数をｗ_ｍ ^{（ｎｅｗ）}とする。ただし、ｍ＝１，２，…，Ｍとする。まず、重み係数ｗ_ｍをｍに関して累積したＷ_ｍを縦軸に、ｍを横軸にとる。原点を始点とし、点（ｍ，Ｗ_ｍ）を順次、線分で結んだ折線グラフを作る。
続いて、 Next, the resampling procedure will be described with reference to FIG. Here, FIG. 7 is a diagram showing how the particles are resampled by the resampling means.
The state vector of the particle before resampling is s _m , the weighting factor is w _m , the state vector of the particle after resampling is s _m ^(new) , and the weighting factor is w _m ^(new) . Here, m = 1, 2,..., M. First, W _{m obtained} by accumulating weighting factors w _m with respect to _m is taken on the vertical axis, and m is taken on the horizontal axis. A line graph is created by connecting the points (m, W _m ) sequentially with line segments, starting from the origin.
continue,

０＜ω_μ≦W_M …（６） 0 <ω _μ ≦ W _M (6)

なる一様乱数ω_μをＭ回発生する。以下、一様乱数をＭ回発生する中のμ番目の試行について説明する。まず、Ｗ_ｍ＝ω_μなる直線と、前記した折線グラフの交点Ω_μを求める。交点Ω_μのｍ座標の小数点以下を切り上げた結果をｍ_μ（上に＾（ハット））とする。
すなわち、 A uniform random number ω _μ is generated M times. Hereinafter, the μ-th trial in which uniform random numbers are generated M times will be described. First, the intersection Ω _μ between the straight line W _m = ω _μ and the above-mentioned line graph is obtained. The result of rounding up the decimal point of the m coordinate of the intersection Ω _μ is defined as m _μ (above (hat)).
That is,

を満たすｍ_μ（上に＾（ハット））を求める。このとき、再サンプリング後のμ番目のパーティクルの状態ベクトルを式（８）とし、 Find m _μ (upper (hat)) that satisfies. At this time, the state vector of the μ-th particle after re-sampling is expressed by Equation (8),

その重み係数を式（９）とする。 The weighting coefficient is represented by equation (9).

式（６）〜式（９）の操作をＭ回繰り返すことで、再サンプリング（再編成）されたＭ個のパーティクルが得られる。
このようにして得られた新たなパーティクル群Ｐ_Ｂを遅延手段２５及びメタボール生成手段４（図１参照）に出力する。 By repeating the operations of Expressions (6) to (9) M times, M particles that are resampled (reorganized) are obtained.
The new particle group P _B obtained in this way is output to the delay means 25 and the metaball generation means 4 (see FIG. 1).

なお、パーティクルフィルタ手段３からメタボール生成手段４（図１参照）に出力し得るパーティクル群としては、前記した加重演算手段２３_Ｎから出力するパーティクル群Ｐ_Ａと、再サンプリング手段２４から出力するパーティクル群Ｐ_Ｂとがある。パーティクルフィルタ手段３からは、何れか一方を出力するが、何れを出力するか選択できるように選択手段を設けるようにしてもよい。 As the particle group which may be output from the particle filter unit 3 to metaball generation unit 4 (see FIG. 1), and particle groups P _A to be output from the weighting calculation means 23 _N described above, particle groups to be output from the resampling unit 24 There is P _B. Either one is output from the particle filter means 3, but a selection means may be provided so that it can be selected which one is output.

ここで、パーティクル群としてＰ_Ａを用いる場合と、Ｐ_Ｂを用いる場合の特徴について説明する。
加重演算手段２３_Ｎから出力されるパーティクル群Ｐ_Ａは、各パーティクルは、対象物体が存在する確からしさに応じて、それぞれ異なる重み係数ｗ_ｍを有しているため、より正確に対象物体を推定した情報を有している。このため、パーティクル群Ｐ_Ａに基づいてメタボール表現することで、正確な形状を再現することができる。 Here it will be described the case of using the P _A as particles group, the characteristics in the case of using the P _B.
Particle Group P _A output from the weighted calculation unit 23 _N, each particle, depending on the likelihood that the target object exists, because it has a different weighting factor w _m respectively, more accurately estimate the target object Information. Therefore, by metaball expressed based on the particle group P _A, it is possible to reproduce a precise shape.

一方、再サンプリング手段２４から出力されるパーティクル群Ｐ_Ｂは、各パーティクルは重み係数ｗ_ｍに応じて、パーティクルの個数に変換されたものである。この変換は、あるパーティクルが有する変換前の重み係数ｗ_ｍと、その変換によって生成されるパーティクルの個数とが確率的にしか対応しないため、局所的には近似的な変換となる。そのため、変換後のパーティクル群Ｐ_Ｂによる形状情報には誤差が生じるため、形状の推定精度は変換前のパーティクル群Ｐ_Ａよりも劣化する。しかし、パーティクル群Ｐ_Ｂは、全てのパーティクルの重み係数ｗ_ｍが均一の値に変換されるため、メタボール表現への変換の際には、各パーティクルを、半径の等しいメタボールとすることができるため、メタボール変換の演算を簡略化し、演算量を低減することができる。 On the other hand, the particle group P _B output from the re-sampling means 24 is obtained by converting each particle into the number of particles according to the weighting factor w _m . This conversion is a local approximate conversion because the weight coefficient w _m before conversion of a certain particle and the number of particles generated by the conversion correspond only probabilistically. Therefore, the shape information using a particle group P _B after the conversion since the errors occur, the estimation accuracy of the shape is degraded than the particle group P _A before conversion. However, in the particle group P _B , since the weight coefficient w _m of all particles is converted to a uniform value, each particle can be made into a metaball having the same radius when converted into the metaball expression. The calculation of the metaball conversion can be simplified and the calculation amount can be reduced.

パーティクル群Ｐ_Ａとパーティクル群Ｐ_Ｂのこのような特徴を考慮して、用途に適したパーティクル群を用いるようにすることができる。 Considering such characteristics of the particle group P _A and the particle group P _B , it is possible to use a particle group suitable for the application.

図３に戻って、パーティクルフィルタ３の構成について説明を続ける。
遅延手段２５は、各カメラＣ_ｎ（例えば、図２のカメラＣ_１，Ｃ_２）で次の撮影が行われ、新たな入力画像Ｉ_ｎが入力されるまで処理を待機するタイミングを調整のための手段であり、例えば、処理途中のパーティクルを一時記憶するメモリ手段で構成することができる。このメモリ手段にパーティクルを記憶し、状態遷移手段２６による処理時間を考慮した上で、入力画像Ｉ_ｎの入力タイミングを見計らって、メモリ手段からパーティクルを読み出して状態遷移手段２６に出力するようにすればよい。 Returning to FIG. 3, the description of the configuration of the particle filter 3 will be continued.
Delay means 25, each of the cameras C _{n (e.g.,} camera C _1, C 2 in FIG. ₂₎ next photographing is performed in, for adjusting the timing of the processing waits until a new input image I _n is input For example, it can be constituted by a memory means for temporarily storing particles being processed. Storing particles in the memory means, in consideration of the processing time by a state transition unit 26, by and sure to allow input timing of the input image I _n, to output from the memory means to the state transition means 26 reads the particles That's fine.

なお、遅延手段２５は、再サンプリング手段２４の直後に限らず、入力画像Ｉ_１〜Ｉ_Ｎに基づいて処理を行う加重演算手段２３_１から加重演算手段２３_Ｎの間以外であれば、再サンプリング手段２４の直前又は状態遷移手段２６の直後に設けてもよい。 Note that the delay unit 25 is not limited to immediately after the re-sampling unit 24, but resampling is performed if it is not between the weight calculation unit 23 ₁ and the weight calculation unit 23 _N that performs processing based on the input images I _{1 to} I _N. It may be provided immediately before the means 24 or immediately after the state transition means 26.

状態遷移手段２６は、遅延手段２５から入力される再サンプリング後のパーティクルに対して、各パーティクルの状態ベクトルｓ_ｍ（ｔ）を遷移させ、遷移後のパーティクルを加重演算手段２３_１に出力する。 State transition means 26 with respect to the particles after re-sampling is input from the delay unit 25, for each particle state vector s _{m (t)} to cause a transition, and outputs the particles after the transition to the weighted calculation unit 23 _1.

状態遷移手段２６は、各パーティクルの状態ベクトルｓ_ｍ（ｔ）を、式（１０）に示すように、確率密度関数ｐ（ｓ（ｔ＋１）｜ｓ_ｍ（ｔ））に従うサンプリングを行うことにより新たな状態ベクトルｓ_ｍ（ｔ＋１）に遷移させる。ここにｔは時刻（あるいは逐次処理の現在の回数）を表す。
ここで、状態遷移は、後記するように、対象物体が変形や移動する場合において、所定時間経過後に撮影した画像から抽出したシルエットＳ_ｎを用いて加重演算手段２３_ｎで重み係数ｗ_ｍを変更する際に、パーティクルが、所定時間経過後の対象物体の位置に追随するように、パーティクルを遷移（移動）させるものである。また、状態遷移において、ガウス雑音等のノイズを付加して遷移先に揺らぎを与え、対象物体の予測外の移動や変形にも追随できるようにしたり、再サンプリングによって縮退したパーティクルを分散させたりすることもできる。 The state transition means 26 newly samples the state vector s _m (t) of each particle by sampling according to the probability density function p (s (t + 1) | s _m (t)) as shown in Expression (10). The state vector s _m (t + 1). Here, t represents the time (or the current number of sequential processes).
Here, as will be described later, in the state transition, when the target object is deformed or moved, the weighting coefficient w _m is changed by the weight calculation means 23 _n using the silhouette S _n extracted from the image taken after a predetermined time has elapsed. In this case, the particles are moved (moved) so that the particles follow the position of the target object after a predetermined time has elapsed. In addition, in the state transition, noise such as Gaussian noise is added to make the transition destination fluctuate so that it can follow unintended movement or deformation of the target object, or degenerate particles by resampling are dispersed You can also.

ｓ_ｍ（ｔ＋１）〜ｐ（ｓ（ｔ＋１）｜ｓ_ｍ（ｔ）） …（１０） s _m (t + 1) to p (s (t + 1) | s _m (t)) (10)

ここで、式（１０）における確率密度関数ｐ（ｓ（ｔ＋１）｜ｓ_ｍ（ｔ））は、対象物体の変形や移動の性質に応じて定める。
例えば、式（１１）に示すような、ｄ次元の状態ベクトルに対する行列Ａとベクトルｂとによる線形の状態遷移に、平均が０、共分散がΣのガウス雑音を付加した確率密度関数を用いることができる。 Here, the probability density function p (s (t + 1) | s _m (t)) in Expression (10) is determined according to the properties of deformation and movement of the target object.
For example, a probability density function in which Gaussian noise with an average of 0 and a covariance of Σ is added to a linear state transition between a matrix A and a vector b for a d-dimensional state vector as shown in Expression (11). Can do.

ここで、状態遷移に式（１１）の確率密度関数を仮定した場合、式（１０）の操作は、式（１２−１）及び式（１２−２）により実装することができる。 Here, when the probability density function of Expression (11) is assumed for the state transition, the operation of Expression (10) can be implemented by Expression (12-1) and Expression (12-2).

ここで、ｖ（ｔ）は平均が０、共分散がΣのガウス雑音であり、正規分布に従う乱数により生成することができる。 Here, v (t) is Gaussian noise having an average of 0 and a covariance of Σ, and can be generated by a random number according to a normal distribution.

式（１２−１）において、行列Ａ及びベクトルｂによって、対象物体の回転、並進などの線形の移動に対応してパーティクルを遷移（移動）させ、次の入力画像Ｉ_ｎの撮影時の対象物体の位置から外れないようにすることができる。 In formula (12-1), the matrix A and the vector b, the rotation of the object, in response to linear movement, such as translational shifts the particle (movement), the object at the time of imaging of the next input image I _n It is possible to prevent it from moving out of the position.

また、式（１２−１）において、式（１２−２）のようなガウス雑音ｖ（ｔ）を付加して、パーティクルの遷移先に揺らぎを与えることにより、対象物体の動作の把握が不十分な段階で想定外の動きをした場合にも、複数のパーティクルの内の何個かを、次の撮影時の対象物体の真の領域内に遷移させることができ、対象物体を捉え続けることができる。
さらに、ガウス雑音を付加することにより、再サンプリングによって複数のコピーが生成されたパーティクルの状態ベクトルに揺らぎを与え、縮退していたパーティクルの位置を分散させることができる。これによって、個数の限られたパーティクルを、対象物体の存在候補領域内に偏りなく分布させることができる。 In addition, in the equation (12-1), the Gaussian noise v (t) as in the equation (12-2) is added to give fluctuations to the particle transition destination, so that the operation of the target object is not sufficiently grasped. Even if an unexpected movement occurs at any stage, it is possible to transition some of the multiple particles into the true area of the target object at the next shooting, and to continue to capture the target object. it can.
Furthermore, by adding Gaussian noise, it is possible to fluctuate the state vector of particles for which a plurality of copies are generated by resampling, and to disperse the positions of degenerated particles. As a result, a limited number of particles can be distributed evenly within the target object existence candidate region.

次に、図８及び図９を参照して、パーティクルを状態遷移させて対象物体の形状推定をする様子について説明する。ここで、図８は、回転台に置かれた剛体の対象物体の形状推定の様子を示した図であり、図９は、ほぼ等速度運動する非剛体の対象物体の形状推定の様子を示した図である。 Next, with reference to FIG. 8 and FIG. 9, a state in which the shape of the target object is estimated by changing the state of particles will be described. Here, FIG. 8 is a diagram showing a shape estimation state of a rigid target object placed on a turntable, and FIG. 9 is a diagram showing a shape estimation state of a non-rigid target object that moves at a substantially constant speed. It is a figure.

（剛体の対象物体を回転台に載せて観測する応用例）
まず、図８に示すように回転台５１の上に剛体の対象物体５２を載せ、回転台５１を予め定めた角度ずつ（１回の撮影毎に、すなわち処理周期あたり角度θずつ）回転させながら形状推定を行う場合について説明する。状態ベクトルｓ_ｍは、式（１）に示した３次元のベクトルを用いることとする。図８に示すように回転軸をＺ軸とし、右手系のデカルト座標系を定義する。なお、この座標系は回転しないものとする。この場合、対象物体は回転台とともに回転し、かつ変形しないので、式（１２−１）における行列Ａ及びベクトルｂを、それぞれ式(１３−１)及び式（１３−２）のように定めることができる。 (Application example for observing a rigid target object on a rotating table)
First, as shown in FIG. 8, a rigid target object 52 is placed on a turntable 51, and the turntable 51 is rotated by a predetermined angle (every time an image is taken, that is, by an angle θ per processing cycle). A case where shape estimation is performed will be described. State vector s _m shall be possible to use a three-dimensional vector shown in equation (1). As shown in FIG. 8, the rotation axis is the Z axis and a right-handed Cartesian coordinate system is defined. It is assumed that this coordinate system does not rotate. In this case, since the target object rotates with the turntable and does not deform, the matrix A and the vector b in the equation (12-1) are determined as the equations (13-1) and (13-2), respectively. Can do.

共分散行列Σは、半正定値の実対称行列であれば、その値は任意に決めることができ、例えば、非負の値を対角要素にもつ対角行列を設定することができる。この対角要素の値も任意に選択することができ、例えば、対象物体の大きさに応じて定めることができる。例えば、対象物体の外接球の半径をｒとしたとき、ｒ^２より十分小さな定数（例えばｒ^２／１００００）を対角要素の値として採用する。さらに、対角要素の値は時々刻々変化させてもよい。例えば、本装置を起動したときに大きめの値（例えばｒ^２／１００）を設定し、撮影する毎に、対角要素の値を小さくする（例えば、撮影する毎に１／２を乗ずる）ようにしてもよい。 The covariance matrix Σ can be arbitrarily determined as long as it is a positive semi-definite real symmetric matrix. For example, a diagonal matrix having non-negative values as diagonal elements can be set. The value of this diagonal element can also be arbitrarily selected, and can be determined according to the size of the target object, for example. For example, when the radius of the circumscribed sphere of the object was r, employing sufficiently small constant than r ² (e.g. r ^2/10000) as the value of the diagonal elements. Further, the value of the diagonal element may be changed from moment to moment. For example, set the larger value (e.g., r ^2/100) when starting the device, each time the photographing, to reduce the value of the diagonal elements (for example, multiplied by 1/2 every time shooting) so It may be.

（変形する対象物体を回転台に載せて観測する応用例）
次に、図８において、対象物体が非剛体であり、変形する場合について説明する。回転台５１の上に変形する対象物体を載せた場合にも、行列Ａ及びベクトルｂとして、それぞれ式（１３−１）及び式（１３−２）を適用することができる。この場合、共分散行列Σは、対象物体が剛体である場合よりも、その固有値が大きくなるよう設定すればよい。すなわち、変形の度合いを織り込んだ、共分散行列Σを設定することで、形状推定を行うことができる。 (Application example of observing the target object to be deformed on a turntable)
Next, a case where the target object is a non-rigid body and deforms in FIG. Even when the target object to be deformed is placed on the turntable 51, the equations (13-1) and (13-2) can be applied as the matrix A and the vector b, respectively. In this case, the covariance matrix Σ may be set so that its eigenvalue is larger than when the target object is a rigid body. That is, shape estimation can be performed by setting a covariance matrix Σ that incorporates the degree of deformation.

（移動する対象物体を観測する応用例）
次に、図９に示すように、移動する対象物体を観測する場合について説明する。図９に示したように、コース５３上を走る非剛体（人物）５４を対象物体として、時々刻々の形状を推定するような場合を例にして説明する。以下の説明では状態ベクトルｓ_ｍとして式（２）に示した位置及び速度からなる６次元のベクトルを用いることとする。例えば、移動する対象物体の速度がほぼ一定（すなわち等速度運動からの誤差が共分散行列Σで吸収可能）であり、また撮影の時間間隔もΔＴで一定である場合は、行列Ａ及びベクトルｂを、それぞれ式（１４−１）及び式（１４−２）のようにすることができる。 (Application example for observing moving objects)
Next, a case where a moving target object is observed as shown in FIG. 9 will be described. As shown in FIG. 9, a case where the non-rigid body (person) 54 running on the course 53 is assumed as the target object and the shape of the moment is estimated will be described as an example. In the following description and the use of 6-dimensional vector composed of position and velocity are shown in equation (2) as a state vector s _m. For example, when the speed of the moving target object is substantially constant (that is, the error from the constant velocity motion can be absorbed by the covariance matrix Σ), and the imaging time interval is also constant at ΔT, the matrix A and the vector b Can be changed to equations (14-1) and (14-2), respectively.

なお、図８に示した例と同様に、共分散行列Σは半正定値の実対称行列であれば、その値は任意に決めることができる。 As in the example shown in FIG. 8, the value of the covariance matrix Σ can be arbitrarily determined as long as it is a semi-definite real symmetric matrix.

このようにして行列Ａ及びベクトルｂを定式化して撮影毎にパーティクルを状態遷移させることにより、移動する物体の形状推定を行うことができる。 In this way, the matrix A and the vector b are formulated, and the state of the moving object is estimated by performing the state transition of the particles for each photographing.

図１に戻って、形状推定装置１０の構成について説明を続ける。
メタボール生成手段４は、パーティクルフィルタ手段３から入力されたパーティクル群に基づき、メタボール表現の３次元モデルを生成し、生成した３次元モデルによる形状データをモルフォロジ処理手段５に出力する。 Returning to FIG. 1, the description of the configuration of the shape estimation apparatus 10 will be continued.
The metaball generation unit 4 generates a three-dimensional model of metaball expression based on the particle group input from the particle filter unit 3, and outputs shape data based on the generated three-dimensional model to the morphology processing unit 5.

メタボール生成手段４は、各パーティクルの状態ベクトルｓ_ｍと重み係数ｗ_ｍに基づいてメタボール表現の３次元モデルを生成する。メタボールとは、中心からの距離によって一定の法則で濃度が変化する仮想的な球体（メタボール）を、１個乃至複数個用いて３次元物体の形状を表現する手法である。複数個のメタボールが存在する場合、ある地点における濃度は全メタボールの濃度の加算値として定義される。そして、全メタボールからの濃度を加算した濃度が一定となる曲面を対象物体の表面として定義する。 The metaball generation unit 4 generates a three-dimensional model of metaball expression based on the state vector s _m and the weight coefficient w _m of each particle. The metaball is a technique for expressing the shape of a three-dimensional object using one or a plurality of virtual spheres (metaballs) whose density changes according to a certain rule depending on the distance from the center. When there are a plurality of metaballs, the concentration at a certain point is defined as the sum of the concentrations of all metaballs. Then, a curved surface having a constant density obtained by adding the densities from all metaballs is defined as the surface of the target object.

１つのメタボールの濃度を表す関数をｂ（Ｘ，Ｙ，Ｚ；Ｘ_０，Ｙ_０，Ｚ_０，ｈ）とおく。ここで、（Ｘ_０，Ｙ_０，Ｚ_０）はメタボールの中心座標、ｈは濃度の拡がりを決めるパラメータである。例えば、式（１５）が典型的なメタボールの濃度関数である。 Let b (X, Y, Z; X ₀ , Y ₀ , Z ₀ , h) be a function representing the concentration of one metaball. Here, (X ₀ , Y ₀ , Z ₀ ) is the center coordinate of the metaball, and h is a parameter that determines the density spread. For example, equation (15) is a typical metaball concentration function.

メタボール生成手段４は、例えば、各パーティクルの状態ベクトルｓ_ｍに含まれる３次元座標（Ｘ_ｍ，Ｙ_ｍ，Ｚ_ｍ）を、対応する１つのメタボールの中心座標とすることができる。一方、濃度の拡がりのパラメータｈは、例えば、重み係数ｗ_ｍに基づきｈ＝ｈ（ｗ_ｍ）のように決定することができる。ここで、ｈは予め定める任意の関数であるが、ｈを広義の単調増加関数とすることが好ましく、例えば、ｈとして０以上の傾きを有する１次関数とすることができる。また、式（１６）のように、単に For example, the metaball generation unit 4 can use the three-dimensional coordinates (X _m , Y _m , Z _m ) included in the state vector s _m of each particle as the center coordinates of one corresponding meta ball. On the other hand, the density h parameter h can be determined as h = h (w _m ) based on the weighting factor w _m , for example. Here, h is an arbitrary function determined in advance, and h is preferably a monotonically increasing function in a broad sense. For example, h can be a linear function having a slope of 0 or more. Also, simply as in equation (16)

ｈ（ｗ）＝ｗ …（１６） h (w) = w (16)

としてもよい。さらに、例えば、ｈ（ｗ）＝１（定関数）とすることもできる。 It is good. Further, for example, h (w) = 1 (constant function) may be used.

Ｍ個のそれぞれのパーティクルに対応して、メタボールを生成する。その結果、ある３次元座標（Ｘ，Ｙ，Ｚ）における濃度Ｂ（Ｘ，Ｙ，Ｚ）は、式（１７）のように表すことができる。 A metaball is generated corresponding to each of the M particles. As a result, the density B (X, Y, Z) at a certain three-dimensional coordinate (X, Y, Z) can be expressed as in Expression (17).

特に、式（１５）の濃度関数と、式（１６）のｈとを採用した場合には、式（１８）のように表される。 In particular, when the concentration function of Expression (15) and h of Expression (16) are adopted, it is expressed as Expression (18).

次に、図１０を参照して、メタボール生成手段４によりパーティクルをメタボール表現に変換する様子について説明する。図１０は、パーティクル群からメタボール表現に変換する様子を示した図である。
図１０の左図において、各パーティクルの存在位置に近いほど、またパーティクルが密集しているほどメタボール表現での濃度が濃くなる。図１０に示した例では、メタボール表現を便宜上２次元的に図示しているが、実際には３次元空間上の濃度分布となる。 Next, with reference to FIG. 10, how the metaball generation unit 4 converts particles into a metaball representation will be described. FIG. 10 is a diagram illustrating a state in which the particle group is converted into a metaball representation.
In the left diagram of FIG. 10, the closer to the position where each particle exists, and the denser the particles, the higher the density in the metaball expression. In the example shown in FIG. 10, the metaball expression is shown two-dimensionally for convenience, but actually, the concentration distribution is in a three-dimensional space.

メタボール表現における等濃度曲面が対象物体の推定形状（表面形状）を表す。例えば、適宜な閾値βを設定し、式（１９）により対象物体の推定形状Ｖを得ることができる。 The isodensity curved surface in the metaball representation represents the estimated shape (surface shape) of the target object. For example, an appropriate threshold value β is set, and the estimated shape V of the target object can be obtained by Expression (19).

図１１を参照して、対象物体のメタボール表現から、対象物体の推定形状を得る様子について説明する。図１１は、メタボール表現から推定形状を得る様子を示した図である。
図１１はメタボール表現において設定した、ある等濃度曲面（左図の破線）の内部および境界を抽出することで、図１１の右図のような対象物体の推定形状を得ることができる。なお、式（１９）の操作は、通常、従来のＣＧ（Computer Graphics）ソフトウェアで実行することができる。 With reference to FIG. 11, how to obtain the estimated shape of the target object from the metaball representation of the target object will be described. FIG. 11 is a diagram showing how the estimated shape is obtained from the metaball representation.
FIG. 11 shows the estimated shape of the target object as shown in the right diagram of FIG. 11 by extracting the inside and boundary of a certain isoconcentration curved surface (broken line in the left diagram) set in the metaball representation. Note that the operation of equation (19) can usually be executed by conventional CG (Computer Graphics) software.

図１に戻って、形状推定装置１０の構成について説明を続ける。
モルフォロジ処理手段５は、メタボール生成手段４によって生成された対象物体の３次元形状データを調整し、出力手段６に出力する。 Returning to FIG. 1, the description of the configuration of the shape estimation apparatus 10 will be continued.
The morphology processing unit 5 adjusts the three-dimensional shape data of the target object generated by the metaball generation unit 4 and outputs it to the output unit 6.

メタボール表現の性質として、前記した閾値β、濃度を表す関数ｂ（Ｘ，Ｙ，Ｚ；Ｘ_０，Ｙ_０，Ｚ_０、ｈ）、および関数ｈの選び方によっては、実際の物体形状よりも拡大もしくは縮小された推定形状が得られてしまう場合がある。モルフォロジ処理手段５は、メタボール表現に対して、３次元の多値モルフォロジ処理を適用することで推定形状を収縮もしくは膨張し、対象物体の実形状に近づけるための修正手段である。 Depending on the property of the metaball expression, the threshold value β, the function b (X, Y, Z; X ₀ , Y ₀ , Z ₀ , h) representing the density, and the method of selecting the function h are larger than the actual object shape. Alternatively, a reduced estimated shape may be obtained. The morphology processing means 5 is a correction means for contracting or expanding the estimated shape by applying a three-dimensional multi-value morphology process to the metaball representation so that it approximates the actual shape of the target object.

ここで、モルフォロジ処理手段５への入力濃度関数をＢ（Ｘ，Ｙ，Ｚ）、モルフォロジ処理手段５からの出力濃度関数をＢ’（Ｘ，Ｙ，Ｚ）とする。収縮のためのモルフォロジ処理は、式（２０）に示すように、ある注目位置の周囲に設定される一定領域内の最小入力濃度を、当該注目位置の出力濃度とする。 Here, the input density function to the morphology processing means 5 is B (X, Y, Z) and the output density function from the morphology processing means 5 is B '(X, Y, Z). In the morphological processing for contraction, as shown in Expression (20), the minimum input density within a certain region set around a certain target position is set as the output density of the target position.

ここで、ＥはＳｔｒｕｃｔｕａｌＥｌｅｍｅｎｔと称される領域形状であり、例えば、式（２１）に示すように、半怪ρの球を用いることができる。 Here, E is a region shape called Structural Element, and for example, as shown in Equation (21), a half-phantom ρ sphere can be used.

一方、膨張のためのモルフォロジ処理は、式（２２）に示すように、ある注目位置の周囲に設定される一定領域内の最大入力濃度を、当該注目位置の出力濃度とする。 On the other hand, in the morphology processing for expansion, as shown in Expression (22), the maximum input density within a certain region set around a certain target position is set as the output density of the target position.

このように、モルフォロジ処理によって、メタボール生成手段４によって生成された対象物体の推定形状の拡大又は縮小を修正（調整）した３次元形状データを生成することができる。 As described above, three-dimensional shape data in which the expansion or reduction of the estimated shape of the target object generated by the metaball generation unit 4 is corrected (adjusted) can be generated by the morphology processing.

出力手段６は、モルフォロジ処理手段５によって修正された、対象物体の３次元形状データを出力するための出力インターフェースである。
これによって、例えば、図示しない画像表示装置などに出力して、推定された形状を表示して確認することができる。 The output unit 6 is an output interface for outputting the three-dimensional shape data of the target object corrected by the morphology processing unit 5.
Thus, for example, the estimated shape can be displayed and confirmed by outputting to an image display device (not shown).

なお、本実施形態では、形状推定装置１０のシルエット抽出手段２及びパーティクルフィルタ手段３は、専用のハードウェアによって構成するようにしたが、一般的なコンピュータを、シルエット抽出手段２及びパーティクルフィルタ手段３として機能するプログラムによって動作させることで実現することもできる。また、形状推定装置１０は、メタボール生成手段４及びモルフォロジ処理手段を含めた各手段として機能するプログラムを結合して動作させることで実現することもできる。 In this embodiment, the silhouette extraction unit 2 and the particle filter unit 3 of the shape estimation apparatus 10 are configured by dedicated hardware. However, a general computer is configured using the silhouette extraction unit 2 and the particle filter unit 3. It can also be realized by operating with a program that functions as: The shape estimation apparatus 10 can also be realized by combining and operating programs that function as each unit including the metaball generation unit 4 and the morphology processing unit.

[形状推定装置の動作]
次に、図１２を参照（適宜図１乃至図３参照）して、図１に示した形状推定装置１０の動作について説明する。図１２は、図１に示した形状推定装置の処理の流れを示したフローチャートである。 [Operation of shape estimation device]
Next, the operation of the shape estimation apparatus 10 shown in FIG. 1 will be described with reference to FIG. 12 (refer to FIGS. 1 to 3 as appropriate). FIG. 12 is a flowchart showing the flow of processing of the shape estimation apparatus shown in FIG.

まず、形状推定装置１０は、カメラＣ_ｎ（例えば、図２のＣ_１，Ｃ_２）等によって対象物体を撮影して得られた画像を入力画像Ｉ_１〜Ｉ_Ｎとして、撮影時のカメラパラメータＣＰ_１〜ＣＰ_Ｎと共に入力手段１_１〜１_Ｎを介して入力する（ステップＳ１１）。 First, the shape estimating device 10, a camera _{C n} (e.g., _C 1, _{C 2} in FIG. 2) an image obtained by photographing the object by such as the input image _I 1 ~I _N, camera parameters at the time of shooting through the input unit ₁ 1 to 1 _N inputs with CP ₁ ~ CP _N (step S11).

入力された入力画像Ｉ_１〜Ｉ_Ｎは、それぞれ対応するシルエット抽出手段２_１〜２_ＮによってシルエットＳ_１〜Ｓ_Ｎが抽出される（ステップＳ１２）。 From the input images I _{1 to} I _N , silhouettes S _{1 to} S _N are extracted by the corresponding silhouette extraction means 2 ₁ to 2 _N (step S 12).

次に、パーティクルフィルタ手段３によって、シルエット抽出手段２_１〜２_Ｎで抽出されたシルエットＳ_１〜Ｓ_Ｎ及び入力手段１_１〜１_Ｎを介して入力されたカメラパラメータＣＰ_１〜ＣＰ_Ｎに基づいて、複数のパーティクルを用いた対象物体の形状推定を行う（ステップＳ１３）。 Then, the particle filter unit 3, based on the camera parameters _CP 1 ~ CP _N input through the silhouette extraction unit ₂ 1 to 2 silhouette _S 1 were extracted with _N to S _N and the input means ₁ 1 to 1 _N Then, the shape of the target object is estimated using a plurality of particles (step S13).

そして、メタボール生成手段４によって、パーティクルフィルタ手段３で推定された対象物体の形状を表すパーティクル群を、メタボール表現に変換し、対象物体の形状データを生成する（ステップＳ１４）。 Then, the metaball generation unit 4 converts the particle group representing the shape of the target object estimated by the particle filter unit 3 into a metaball expression, and generates target object shape data (step S14).

メタボール生成手段４で生成された対象物体の形状データは、モルフォロジ処理手段５によって、モルフォロジ処理が施されて、その形状の修正が行われ（ステップＳ１５）、出力手段６を介して出力される（ステップＳ１６）。 The shape data of the target object generated by the metaball generation unit 4 is subjected to morphological processing by the morphology processing unit 5 to correct the shape (step S15), and is output via the output unit 6 ( Step S16).

形状推定装置１０は、処理すべき次の画像の入力があるかどうかを確認し（ステップＳ１７）、画像がある場合は（ステップＳ１７でＹｅｓ）、ステップＳ１１に戻って対象物体の形状推定を繰り返し実行し、パーティクル群の状態ベクトルｓ_ｍ及び重み係数ｗ_ｍの更新を続ける。一方、次の画像の入力がない場合は（ステップＳ１７でＮｏ）、処理を終了する。 The shape estimation apparatus 10 confirms whether or not there is an input of the next image to be processed (step S17). If there is an image (Yes in step S17), the shape estimation device 10 returns to step S11 and repeats the target object shape estimation. run continues to update the state vector s _m and the weighting factor w _m for the particle group. On the other hand, if there is no input for the next image (No in step S17), the process ends.

なお、図１２に示したフォローチャートでは、ステップＳ１３において、パーティクルフィルタ手段３が出力したパーティクル群に基づいて、ステップＳ１４〜ステップＳ１６の処理を行った後に、次の画像に基づくステップＳ１１〜ステップＳ１３の処理を行うようにしたが、ステップＳ１３において、パーティクルフィルタ手段３がパーティクル群を出力した後に、メタボール生成手段４によるステップＳ１４の処理を進めると共に、ステップＳ１１に戻り、並行して次の画像に基づく処理を行うようにしてもよい。 In the follow chart shown in FIG. 12, in step S13, after the processing of step S14 to step S16 is performed based on the particle group output by the particle filter unit 3, steps S11 to S13 based on the next image are performed. In step S13, after the particle filter unit 3 outputs the particle group in step S13, the process of step S14 by the metaball generation unit 4 is advanced, and the process returns to step S11 to simultaneously display the next image. You may make it perform the process based on.

［パーティクルフィルタ手段の動作］
次に、図１３を参照（適宜図３参照）して、図３に示したパーティクルフィルタ手段３の動作について説明する。図１３は、図３に示したパーティクルフィルタ手段の処理の流れを示したフローチャートである。
なお、図１３に示したフローチャートは、図１２に示したフローチャートのステップＳ１３に対応する。 [Operation of particle filter means]
Next, the operation of the particle filter means 3 shown in FIG. 3 will be described with reference to FIG. FIG. 13 is a flowchart showing the flow of processing of the particle filter means shown in FIG.
Note that the flowchart shown in FIG. 13 corresponds to step S13 of the flowchart shown in FIG.

まず、パーティクルフィルタ手段３は、パーティクル生成手段２１によって、対象物体の存在候補領域に所定数（Ｍ個）の、パラメータを初期化したパーティクルを生成し、スイッチ２２をパーティクル生成手段２１側に接続して、生成したパーティクルを加重演算手段２３_２に供給する（ステップＳ３１）。
なお、所定数のパーティクルの供給が完了すると、スイッチ２２の接続を加重演算手段２３_１側に切り替えておく。 First, the particle filter unit 3 uses the particle generation unit 21 to generate a predetermined number (M) of particles with initialized parameters in the target object existence candidate region, and connects the switch 22 to the particle generation unit 21 side. Te, and it supplies the generated particles to the weight calculating unit 23 ₂ (step S31).
Incidentally, when the supply of the predetermined number of particles is complete, keep switching the connection of the switch 22 to the weighted calculation unit 23 _1.

パーティクルフィルタ手段３は、加重演算手段２３_２によって、入力されたパーティクルの重み係数ｗ_ｍを更新（重み係数変更工程）し、更新したパーティクルを次段の加重演算手段２３_３（不図示）に出力する（ステップＳ３２_２）。そして、順次、ｎ番目の加重演算手段２３_ｎによって、前段のｎ−１番目の加重演算手段２３_ｎ−１から入力されたパーティクルの重み係数ｗ_ｍを更新し、更新したパーティクルを次段の加重演算手段２３_ｎ＋１に出力する。
最後段の加重演算手段２３_Ｎによって、パーティクルの重み係数ｗ_ｍを更新すると（ステップＳ３２_Ｎ）、Ｍ個のパーティクル群Ｐ_Ａを、メタボール生成手段４（図１参照）に出力する（ステップＳ３３）と共に、再サンプリング手段２４に出力する。
なお、ここでは、パーティクルフィルタ手段３から出力されるパーティクル群は、加重演算手段２３_Ｎによって出力されるパーティクル群Ｐ_Ａとしたが、再サンプリング手段２４によって出力されるパーティクル群Ｐ_Ｂとしてもよい。その場合には、パーティクル群の出力（ステップＳ３３）は、再サンプリング（ステップＳ３４）の後に行うようにすればよい。 Particle filter means 3, by a weighted calculation unit 23 _2, and updates the weighting factor w _m of the input particle (weight coefficient changing step), outputs the updated particle to the next stage of the weighting calculation means 23 3 _(not shown) (Step S32 ₂ ). Then, the n-th weight calculation means 23 _n sequentially updates the weight coefficient w _m of the particles input from the _n− 1th weight calculation means 23 _n−1 in the previous stage, and the updated particles are weighted in the next stage. It outputs to the arithmetic means 23 _{n + 1} .
By a weighted arithmetic means 23 _N of the last stage, updating the weighting factor _{w m} of particles (step _S32 N), the M particles group _{P A,} and outputs the metaball generation unit 4 (see FIG. 1) (step S33) At the same time, it is output to the resampling means 24.
Here, the particle group to be output from the particle filter unit 3 has been a particle group P _A output by the weighting calculation means 23 _N, may be a particle group P _B output by the resampling means 24. In that case, the output of the particle group (step S33) may be performed after resampling (step S34).

続いて、パーティクルフィルタ手段３は、再サンプリング手段２４によって、加重演算手段２３_Ｎで重み係数ｗ_ｍを更新されたパーティクル群を再サンプリングし（ステップＳ３４）、再サンプリングしたパーティクル群を遅延手段２５に出力する。 Subsequently, the particle filter means 3 resamples the particle group whose weight coefficient w _m has been updated by the weight calculation means 23 _N by the re-sampling means 24 (step S34), and sends the re-sampled particle group to the delay means 25. Output.

パーティクルフィルタ手段３は、遅延手段２５によって、次の画像入力があるまでパーティクルの処理を待機し、タイミングを調整して状態遷移手段２６にパーティクルを出力する（ステップＳ３５）。 The particle filter means 3 waits for the processing of the particles until the next image input by the delay means 25, adjusts the timing, and outputs the particles to the state transition means 26 (step S35).

そして、状態遷移手段２６によって、パーティクルの状態ベクトルを遷移させて（ステップＳ３６）、状態ベクトルを遷移させたパーティクルを加重演算手段２３_１に出力する。 Then, the state changing means 26, by transitioning the state vector of the particle (Step S36), and outputs the particles which transits the state vector to the weighted calculation unit 23 _1.

ステップＳ３１において、パーティクル生成手段２１で生成されたパーティクルは、ここまでで、処理が一巡したことになる。そして、次の入力画像に基づいて、二巡目の処理に進む。
ステップＳ３６において状態遷移手段２６で状態ベクトルを遷移されたパーティクルは、二巡目以降は、加重演算手段２３_１から加重演算手段２３_Ｎによって、順次パーティクルの重み係数ｗ_ｍが更新され（ステップＳ３２_１〜Ｓ３２_Ｎ）、パーティクル群Ｐ_Ａを出力する（ステップＳ３３）。そして、再サンプリング手段２４によって、パーティクル群を再サンプリングし（ステップＳ３４）、遅延手段２５によって、パーティクルの処理のタイミングを調整し（ステップＳ３５）、状態遷移手段２６によって、パーティクルの状態ベクトルを遷移する（ステップＳ３６）。
以下、新たな入力画像があると、ステップＳ３２_１からステップＳ３６を繰り返す。 In step S31, the particles generated by the particle generating means 21 have been processed once. Then, based on the next input image, the process proceeds to the second round.
For the particles whose state vectors have been changed by the state transition means 26 in step S36, the weight coefficient w _{m of the} particles is sequentially updated by the weight calculation means 23 ₁ to the weight calculation means 23 _N after the second round (step S32 ₁ ~S32 _N), and it outputs the particle group _{P a} (step S33). Then, the re-sampling unit 24 resamples the particle group (step S34), the delay unit 25 adjusts the particle processing timing (step S35), and the state transition unit 26 transitions the particle state vector. (Step S36).
Hereinafter, when there is a new input image, and repeats the step S36 from step S32 _1.

ここで、入力画像を時系列で入力し、パーティクルフィルタ手段３によって、パーティクルの状態ベクトルｓ_ｍ及び重み係数ｗ_ｍの更新を繰り返し行うことにより、対象物体が移動又は変形する場合であっても、限られた個数のパーティクルを用いて当該対象物体の形状を効率的に推定することができる。 Here, even when the target object moves or deforms by inputting the input image in time series and repeatedly updating the particle state vector s _m and the weighting factor w _m by the particle filter unit 3, The shape of the target object can be efficiently estimated using a limited number of particles.

なお、加重演算手段２３_１〜２３_Ｎは、パーティクルを個々に処理することができるため、Ｍ個のパーティクルのすべての処理を終えてから次段に出力するのではなく、処理が成された個々のパーティクルを順次出力し、直列に接続された加重演算手段２３_１〜２３_Ｎによってパイプライン処理を行うようにしてもよい。 Since the weight calculation means 23 ₁ to 23 _N can individually process the particles, each of the processed particles is not output to the next stage after all the M particles have been processed. The particles may be sequentially output, and the pipeline processing may be performed by the weight calculation units 23 ₁ to 23 _N connected in series.

［パーティクル生成手段の動作］
次に、図１４を参照（適宜図４参照）して、図１２に示したフローチャートのパーティクル生成（ステップＳ３１）の手順の例について説明する。図１４は、図１２に示したフローチャートのパーティクル生成の詳細な手順を示したフローチャートである。 [Operation of particle generation means]
Next, an example of the procedure of particle generation (step S31) in the flowchart shown in FIG. 12 will be described with reference to FIG. 14 (refer to FIG. 4 as appropriate). FIG. 14 is a flowchart showing a detailed procedure of particle generation in the flowchart shown in FIG.

式（３）及び式（４）に基づくパーティクルの生成は、図１４のフローチャートに示した手順により簡便に実行することもできる。
Ｍ個のパーティクルを生成するに際して、ｍ（ｍ＝１，２，…，Ｍ）をパーティクルの番号とすると、まず、ｍ＝１として（ステップＳ５１）、１番目のパーティクルから順次生成する。 The generation of particles based on the formula (3) and the formula (4) can be easily executed by the procedure shown in the flowchart of FIG.
When generating M particles, if m (m = 1, 2,..., M) is a particle number, first, m = 1 is set (step S51), and the particles are sequentially generated from the first particle.

１番目のパーティクルに対して、領域Ｄ内において一様で、かつ、領域Ｄ外において確率密度が０となる確率密度関数ｕ（Ｄ）に従う乱数を生成し、生成した乱数値から３次元の座標（Ｘ，Ｙ，Ｚ）を構成して、座標の初期値の候補とする（ステップＳ５２）。 For the first particle, a random number according to the probability density function u (D) that is uniform in the region D and has a probability density of 0 outside the region D is generated, and three-dimensional coordinates are generated from the generated random value. (X, Y, Z) are configured and set as candidates for initial values of coordinates (step S52).

次に、ステップＳ５２でサンプリングした候補座標のシルエットＳ_１に対応する画像面への投影変換（ｆ_１（Ｘ，Ｙ，Ｚ）、ｇ_１（Ｘ，Ｙ，Ｚ））を求め、シルエットＳ_１の画素値Ｓ_１（ｆ_１（Ｘ，Ｙ，Ｚ）、ｇ_１（Ｘ，Ｙ，Ｚ））を取得し、それをｑとする（ステップＳ５３）。 Next, the projection transformation (f ₁ (X, Y, Z), g ₁ (X, Y, Z)) onto the image plane corresponding to the silhouette S ₁ of the candidate coordinates sampled in step S52 is obtained, and the silhouette S ₁ Pixel value S ₁ (f ₁ (X, Y, Z), g ₁ (X, Y, Z)) is acquired and set to q (step S53).

続いて、０以上１未満の一様乱数ｒを生成し（ステップＳ５４）、ステップＳ５３で取得したｑとを比較する（ステップＳ５５）。
ｑがｒよりも大きい場合（ステップＳ５５でＹｅｓ）、ステップＳ５２で生成した座標（Ｘ，Ｙ，Ｚ）を、１番目（ｍ番目）のパーティクルの座標（Ｘ_ｍ，Ｙ_ｍ，Ｚ_ｍ）の初期値として採用する（ステップＳ５６）。
一方、ｑがｒ以下の場合（ステップＳ５５でＮｏ）、ステップＳ５２に戻り、１番目（ｍ番目）のパーティクルの座標の候補を再度生成する。 Subsequently, a uniform random number r of 0 or more and less than 1 is generated (step S54), and compared with q acquired in step S53 (step S55).
If q is larger than r in (Yes in step S55), the coordinates generated in step S52 (X, Y, Z), and first (m-th) of particles coordinates _{_{_{(X m, Y m, Z}}} m) The initial value is adopted (step S56).
On the other hand, if q is equal to or smaller than r (No in step S55), the process returns to step S52, and the coordinate candidates of the first (mth) particle are generated again.

ステップＳ５６で１番目（ｍ番目）のパーティクルの座標の初期値が採用されると、そのパーティクルの番号ｍと生成すべきパーティクルの個数Ｍとを比較し（ステップＳ５７）、ｍがＭよりも小さい場合は（ステップＳ５７でＹｅｓ）、ｍをインクリメントし（ステップＳ５８）、ステップＳ５２に戻って、次のパーティクルに対する座標の初期値を決定する手順を順次実行する。そして、ｍがインクリメントされてＭになるまでステップＳ５２からステップＳ５８の処理を繰り返す。
一方、ｍがＭ以上の場合（ステップＳ５７でＮｏ）、必要個数のパーティクルの座標の初期値の決定が完了したため、処理を終了する。 When the initial value of the coordinates of the first (m-th) particle is adopted in step S56, the particle number m is compared with the number M of particles to be generated (step S57), and m is smaller than M. In this case (Yes in step S57), m is incremented (step S58), and the process returns to step S52 to sequentially execute the procedure for determining the initial value of coordinates for the next particle. The processes from step S52 to step S58 are repeated until m is incremented to M.
On the other hand, if m is equal to or greater than M (No in step S57), the process ends because the determination of the initial values of the coordinates of the required number of particles has been completed.

以上の手順によって、Ｍ個のパーティクルの状態ベクトルｓ_ｍの座標の初期値（Ｘ_ｍ，Ｙ_ｍ，Ｚ_ｍ）が設定される。また、状態ベクトルｓ_ｍが、座標（位置成分）以外の成分を含む場合は、それらの成分の一部又は全部の初期値を、所定の定数（例えば０）や乱数によって設定することができる。また、パーティクルの重み係数ｗ_ｍの初期値は、すべて均一の値（例えば、１／Ｍ）とする。 By the above procedure, the coordinate initial value of M particles in the state vector _{_{_{s m (X m, Y m}}} , Z m) is set. The state vector s _m is if it contains components other than the coordinate (position component), the initial values of some or all of the components, can be set by a predetermined constant (e.g., 0) and a random number. The initial values of the particle weighting coefficient w _m are all uniform values (for example, 1 / M).

以上、説明したように、本発明による形状推定装置１０は、入力画像Ｉ_ｎ及び対応するカメラパラメータＣＰ_ｎに基づいて、対象物体の形状を推定することができる。 As described above, the shape estimation apparatus 10 according to the present invention, based on the input image I _n and the corresponding camera parameters CP _n, it is possible to estimate the shape of the target object.

なお、対象物体が移動又は変形しない場合は、時系列で画像を入力する必要はなく、同じ画像を用いてパーティクルの状態ベクトルｓ_ｍ及び重み係数ｗ_ｍが収束するまでパーティクルフィルタ手段３による処理を繰り返すようにすればよい。 In the case where the target object does not move or deform, the time series is not necessary to input an image, the processing by the particle filter unit 3 to the state vector s _m and the weighting factor w _m for the particles to converge with the same image Repeat it.

また、状態ベクトルｓ_ｍに適宜速度成分や加速度成分を導入し、時系列で入力される画像を用いてパーティクルフィルタ手段３による処理を繰り返すことにより、時々刻々移動又は変形する対象物体の、動きを含めた３次元形状の推定を行うことができる。
更に、パーティクルフィルタによって推定した３次元形状に関する情報をメタボール表現へ簡単に変換することができ、推定した３次元形状の外観を確認することができる。 Furthermore, by introducing appropriate velocity component and the acceleration component of the state vector s _m, by repeating the processing by the particle filtering unit 3 by using the image input in time series, of the object to be momentarily moved or modified, the movement The included three-dimensional shape can be estimated.
Furthermore, information regarding the three-dimensional shape estimated by the particle filter can be easily converted into a metaball expression, and the appearance of the estimated three-dimensional shape can be confirmed.

本発明にかかる実施形態の形状推定装置の構成を示したブロック図である。It is the block diagram which showed the structure of the shape estimation apparatus of embodiment concerning this invention. （ａ）は、対象物体をカメラで撮影する様子を示した図、（ｂ）はカメラによって撮影された入力画像、（ｃ）はシルエットである。(A) is the figure which showed a mode that a target object is image | photographed with a camera, (b) is the input image image | photographed with the camera, (c) is a silhouette. パーティクルフィルタ手段の構成を示したブロック図である。It is the block diagram which showed the structure of the particle filter means. パーティクルを生成する様子を模式的に示した図である。It is the figure which showed a mode that the particle | grains were produced | generated typically. パーティクルの重み係数を更新する様子を模式的に示した図である。It is the figure which showed typically a mode that the weighting coefficient of a particle was updated. 再サンプリングの様子を模式的に示した図であり、（ａ）は、再サンプリング前のパーティクルを示し、（ｂ）は、再サンプリング後のパーティクルを示す。It is the figure which showed the mode of resampling typically, (a) shows the particle before resampling, (b) shows the particle after resampling. 再サンプリング手段によるパーティクルの再サンプリングの様子を示した図である。It is the figure which showed the mode of the resampling of the particle | grains by the resampling means. 回転台に置かれた剛体の対象物体の形状推定の様子を示した図である。It is the figure which showed the mode of the shape estimation of the target object of the rigid body placed on the turntable. ほぼ等速度運動する非剛体の対象物体の形状推定の様子を示した図である。It is the figure which showed the mode of the shape estimation of the non-rigid target object which carries out a substantially equal speed motion. パーティクル群からメタボール表現に変換する様子を示した図である。It is the figure which showed a mode that it converted into a metaball expression from a particle group. メタボール表現から推定形状を得る様子を示した図である。It is the figure which showed a mode that an estimated shape was acquired from metaball expression. 図１に示した形状推定装置の処理の流れを示したフローチャートである。It is the flowchart which showed the flow of the process of the shape estimation apparatus shown in FIG. 図３に示したパーティクルフィルタ手段の処理の流れ示したフローチャートである。It is the flowchart which showed the flow of the process of the particle filter means shown in FIG. 図１２に示したフローチャートのパーティクル生成の詳細な手順を示したフローチャートである。It is the flowchart which showed the detailed procedure of the particle | grain generation of the flowchart shown in FIG.

符号の説明Explanation of symbols

２（２_１，２_２，…，２_N）シルエット抽出手段
３パーティクルフィルタ手段
４メタボール生成手段
５モルフォロジ処理手段
１０形状推定装置
２１パーティクル生成手段
２３（２３_１，２３_２，…，２３_Ｎ）加重演算手段（重み係数変更手段）
２４再サンプリング手段
２６状態遷移手段
ＣＰ_１，ＣＰ_２，…，ＣＰ_Ｎカメラパラメータ
Ｄ領域（探索領域）
Ｉ_１，Ｉ_２，…，Ｉ_Ｎ入力画像
Ｏ対象物体（オブジェクト）
Ｏ’ 対象物体像（映像オブジェクト）
Ｐ（Ｐ_１，Ｐ_２）パーティクル
Ｐ_Ａ，Ｐ_Ｂパーティクル群
Ｓ_１，Ｓ_２，…，Ｓ_Ｎシルエット 2 (2 ₁ , 2 ₂ ,..., 2 _N ) Silhouette extraction means 3 Particle filter means 4 Metaball generation means 5 Morphology processing means 10 Shape estimation device 21 Particle generation means 23 (23 ₁ , 23 ₂ ,..., 23 _N ) Weighting Calculation means (weight coefficient changing means)
24 resampling means 26 state transition means _{_{CP 1, CP 2, ...,}} CP N camera parameters D region (search area)
I ₁ , I ₂ ,..., I _N input image O Target object (object)
O 'Target object image (video object)
_P _(P _1, P 2) particles P _A, _{P B} particle group _{_{S 1, S 2, ...,}} S N silhouettes

Claims

オブジェクトが撮影された画像中の映像オブジェクトに基づき、３次元座標を含む状態ベクトルと重み係数とを有した情報であるパーティクルを用いたパーティクルフィルタによって、前記オブジェクトの３次元形状を推定する形状推定装置であって、
前記画像に含まれる前記映像オブジェクトの領域をシルエットとして抽出するシルエット抽出手段と、
２以上である所定数の前記パーティクルに対してパーティクルフィルタによる処理を行うパーティクルフィルタ手段と、
前記パーティクルフィルタ手段によって処理された前記所定数のパーティクルに基づいてメタボール表現による３次元形状モデルを生成するメタボール生成手段と、を備え、
前記パーティクルフィルタ手段は、前記オブジェクトを探索する予め定められた探索領域内を示す３次元座標を有する、前記所定数のパーティクルを生成するパーティクル生成手段と、
前記シルエット抽出手段によって抽出されたシルエットに基づいて、前記パーティクルの重み係数を変更する重み係数変更手段と、
前記重み係数変更手段によって重み係数を変更されたパーティクルを３次元座標について再サンプリングして前記所定数のパーティクルを再生成すると共に、前記再生成した前記所定数のパーティクルの重み係数を均一にする再サンプリング手段と、
前記再サンプリング手段によって再サンプリングされたパーティクルの状態ベクトルを遷移させる状態遷移手段と、を有し、
前記重み係数変更手段と前記再サンプリング手段と前記状態遷移手段とによる前記パーティクルに対する処理を繰り返すことを特徴とする形状推定装置。 A shape estimation device that estimates a three-dimensional shape of an object by a particle filter using particles that are information having a state vector including three-dimensional coordinates and a weighting factor based on a video object in an image in which the object is photographed Because
Silhouette extraction means for extracting a region of the video object included in the image as a silhouette;
Particle filter means for performing processing by a particle filter on a predetermined number of particles equal to or greater than 2,
Metaball generation means for generating a three-dimensional shape model by metaball expression based on the predetermined number of particles processed by the particle filter means,
The particle filter means has three-dimensional coordinates indicating a predetermined search area for searching for the object, and generates a predetermined number of particles;
Weight coefficient changing means for changing the weight coefficient of the particles based on the silhouette extracted by the silhouette extracting means;
The particles whose weighting factor has been changed by the weighting factor changing means are resampled with respect to the three-dimensional coordinates to regenerate the predetermined number of particles, and regenerate the weighting factor of the predetermined number of particles regenerated. Sampling means;
State transition means for transitioning the state vector of the particles resampled by the resampling means,
A shape estimation apparatus characterized by repeating the processing for the particles by the weight coefficient changing means, the resampling means, and the state transition means.

前記メタボール生成手段は、前記重み係数変更手段によって重み係数が変更された前記所定数のパーティクルに基づいて、メタボール表現による３次元形状モデルを生成することを特徴とする請求項１に記載の形状推定装置。 2. The shape estimation according to claim 1, wherein the metaball generation unit generates a three-dimensional shape model based on a metaball representation based on the predetermined number of particles whose weighting factors have been changed by the weighting factor changing unit. apparatus.

前記メタボール生成手段は、前記再サンプリング手段によって再サンプリングされた前記所定数のパーティクルに基づいて、メタボール表現による３次元形状モデルを生成することを特徴とする請求項１に記載の形状推定装置。 The shape estimation apparatus according to claim 1, wherein the metaball generation unit generates a three-dimensional shape model based on a metaball expression based on the predetermined number of particles resampled by the resampling unit.

前記メタボール生成手段によって生成された前記メタボール表現による３次元形状モデルに対して、収縮又は膨張を行う３次元多値モルフォロジ処理によって３次元形状モデルを修正するモルフォロジ処理手段をさらに備えることを特徴とする請求項１乃至請求項３の何れか一項に記載の形状推定装置。 Morphology processing means for correcting the three-dimensional shape model by three-dimensional multi-value morphology processing for contraction or expansion with respect to the three-dimensional shape model by the metaball expression generated by the metaball generation means. The shape estimation apparatus according to any one of claims 1 to 3.

さらに１以上のシルエット抽出手段と、当該シルエット抽出手段に対応する重み係数変更手段とを備え、前記複数の重み係数変更手段は、それぞれ対応するシルエット抽出手段によって抽出されるシルエットに基づいて、前記パーティクルの重み係数を順次変更することを特徴とする請求項１乃至請求項４の何れか一項に記載の形状推定装置。 Furthermore, one or more silhouette extracting means and a weight coefficient changing means corresponding to the silhouette extracting means are provided, and the plurality of weight coefficient changing means are arranged on the basis of the silhouette extracted by the corresponding silhouette extracting means. The shape estimation apparatus according to claim 1, wherein the weight coefficients are sequentially changed.

オブジェクトが撮影された画像中の映像オブジェクトに基づき、３次元座標を含む状態ベクトルと重み係数とを有した情報であるパーティクルを用いたパーティクルフィルタによって、前記オブジェクトの３次元形状を推定するために、
コンピュータを、
前記画像に含まれる前記映像オブジェクトの領域をシルエットとして抽出するシルエット抽出手段、
２以上である所定数の前記パーティクルに対してパーティクルフィルタによる処理を行うパーティクルフィルタ手段、
前記パーティクルフィルタ手段によって処理された前記所定数のパーティクルに基づいてメタボール表現による３次元形状モデルを生成するメタボール生成手段、
として機能させる形状推定プログラムであって、
前記パーティクルフィルタ手段は、前記オブジェクトを探索する予め定められた探索領域内を示す３次元座標を有する、前記所定数のパーティクルを生成するパーティクル生成手段、
前記シルエット抽出手段によって抽出されたシルエットに基づいて、前記パーティクルの重み係数を変更する重み係数変更手段、
前記重み係数変更手段によって重み係数を変更されたパーティクルを３次元座標について再サンプリングして前記所定数のパーティクルを再生成すると共に、前記再生成した前記所定数のパーティクルの重み係数を均一にする再サンプリング手段、
前記再サンプリング手段によって再サンプリングされたパーティクルの状態ベクトルを遷移させる状態遷移手段、を含み、
前記重み係数変更手段と前記再サンプリング手段と前記状態遷移手段とによる前記パーティクルに対する処理を繰り返すことを特徴とする形状推定プログラム。 In order to estimate the three-dimensional shape of the object by a particle filter using particles, which is information having a state vector including three-dimensional coordinates and a weighting coefficient, based on a video object in an image in which the object is photographed,
Computer
Silhouette extraction means for extracting a region of the video object included in the image as a silhouette;
Particle filter means for performing processing by a particle filter on a predetermined number of particles equal to or greater than 2,
Metaball generating means for generating a three-dimensional shape model by metaball expression based on the predetermined number of particles processed by the particle filter means;
A shape estimation program that functions as
The particle filter means has a three-dimensional coordinate indicating a predetermined search area for searching for the object, a particle generation means for generating the predetermined number of particles,
Weight coefficient changing means for changing the weight coefficient of the particles based on the silhouette extracted by the silhouette extracting means;
The particles whose weighting factor has been changed by the weighting factor changing means are resampled with respect to the three-dimensional coordinates to regenerate the predetermined number of particles, and regenerate the weighting factor of the predetermined number of particles regenerated. Sampling means,
State transition means for transitioning a state vector of particles resampled by the resampling means,
A shape estimation program that repeats processing on the particles by the weight coefficient changing means, the resampling means, and the state transition means.