JP6685943B2

JP6685943B2 - Separation matrix design device, filter coefficient calculation device, method thereof, and program

Info

Publication number: JP6685943B2
Application number: JP2017009296A
Authority: JP
Inventors: 健太丹羽; 小林　和則; 和則小林
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 2017-01-23
Filing date: 2017-01-23
Publication date: 2020-04-22
Anticipated expiration: 2037-01-23
Also published as: JP2018121108A

Description

本発明は、複数のマイクロホンで構成された音響アレイ装置を用いて、任意の音源信号や位置といった情報を推定、抽出するビームフォーミング技術に関する。 The present invention relates to a beamforming technique for estimating and extracting information such as an arbitrary sound source signal and position using an acoustic array device including a plurality of microphones.

ビームフォーミングとウィーナーフィルタリングを組み合わせた音源強調方式について研究を進めている。ウィーナーフィルタを推定するための方式として、局所PSD(Power Spectral Density、パワースペクトル密度)推定法がある。局所PSD推定法では、線形性の逆写像により、複数のビームフォーミングの出力信号のPSD群から音源のPSD群を推定し、音源のPSD群からウィーナーフィルタを構成する(非特許文献１参照)。 We are researching a sound source enhancement method that combines beamforming and Wiener filtering. As a method for estimating the Wiener filter, there is a local PSD (Power Spectral Density) estimation method. In the local PSD estimation method, the PSD group of the sound source is estimated from the PSD groups of the output signals of the plurality of beamformings by the inverse mapping of the linearity, and the Wiener filter is constructed from the PSD group of the sound source (see Non-Patent Document 1).

K. Niwa, T. Kawase, K. Kobayashi and Y. Hioka, "PSD estimation in beamspace using property of M-matrix", IWAENC2016, 2016.K. Niwa, T. Kawase, K. Kobayashi and Y. Hioka, "PSD estimation in beamspace using property of M-matrix", IWAENC2016, 2016.

従来技術では、ヒューリスティックにビームフォーミングを設計している。そのため、ウィーナーフィルタを構成する過程において、設計したビームフォーミングの特性によって、音源のPSD群の推定精度が劣化してしまうことが分かっている。 In the prior art, beamforming is heuristically designed. Therefore, it is known that the estimation accuracy of the PSD group of the sound source deteriorates due to the characteristics of the designed beamforming in the process of constructing the Wiener filter.

本発明では、ビームフォーミングのフィルタ群を導出する際に用いるPSD分離行列を解析的に設計する分離行列設計装置、解析的に設計されたPSD分離行列を用いて解析的にフィルタ係数を算出するフィルタ係数算出装置、その方法、及びプログラムを提供することを目的とする。 In the present invention, a separation matrix design device for analytically designing a PSD separation matrix used in deriving a beamforming filter group, a filter for analytically calculating filter coefficients using an analytically designed PSD separation matrix An object is to provide a coefficient calculation device, a method thereof, and a program.

上記の課題を解決するために、本発明の一態様によれば、分離行列設計装置は、Mを2以上の整数とし、M個の収音信号を入力とし、L個のビームフォーミング出力値を得るビームフォーミングにおいて、l=1,2,…,Lとし、i=1,2,…,Lとし、PSD分離行列G_L,ωが、l番目のビームフォーマの方向Θ_iに対する感度を要素とするL×Lの非負行列であるPSD混合行列D_L,ωの逆行列となり、かつ、PSD分離行列G_L,ωがM行列となるように、PSD分離行列G_L,ωを設計する分離行列設計部を含む。 In order to solve the above problems, according to one aspect of the present invention, a separation matrix design device, M is an integer of 2 or more, M sound pickup signals as input, L beamforming output values. In the obtained beamforming, l = 1,2, ..., L and i = 1,2, ..., L, and the PSD separation matrix G _{L, ω} has the sensitivity to the direction Θ _i of the l-th beamformer as an element. A separation matrix that designs the PSD separation matrix G _{L, ω} such that it is the inverse of the PSD mixing matrix D _{L, ω} that is a nonnegative L × L matrix, and the PSD separation matrix G _{L, ω} is an M matrix. Including design department.

上記の課題を解決するために、本発明の他の態様によれば、分離行列設計方法は、Mを2以上の整数とし、M個の収音信号を入力とし、L個のビームフォーミング出力値を得るビームフォーミングにおいて、l=1,2,…,Lとし、i=1,2,…,Lとし、PSD分離行列G_L,ωが、l番目のビームフォーマの方向Θ_iに対する感度を要素とするL×Lの非負行列であるPSD混合行列D_L,ωの逆行列となり、かつ、PSD分離行列G_L,ωがM行列となるように、PSD分離行列G_L,ωを設計する分離行列設計ステップを含む。 In order to solve the above problems, according to another aspect of the present invention, a separation matrix design method, M is an integer of 2 or more, M sound pickup signals are input, L beamforming output values. In the beamforming, we obtain l = 1,2, ..., L, i = 1,2, ..., L, and the PSD separation matrix G _{L, ω} is the sensitivity of the l-th beamformer to the direction Θ _i . Design the PSD separation matrix G _{L, ω} such that it is the inverse of the PSD mixing matrix D _{L, ω} that is a non-negative L × L matrix, and the PSD separation matrix G _{L, ω} is an M matrix. Includes matrix design step.

本発明によれば、音源のPSD群の推定精度が高まるようなビームフォーミングのフィルタ群等を解析的に導出することができるという効果を奏する。 Advantageous Effects of Invention According to the present invention, it is possible to analytically derive a beamforming filter group or the like that increases the estimation accuracy of a PSD group of a sound source.

ビームフォーミングとウィーナーポストフィルターを使った音源強調系の機能ブロック図。Functional block diagram of sound source enhancement system using beamforming and Wiener post filter. ビームフォーマと到来方向との関係を示す図。The figure which shows the relationship between a beam former and an arrival direction. 音源PSDを推定方法を説明するための図。The figure for demonstrating the estimation method of sound source PSD. 従来のビームフォーミングフィルタ係数群、PSD混合行列、PSD分離行列を推定するフローを示す図。The figure which shows the flow which estimates the conventional beam forming filter coefficient group, PSD mixing matrix, and PSD separation matrix. 等間隔に離散化されたM方向から平面波が到来することを仮定したとき、H_M,ωを説明するための図。FIG. 6 is a diagram for explaining H _{M, ω} when it is assumed that plane waves arrive from the M direction, which is discretized at equal intervals. Δ_M,ωを多重極展開したときの具体的な例を示す図。The figure which shows the specific example at the time of carrying out multipole expansion of (DELTA) _{M, (omega} ). 第一実施形態に係るフィルタ係数算出装置の機能ブロック図。The functional block diagram of the filter coefficient calculation device concerning a first embodiment. 第一実施形態に係るフィルタ係数算出装置の処理フローの例を示す図。The figure which shows the example of the processing flow of the filter coefficient calculation apparatus which concerns on 1st embodiment. 第二実施形態に係る音声強調装置の機能ブロック図。The functional block diagram of the speech enhancement apparatus which concerns on 2nd embodiment. 第二実施形態に係る音声強調装置の処理フローの例を示す図。The figure which shows the example of the process flow of the speech enhancement apparatus which concerns on 2nd embodiment. 第三実施形態に係る音声強調装置の機能ブロック図。The functional block diagram of the speech enhancement apparatus which concerns on 3rd embodiment. 第三実施形態に係る音声強調装置の処理フローの例を示す図。The figure which shows the example of the process flow of the speech enhancement apparatus which concerns on 3rd embodiment. Ψ_L,ωを変えながら作成したフィルタの指向特性を示す図。The figure which shows the directional characteristic of the filter produced, changing Ψ _{L, ω} . Ψ_L,ωを変えながら作成したフィルタの指向特性を示す図。The figure which shows the directional characteristic of the filter produced, changing Ψ _{L, ω} . Ψ_L,ωを変えながら作成したフィルタの指向特性を示す図。The figure which shows the directional characteristic of the filter produced, changing Ψ _{L, ω} . Ψ_L,ωを変えながら作成したフィルタの指向特性を示す図。The figure which shows the directional characteristic of the filter produced, changing Ψ _{L, ω} .

以下、本発明の実施形態について、説明する。なお、以下の説明に用いる図面では、同じ機能を持つ構成部や同じ処理を行うステップには同一の符号を記し、重複説明を省略する。以下の説明において、ベクトルや行列の各要素単位で行われる処理は、特に断りが無い限り、そのベクトルやその行列の全ての要素に対して適用されるものとする。 Hereinafter, embodiments of the present invention will be described. In the drawings used in the following description, components having the same function and steps for performing the same process are denoted by the same reference numerals, and duplicate description will be omitted. In the following description, the processing performed for each element of a vector or matrix shall be applied to all the elements of the vector or matrix unless otherwise specified.

＜局所PSD推定に基づく音源強調＞
非特許文献１を従来研究とし、局所PSD(Power Spectral Density: パワースペクトル密度)推定に基づく音源強調についての原理とその課題についてより詳細に説明する。 <Sound source enhancement based on local PSD estimation>
With Non-Patent Document 1 as a conventional study, the principle and problem of sound source enhancement based on local PSD (Power Spectral Density) estimation will be described in more detail.

＜ビームフォーミングとウィーナーポストフィルターを使った音源強調＞
図１は、ビームフォーミングとウィーナーポストフィルターを使った音源強調系の機能ブロック図を示す。ビームフォーミング部９１は、M個の観測信号X_m,ω,τを用いて、ビームフォーミングによりビームフォーミング出力信号Y_l,ω,τを求める。ここで、ωとτはそれぞれ周波数、フレーム時間のインデックスを表す。ウィーナーフィルタ推定部９２は、M個のビームフォーミング出力信号Y_l,ω,τを用いて、PSD推定に基づきウィーナーフィルタを推定する。ウィーナーフィルタリング部９３は、ビームフォーミング出力信号Y_l,ω,τに対してウィーナーフィルタを用いてフィルタリングを行い、雑音成分を抑圧した出力信号Z_l,ω,τを得る。 <Sound source enhancement using beamforming and Wiener post filter>
FIG. 1 shows a functional block diagram of a sound source enhancement system using beamforming and a Wiener post filter. The beamforming unit 91 obtains a beamforming output signal Y _{l, ω, τ} by beamforming using the M observation signals X _{m, ω, τ} . Here, ω and τ represent the index of frequency and frame time, respectively. The Wiener filter estimation unit 92 estimates the Wiener filter based on PSD estimation using the M beamforming output signals Y _{l, ω, τ} . The Wiener filtering unit 93 filters the beamforming output signal Y _{l, ω, τ} using a Wiener filter to obtain an output signal Z _{l, ω, τ in} which a noise component is suppressed.

まず、観測信号とビームフォーミング出力信号のモデルについて説明する。次に、局所PSD推定に基づくウィーナーフィルタの推定について説明する。 First, models of the observation signal and the beamforming output signal will be described. Next, the Wiener filter estimation based on the local PSD estimation will be described.

(ビームフォーミング出力信号のモデル化)
M(≧2)本のマイクロホンを用いて観測することを想定する。音場には、K個の音源がある。k番目の音源信号(時間周波数成分)をS_k,ω,τと表す。k番目の音源とm番目のマイクロホンの間の伝達特性をA_m,k,ωと表すとき、m番目の観測信号X_m,ω,τは以下でモデル化される。 (Modeling of beamforming output signal)
It is assumed that observation is performed using M (≧ 2) microphones. There are K sound sources in the sound field. The kth sound source signal (time frequency component) is represented as S _{k, ω, τ} . When the transfer characteristic between the k-th sound source and the m-th microphone is represented by A _{m, k, ω} , the m-th observation signal X _{m, ω, τ} is modeled as follows.

ビームフォーミングは、指向性を制御するためによく使われる技術である。l番目のビームフォーマのm番目のマイクロホンに対する係数をW_l,m,ωと書くとき、l番目のビームフォーマの出力信号Y_l,ω,τは以下で計算される。 Beamforming is a technique often used to control directivity. When the coefficient of the l-th beamformer for the m-th microphone is written as W _{l, m, ω} , the output signal Y _{l, ω, τ} of the l-th beamformer is calculated as follows.

ここで、D_l,k,ωと＊は、l番目のビームフォーマのk番目の音源に対する感度と複素共役演算をそれぞれ表す。 Here, D _{l, k, ω} and * represent the sensitivity and the complex conjugate calculation of the l-th beamformer with respect to the k-th sound source, respectively.

(局所PSD推定に基づくウィーナーフィルタ推定)
音源信号が互いに無相関であることを仮定すると、l番目のビームフォーマの出力信号のPSDは以下でモデル化される。 (Wiener filter estimation based on local PSD estimation)
Assuming that the source signals are uncorrelated with each other, the PSD of the output signal of the l-th beamformer is modeled below.

ここで、<・>とφ_{S_k,ω}は、それぞれ期待値演算子(例えば、時間平均化処理で置き換えることができる)、k番目の音源信号のPSDを表す。なお、下付き添え字S_k,ωはS_k,ωを意味する。式(3)の関係が、L個の方向(ある程度の角度幅をもった領域){Θ₁,…,Θ_L}において音源群をグループ化した際にも成立するのであれば、φ_{Y_l,ω}(下付き添え字Y_l,ωはY_l,ωを意味する）は以下で近似して表される。 Here, <·> and φ _{S_k, ω} represent an expected value operator (for example, can be replaced by a time averaging process) and a PSD of the kth sound source signal. The subscript S_k, ω means S _k, ω. If the relation of equation (3) holds even when the sound source groups are grouped in _L directions (regions having a certain angular width) {Θ ₁ , ..., Θ _L }, then φ _{Y_l, ω} (subscript Y_l, ω means Y _l, ω) is approximated as follows.

ここで、D_{l,Θ_i,ω}(下付き添え字l,Θ_i,ωはl,Θ_i,ωを意味する）はl番目のビームフォーマの方向Θ_iに対する感度であり、φ_{S_Θ_i,ω}(下付き添え字S_Θ_i,ωはS_{Θ_i},ωを意味し、この中の下付き添え字Θ_iはΘ_iを意味する）は方向Θ_iを中心としたある程度の角度幅における音源群の加算値のPSD(音源PSDと以後、呼ぶ)である。φ_{S_Θ_i,ω}、φ_{Y_l,ω}の関係は以下で表される。 Where D _{l, Θ_i, ω} (subscript l, Θ_i, ω means l, Θ _i, ω) is the sensitivity of the l-th beamformer with respect to direction Θ _i , and φ _{S_Θ_i, ω} ( The subscript S_Θ_i, ω means S _{Θ_i} , ω, where the subscript Θ_i means Θ _i ) is the sum of the sound source groups in a certain angle width around the direction Θ _i . PSD (hereinafter referred to as sound source PSD). The relationship between φ _{S_Θ_i, ω} and φ _{Y_l, ω} is expressed as follows.

行列D_L,ωのことを以後、PSD混合行列と呼ぶことにする。PSD混合行列D_L,ωは、対角成分が0より大きく、非対角成分が0以上のL×Lの非負行列である。 The matrix D _{L, ω} will be hereinafter referred to as a PSD mixing matrix. The PSD mixing matrix D _{L, ω} is an L × L non-negative matrix whose diagonal components are larger than 0 and whose off-diagonal components are 0 or more.

音源PSD(Φ_{S_ω}(下付き添え字S_ωはS_ωを意味する）)を求めるために、式(5)の逆問題を解く必要がある。局所PSD推定法では、Φ_{S_ω}は以下で推定される。 In order to obtain the sound source PSD (Φ _{S_ω} (subscript S_ω means S _ω )), it is necessary to solve the inverse problem of equation (5). In the local PSD estimation method, Φ _{S — ω} is estimated as

以後、G_L,ωをPSD分離行列と呼ぶことにする。PSD分離行列の構造は以下でモデル化される。 Hereinafter, G _{L, ω} will be referred to as the PSD separation matrix. The structure of the PSD separation matrix is modeled below.

つまり、G_L,ω・D_L,ω＝I_Lであり、PSD分離行列G_L,ωは、対角成分が0より大きく、非対角成分が0以下のL×Lの行列である。実用上、効果のある方法として、音源PSDを時間フレームごとに求めることもできる。これは、音源の時間スパース性が高いことを想定しており、以下で実行される。 That is, G _{L, ω} · D _{L, ω} = I _L , and the PSD separation matrix G _{L, ω} is an L × L matrix in which the diagonal component is larger than 0 and the non-diagonal component is 0 or less. As a practically effective method, the sound source PSD can be obtained for each time frame. This is assumed to be high in time sparseness of the sound source, and will be performed below.

l番目の音源群を強調するためのウィーナーポストフィルタは以下で計算される。 The Wiener post filter for enhancing the l-th sound source group is calculated below.

これをビームフォーミング出力信号Y_l,ω,τに畳み込むことにより、出力信号Z_l,ω,τを得る。 The output signal Z _{l, ω, τ} is obtained by convolving this with the beamforming output signal Y _{l, ω, τ} .

(音源PSDを高精度に推定するためにPSD分離行列に求められる条件)
音源PSDを高精度に推定するためにPSD分離行列G_L,ωに求められる条件について、まず仮説を立てる。次に、その仮説を満たすためにPSD分離行列G_L,ωに求められる条件について明確化する。 (Conditions required for PSD separation matrix to estimate source PSD with high accuracy)
First, a hypothesis is made about the condition required for the PSD separation matrix G _{L, ω} to estimate the sound source PSD with high accuracy. Next, the conditions required for the PSD separation matrix G _{L, ω} to satisfy the hypothesis are clarified.

(仮説)
図２に示すように、l番目のビームフォーマは、方向Θ_lから到来する音源を強調すると想定する。式(6)と(7)より、l番目の音源PSDを計算する過程は以下で書き下すことができる。 (hypothesis)
As shown in FIG. 2, it is assumed that the l-th beamformer emphasizes the sound source coming from the direction Θ _l . From equations (6) and (7), the process of calculating the l-th sound source PSD can be described below.

ビームフォーミング出力信号Y_l,ω,τには、l番目の音源群を強調した信号と抑圧はされているものの周囲の雑音とが混在する。そのとき、φ_{Y_l,ω,τ}(下付添え字Y_l,ω,τはY_l,ω,τを意味する)を構成する要素のうちl番目の音源PSDが支配的であり、式(11)の第二項目の演算は、周囲の雑音を減算する役割を担う。図３に示すように、φ_{Y_l,ω,τ}から周囲雑音を減算するためには、PSD分離行列の対角項λ_l,ωが正の実数で(強調する役割)、非対角項γ_l,i,ωは0以下の実数(減算する役割)になるべきである。 In the beamforming output signal Y _{l, ω, τ} , a signal in which the l-th sound source group is emphasized and ambient noise that has been suppressed are mixed. At that time, the l-th sound source PSD is dominant among the elements forming φ _{Y_l, ω, τ} (subscript Y_l, ω, τ means Y _l, ω, τ), and the equation (11 The operation of the second item of) plays a role of subtracting ambient noise. As shown in FIG. 3, in order to subtract the ambient noise from φ _{Y_l, ω, τ} , the diagonal term λ _{l, ω} of the PSD separation matrix is a positive real number (the role of emphasis) and the off-diagonal term γ. _{l, i, ω} should be a real number less than 0 (the role of subtraction).

だから、高精度に音源PSDを推定するための仮説は、式(12)を満たすPSD分離行列G_L,ωを設計することで表される。 Therefore, the hypothesis for estimating the sound source PSD with high accuracy is represented by designing the PSD separation matrix G _{L, ω} that satisfies Expression (12).

PSD分離行列G_L,ωの対角項は正、非対角項は負であり、その逆行列(つまり、PSD混合行列D_L,ω)が定義通りに非負の行列となるとき、PSD分離行列G_L,ωは、M行列と呼ばれる。故に、上述した高精度に音源PSDを推定するための仮説は、G_L,ωをM行列になるように設計するという言葉で言い換え可能である。 When the diagonal term of the PSD separation matrix G _{L, ω} is positive and the off-diagonal term is negative, and its inverse (that is, the PSD mixing matrix D _{L, ω} ) is a nonnegative matrix as defined, the PSD separation is The matrix G _{L, ω} is called the M matrix. Therefore, the above-mentioned hypothesis for estimating the sound source PSD with high accuracy can be paraphrased by the term of designing G _{L, ω} to be an M matrix.

(PSD分離行列がM行列となるための条件の導出)
PSD分離行列G_L,ωがM行列となるための条件を導出する。PSD分離行列G_L,ωの逆行列を解析的にモデル化するために、以下のように行列を分解する。 (Derivation of conditions for PSD separation matrix to be M matrix)
The condition for the PSD separation matrix G _{L, ω} to become an M matrix is derived. In order to analytically model the inverse matrix of the PSD separation matrix G _{L, ω} , the matrix is decomposed as follows.

ここで、Λ_L,ω正の対角行列、B_L,ωは非負の非対角行列、I_Lは単位行列である。 Here, Λ _{L, ω is a} positive diagonal matrix, B _{L, ω} is a non-negative non-diagonal matrix, and I _L is an identity matrix.

PSD分離行列G_L,ωの逆行列は以下で計算される。 The inverse matrix of the PSD separation matrix G _{L, ω} is calculated as follows.

Λ^-1 _L,ωは非負行列であるから、PSD混合行列D_L,ωが定義通りに非負行列になるための条件は、(I_L-B_L,ω)^-1が非負行列になることである。 Since Λ ^-1 _{L, ω} is a non-negative matrix, the condition for the PSD mixing matrix D _{L, ω} to be a non-negative matrix as defined is that (I _L -B _{L, ω} ) ^-1 is a non-negative matrix. Is.

(I_L-B_L,ω)^-1が非負行列になるための条件を導出する。ノイマン級数展開することで以下のように展開される。 We derive the condition for (I _L -B _{L, ω} ) ^{-1 to} be a non-negative matrix. It is expanded as follows by expanding the Neumann series.

なお、式(18)が成り立つ条件は、B_L,ωのスペクトルノルム(行列の特異値の絶対値の最大値)が1未満になることである。このことは、以下の式で表される。 The condition that Eq. (18) holds is that the spectral norm of B _{L, ω} (maximum absolute value of singular value of matrix) is less than 1. This is expressed by the following equation.

ここで、σ_i,ωは、B_L,ωのi番目の特異値を表す。まとめると、PSD分離行列G_L,ωがM行列になるための条件は、式(19)で表される。つまり、B_L,ωのスペクトルノルムが1未満になればいい。 Here, σ _{i, ω} represents the i-th singular value of B _{L, ω} . In summary, the condition for the PSD separation matrix G _{L, ω} to become an M matrix is expressed by equation (19). That is, the spectral norm of B _{L, ω} should be less than 1.

(従来法の処理フロー)
非特許文献１では、B_L,ωの解析的なモデル化が難しいことから、式(19)の関係が成り立つと仮定して、PSD混合行列D_L,ωとPSD分離行列G_L,ωの算出方法について提案した。簡略的ではあるが、ビームフォーミングフィルタ係数群W_L,ω、PSD混合行列D_L,ω、PSD分離行列G_L,ωを推定するフローについて図４に載せる。 (Processing flow of conventional method)
In Non-Patent Document 1, since it is difficult to analytically model B _{L, ω} , it is assumed that the relationship of Expression (19) holds and the PSD mixing matrix D _{L, ω} and the PSD separation matrix G _{L, ω} are The calculation method was proposed. Although simple, the flow for estimating the beamforming filter coefficient group W _{L, ω} , the PSD mixing matrix D _{L, ω} , and the PSD separation matrix G _{L, ω} is shown in FIG.

まず、最小分散無歪応答法(参考文献１)により、ビームフォーミングフィルタを設計する（Ｓ１００）。
(参考文献１)浅野太，「音のアレイ信号処理」,コロナ社,2011, pp.86-90.
なお、L個のビームフォーマの焦点はそれぞれ異なる方向{Θ₁,…,Θ_L}であるとし、それらの係数は行列W_L,ωに格納されるとする。 First, a beamforming filter is designed by the minimum variance distortion-free response method (reference document 1) (S100).
(Reference 1) Futoshi Asano, "Sound Array Signal Processing," Corona Publishing Co., Ltd., 2011, pp.86-90.
Note that the focal points of the L beam formers are in different directions {Θ ₁ , ..., Θ _L }, and their coefficients are stored in the matrix W _{L, ω} .

次に、行列W_L,ωを用いてPSD混合行列を計算する(Ｓ１０１)。 Next, a PSD mixing matrix is calculated using the matrix W _{L, ω} (S101).

ここで、H_L,ωはアレイマニフォールドベクトル(自由音場にアレイを設置したときに得られるであろう伝達特性の模擬値)である。 Here, H _{L, ω} is an array manifold vector (a simulated value of a transfer characteristic that would be obtained when the array is installed in a free sound field).

なお、H_{m,Θ_l,ω}(下付き添え字m,Θ_l,ωはm,Θ_l,ωを意味する)は、m番目のマイクロホンに対して方向Θ_lから音が到来したときの自由音場における伝達特性の模擬値を表し、事前に計算可能である。計算されたΔ_L,ωのパワーを式(5)のように計算することで、PSD混合行列D_L,ωを求め、出力する。 H _{m, Θ_l, ω} (subscript m, Θ_l, ω means m, Θ _l , ω) is the free sound when the sound arrives from the direction Θ _l to the m-th microphone. It represents the simulated value of the transfer characteristic in the field and can be calculated in advance. The PSD mixing matrix D _{L, ω} is obtained and calculated by calculating the calculated power of Δ _{L, ω} as in Expression (5).

最後に、PSD分離行列G_L,ωを近似計算する(Ｓ１０２)。式(19)の関係が成り立つと仮定し、Λ_L,ω=I_Lとするのであれば、PSD混合行列D_L,ωを以下のように近似的にモデル化することができる。 Finally, the PSD separation matrix G _{L, ω} is approximately calculated (S102). Assuming that the relationship of Expression (19) holds, and assuming Λ _{L, ω} = I _L , the PSD mixing matrix D _{L, ω} can be approximately modeled as follows.

PSD混合行列D_L,ωは既に計算されているのだから、Γ_L,ωを近似的に求めることはできる。その近似したΓ_L,ωを使えば、PSD分離行列G_L,ωもまた近似的に求めることができる。 Since the PSD mixing matrix D _{L, ω} has already been calculated, Γ _{L, ω} can be approximately calculated. By using the approximated Γ _{L, ω} , the PSD separation matrix G _{L, ω} can also be approximately _calculated .

一連の操作により、目的の量(W_L,ω、D_L,ω、G_L,ω)を求めることはできたが、近似操作を伴うために、PSDの推定誤差を増大させてしまうという欠点があった。これが、従来技術の課題の詳細である。 Although it was possible to obtain the target quantities (W _{L, ω} , D _{L, ω} , G _{L, ω} ) by a series of operations, the drawback is that the PSD estimation error increases because of the approximation operation. was there. This is the detail of the problems of the prior art.

(本実施形態の原理)
まず、音源PSDを高精度に推定するためのPSD混合/分離行列の設計方式に関する原理を述べる。次に、設計したビームパターンに対応したフィルタ係数を解析的に導出するための方法について述べる。 (Principle of this embodiment)
First, we describe the principle of the PSD mixing / separation matrix design method for estimating the source PSD with high accuracy. Next, a method for analytically deriving the filter coefficient corresponding to the designed beam pattern will be described.

(音源PSDを高精度に推定するためのPSD混合/分離行列の設計)
B_L,ωの特異値を解析的にモデル化するために、PSD混合/分離行列の構造を巡回性を持つ行列（以下、巡回行列ともいう）になるように拘束をかける。この物理的意味合いについて、まず説明する。次に、PSD分離行列G_L,ωが巡回M行列となるような条件を導出する。 (Design of PSD mixture / separation matrix for estimating source PSD with high accuracy)
In order to analytically model the singular value of B _{L, ω} , the structure of the PSD mixing / separation matrix is constrained to be a cyclic matrix (hereinafter also referred to as a cyclic matrix). This physical meaning will be described first. Next, the condition that the PSD separation matrix G _{L, ω} becomes a cyclic M matrix is derived.

(対称性ビームフォーマを設計した場合のPSD混合/分離行列のモデル化)
B_L,ωの特異値を解析的にモデル化するための一つの方法として、PSD混合行列D_L,ωが巡回行列となるように制約を課すこととした。 (Modeling PSD mixing / separation matrix when designing a symmetric beamformer)
As one method for analytically modeling the singular value of B _{L, ω} , a constraint is imposed so that the PSD mixing matrix D _{L, ω} becomes a cyclic matrix.

式(25)を構成するL個のビームフォーマの特性は基本的に同じで、回転させているだけであることが分かる。式(25)のように、PSD混合行列D_L,ωが巡回行列になるためには、M個のマイクロホンの空間的配置の中心を通るある回転軸に対して1回転させたときに、M個のマイクロホンの空間的配置と同一の空間的配置が複数回現れるようにM個のマイクロホンが配置される必要がある。言い換えると、回転対称性を満たすように、M個のマイクロホンが配置される必要がある。例えば、等間隔の円形アレイを用いて音場を観測し、対称性を持たせてビームフォーマを設計することで達成されるだろう。他にも、正多面体の頂点にマイクロホンを配置する方法や、それらを多重化して配置する(例えば、二つの同心円を用いて、大きい円形アレイと、小さい円形アレイとを用いて二重化して配置する)方法が考えられる。このとき、マイクロホンは、バッフルに取り付けてもよいし、バッフルを設けずに、例えば、糸で吊るしたり、細い棒で固定して、空中に配置してもよい。 It can be seen that the characteristics of the L beamformers that make up equation (25) are basically the same, only rotating. As shown in Eq. (25), in order for the PSD mixing matrix D _{L, ω} to become a cyclic matrix, when one rotation is performed with respect to a rotation axis passing through the center of the spatial arrangement of M microphones, M It is necessary to arrange M microphones so that the same spatial arrangement as that of the individual microphones appears multiple times. In other words, M microphones need to be arranged so as to satisfy the rotational symmetry. For example, it may be achieved by observing the sound field using an evenly spaced circular array and designing the beamformer with symmetry. Alternatively, a method of arranging the microphones at the vertices of the regular polyhedron or arranging them in a multiplexed manner (for example, using two concentric circles, a large circular array and a small circular array are arranged in duplicate) ) A method is possible. At this time, the microphone may be attached to the baffle, or may be placed in the air without being provided with the baffle, for example, hung with a thread or fixed with a thin rod.

式(19)の条件が成立することをここでは仮定すると、式(17)、(18)より、D_L,ωは以下のようにモデル化される。 Assuming that the condition of Expression (19) is satisfied here, from Expressions (17) and (18), D _{L, ω} is modeled as follows.

PSD混合行列D_L,ωが巡回行列であるとき、PSD混合行列D_L,ωを構成するB_L,ωとΛ_L,ωもまた巡回行列になることが知られている。 When PSD mixing matrix D _{L, omega} is circulant matrix, PSD mixing matrix D _L, B constitutes the _omega _{L, omega} and lambda _{L, omega} is also known to be a circulant matrix.

ここで、α_ωは任意の正の実数である。 Here, α _ω is an arbitrary positive real number.

巡回行列の基本的な性質より、解析対象の行列B_L,ωの固有ベクトルは、離散フーリエ変換の基底行列(DFT基底行列)で構成されることが知られている。 It is known that the eigenvector of the matrix B _{L, ω to be} analyzed is composed of the basis matrix (DFT basis matrix) of the discrete Fourier transform from the basic property of the cyclic matrix.

ここで、^Hは共役転置を表す。F_LはDFT基底行列であり、ωとは独立である。 Here, ^H represents the conjugate transpose. F _L is a DFT basis matrix and is independent of ω.

固有値行列Ψ_L,ωはDFTの出力値で構成されることになる。 The eigenvalue matrix Ψ _{L, ω} will be composed of the output values of the DFT.

式(19)の条件を満たすとき、B_L,ωとΛ_L,ωは解析的にモデル化できることが分かった。その時、PSD混合行列D_L,ωやPSD分離行列G_L,ωもまた解析的にモデル化できる。式(29)より、PSD混合行列D_L,ωの構成要素である(I_L-B_L,ω)-¹は、以下でモデル化できる。 It was found that B _{L, ω} and Λ _{L, ω} can be analytically modeled when the condition of Eq. (19) is satisfied. At that time, the PSD mixing matrix D _{L, ω} and the PSD separation matrix G _{L, ω} can also be analytically modeled. From Equation (29), (I _L- B _{L, ω} ) ⁻¹ _, which is a component of the PSD mixing matrix D _{L, ω} , can be modeled as follows.

ここで、 here,

である。式(13)と(26)より、PSD混合行列D_L,ωとPSD分離行列G_L,ωは以下でモデル化される。 Is. From the equations (13) and (26), the PSD mixing matrix D _{L, ω} and the PSD separating matrix G _{L, ω} are modeled as follows.

式(36)より、PSD分離行列G_L,ωは巡回性を持つM行列(巡回M行列)になることが分かる。 From Equation (36), it can be seen that the PSD separation matrix G _{L, ω} becomes an M matrix having cyclicity (cyclic M matrix).

(PSD分離行列G_L,ωが巡回M行列となるための条件の導出)
PSD分離行列G_L,ωが巡回M行列となるための条件を導出する。B_L,ωは非負非対角行列であるから、DFT出力成分の一つである直流成分ψ_1,ωが{ψ_1,ω,…,ψ_L,ω}の中で最大値をであることは明らかである。 (Derivation of conditions for the PSD separation matrix G _{L, ω to} become a cyclic M matrix)
The condition for the PSD separation matrix G _{L, ω to} become a cyclic M matrix is derived. Since B _{L, ω} is a non-negative non-diagonal matrix, the DC component ψ _{1, ω} that is one of the DFT output components has the maximum value in {ψ _{1, ω} , ..., ψ _{L, ω} }. That is clear.

f⁰ _L=1であるので、ψ_1,ωは解析的に計算できる。 Since f ⁰ _L = 1, ψ _{1, ω} can be calculated analytically.

よって、式(19)にあるB_L,ωのスペクトルノルムは以下で書ける。 Therefore, the spectral norm of B _{L, ω} in equation (19) can be written as

故に、PSD分離行列G_L,ωを巡回M行列にするという条件は、ψ_1,ωを1未満にすることで達成される。 Therefore, the condition that the PSD separation matrix G _{L, ω} is a cyclic M matrix is achieved by setting ψ _{1, ω} to be less than 1.

ψ_1,ωを1未満にするための条件を導出する。B_L,ωの対角項は全て0であることと、F_LB_L,ωF^H _L=Ψ_L,ωであることから、１つの条件は以下で書き表すことができる。 We derive the conditions for making ψ _{1, ω} less than 1. Since the diagonal terms of B _{L, ω} are all 0 and F _L B _{L, ω} F ^H _L = Ψ _{L, ω} , one condition can be expressed as follows.

B_L,ωのスペクトルノルムは1未満であるべきなので、式(40)の条件は以下のように書き直すことができる。 Since the spectral norm of B _{L, ω} should be less than 1, the condition in Eq. (40) can be rewritten as:

その上で、{ψ_1,ω,…,ψ_L,ω}は、実数系列{0,B_2,ω,…,B_L,ω}を入力した際にのDFT出力群であることから、式(42)の関係は成り立つ。 Moreover, since {ψ _{1, ω} , ..., ψ _{L, ω} } is the DFT output group when the real number sequence {0, B _{2, ω} , ..., B _{L, ω} } is input, The relationship of equation (42) holds.

まとめると、G_L,ωが巡回M行列となるための条件は、式(41)と(42)で表される。式(42)の複素共役の関係より、Ψ_L,ωを決めるための自由度は In summary, the conditions for G _{L, ω to} become a cyclic M matrix are expressed by equations (41) and (42). From the complex conjugate relation of equation (42), the degree of freedom for determining Ψ _{L, ω} is

に限定される。 Limited to

数値例を示すことで理解の促進を図りたい。L=6として、PSD混合行列D_L,ωやPSD分離行列G_L,ωを具体的に計算する。式(41)の条件を満たす中で、 We would like to promote understanding by showing numerical examples. When L = 6, the PSD mixing matrix D _{L, ω} and the PSD separation matrix G _{L, ω} are specifically calculated. While satisfying the condition of Expression (41),

個の要素を任意に設計する。 Design each element arbitrarily.

ここで、ψ_2,ω,ψ_3,ω,ψ_4,ωの選択肢として複素数でも構わないのであるが、ここでは実数を使用した。式(42)より、残りの成分のいくつかは自動的に決まる。 Here, a complex number may be used as an option of ψ _{2, ω} , ψ _{3, ω} , ψ _{4, ω} , but here, a real number is used. From equation (42), some of the remaining components are automatically determined.

最後に、式(41)によりψ_1,ωを計算する。 Finally, ψ _{1, ω} is calculated by the equation (41).

ψ_1,ωが1未満であるという条件を満たすので、式(35)や(36)を使って、D_L,ωやG_L,ωを解析的に設計することが可能である。Λ_L,ω=I_Lとして、D_L,ω、G_L,ωはそれぞれ以下で計算される。 Since the condition that ψ _{1, ω} is less than 1 is satisfied, it is possible to analytically design D _{L, ω} and G _{L, ω} using equations (35) and (36). Assuming Λ _{L, ω} = I _L , D _{L, ω} and G _{L, ω} are calculated as follows.

ここで、diag(・)は対角行列化する演算子である。PSD分離行列G_L,ωが巡回M行列であることが確認できる。 Here, diag (·) is a diagonal matrix operator. It can be confirmed that the PSD separation matrix G _{L, ω} is a cyclic M matrix.

(解析的なビームフォーミングのフィルタ係数の導出方法)
まず、式(35)に従うようなビームパターンを得るためのフィルタ係数の導出方法について、説明する。次に設計したビームフォーミングの物理的な性質について調査を行う。 (Analytical beamforming filter coefficient derivation method)
First, a method of deriving a filter coefficient for obtaining a beam pattern according to Expression (35) will be described. Next, we investigate the physical properties of the designed beamforming.

(ビームフォーミングのフィルタ係数の導出)
ビームフォーミングのフィルタ係数を解析的に導出するために、MがL以上であること(M≧L)を前提とする。ビームパターン(ビームフォーマの感度)Δ_L,ω、フィルタ係数行列W_L,ω、アレイマニフォールドベクトル(自遊空間における伝達特性の模擬)H_L,ωの関係は以下で定義される。 (Derivation of filter coefficients for beamforming)
In order to analytically derive the beamforming filter coefficient, it is assumed that M is L or more (M ≧ L). The relationship between the beam pattern (sensitivity of the beam former) Δ _{L, ω} , the filter coefficient matrix W _{L, ω} , and the array manifold vector (simulation of the transfer characteristic in the free-play space) H _{L, ω} is defined below.

ここで、 here,

である。Δ_L,ωは、PSD混合行列D_L,ωの各要素の平方根を求めることで求められる。また、PSD混合行列D_L,ωが巡回行列であれば、Δ_L,ωもまた巡回行列であるので、以下のように表現できる。 Is. Δ _{L, ω} is obtained by obtaining the square root of each element of the PSD mixing matrix D _{L, ω} . Further, if the PSD mixing matrix D _{L, ω} is a cyclic matrix, Δ _{L, ω} is also a cyclic matrix, and thus can be expressed as follows.

ここで、 here,

である。L=M、かつH_L,ωがフルランクの行列であるとき、フィルタ係数行列W_L,ωは以下で計算される。 Is. When L = M and H _{L, ω} is a full rank matrix, the filter coefficient matrix W _{L, ω} is calculated as follows.

しかしながら、L<Mであるとき、少し工夫が必要である。それは、H_L,ωの疑似逆行列は正確に求まらないためである。この問題を解決するための基本的なアイデアは、L個のビームパターンの特性を保持したままで、Δ_L,ωの次元をLからMへ上げることである。このアップコンバージョン処理を通じて、W_L,ω、Δ_L,ω、H_L,ωはM次元の正方行列W_M,ω、Δ_M,ω、H_M,ωになる。アップコンバージョンされたΔ_M,ωは以下でモデル化される。 However, when L <M, some work needs to be done. This is because the pseudo inverse matrix of H _{L, ω} cannot be obtained accurately. The basic idea to solve this problem is to increase the dimension of Δ _{L, ω} from L to M while maintaining the characteristics of L beam patterns. Through this up-conversion process, W _{L, ω} , Δ _{L, ω} , and H _{L, ω become} M-dimensional square matrices W _{M, ω} , Δ _{M, ω} , and H _{M, ω} . The up-converted Δ _{M, ω} is modeled below.

ここで、 here,

である。Υ_Δ,L,ωやΥ_Δ,M,ωは、Δ_L,ωやΔ_M,ωのDFT出力値により構成されるのだから、L個のビームパターンの特性を維持したままに、それらを変換することは容易に達成できる。Lが偶数であるとき、Δ_M,ωは以下でモデル化される。 Is. Υ _{Δ, L, ω} and Υ _{Δ, M, ω} are composed of the DFT output values of Δ _{L, ω} and Δ _{M, ω} , so they can be changed while maintaining the characteristics of the L beam patterns. Converting can be easily accomplished. When L is even, Δ _{M, ω} is modeled below.

Lが奇数であるとき、以下の変換が成り立つ。 When L is odd, the following transformations hold.

上述の(対称性ビームフォーマを設計した場合のPSD混合/分離行列のモデル化)で述べたように、ここでは、Δ_M,ωを巡回行列とするために、等間隔の円形アレイで観測することを前提とする。図５に示すように、等間隔に離散化されたM方向から平面波が到来することを仮定したとき、H_M,ωは以下でモデル化される。 As described above (modeling of PSD mixing / separation matrix when designing a symmetric beamformer), here we observe with a circular array of equal intervals in order to make Δ _{M, ω} a cyclic matrix. It is assumed that. As shown in FIG. 5, assuming that a plane wave comes from the M direction, which is discretized at equal intervals, H _{M, ω} is modeled as follows.

ここで、τ_l,mはl番目の方向から平面波が到来したときのm番目のマイクロホンと円形アレイの中心の間の遅延時間差であり、以下で計算可能である。 Here, τ _{l, m} is the delay time difference between the m-th microphone and the center of the circular array when the plane wave arrives from the l-th direction, and can be calculated as follows.

ここで、dとcはそれぞれ、アレイ半径と音速を表す。τ_l,mはインデックスの差(l-m)にのみ依存するので、H_M,ωは巡回行列になる。故に以下のように分解することが可能である。 Here, d and c represent the array radius and the speed of sound, respectively. Since τ _{l, m} depends only on the index difference (lm), H _{M, ω} becomes a cyclic matrix. Therefore, it can be decomposed as follows.

ここで、 here,

である。式(57)、(58)、(64)より、L<Mであるときのビームフォーミングフィルタは以下により解析的に計算することができる。 Is. From equations (57), (58), and (64), the beamforming filter when L <M can be analytically calculated as follows.

(設計したビームフォーマの物理特性の調査)
設計したビームフォーマの特性を調査するために、Δ_M,ωを以下のように、多重極展開した。 (Investigation of physical properties of designed beam former)
In order to investigate the characteristics of the designed beam former, Δ _{M, ω} was multipole expanded as follows.

ここで、Mは偶数であることを想定した。Mが奇数であるときは、以下で展開される。 Here, it is assumed that M is an even number. When M is odd, it is expanded by

Ω_M,l,ωに含まれるビームパターンは、2(l-1)番目の多重極に対応する。l=1であるとき、Ω_M,1,ωはモノポールの指向性で構成される。 The beam pattern contained in Ω _{M, l, ω} corresponds to the 2 (l-1) th multipole. When l = 1, Ω _{M, 1, ω} is composed of monopole directivity.

ここで、 here,

である。Ω_M,2,ωはダイポールで構成される。 Is. Ω _{M, 2, ω} is composed of a dipole.

ここで、 here,

である。Ω_M,l,ωの一般形は以下で書き表すことができる。 Is. The general form of Ω _{M, l, ω} can be written as

ここで、ν_l,ωは、Mが偶数の場合、奇数の場合のそれぞれにおいて、式(74)、または式(75)で計算することができる。 Here, ν _{l, ω} can be calculated by the formula (74) or the formula (75) in the case where M is even and the case where M is odd.

図６に、Δ_M,ωを多重極展開したときの具体的な例を示す。なお、ここで使用したPSD混合行列D_L,ωの数値は、式(49)と同じである。この図より、多重極の重みづけ加算により、設計したフィルタの指向特性が書き表されることが分かる。また、離散ではなく、連続的な指向特性が予測可能であることも分かる。多重極に対応した指向特性を広い周波数バンドに渡って得るために、マイクロホン間の距離を小さくしたほうがいい。 FIG. 6 shows a specific example of multipole expansion of Δ _{M, ω} . The numerical values of the PSD mixing matrix D _{L, ω} used here are the same as in Expression (49). From this figure, it can be seen that the directional characteristics of the designed filter are written by weighted addition of multiple poles. It can also be seen that continuous directional characteristics can be predicted instead of being discrete. In order to obtain directional characteristics corresponding to multiple poles over a wide frequency band, it is better to reduce the distance between microphones.

＜第一実施形態＞
提案法の原理をまとめると、式(7)のようにD_L,ωG_L,ω=I_Lという拘束下で、PSD分離行列G_L,ωがM行列となるようなPSD混合/分離行列の設計法とそれに対応したビームフォーミングフィルタの設計法であった。なお、従来方式では、D_L,ωG_L,ω=I_Lという関係は近似されているので成立しないし、ビームフォーミングフィルタの導出も解析的ではなかった。 <First embodiment>
To summarize the principle of the proposed method, the PSD mixture / separation matrix such that the PSD separation matrix G _{L, ω} becomes the M matrix under the constraint of D _{L, ω} G _{L, ω} = I _L as shown in Eq. (7). And the corresponding beamforming filter design method. Note that in the conventional method, the relationship of D _{L, ω} G _{L, ω} = I _L is approximated and does not hold, and the derivation of the beamforming filter was not analytical.

図７は第一実施形態に係るフィルタ係数算出装置２００の機能ブロック図を、図８はその処理フローを示す。 FIG. 7 shows a functional block diagram of the filter coefficient calculation device 200 according to the first embodiment, and FIG. 8 shows its processing flow.

フィルタ係数算出装置２００は、CPUと、RAMと、以下の処理を実行するためのプログラムを記録したROMを備えたコンピュータで構成され、機能的には次に示すように構成されている。 The filter coefficient calculation device 200 is configured by a computer including a CPU, a RAM, and a ROM recording a program for executing the following processing, and is functionally configured as shown below.

フィルタ係数算出装置２００は、パラメータ設計部２０１と、分離行列設計部２０２と、フィルタ係数算出部２０３とを含む。 The filter coefficient calculation device 200 includes a parameter design unit 201, a separation matrix design unit 202, and a filter coefficient calculation unit 203.

＜パラメータ設計部２０１＞
パラメータ設計部２０１は、PSD分離行列GのパラメータΨ_L,ωが以下の条件を満たすようにパラメータΨ_L,ωを設計し（Ｓ２０１）、出力する。 <Parameter design unit 201>
The parameter designing unit 201 designs the parameter Ψ _{L, ω} so that the parameter Ψ _{L, ω} of the PSD separation matrix G satisfies the following condition (S201) and outputs it.

例えば、人手により、 For example, manually

を、パラメータ設計部２０１が設計する。 Is designed by the parameter design unit 201.

＜分離行列設計部２０２＞
分離行列設計部２０２は、パラメータΨ_L,ωを受け取り、PSD分離行列G_L,ωがPSD混合行列D_L,ωの逆行列となり、かつ、PSD分離行列G_L,ωがM行列となるように、PSD分離行列G_L,ωを設計し（Ｓ２０２）、出力する。例えば、式(35)、(36)により、D_L,ω、G_L,ωを解析的に設計する。 <Separation matrix design unit 202>
The separation matrix designing unit 202 receives the parameter Ψ _{L, ω} , and the PSD separation matrix G _{L, ω} is an inverse matrix of the PSD mixing matrix D _{L, ω} and the PSD separation matrix G _{L, ω} is an M matrix. Then, the PSD separation matrix G _{L, ω} is designed (S202) and output. For example, D _{L, ω} and G _{L, ω} are analytically designed by equations (35) and (36).

＜フィルタ係数算出部２０３＞
フィルタ係数算出部２０３は、PSD混合行列D_L,ωを受け取り、ビームフォーミングのフィルタ係数を解析的に
M=Lのとき、 <Filter coefficient calculation unit 203>
The filter coefficient calculation unit 203 receives the PSD mixing matrix D _{L, ω} and analytically calculates the beamforming filter coefficient.
When M = L,

として算出し、M>Lのとき、 And when M> L,

として算出し（Ｓ２０３）、出力する。なお、式(57)のΔ_L,ωはPSD混合行列D_L,ωの各要素の平方根として求められる行列、H_L,ωはアレイマニフォールドベクトル(自由音場にアレイを設置したときに得られるであろう伝達特性の模擬値)である。式(65)のF_MはDFT基底行列、Υ_Δ,M,ωはΔ_M,ωのDFT出力値により構成される行列、Δ_M,ω,H_M,ωはΔ_L,ω,H_L,ωに対してそれぞれアップコンバージョンを行った行列である。 Is calculated (S203) and output. Note that Δ _{L, ω} in Equation (57) is a matrix obtained as the square root of each element of the PSD mixing matrix D _{L, ω} , and H _{L, ω} is an array manifold vector (obtained when the array is installed in the free sound field). It is a simulated value of the transfer characteristic. F _M is DFT basis matrix of the formula _{(65), Υ Δ, M} , ω is delta _M, the matrix composed of DFT output values of _{_{ω, Δ M, ω, H}} M, ω is Δ _{_L, ω,} H _L _{and ω} are up-converted matrices.

＜効果＞
以上の構成により、回転対称性を満たすように、M個のマイクロホンが配置される（例えば、等間隔円形アレイを用いて音場を観測する）ことを前提として、高精度に音源PSDを推定するために適した性質を持つPSD混合/分離行列、及びビームフォーミングのフィルタ係数を解析的に導出する方法について提案した。これまで、ヒューリスティックに決めてきた変数(PSD混合/分離行列、ビームフォーミングのフィルタ係数)を解析的に最適化することで、音源PSDの推定誤差を軽減し、クリアな目的音強調が可能になる。 <Effect>
With the above configuration, it is assumed that M microphones are arranged so as to satisfy the rotational symmetry (for example, the sound field is observed using a circular array with equal intervals), and the sound source PSD is estimated with high accuracy. We have proposed a method for analytically deriving the PSD mixing / separation matrix with suitable properties and the beamforming filter coefficient. Up to now, heuristically determined variables (PSD mixing / separation matrix, beamforming filter coefficients) are analytically optimized to reduce the estimation error of the sound source PSD and enable clear target sound enhancement. .

＜変形例＞
フィルタ係数算出部２０３のフィルタ係数を算出する式は、本実施形態の式に限定されない。本実施形態で求めたPSD混合/分離行列を用いて、他の式等によりフィルタ係数を算出してもよい。また、パラメータ設計部２０１と分離行列設計部２０２とを別装置とし、分離行列設計装置としてもよい。さらに、パラメータ設計部２０１を備えず、分離行列設計部２０２のみからなる離行列設計装置としてもよい。このときパラメータΨは人手により入力してもよいし、別装置から与えられてもよい。 <Modification>
The formula for calculating the filter coefficient of the filter coefficient calculation unit 203 is not limited to the formula of this embodiment. The PSD mixing / separation matrix obtained in this embodiment may be used to calculate the filter coefficient by another equation or the like. Further, the parameter designing unit 201 and the separation matrix designing unit 202 may be separate devices, and may be a separation matrix designing device. Furthermore, the separation matrix design device may be configured to include only the separation matrix design unit 202 without the parameter design unit 201. At this time, the parameter Ψ may be manually input or may be given from another device.

＜第二実施形態＞
第一実施形態のフィルタ係数算出装置２００で求めたフィルタ係数W_L,ω及びPSD分離行列G_L,ωを用いた音声強調装置３００について説明する。 <Second embodiment>
A speech enhancement device 300 using the filter coefficient W _{L, ω} and the PSD separation matrix G _{L, ω} obtained by the filter coefficient calculation device 200 of the first embodiment will be described.

図９は第二実施形態に係る音声強調装置３００の機能ブロック図、図１０はその処理フローの例を示す図である。 FIG. 9 is a functional block diagram of the voice emphasizing device 300 according to the second embodiment, and FIG. 10 is a diagram showing an example of its processing flow.

音声強調装置３００は、ビームフォーミング部３０１、ウィーナーフィルタ推定部３０２、ウィーナーフィルタリング部３０３、記憶部３０４を含む。 The speech enhancement device 300 includes a beamforming unit 301, a Wiener filter estimation unit 302, a Wiener filtering unit 303, and a storage unit 304.

音声強調装置３００は、M個のマイクロホンで収音した観測信号X_m,ω,τを入力とし、方向Θ_lから到来する音を強調した出力信号Z_l,ω,τを求め、出力する。なお、M個のマイクロホンは、回転対称性を満たすように配置されるものとする。 The speech emphasizing device 300 receives the observation signals X _{m, ω, τ} collected by M microphones as input, and obtains and outputs an output signal Z _{l, ω, τ in} which the sound coming from the direction Θ _l is emphasized. Note that the M microphones are arranged so as to satisfy rotational symmetry.

記憶部３０４には、第一実施形態で求めたフィルタ係数W_L,ω及びPSD分離行列G_L,ωを格納しておく。 The storage unit 304 stores the filter coefficient W _{L, ω} and the PSD separation matrix G _{L, ω} obtained in the first embodiment.

ビームフォーミング部３０１は、観測信号X_m,ω,τとフィルタ係数W_l,ω,τとを受け取り、次式により、ビームフォーミング出力信号Y_l,ω,τを求め（Ｓ３０１）、出力する。 The beamforming unit 301 receives the observation signal X _{m, ω, τ} and the filter coefficient W _{l, ω, τ, obtains a} beamforming output signal Y _{l, ω, τ} from the following equation (S301), and outputs it.

ウィーナーフィルタ推定部３０２は、ビームフォーミング出力信号Y_l,ω,τとPSD分離行列G_L,ωとを受け取り、次式により、ウィーナーフィルタU_l,ω,τを算出し（Ｓ３０２）、出力する。 The Wiener filter estimation unit 302 receives the beamforming output signal Y _{l, ω, τ} and the PSD separation matrix G _{L, ω} , calculates the Wiener filter U _{l, ω, τ} by the following equation (S302) and outputs it. .

ウィーナーフィルタリング部３０３は、ビームフォーミング出力信号Y_l,ω,τとウィーナーフィルタU_l,ω,τとを受け取り、ビームフォーミング出力信号Y_l,ω,τをウィーナーフィルタU_l,ω,τでフィルタリングし（Ｓ３０３）、出力信号Z_l,ω,τを求め、出力する。 The Wiener filtering unit 303 receives the beamforming output signal Y _{l, ω, τ} and the Wiener filter U _{l, ω, τ,} and filters the beamforming output signal Y _{l, ω, τ} with the Wiener filter U _{l, ω, τ} . (S303), the output signals Z _{l, ω, τ} are obtained and output.

＜第三実施形態＞
第二実施形態と異なる部分を中心に説明する。 <Third embodiment>
The description will focus on the parts that are different from the second embodiment.

第一実施形態の技術を応用すると、ユーザの操作に応じて強調したい音源の収音領域の角度幅を連続的に変えることができる。例えば、雑音となる成分が近傍にある場合には、収音領域を狭めることで、出力SN比を向上する。 By applying the technique of the first embodiment, it is possible to continuously change the angular width of the sound collection region of the sound source to be emphasized according to the user's operation. For example, when there is a noise component in the vicinity, the output SN ratio is improved by narrowing the sound collection area.

図１１は第三実施形態に係る音声強調装置４００の機能ブロック図、図１２はその処理フローの例を示す図である。 FIG. 11 is a functional block diagram of the voice emphasizing device 400 according to the third embodiment, and FIG. 12 is a diagram showing an example of the processing flow thereof.

音声強調装置３００は、収音領域指定部４０１、パラメータ設計部４０２、分離行列設計部２０２、フィルタ係数算出部２０３、ビームフォーミング部３０１、ウィーナーフィルタ推定部３０２、ウィーナーフィルタリング部３０３を含む。 The speech enhancement device 300 includes a sound collection area designating unit 401, a parameter designing unit 402, a separation matrix designing unit 202, a filter coefficient computing unit 203, a beamforming unit 301, a Wiener filter estimating unit 302, and a Wiener filtering unit 303.

第一実施形態及び第二実施形態と異なる収音領域指定部４０１、パラメータ設計部４０２について説明する。 A sound collection area designating unit 401 and a parameter designing unit 402 different from those in the first and second embodiments will be described.

収音領域指定部４０１は、図示しない入力手段(キーボート、マウス、タッチパネル等)を介して、ユーザによって指定された収音領域Aを受け取り、広さを表すためのパラメータμを求め(Ｓ４０１)、出力する。例えば、0≦μ＜1とし、μが小さいほど収音領域を狭め、μが1に近づくほど収音領域を広げることに対応する。収音領域Aを0≦μ＜1とし、収音領域Aを広さを表すためのパラメータμとしてそのまま出力してもよい。この値は音源強調の処理中に変えることが可能で、連続的に変えることで収音領域をスムーズに広げたり狭めたりすることができる。 The sound collection area designating section 401 receives the sound collection area A designated by the user via an input means (not shown) (keyboard, mouse, touch panel, etc.), and obtains a parameter μ for expressing the width (S401), Output. For example, 0 ≦ μ <1, which corresponds to narrowing the sound collection area as μ decreases, and expanding the sound collection area as μ approaches 1. The sound pickup area A may be set to 0 ≦ μ <1, and the sound pickup area A may be output as it is as a parameter μ for expressing the width. This value can be changed during the sound source enhancement process, and by continuously changing it, the sound collection area can be smoothly widened or narrowed.

パラメータ設計部４０２は、パラメータμを入力とし、パラメータμに対応するパラメータΨ_L,ωを設計し（Ｓ４０２）、出力する。収音領域のユーザ要求にマッチして、Ψ_L,ωを決める方法には任意性がある。大きな方針としては、μが大きくなるほど、ψ_1,ωが大きな値になるようにすればよい。上述の＜設計したビームフォーマの物理特性の調査＞で説明したように、ψ_1,ωは無指向性で収音した信号に掛かる係数に対応する。詳細は述べないが、ψ_1,ωが大きくなるほど、フィルタの特性における無指向性収音信号の重みが大きくなるため、収音領域幅が大きくなる。例えば、ψ_1,ω=μとし、他の値は式(41)と(42)を満たすようにランダムに値を割り振ったり同じ値を代入してもよい。 The parameter designing unit 402 inputs the parameter μ, designs the parameter Ψ _{L, ω} corresponding to the parameter μ (S402), and outputs it. There is arbitrariness in the method of determining Ψ _{L, ω} that matches the user demand in the sound collection area. As a big policy, ψ _{1, ω} may have a larger value as μ increases. As described above in <Investigation of Physical Properties of Designed Beamformer>, ψ _{1, ω} corresponds to a coefficient applied to a non-directionally picked up signal. Although not described in detail, the larger ψ _{1, ω} , the larger the weight of the omnidirectional sound pickup signal in the filter characteristics, and thus the larger the sound pickup region width. For example, ψ _{1, ω} = μ and other values may be randomly assigned or the same value may be substituted so as to satisfy the equations (41) and (42).

例えば、１つ以上の閾値とパラメータμとの大小関係に基づき、予め用意しておいたパラメータΨ_L,ωを選択する構成としてもよい。例えば、2個の閾値T₁,T₂(T₁＜T₂)を用意し、 For example, the parameter Ψ _{L, ω} prepared in advance may be selected based on the magnitude relationship between one or more threshold values and the parameter μ. For example, prepare _two thresholds T ₁ , T ₂ (T ₁ <T ₂ ),

とする。ただし、a₁＜a₂＜a₃とする。また、所定の計算式に基づき、μを用いて、パラメータΨ_L,ωを計算により求める構成としてもよい。前述の通り、μが大きくなるほど、ψ_1,ωが大きな値になるように、これらの閾値とパラメータΨ_L,ω、または所定の計算式を用意すればよい。 And However, a ₁ <a ₂ <a ₃ . Further, the parameter Ψ _{L, ω} may be calculated by using μ based on a predetermined calculation formula. As described above, these thresholds and parameters Ψ _{L, ω} or a predetermined calculation formula may be prepared so that ψ _{1, ω} has a larger value as μ increases.

図１３−図１６に、ψ_1,ωを変えながら作成したフィルタの指向特性を載せる。ψ_1,ωを大きくすることで、ビーム幅が広がることが分かる。なお、あまり、感度差がつかないようなビームフォーマを作成しても、正確に音源PSDを推定できないことから、ψ_1,ωの最大値は0.8程度までにしたほうがいいことも数値シミュレーションより分かった。 13 to 16 show the directional characteristics of the filters created while changing ψ _{1, ω} . It can be seen that increasing ψ _{1, ω} widens the beam width. It should be noted from the numerical simulation that the maximum value of ψ _{1, ω} should be set to about 0.8 because it is not possible to accurately estimate the PSD of the sound source even if a beamformer with a small difference in sensitivity is not created. It was

＜効果＞
このような構成とすることで、第二実施形態と同様の効果を得ることができる。さらに、柔軟に収音領域を変更できる。 <Effect>
With such a configuration, the same effect as that of the second embodiment can be obtained. Furthermore, the sound collection area can be changed flexibly.

＜その他の変形例＞
本発明は上記の実施形態及び変形例に限定されるものではない。例えば、上述の各種の処理は、記載に従って時系列に実行されるのみならず、処理を実行する装置の処理能力あるいは必要に応じて並列的にあるいは個別に実行されてもよい。その他、本発明の趣旨を逸脱しない範囲で適宜変更が可能である。 <Other modifications>
The present invention is not limited to the above embodiments and modifications. For example, the various processes described above may be executed not only in time series according to the description, but also in parallel or individually according to the processing capability of the device that executes the process or the need. Other changes can be made as appropriate without departing from the spirit of the present invention.

＜プログラム及び記録媒体＞
また、上記の実施形態及び変形例で説明した各装置における各種の処理機能をコンピュータによって実現してもよい。その場合、各装置が有すべき機能の処理内容はプログラムによって記述される。そして、このプログラムをコンピュータで実行することにより、上記各装置における各種の処理機能がコンピュータ上で実現される。 <Program and recording medium>
Further, various processing functions in each device described in the above-described embodiment and modification may be realized by a computer. In that case, the processing content of the function that each device should have is described by the program. By executing this program on a computer, various processing functions of the above-described devices are realized on the computer.

この処理内容を記述したプログラムは、コンピュータで読み取り可能な記録媒体に記録しておくことができる。コンピュータで読み取り可能な記録媒体としては、例えば、磁気記録装置、光ディスク、光磁気記録媒体、半導体メモリ等どのようなものでもよい。 The program describing the processing contents can be recorded in a computer-readable recording medium. The computer-readable recording medium may be any recording medium such as a magnetic recording device, an optical disc, a magneto-optical recording medium, or a semiconductor memory.

また、このプログラムの流通は、例えば、そのプログラムを記録したＤＶＤ、ＣＤ−ＲＯＭ等の可搬型記録媒体を販売、譲渡、貸与等することによって行う。さらに、このプログラムをサーバコンピュータの記憶装置に格納しておき、ネットワークを介して、サーバコンピュータから他のコンピュータにそのプログラムを転送することにより、このプログラムを流通させてもよい。 The distribution of this program is performed by selling, transferring, or lending a portable recording medium such as a DVD or a CD-ROM in which the program is recorded. Further, the program may be stored in a storage device of a server computer and transferred from the server computer to another computer via a network to distribute the program.

このようなプログラムを実行するコンピュータは、例えば、まず、可搬型記録媒体に記録されたプログラムもしくはサーバコンピュータから転送されたプログラムを、一旦、自己の記憶部に格納する。そして、処理の実行時、このコンピュータは、自己の記憶部に格納されたプログラムを読み取り、読み取ったプログラムに従った処理を実行する。また、このプログラムの別の実施形態として、コンピュータが可搬型記録媒体から直接プログラムを読み取り、そのプログラムに従った処理を実行することとしてもよい。さらに、このコンピュータにサーバコンピュータからプログラムが転送されるたびに、逐次、受け取ったプログラムに従った処理を実行することとしてもよい。また、サーバコンピュータから、このコンピュータへのプログラムの転送は行わず、その実行指示と結果取得のみによって処理機能を実現する、いわゆるＡＳＰ（Application Service Provider）型のサービスによって、上述の処理を実行する構成としてもよい。なお、プログラムには、電子計算機による処理の用に供する情報であってプログラムに準ずるもの（コンピュータに対する直接の指令ではないがコンピュータの処理を規定する性質を有するデータ等）を含むものとする。 A computer that executes such a program first stores, for example, the program recorded on a portable recording medium or the program transferred from the server computer in its own storage unit. Then, when executing the process, this computer reads the program stored in its own storage unit and executes the process according to the read program. Further, as another embodiment of this program, the computer may directly read the program from the portable recording medium and execute processing according to the program. Further, each time the program is transferred from the server computer to this computer, the processing according to the received program may be executed successively. Further, the above-described processing is executed by a so-called ASP (Application Service Provider) type service that realizes a processing function only by executing the execution instruction and acquiring the result without transferring the program from the server computer to the computer. May be Note that the program includes information that is used for processing by an electronic computer and that conforms to the program (such as data that is not a direct command to a computer but has the property of defining computer processing).

また、コンピュータ上で所定のプログラムを実行させることにより、各装置を構成することとしたが、これらの処理内容の少なくとも一部をハードウェア的に実現することとしてもよい。 Further, although each device is configured by executing a predetermined program on a computer, at least a part of the processing contents may be realized by hardware.

Claims

Mを2以上の整数とし、M個の収音信号を入力とし、L個のビームフォーミング出力値を得るビームフォーミングにおいて、l=1,2,…,Lとし、i=1,2,…,Lとし、PSD分離行列G_L,ωが、l番目のビームフォーマの方向Θ_iに対する感度を要素とするL×Lの非負行列であるPSD混合行列D_L,ωの逆行列となり、かつ、前記PSD分離行列G_L,ωがM行列となるように、前記PSD分離行列G_L,ωを設計する分離行列設計部を含み、
前記M個の収音信号は、回転対称性を満たす制約のもと配置されたM個のマイクロホンで収音された信号であり、
前記G _L,ω は必ずD _L,ω の逆行列となる制約を課せられている、
分離行列設計装置。 In beamforming in which M is an integer of 2 or more, M sound pickup signals are input, and L beamforming output values are obtained, l = 1,2, ..., L and i = 1,2, ..., L, the PSD separation matrix G _{L, ω} is an inverse matrix of a PSD mixing matrix D _{L, ω} that is an L × L non-negative matrix having the sensitivity of the l-th beamformer in the direction Θ _{i as} an element, and as PSD separating matrix G _{L, omega} is M matrix, see contains the separation matrix design unit for designing the PSD separating matrix G _L, the _omega,
The M picked-up signals are signals picked up by M microphones arranged under a constraint satisfying rotational symmetry,
The G _{L, ω} is always constrained to be an inverse matrix of D _{L, ω} ,
Separation matrix design device.

Mを2以上の整数とし、M個の収音信号を入力とし、L個のビームフォーミング出力値を得るビームフォーミングにおいて、l=1,2,…,Lとし、i=1,2,…,Lとし、PSD分離行列G _L,ω が、l番目のビームフォーマの方向Θ _i に対する感度を要素とするL×Lの非負行列であるPSD混合行列D _L,ω の逆行列となり、かつ、前記PSD分離行列G _L,ω がM行列となるように、前記PSD分離行列G _L,ω を設計する分離行列設計部を含み、
M個のマイクロホンの空間的配置の中心を通るある回転軸に対して1回転させたときに、M個の前記マイクロホンの空間的配置と同一の空間的配置が複数回現れるようにM個の前記マイクロホンが配置されるものとし、前記分離行列設計部は、前記PSD分離行列G_L,ωが巡回性を持つM行列となるように前記PSD分離行列G_L,ωを設計する、
分離行列設計装置。 In beamforming in which M is an integer of 2 or more, M sound pickup signals are input, and L beamforming output values are obtained, l = 1,2, ..., L and i = 1,2, ..., L, the PSD separation matrix G _{L, ω} is an inverse matrix of a PSD mixing matrix D _{L, ω} that is an L × L non-negative matrix having the sensitivity of the l-th beamformer in the direction Θ _{i as} an element , and A PSD separation matrix G _{L, ω} includes a separation matrix design unit for designing the PSD separation matrix G _{L, ω} so that it becomes an M matrix ,
When one rotation is made about a rotation axis passing through the center of the spatial arrangement of the M microphones, the M spatial arrangements that are the same as the spatial arrangements of the M microphones appear multiple times. shall microphone is disposed, the separation matrix design unit is designed the PSD separating matrix G _{L, omega} is the PSD separation matrix G _L such that M matrix with _cyclicity, the _omega,
Separation matrix design device.

請求項２の分離行列設計装置であって、
A^Hは行列Aの共役転置、Λ_L,ωは正の対角行列、F_LはDFT基底行列、I_Lは単位行列、前記PSD分離行列G_L,ωは、
であり、
A^*はAの複素共役であり、前記PSD分離行列G_L,ωのパラメータΨ_L,ωが以下の条件を満たすようにパラメータΨ_L,ωを設計するパラメータ設計部を含む、
分離行列設計装置。 A separation matrix design device 請 Motomeko 2,
A ^H is a conjugate transpose of the matrix A, Λ _{L, ω} is a positive diagonal matrix, F _L is a DFT basis matrix, I _L is an identity matrix, and the PSD separation matrix G _{L, ω} is
And
A ^* is the complex conjugate of A, including the parameter design portion to which the PSD separating matrix G _L, parameters _omega [psi _{L, omega} is the design parameters [psi _L, the _omega so as to satisfy the following conditions,
Separation matrix design device.

請求項３の分離行列設計装置であって、
ユーザによって指定された収音領域Aを受け取り、広さを表すためのパラメータμを求める収音領域指定部を含み、
前記パラメータ設計部は、前記パラメータμが大きいほどパラメータψ_1,ωの値が大きくなるように設計する、
分離行列設計装置。 The separation matrix design device according to claim 3, wherein
The sound collecting area A specified by the user is received, and the sound collecting area designating unit for obtaining the parameter μ for expressing the width is included.
The parameter design unit designs such that the larger the parameter μ, the larger the value of the parameter ψ _{1, ω} .
Separation matrix design device.

請求項２から請求項４の何れかの分離行列設計装置が設計した前記PSD分離行列G_L,ωの逆行列である前記PSD混合行列D_L,ωを用いてビームフォーミングのフィルタ係数W_L,ωを算出するフィルタ係数算出部を含み、
Δ_L,ωは前記PSD混合行列D_L,ωの各要素の平方根として求められる行列、H_L,ωはアレイマニフォールドベクトル、Υ_Δ,M,ωはΔ_M,ωのDFT出力値により構成される行列であり、F_MはDFT基底行列、Υ_Δ,M,ωはΔ_M,ωのDFT出力値により構成される行列、Δ_M,ω,H_M,ωはΔ_L,ω,H_L,ωに対してそれぞれアップコンバージョンを行った行列であり、前記フィルタ係数算出部は、前記フィルタ係数Wを、M=Lのとき、
として算出し、M>Lのとき、
として算出する、
フィルタ係数算出装置。 A filter coefficient W _{L, for} beamforming using the PSD mixing matrix D _{L, ω} which is an inverse matrix of the PSD separation matrix G _{L, ω} designed by the separation matrix designing device according to any one of claims 2 to 4 _. includes a filter coefficient calculation unit that calculates _ω ,
Δ _{L, ω} is a matrix obtained as the square root of each element of the PSD mixing matrix D _{L, ω} , H _{L, ω} is an array manifold vector, and Υ _{Δ, M, ω} is composed of DFT output values of Δ _{M, ω} that is a matrix, F _M is DFT basis _matrix, Υ Δ, _M, ω is delta _M, the matrix composed of DFT output values of _{_{ω, Δ M, ω, H}} M, ω is Δ _{_L, ω,} H _L _, is a matrix respectively up-converted for _ω , the filter coefficient calculation unit, the filter coefficient W, when M = L,
And when M> L,
Is calculated as
Filter coefficient calculation device.

Mを2以上の整数とし、M個の収音信号を入力とし、L個のビームフォーミング出力値を得るビームフォーミングにおいて、l=1,2,…,Lとし、i=1,2,…,Lとし、PSD分離行列G_L,ωが、l番目のビームフォーマの方向Θ_iに対する感度を要素とするL×Lの非負行列であるPSD混合行列D_L,ωの逆行列となり、かつ、前記PSD分離行列G_L,ωがM行列となるように、前記PSD分離行列G_L,ωを設計する分離行列設計ステップを含み、
前記M個の収音信号は、回転対称性を満たす制約のもと配置されたM個のマイクロホンで収音された信号であり、
前記G _L,ω は必ずD _L,ω の逆行列となる制約を課せられている、
分離行列設計装置が実行する分離行列設計方法。 In beamforming in which M is an integer of 2 or more, M sound pickup signals are input, and L beamforming output values are obtained, l = 1,2, ..., L and i = 1,2, ..., L, the PSD separation matrix G _{L, ω} is an inverse matrix of a PSD mixing matrix D _{L, ω} that is an L × L non-negative matrix having the sensitivity of the l-th beamformer in the direction Θ _{i as} an element, and as PSD separating matrix G _{L, omega} is M matrix, see contains the separation matrix design step of designing the PSD separating matrix G _L, the _omega,
The M picked-up signals are signals picked up by M microphones arranged under a constraint satisfying rotational symmetry,
The G _{L, ω} is always constrained to be an inverse matrix of D _{L, ω} ,
A separation matrix design method executed by a separation matrix design device.

Mを2以上の整数とし、M個の収音信号を入力とし、L個のビームフォーミング出力値を得るビームフォーミングにおいて、l=1,2,…,Lとし、i=1,2,…,Lとし、PSD分離行列G _L,ω が、l番目のビームフォーマの方向Θ _i に対する感度を要素とするL×Lの非負行列であるPSD混合行列D _L,ω の逆行列となり、かつ、前記PSD分離行列G _L,ω がM行列となるように設計した前記PSD分離行列G_L,ωの逆行列である前記PSD混合行列D_L,ωを用いてビームフォーミングのフィルタ係数W_L,ωを算出するフィルタ係数算出ステップを含み、
Δ_L,ωは前記PSD混合行列D_L,ωの各要素の平方根として求められる行列、H_L,ωはアレイマニフォールドベクトル、Υ_Δ,M,ωはΔ_M,ωのDFT出力値により構成される行列であり、F_MはDFT基底行列、Υ_Δ,M,ωはΔ_M,ωのDFT出力値により構成される行列、Δ_M,ω,H_M,ωはΔ_L,ω,H_L,ωに対してそれぞれアップコンバージョンを行った行列であり、前記フィルタ係数算出ステップは、前記フィルタ係数Wを、M=Lのとき、
として算出し、M>Lのとき、
として算出する、
フィルタ係数算出装置が実行するフィルタ係数算出方法。 In beamforming in which M is an integer of 2 or more, M sound pickup signals are input, and L beamforming output values are obtained, l = 1,2, ..., L and i = 1,2, ..., L, the PSD separation matrix G _{L, ω} is an inverse matrix of a PSD mixing matrix D _{L, ω} that is an L × L non-negative matrix having the sensitivity of the l-th beamformer in the direction Θ _{i as} an element , and Using the PSD mixing matrix D _{L, ω} that is the inverse of the PSD separation matrix G _{L, ω} designed so that the PSD separation matrix G _{L, ω} becomes an M matrix, the beamforming filter coefficient W _{L, ω} Including a filter coefficient calculation step of calculating,
Δ _{L, ω} is a matrix obtained as the square root of each element of the PSD mixing matrix D _{L, ω} , H _{L, ω} is an array manifold vector, and Υ _{Δ, M, ω} is composed of DFT output values of Δ _{M, ω.} that is a matrix, F _M is DFT basis _matrix, Υ Δ, _M, ω is delta _M, the matrix composed of DFT output values of _{_{ω, Δ M, ω, H}} M, ω is Δ _{_L, ω,} H _L _, is a matrix each up-converted for _ω , the filter coefficient calculation step, the filter coefficient W, when M = L,
And when M> L,
Is calculated as
A filter coefficient calculation method executed by a filter coefficient calculation device.

請求項２から請求項４の何れかの分離行列設計装置、または、請求項５のフィルタ係数算出装置としてコンピュータを機能させるためのプログラム。 A program for causing a computer to function as the separation matrix design device according to any one of claims 2 to 4, or the filter coefficient calculation device according to claim 5.