JP5986966B2

JP5986966B2 - Sound field recording / reproducing apparatus, method, and program

Info

Publication number: JP5986966B2
Application number: JP2013167502A
Authority: JP
Inventors: 翔一小山; 祐介日和▲崎▼
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 2013-08-12
Filing date: 2013-08-12
Publication date: 2016-09-06
Anticipated expiration: 2033-08-12
Also published as: JP2015037207A

Description

本発明、ある音場に設置されたマイクアレイで音信号を収音し、その音信号を用いてスピーカアレイでその音場を再生する波面合成法（Wave Field Synthesis）の技術に関する。 The present invention relates to a wave field synthesis technique for collecting a sound signal with a microphone array installed in a certain sound field and reproducing the sound field with a speaker array using the sound signal.

波面合成法は、複数のマイクロホンとスピーカを用いて、遠隔地の音場を仮想的に再現する技術である。遠隔コミュニケーションシステムなどの応用では、リアルタイムの収音・再現が必要になるため、一般的なマイクロホンアレイで収音した音圧を、一般的なスピーカアレイで出力するための音場再現信号へと一意に変換可能であることが必要となる。 The wavefront synthesis method is a technique for virtually reproducing a sound field in a remote place using a plurality of microphones and speakers. In applications such as remote communication systems, real-time sound collection / reproduction is required, so the sound pressure collected by a general microphone array is unique to a sound field reproduction signal for output by a general speaker array. It must be convertible to

波面合成法の従来技術として非特許文献１が知られている。 Non-Patent Document 1 is known as a prior art of the wavefront synthesis method.

小山、古家、日和崎、羽田、「音場収音・再現のための時空間周波数領域信号変換法」、音響学会秋季研究発表会講演論文集、2011年、 pp. 635-636Koyama, Furuya, Hiwazaki, Haneda, "Spatio-temporal frequency domain signal conversion method for sound field collection and reproduction", Proc. Of the Acoustical Society of Japan Autumn Meeting, 2011, pp. 635-636

しかしながら、従来技術では、マイクロホンアレイで取得した音場をスピーカアレイを用いて再現する場合に、マイクロホンやスピーカの間隔に依存する、空間エイリアシングによる誤差が生じる。これはマイクロホンの数がスピーカの数よりも少ない場合に特に顕著であり、再現された音場において、音質や定位感の劣化が生じるという課題があった。 However, in the prior art, when a sound field acquired by a microphone array is reproduced using a speaker array, an error due to spatial aliasing that depends on the distance between the microphone and the speaker occurs. This is particularly noticeable when the number of microphones is smaller than the number of speakers, and there is a problem that the sound quality and the feeling of localization occur in the reproduced sound field.

本発明は、一般的なマイクロホン・スピーカを用いて、高精度に音場を収音・再現するための音場収音再生装置、方法及びプログラムを提供することを目的とする。 It is an object of the present invention to provide a sound field sound collecting / reproducing apparatus, method, and program for collecting and reproducing a sound field with high accuracy using a general microphone / speaker.

上記の課題を解決するために、本発明の第一の態様によれば、音場収音再生装置は、複数のマイクロホンが配置されている空間には音源が一つ以上存在し、複数のスピーカが直線状または平面状に波面の進行方向に対し制御点よりも後方に配置されるものとし、ωを周波数とし、複数のスピーカにより再現される仮想音源の位置と複数のスピーカの位置とに基づいて求まるスピーカの駆動信号の振幅の予測分布と、複数のスピーカの各位置から複数の制御点の各位置までの伝達特性とに基づいて求まり、仮想音源の位置からの波面を制御点で実現するフィルタF(ω)を、複数のマイクロホンで収音された信号それぞれの周波数領域信号P（ω）に対して適用してフィルタ処理後信号D(ω)を生成する変換フィルタ部を含む。 In order to solve the above-described problem, according to the first aspect of the present invention, the sound field collecting and reproducing device includes one or more sound sources in a space where a plurality of microphones are arranged, and a plurality of speakers. Is arranged linearly or planarly behind the control point with respect to the traveling direction of the wavefront, with ω as the frequency, based on the position of the virtual sound source reproduced by the multiple speakers and the position of the multiple speakers To obtain the wavefront from the position of the virtual sound source at the control point, based on the predicted distribution of the amplitude of the driving signal of the speaker and the transfer characteristics from each position of the plurality of speakers to each position of the plurality of control points. The filter F (ω) is applied to the frequency domain signal P (ω) of each of the signals picked up by the plurality of microphones, and includes a conversion filter unit that generates a filtered signal D (ω).

上記の課題を解決するために、本発明の第二の態様によれば、音場収音再生方法は、複数のマイクロホンが配置されている空間には音を発する音源が一つ以上存在し、複数のスピーカが直線状または平面状に波面の進行方向に対し制御点よりも後方に配置されるものとし、ωを周波数とし、複数のスピーカにより再現される仮想音源の位置と複数のスピーカの位置とに基づいて求まるスピーカの駆動信号の振幅の予測分布と、複数のスピーカの各位置から複数の制御点の各位置までの伝達特性とに基づいて求まり、仮想音源の位置からの波面を制御点で実現するフィルタF(ω)を、複数のマイクロホンで収音された信号それぞれの周波数領域信号P（ω）に対して適用してフィルタ処理後信号D(ω)を生成する変換フィルタステップを含む。 In order to solve the above-described problem, according to the second aspect of the present invention, the sound field collecting and reproducing method includes at least one sound source that emits sound in a space where a plurality of microphones are arranged, A plurality of speakers are arranged linearly or planarly behind the control point with respect to the traveling direction of the wavefront, the position of the virtual sound source reproduced by the plurality of speakers and the position of the plurality of speakers, where ω is the frequency The wavefront from the position of the virtual sound source is determined based on the predicted distribution of the amplitude of the driving signal of the speaker obtained based on and the transfer characteristics from each position of the plurality of speakers to each position of the plurality of control points. Including a transform filter step of applying the filter F (ω) realized in step S1 to the frequency domain signal P (ω) of each of the signals picked up by a plurality of microphones to generate a filtered signal D (ω) .

本発明によれば、空間エイリアシングによる誤差を軽減することができ、再現された音場における、音質や定位感の劣化の改善を可能にするという効果を奏する。 According to the present invention, an error due to spatial aliasing can be reduced, and an effect is obtained that it is possible to improve deterioration in sound quality and localization in a reproduced sound field.

マイクロホン及びスピーカの配置の例を説明するための図。The figure for demonstrating the example of arrangement | positioning of a microphone and a speaker. 音場収音再生装置の例を示す機能ブロック図。The functional block diagram which shows the example of a sound field sound collection reproducing | regenerating apparatus. 音場収音再生方法の例を示す流れ図。The flowchart which shows the example of the sound field sound collection reproduction | regeneration method. マイクロホンアレイとスピーカアレイと消音領域の例を示す図。The figure which shows the example of a microphone array, a speaker array, and a silence area. マイクロホン及びスピーカの配置の例を説明するための図。The figure for demonstrating the example of arrangement | positioning of a microphone and a speaker.

以下、本発明の実施形態について説明する。なお、以下の説明に用いる図面では、同じ機能を持つ構成部や同じ処理を行うステップには同一の符号を記し、重複説明を省略する。以下の説明において、テキスト中で使用する記号「⁻」「＾」等は、本来直前の文字の真上に記載されるべきものであるが、テキスト記法の制限により、当該文字の直後に記載する。式中においてはこれらの記号は本来の位置に記述している。また、ベクトルや行列の各要素単位で行われる処理は、特に断りが無い限り、そのベクトルやその行列の全ての要素に対して適用されるものとする。 Hereinafter, embodiments of the present invention will be described. In the drawings used for the following description, constituent parts having the same function and steps for performing the same process are denoted by the same reference numerals, and redundant description is omitted. In the following description, the symbols “ ⁻ ”, “^”, etc. used in the text should be described immediately above the immediately preceding character, but are described immediately after the character due to restrictions on text notation. . In the formula, these symbols are written in their original positions. Further, the processing performed for each element of a vector or matrix is applied to all elements of the vector or matrix unless otherwise specified.

＜第一実施形態＞
本発明では、マイクロホンアレイで収音した音圧と、収音側のおおまかな音源の位置とを併用した信号変換を行い、空間エイリアシングによる誤差を軽減する。空間フーリエ変換を行う代わりに、音源の位置を利用することで、空間フーリエ変換によるエイリアシングの影響を低減することができ、目的音源の再生音の音色をよくすることができる。 <First embodiment>
In the present invention, signal conversion using both the sound pressure collected by the microphone array and the position of the rough sound source on the sound collection side is performed to reduce errors due to spatial aliasing. By using the position of the sound source instead of performing the spatial Fourier transform, the influence of aliasing due to the spatial Fourier transform can be reduced, and the tone of the reproduced sound of the target sound source can be improved.

第一実施形態では、平面状にスピーカアレイを配置した場合について説明する。
第一実施形態の音場収音再生装置及び方法は、図１に示すように、収音側の空間に配置されているM個のマイクロホン１−ｉで構成されるマイクロホンアレイと、再生側の空間に配置されているL個のスピーカ２−ｐで構成されるスピーカアレイとを用いて、収音側の空間の音源Ｓで発生した音によって形成された収音側の空間の音場を再生側の空間で再現する。なお、収音側の空間及び再生側の空間は互いに異なる空間であってもよいし、同じ空間であってもよい。図１では、再生側の空間においてL個のスピーカ２−ｐで再現された音源を音源Ｓ'（以下、「仮想音源Ｓ’」とも表現する）と表現している。 In the first embodiment, a case where the speaker array is arranged in a planar shape will be described.
As shown in FIG. 1, the sound field sound collection / reproduction device and method of the first embodiment includes a microphone array composed of M microphones 1-i arranged in a sound collection side space, and a reproduction side Using a speaker array composed of L speakers 2-p arranged in space, the sound field in the sound collection side formed by the sound generated by the sound source S in the sound collection side space is reproduced. Reproduce in the side space. Note that the sound collection side space and the reproduction side space may be different from each other or the same space. In FIG. 1, the sound source reproduced by the L speakers 2-p in the reproduction-side space is expressed as a sound source S ′ (hereinafter also referred to as “virtual sound source S ′”).

図２は音場収音再生装置１００の機能ブロック図を、図３はその処理フローを示す。 FIG. 2 is a functional block diagram of the sound field sound collecting / reproducing apparatus 100, and FIG. 3 shows a processing flow thereof.

音場収音再生装置１００は、音源位置検出部１０１、フィルタ計算部１０３、周波数変換部１１１、変換フィルタ部１１３及び周波数逆変換部１１５を含む。 The sound field sound collecting / reproducing apparatus 100 includes a sound source position detecting unit 101, a filter calculating unit 103, a frequency converting unit 111, a converting filter unit 113, and a frequency inverse converting unit 115.

本実施形態では、任意の形状で配置したマイクロホンアレイ（チャネル数M）で収音した音場を、平面状に配置したスピーカアレイ（チャネル数L）を用いて再現するとする。スピーカの素子間隔は任意の配置で構わない。各マイクロホン１−ｉの位置をr^- _m,i=(x_m,i,y_m,i,z_m,i)とし、各スピーカ２−ｐの位置をr^- _s,p=(x_s,p,0,z_s,p)とする。つまり、各スピーカ２−ｐは、xz平面上に、平面状に配置される。ただし、M>2、L>2、i=1,2,…,M、p=1,2,…,Lである。収音側の空間に配置されたマイクロホンの数Mと再生側の空間に配置されたスピーカの数Lとは異なっていてもよいし、同一であってもよい。 In the present embodiment, it is assumed that a sound field picked up by a microphone array (number of channels M) arranged in an arbitrary shape is reproduced using a speaker array (number of channels L) arranged in a plane. The speaker spacing may be arbitrarily arranged. The position of each microphone 1-i is r ^- _{m, i} = (xm _{, i} , ym _{, i} , zm _{, i} ), and the position of each speaker 2-p is r ^- _{s, p} = ( _{xs, p} , 0, z _{s, p} ). That is, each speaker 2-p is arranged in a plane on the xz plane. However, M> 2, L> 2, i = 1, 2,..., M, p = 1, 2,. The number M of microphones arranged in the sound collection side space and the number L of speakers arranged in the reproduction side space may be different or the same.

また、スピーカアレイの前面に、マイクロホンアレイのマイクロホン１−ｉと同じ配置の制御点４−ｉを設定する。各制御点４−ｉの位置は、r^- _c,i=(x_c,i,y_c,i,z_c,i)とする。また、各スピーカ２−ｐの位置r^- _s,pから制御点４−ｉの位置r^- _c,iまでの伝達関数をG(ω,r^- _c,i-r^- _s,p)と置き、既知であるとする。伝達関数をG(ω,r^- _c,i-r^- _s,p)はあらかじめ測定やシミュレーションにより取得するか、以下のようにモノポール特性を仮定して計算する。 Further, a control point 4-i having the same arrangement as the microphone 1-i of the microphone array is set on the front surface of the speaker array. The position of each control point 4-i is r ^- _{c, i} = ( _{xc, i} , yc _{, i} , zc _{, i} ). The position r of the speaker 2-p ^- Position ^{_{(s, p ω, r -}} - c, i -r) and ^- _s, the position of the control point 4-i from _p r _c, the transfer function up to _i G Suppose that it is already known. The transfer function ^{_{G (ω, r - c,}} i -r - s, p) is either obtained by previously measured or simulation, calculated assuming the monopole characteristics as follows.

ただし、jは虚数単位、kは波数である。波数kは、周波数ωを音速cで割った値k=ω/cとして定義される。 However, j is an imaginary unit and k is a wave number. The wave number k is defined as a value k = ω / c obtained by dividing the frequency ω by the speed of sound c.

スピーカ２−ｐは、波面の進行方向に対し、制御点よりも後方に配置する。なお、スピーカ２−ｐの位置は、仮想音源Ｓ’の位置にはよらず、制御点４−ｉの位置のみに拘束される。 The speaker 2-p is disposed behind the control point with respect to the traveling direction of the wavefront. Note that the position of the speaker 2-p is not limited to the position of the virtual sound source S ', but is restricted only to the position of the control point 4-i.

収音側の空間に配置されたマイクロホン１−１，１−２，…，１−Ｍは、収音側の空間の音源Ｓで発せられた音を収音して時間領域の信号を生成する。生成された信号は、周波数変換部１１１に送られる。r^- _m,i=(x_m,i,y_m,i,z_m,i)に位置するマイクロホン１−ｉで収音された時間領域の時刻ｔの信号をp_i(t)と表記する。 The microphones 1-1, 1-2,..., 1-M arranged in the sound collection side space collect sound generated by the sound source S in the sound collection side space and generate a time domain signal. . The generated signal is sent to the frequency conversion unit 111. r ^- _{m, i} = denoted _{_{(x m, i, y m}} , i, z m, i) a signal at time t of the sound pickup time domain by the microphone 1-i located in the p _i (t) .

＜音源位置検出部１０１＞
音源位置検出部１０１は、収音側の空間で、話者や演奏者などの音源Ｓの位置r^- _ps=(x_ps,y_ps,z_ps)を検出し、出力する。位置検出においては従来技術を用いればよい。例えば、カメラ３等の画像を入力として、画像から位置r^- _psを検出してもよいし、マイクロホンアレイで収音した信号p_i(t)を入力として、信号p_i(t)から音源定位を行って位置r^- _psを検出してもよい。 <Sound source position detection unit 101>
The sound source position detection unit 101 detects and outputs the position r ⁻ _ps = (x _ps , y _ps , z _ps ) of the sound source S such as a speaker or a performer in the sound collection side space. Conventional technology may be used for position detection. For example, an image from the camera 3 or the like may be used as an input to detect the position r ^- _ps from the image, or a signal p _i (t) picked up by a microphone array may be used as an input to determine sound source localization from the signal p _i (t). _{May be used} to detect the position r ^- _ps .

＜フィルタ計算部１０３＞
フィルタ計算部１０３は、音源の位置r^- _ps=(x_ps,y_ps,z_ps)を入力とし、フィルタF(ω)を計算し、出力する。 <Filter calculation unit 103>
The filter calculation unit 103 receives the sound source position r ⁻ _ps = (x _ps , y _ps , z _ps ) as input, calculates the filter F (ω), and outputs it.

例えば、フィルタ計算部１０３は、音源の位置r^- _psに加え、Ｍ個のマイクロホン１−ｉの位置r^- _m,iと、Ｍ個の制御点４−ｉの位置r^- _c,iまたは仮想音源の位置r^- _ps'=(x_ps',y_ps',z_ps')とを入力とする。フィルタ計算部１０３は、音源の位置r^- _psとＭ個のマイクロホン１−ｉの位置r^- _m,iと用いて、その位置関係を求める。音源の位置r^- _psとＭ個のマイクロホン１−ｉの位置r^- _m,iとの位置関係と、仮想音源の位置r^- _ps'とＭ個の制御点４−ｉの位置r^- _c,iとの位置関係とが同じになるように、Ｍ個の制御点４−ｉの位置r^- _c,iから仮想音源の位置r^- _ps'を、または、仮想音源の位置r^- _ps'からＭ個の制御点４−ｉの位置r^- _c,iを求める。 For example, the filter calculation unit 103, the position of the sound source r ^- in addition to _ps, the position of the M microphones 1-i r ^- _{m, i} and the position of the M control point 4-i r ^- _{c, i} or virtual The sound source position r ⁻ _{ps ′} = (x _{ps ′} , y _{ps ′} , z _{ps ′} ) is used as an input. Filter calculation unit 103, the position of the sound source r ^- position of _ps and M microphones 1-i r ^- _m, using a _i, determining the positional relation. Position of the sound source r ^- _ps and M microphones 1-i of the position r ^- _m, the positional relationship between the _i, of the virtual sound source position r ^- _ps' and the position of the M control point 4-i r ^- _c, as the positional relationship between the _i are the same, the position r of the M control point 4-i ^- _c, the position r of the virtual sound source from the _i ^- _', or, a position r of the virtual sound source ^- _ps' _ps from The positions r ^- _{c, i} of M control points 4-i are obtained.

さらに、フィルタ計算部１０３は、スピーカ２−ｐの位置r^- _s,pを入力とする。フィルタ計算部１０３は、音源Ｓ、仮想音源Ｓ’の位置（点音源）から発せられる音は球面波で伝わると仮定し、仮想音源の位置r^- _ps'とスピーカ２−ｐの位置r^- _s,pとを用いて、スピーカ２−ｐの駆動信号の振幅の予測分布（以下「スピーカ駆動信号振幅予測分布」ともいう）w_p(ω)を以下のように計算する。 Further, the filter calculation unit 103 receives the position r ⁻ _{s, p} of the speaker 2- _p as an input. The filter calculation unit 103 assumes that sound emitted from the positions of the sound source S and the virtual sound source S ′ (point sound source) is transmitted by a spherical wave, and the position r ⁻ _{ps ′} of the virtual sound source and the position r ⁻ _{s of the} speaker 2-p. _{, p} , the predicted distribution of the amplitude of the driving signal of the speaker 2-p (hereinafter also referred to as “speaker driving signal amplitude predicted distribution”) w _p (ω) is calculated as follows.

ここで、

here,

である。γ_pは窓関数である。例えば、通常は打ち切り誤差の影響を軽減するために、Tukey窓関数を用いる。ただし、γ_pはどのような窓関数を用いても構わないし、なくてもよい。 It is. γ _p is a window function. For example, a Tukey window function is usually used to reduce the effect of truncation error. However, γ _p may or may not use any window function.

音源が複数存在する場合でも、単一の音源のみを仮定して、式（３）を利用して構わないが、各音源信号の相対的な振幅の比がわかれば、複数音源を仮定してw_p(ω)を計算することもできる。各仮想音源ｑの位置をr^- _ps',q、音源信号の振幅をa_qとすれば、 Even if there are multiple sound sources, it is possible to use Equation (3) assuming only a single sound source. However, if the relative amplitude ratio of each sound source signal is known, multiple sound sources are assumed. w _p (ω) can also be calculated. If the position of each virtual sound source q is r ^- _{ps', q} and the amplitude of the sound source signal is a _q ,

とすればよい。仮想音源ｑの位置をr^- _ps',qは、単一の仮想音源r^- _ps'と同様の方法で求めればよい。ただし、各ｑ＝１，２，…，Ｑであり、Ｑは音源の個数である。音源信号の振幅a_qの取得方法は、従来技術を用いればよい。例えば、各話者の口元に取り付けたピンマイクの信号から、各処理フレームでの信号の2乗平均の平方根を取ることで推定できる。または、マイクロホンアレイで指向性制御を行い、各音源位置の方向からの到来音波のパワーを用いることもできる。複数の音源が存在し、各音源の振幅またはその比がわかる場合には、それらの値を入力として、式（４）により、w^_p(ω)を求め、求めたw^_p(ω)を用いて、式（２）によりスピーカ駆動信号振幅予測分布w_p(ω)を求めたほうが、単一の音源のみを仮定した場合よりも、再現された音場における音質や定位感を向上させることができる。 And it is sufficient. The position r ^- _{ps ', q} of the virtual sound source _q may be obtained by the same method as that for the single virtual sound source r ^- _ps' . Where q = 1, 2,..., Q, where Q is the number of sound sources. A conventional technique may be used as a method for obtaining the amplitude a _q of the sound source signal. For example, it can be estimated by taking the square root of the root mean square of the signal in each processing frame from the signal of the pin microphone attached to the mouth of each speaker. Alternatively, directivity control can be performed with a microphone array, and the power of incoming sound waves from the direction of each sound source position can be used. If there are multiple sound sources and the amplitude or ratio of each sound source is known, use these values as input to find w ^ _p (ω) using equation (4), and find the calculated w ^ _p (ω) Using the formula (2), the speaker drive signal amplitude prediction distribution w _p (ω) is obtained, which improves the sound quality and the sense of localization in the reproduced sound field, rather than assuming only a single sound source. be able to.

次に、フィルタ計算部１０３は、M個の制御点４−ｉとL個のスピーカ２−ｐとの間のM×L個の伝達特性G(ω,r^- _c,i-r^- _s,p)を用いて、空間相関行列R^-(ω)∈C^M×Mを以下のように計算する。ただし、Cは複素数全体の集合を表す。 Next, the filter calculation unit 103, M × L number of transfer characteristic G between the M control point 4-i and the L loudspeakers ^{2-p (ω, r -} c, i -r - s, _{Using p} ), the spatial correlation matrix R ⁻ (ω) ∈C ^{M × M} is calculated as follows. Where C represents the set of all complex numbers.

ここで、ｎ＝１，２，…，Ｍであり、上付き添え字＊は複素共役を表し、R_in(ω)は空間相関行列R^-(ω)の(i,n)番目の要素である。なお、M×L個の伝達特性G(ω,r^- _c,i-r^- _s,p)は前述の通り、予め測定やシミュレーションにより取得するか、モノポール特性を仮定して計算して、記憶部等に記憶しておく。フィルタ計算部１０３は、空間相関行列R^-(ω)の固有値展開を計算する。 Here, n = 1, 2,..., M, the superscript * represents the complex conjugate, and R _in (ω) is the (i, n) -th element of the spatial correlation matrix R ⁻ (ω). is there. The M × L transfer characteristics G (ω, r ^- _{c, i-} r ^- _{s, p} ) are obtained in advance by measurement or simulation as described above, or calculated assuming the monopole characteristics, Store it in a storage unit or the like. The filter calculation unit 103 calculates the eigenvalue expansion of the spatial correlation matrix R ⁻ (ω).

ここで、Λ(ω)を空間相関行列R^-(ω)の固有値成分の対角行列、U(ω)を対応する固有ベクトルを列とする行列、上付き添え字Hを複素共役転置とする。なお、固有値展開にはどのような数値計算アルゴリズムを用いても構わない。 Here, Λ (ω) is a diagonal matrix of the eigenvalue components of the spatial correlation matrix R ⁻ (ω), U (ω) is a matrix with the corresponding eigenvectors as columns, and the superscript H is a complex conjugate transpose. Any numerical calculation algorithm may be used for eigenvalue expansion.

フィルタ計算部１０３は、信号変換用のフィルタF(ω)∈C^L×Mの第p行目を以下のように計算する。 The filter calculation unit 103 calculates the p-th row of the signal conversion filter F (ω) εC ^{L × M} as follows.

ここで、αは正則化パラメータ、Ｉは単位行列を表しG_p(ω)∈C^M×1は伝達特性G(ω,r^- _c,i-r^- _s,p)を各要素にもつベクトルである。1≦p≦Lの各行f^- _p(ω)を計算し、フィルタF(ω)を得る。 Here, alpha is a regularization parameter, I is _{^{G p (ω) ∈C M ×}} 1 represents a unit matrix transfer characteristic G with ^{_{^{(ω, r - - c,}}} i -r s, p) to each element vector It is. Each row f ⁻ _p (ω) with 1 ≦ p ≦ L is calculated to obtain a filter F (ω).

フィルタF(ω)は、入力である音源の位置r^- _psに応じてオンラインで計算しても構わないし、予め複数の音源の位置に対して計算しておいた複数のフィルタF(ω)の中から、収音時の音源の位置と、最も近い音源の位置に対応するフィルタF(ω)を選択して出力する構成でも構わない。 The filter F (ω) may be calculated online in accordance with the input sound source position r ^- _ps , or a plurality of filters F (ω) previously calculated for a plurality of sound source positions. A configuration may be adopted in which a filter F (ω) corresponding to the position of the sound source at the time of sound collection and the closest sound source position is selected and output.

＜周波数変換部１１１＞
周波数変換部１１１は、マイクロホン１−ｉで収音された信号p_i(t)をフーリエ変換により周波数領域信号P_i(ω)に変換する（Ｓ１）。生成された周波数領域信号P_i(ω)は、変換フィルタ部１１３に送られる。ωは周波数である。なお、ωを音速cで割った値k=ω/cを波数として定義する。波数とは、いわゆる空間周波数又は角度スペクトルのことである。例えば、短時間離散フーリエ変換により周波数領域信号P_i(ω)が生成される。もちろん、他の既存の方法により周波数領域信号P_i(ω)を生成してもよい。また、オーバーラップアド等の方法を用いて周波数領域信号P_i(ω)を生成してもよい。入力信号が長い場合や、リアルタイム処理のように連続して信号が入力される場合には、例えば10msごとといったフレームごとに処理を行う。周波数領域信号P_i(ω)は、例えば次式のように定義される。 <Frequency converter 111>
The frequency converter 111 converts the signal p _i (t) collected by the microphone 1-i into a frequency domain signal P _i (ω) by Fourier transform (S1). The generated frequency domain signal P _i (ω) is sent to the transform filter unit 113. ω is a frequency. A value k = ω / c obtained by dividing ω by the speed of sound c is defined as a wave number. The wave number is a so-called spatial frequency or angular spectrum. For example, the frequency domain signal P _i (ω) is generated by short-time discrete Fourier transform. Of course, the frequency domain signal P _i (ω) may be generated by other existing methods. Further, the frequency domain signal P _i (ω) may be generated using a method such as overlap add. When the input signal is long or when the signal is continuously input as in real time processing, the processing is performed for each frame such as every 10 ms. The frequency domain signal P _i (ω) is defined as follows, for example.

ただし、exp関数の引数の中のjは虚数単位である。 However, j in the argument of the exp function is an imaginary unit.

＜変換フィルタ部１１３＞
変換フィルタ部１１３は、周波数領域信号P_i(ω)とフィルタF(ω)とを入力とし、周波数領域信号P_i(ω)に対してフィルタF(ω)を適用してフィルタ処理後信号D_p(ω)を生成する（Ｓ４）。フィルタ処理後信号D(ω)は、周波数逆変換部１１５に送信される。 <Conversion filter unit 113>
The transformation filter unit 113 receives the frequency domain signal P _i (ω) and the filter F (ω) as inputs, applies the filter F (ω) to the frequency domain signal P _i (ω), and performs a filtered signal D _p (ω) is generated (S4). The filtered signal D (ω) is transmitted to the frequency inverse transform unit 115.

周波数領域信号P_i(ω)とフィルタ処理後信号D_p(ω)とをそれぞれベクトル形式でP(ω)∈C^M×1、D(ω)∈C^L×1と表記するとき、 When the frequency domain signal P _i (ω) and the filtered signal D _p (ω) are respectively expressed in vector form as P (ω) ∈C ^{M × 1} and D (ω) ∈C ^{L × 1} ,

として計算する。 Calculate as

＜周波数逆変換部１１５＞
周波数逆変換部１１５は、周波数領域信号D_p(ω)を逆フーリエ変換により時間領域信号d_p(t)に変換する（Ｓ６）。逆フーリエ変換によりフレーム毎に得られた時間領域信号d_p(t)は適宜シフトされて線形和が取られて、連続した時間領域信号となる。逆フーリエ変換は短時間離散逆フーリエ変換等の既存の方法を用いればよい。例えば、離散逆フーリエ変換（IDFT）は例えば次式で定義される。 <Frequency inverse transform unit 115>
The frequency inverse transform unit 115 transforms the frequency domain signal D _p (ω) into a time domain signal d _p (t) by inverse Fourier transform (S6). The time domain signal d _p (t) obtained for each frame by the inverse Fourier transform is appropriately shifted to obtain a linear sum, and becomes a continuous time domain signal. For the inverse Fourier transform, an existing method such as a short-time discrete inverse Fourier transform may be used. For example, the discrete inverse Fourier transform (IDFT) is defined by the following equation, for example.

時間領域信号d_p(t)は、スピーカ２−ｐに送られる。 The time domain signal d _p (t) is sent to the speaker 2-p.

スピーカ２−ｐは、時間領域信号d_p(t)に基づいて音を再生する。例えば、r^- _s,p=(x_s,p,0,z_s,p)に位置するスピーカ２−ｐが時間領域信号d_p(t)に基づいて音を再生する。これにより、第一の空間の音場を第二の空間に再現することができる。 The speaker 2-p reproduces sound based on the time domain signal d _p (t). For example, the speaker 2-p located at r ⁻ _{s, p} = (x _{s, p} , 0, z _{s, p} ) reproduces sound based on the time domain signal d _p (t). Thereby, the sound field of the first space can be reproduced in the second space.

＜効果＞
このような構成により、マイクロホンアレイで収音した音圧と、収音側のおおまかな音源位置とを併用した信号変換を行い、空間エイリアシングによる誤差を軽減することができる。再現された音場における、音質や定位感の劣化の改善を可能にする。特に従来技術では、マイクロホンの数がスピーカの数よりも少ない場合に顕著に音質や定位感の劣化が生じるが、本実施形態であれば、マイクロホンの数がスピーカの数よりも少ない場合にも劣化を改善できる。 <Effect>
With such a configuration, signal conversion using both the sound pressure collected by the microphone array and the approximate sound source position on the sound collection side can be performed, and errors due to spatial aliasing can be reduced. Improves the degradation of sound quality and localization in the reproduced sound field. In particular, in the conventional technology, when the number of microphones is smaller than the number of speakers, the sound quality and the localization feeling are remarkably deteriorated. However, in this embodiment, the deterioration is also caused when the number of microphones is smaller than the number of speakers. Can be improved.

＜変形例＞
なお、制御点４−ｉは、スピーカアレイの前面に、マイクロホンアレイのマイクロホン１−ｉに対応するように配置する。マイクロホン１−ｉと制御点４−ｉとは、配置が対応していればよく、その配置の形状は、平面状である必要はなく、自由に配置してよい。 <Modification>
The control point 4-i is disposed on the front surface of the speaker array so as to correspond to the microphone 1-i of the microphone array. The microphones 1-i and the control points 4-i only need to correspond to each other, and the shape of the arrangement does not need to be a flat shape, and may be freely arranged.

また、周波数ωごとに異なる音源位置に対するフィルタF(ω)を計算したり、適用しても構わない。換言すると、全ての周波数ωに対して、同じ音源の位置を仮定したフィルタF(ω)を計算したり、適用しなくともよい。周波数ωによって音源位置検出の精度が異なる場合がある。例えば、高い周波数においては精度が高いが、低い周波数においては精度が低くなる場合等がある。その場合、フィルタ計算部１０３は、精度の高い周波数では、検出される音源の位置の全てを使って、式（４）により、w^_p(ω)を求め、精度の低い周波数では、検出される音源の位置の一部を使って、式（３）（検出される音源の位置の一部が、単一の音源からなる場合）または式（４）（検出される音源の位置の一部が、検出される音源の位置の全てではないが、２個以上の音源からなる場合）により、w^_p(ω)を求める。 Also, the filter F (ω) for different sound source positions for each frequency ω may be calculated or applied. In other words, it is not necessary to calculate or apply the filter F (ω) assuming the same sound source position for all frequencies ω. The accuracy of sound source position detection may differ depending on the frequency ω. For example, the accuracy may be high at a high frequency, but the accuracy may be low at a low frequency. In that case, the filter calculation unit 103 obtains w ^ _p (ω) using Equation (4) using all the positions of the detected sound source at a high-precision frequency, and is detected at a low-precision frequency. (3) (when a part of the detected sound source position consists of a single sound source) or (4) (a part of the detected sound source position) However, not all of the detected sound source positions, but when two or more sound sources are used), w ^ _p (ω) is obtained.

変換フィルタ部１１３は、精度の高い周波数では、検出される音源の位置の全てを使って、求めたフィルタF(ω)を適用し、精度の低い周波数では、検出される音源の位置の一部を使って、求めたフィルタF(ω)を適用する。このような構成とすることで、周波数ωによって音源位置検出の精度が異なる場合（特に精度が低い周波数ωを含む場合）には、全ての周波数ωに対して、同じ音源の位置を仮定したフィルタF(ω)を計算するよりも、再現された音場における音質や定位感を向上させることができる。 The conversion filter unit 113 applies the obtained filter F (ω) using all of the detected sound source positions at high-accuracy frequencies, and a part of the detected sound source positions at low-accuracy frequencies. Is used to apply the obtained filter F (ω). By adopting such a configuration, when the accuracy of sound source position detection differs depending on the frequency ω (particularly when the accuracy includes a frequency ω with low accuracy), the filter assumes the same sound source position for all frequencies ω. Rather than calculating F (ω), it is possible to improve sound quality and localization in the reproduced sound field.

本実施形態では、収音側の空間の音場を再生側の空間で再現しているが、一つの部屋にマイクロホンアレイとスピーカアレイを設置し、音源からの音波を消音するために利用してもよい。この場合、マイクロホンアレイのマイクロホン１−ｉと同じ配置の制御点４−ｉを設定し、フィルタ係数を逆相とすることによって、実現可能である。この場合、マイクロホンアレイとスピーカアレイとは同一空間に向かい合うように配置される。音源Ｓがマイクロホンアレイの前方に位置する場合には、制御点４−ｉよりマイクロホンアレイ側が消音領域となる（図４Ａ及び図４Ｂ参照）。一方、音源Ｓがマイクロホンアレイの後方に位置する場合には、制御点４−ｉの位置に関係なく、マイクロホンアレイ周辺のみが消音領域となる（図４Ｃ参照）。このような構成により、再現された音場における、音質や定位感の劣化の改善を可能とし、音源からの音波との類似性を高めることで、消音性能を高めている。 In this embodiment, the sound field of the sound collection side space is reproduced in the reproduction side space, but a microphone array and a speaker array are installed in one room and used to mute sound waves from the sound source. Also good. In this case, it can be realized by setting the control points 4-i having the same arrangement as the microphones 1-i of the microphone array and setting the filter coefficients in reverse phase. In this case, the microphone array and the speaker array are arranged to face the same space. When the sound source S is located in front of the microphone array, the microphone array side is a muffler region from the control point 4-i (see FIGS. 4A and 4B). On the other hand, when the sound source S is located behind the microphone array, only the periphery of the microphone array is the mute area regardless of the position of the control point 4-i (see FIG. 4C). With such a configuration, it is possible to improve the deterioration of sound quality and localization in the reproduced sound field, and the sound deadening performance is enhanced by increasing the similarity with the sound wave from the sound source.

＜第二実施形態＞
第一実施形態と異なる部分を中心に説明する。
第二実施形態では、直線状にスピーカアレイを配置した場合について説明する。 <Second embodiment>
A description will be given centering on differences from the first embodiment.
In the second embodiment, a case will be described in which speaker arrays are arranged linearly.

各スピーカ２−ｐの位置をr^- _s,p=(x_s,p,0,0)とする。つまり、各スピーカ２−ｐは、x軸上に直線状に配置される（図５参照）。また、各スピーカ２−ｐの位置r^- _s,pから制御点４−ｉの位置r^- _c,iまでの伝達関数をG(ω,r^- _c,i-r^- _s,p)と置き、既知であるとする。これはあらかじめ測定やシミュレーションにより取得するか、以下のように線音源特性を仮定して計算する。 Assume that the position of each speaker 2-p is r ^- _{s, p} = ( _{xs, p} , 0,0). That is, each speaker 2-p is arranged linearly on the x axis (see FIG. 5). The position r of the speaker 2-p ^- Position ^{_{(s, p ω, r -}} - c, i -r) and ^- _s, the position of the control point 4-i from _p r _c, the transfer function up to _i G Suppose that it is already known. This is obtained in advance by measurement or simulation, or is calculated assuming the line source characteristics as follows.

ただし、H⁽¹⁾ ₀は０次の第一種ハンケル関数を表す。または、第一実施形態の平面状スピーカアレイの場合と同じく、モノポール特性を仮定して計算してもよい。直線状スピーカアレイの場合は、フィルタ計算部１０３におけるスピーカ駆動信号振幅予測分布w_pの計算のみが平面状スピーカアレイの場合と異なる。 Here, H ⁽¹⁾ ₀ represents a zeroth-order first-class Hankel function. Or, similarly to the planar speaker array of the first embodiment, it may be calculated assuming monopole characteristics. In the case of the linear speaker array, only the calculation of the speaker drive signal amplitude prediction distribution w _p in the filter calculation unit 103 is different from the case of the planar speaker array.

フィルタ計算部１０３は、仮想音源の位置r^- _ps'とスピーカ２−ｐの位置r^- _s,pとを用いて、スピーカ２−ｐの駆動信号の振幅の予測分布（以下「スピーカ駆動信号振幅予測分布」ともいう）w_p(ω)を以下のように計算する。 The filter calculation unit 103 uses the virtual sound source position r ⁻ _{ps ′} and the position r ⁻ _{s, p} of the speaker 2-p to predict the distribution of the amplitude of the driving signal of the speaker 2-p (hereinafter “speaker driving signal amplitude”). W _p (ω) (also referred to as “predictive distribution”) is calculated as follows.

ここで、

here,

である。ただし、H⁽¹⁾ ₀は０次の第一種ハンケル関数である。なお、スピーカ駆動信号振幅予測分布w_p(ω)を平面状スピーカアレイの場合と同じように計算してもよい。ただし、直線状スピーカアレイに対するフィルタ係数としては、式（１３）に基づいて生成したもののほうが精度が高い。 It is. However, H ⁽¹⁾ ₀ is a zeroth-order first-class Hankel function. Note that the speaker drive signal amplitude prediction distribution w _p (ω) may be calculated in the same manner as in the case of the planar speaker array. However, as the filter coefficient for the linear speaker array, the filter coefficient generated based on Expression (13) has higher accuracy.

音源が複数存在する場合でも、単一の音源のみを仮定して、式（１３）を利用して構わないが、各音源信号の振幅またはその比がわかれば、次式により、複数音源を仮定してw_p(ω)を計算することもできる。 Even if there are a plurality of sound sources, only a single sound source may be assumed and Equation (13) may be used. However, if the amplitude of each sound source signal or its ratio is known, the following equation is used to assume a plurality of sound sources. And w _p (ω) can also be calculated.

ただし、H⁽¹⁾ ₁は１次の第一種ハンケル関数である。 Where H ⁽¹⁾ ₁ is a first-order first-class Hankel function.

＜効果＞
このような構成により、第一実施形態と同様の効果を得ることができる。なお、第二実施形態においても、第一実施形態の変形例と同様の変形を行うことができる。 <Effect>
With such a configuration, the same effect as that of the first embodiment can be obtained. In the second embodiment, the same modification as the modification of the first embodiment can be performed.

＜その他の変形例＞
音場収音再生装置を構成する各部は、収音側の空間に配置された収音装置と再生側の空間に配置された再生装置の何れに備えられていてもよい。換言すれば、周波数変換部１１１、変換フィルタ部１１３、周波数逆変換部１１５、音源位置検出部１０１及びフィルタ計算部１０３のそれぞれの処理は、収音側の空間に配置された収音装置で実行されてもよいし、再生側の空間に配置された再生装置で実行されてもよい。収音装置で生成された信号は、再生装置に送信される。 <Other variations>
Each unit constituting the sound field sound collecting / reproducing device may be provided in either the sound collecting device arranged in the sound collecting side space or the reproducing device arranged in the reproducing side space. In other words, the processes of the frequency conversion unit 111, the conversion filter unit 113, the frequency inverse conversion unit 115, the sound source position detection unit 101, and the filter calculation unit 103 are executed by a sound collection device arranged in the sound collection side space. Or may be executed by a playback device arranged in a space on the playback side. The signal generated by the sound collection device is transmitted to the reproduction device.

収音側の空間と再生側の空間の位置は、図１や図５に示したものに限定されない。収音側の空間と再生側の空間は、隣接していても互いに離れた位置にあってもよい。また、収音側の空間と再生側の空間の向きもどのようなものであってもよい。 The positions of the sound collection side space and the reproduction side space are not limited to those shown in FIGS. 1 and 5. The space on the sound collection side and the space on the reproduction side may be adjacent to each other or at positions separated from each other. Further, the direction of the sound collection side space and the reproduction side space may be arbitrary.

音場収音再生装置は、変換フィルタ部１１３を含みさえすれば、他の部を備えていなくてもよい。例えば、音場収音再生装置は、変換フィルタ部１１３及び周波数逆変換部１１５から構成されていてもよい。また、音場収音再生装置は、周波数変換部１１１、及び変換フィルタ部１１３から構成されていてもよい。 As long as the sound field sound collecting / reproducing apparatus includes the conversion filter unit 113, the sound field collecting / reproducing device may not include other units. For example, the sound field sound collecting / reproducing device may include a conversion filter unit 113 and a frequency inverse conversion unit 115. Further, the sound field sound collecting / reproducing apparatus may include a frequency conversion unit 111 and a conversion filter unit 113.

音場収音再生装置は、コンピュータによって実現することができる。この場合、この装置の各部の処理内容はプログラムによって記述される。そして、このプログラムをコンピュータで実行することにより、この装置における各部がコンピュータ上で実現される。 The sound field sound collecting / reproducing apparatus can be realized by a computer. In this case, the processing content of each part of this apparatus is described by a program. Then, by executing this program on a computer, each unit in this apparatus is realized on the computer.

この処理内容を記述したプログラムは、コンピュータで読み取り可能な記録媒体に記録しておくことができる。また、この形態では、コンピュータ上で所定のプログラムを実行させることにより、これらの装置を構成することとしたが、これらの処理内容の少なくとも一部をハードウェア的に実現することとしてもよい。 The program describing the processing contents can be recorded on a computer-readable recording medium. In this embodiment, these apparatuses are configured by executing a predetermined program on a computer. However, at least a part of these processing contents may be realized by hardware.

この発明は、上述の実施形態に限定されるものではなく、本発明の趣旨を逸脱しない範囲で適宜変更が可能である。 The present invention is not limited to the above-described embodiment, and can be modified as appropriate without departing from the spirit of the present invention.

Claims

複数のマイクロホンが配置されている空間には音源が一つ以上存在し、複数のスピーカが直線状または平面状に波面の進行方向に対し制御点よりも後方に配置されるものとし、ωを周波数とし、
前記複数のスピーカにより再現される仮想音源の位置と前記複数のスピーカの位置とに基づいて求まるスピーカの駆動信号の振幅の予測分布と、前記複数のスピーカの各位置から複数の前記制御点の各位置までの伝達特性とに基づいて求まり、前記仮想音源の位置からの波面を前記制御点で実現するフィルタF(ω)を、前記複数のマイクロホンで収音された信号それぞれの周波数領域信号P（ω）に対して適用してフィルタ処理後信号D(ω)を生成する変換フィルタ部を含む、
音場収音再生装置。 One or more sound sources exist in the space where the plurality of microphones are arranged, and the plurality of speakers are arranged linearly or planarly behind the control point with respect to the traveling direction of the wavefront, and ω is the frequency age,
A predicted distribution of the amplitude of the driving signal of the speaker obtained based on the position of the virtual sound source reproduced by the plurality of speakers and the position of the plurality of speakers, and each of the plurality of control points from each position of the plurality of speakers. A filter F (ω) that is obtained based on the transfer characteristic up to the position and realizes the wavefront from the position of the virtual sound source at the control point, and each of the frequency domain signals P ( including a transform filter unit applied to ω) to generate a filtered signal D (ω),
Sound field recording and playback device.

請求項１の音場収音再生装置であって、
w_p(ω)をスピーカｐの駆動信号の振幅の予測分布とし、G_p(ω)をスピーカｐの位置r^- _s,pから制御点ｃの位置r^- _c,iまでの伝達特性G(ω,r^- _c,i-r^- _s,p)を要素にもつベクトル、上付き添え字Hを複素共役転置、Λ(ω)を空間相関行列の固有値成分の対角行列、U(ω)を対応する固有ベクトルを列とする行列、αを正則化パラメータ、Iを単位行列、Cを複素数全体の集合とし、前記フィルタF(ω)∈C^L×Mの第p行目は、

またはf^-p(ω)の逆相と定義され、
前記変換フィルタ部は、フィルタ処理後信号D(ω)を

として生成する、
音場収音再生装置。 The sound field collecting and reproducing device according to claim 1,
w _p a (omega) and the predicted distribution of the amplitude of the drive signal of the speaker p, the position of G _p (ω) speaker p r ^- _s, the position of the control point c from _p r ^- _c, transfer to _i characteristic G ( ω, r ^- _{c, i} -r ^- _{s, p} ) elements, superscript H is the complex conjugate transpose, Λ (ω) is the diagonal matrix of the eigenvalue components of the spatial correlation matrix, U (ω) Is a matrix with corresponding eigenvectors as columns, α is a regularization parameter, I is a unit matrix, C is a set of all complex numbers, and the p-th row of the filter F (ω) ∈C ^{L × M} is

Or defined as the reverse phase of f ^- p (ω)
The conversion filter unit converts the filtered signal D (ω)

Generate as
Sound field recording and playback device.

請求項１または２の音場収音再生装置であって、
前記複数のスピーカが平面状に配置されるものとし、γ_pを窓関数、y_s,pをスピーカｐの位置r^- _s,pのｙ座標、y_ps'を仮想音源の位置r^- _ps'のｙ座標、ｊを虚数単位、kを波数、cを音速、k=ω/cとし、スピーカｐの駆動信号の振幅の予測分布w_p(ω)は、

と定義される、
音場収音再生装置。 The sound field sound collecting and reproducing device according to claim 1 or 2,
It is assumed that the plurality of speakers are arranged in a plane, γ _p is a window function, y _{s, p} is the y-coordinate of the position r ⁻ _{s, p} of the speaker p, and y _{ps ′} is the position r ⁻ _{ps ′ of the} virtual sound source. Y coordinate, j is imaginary unit, k is wave number, c is sound velocity, k = ω / c, and the predicted distribution w _p (ω) of the driving signal of the speaker p is

Defined as
Sound field recording and playback device.

請求項１または２の音場収音再生装置であって、
前記複数のスピーカが直線状に配置されるものとし、γ_pを窓関数、y_s,pをスピーカｐの位置r^- _s,pのｙ座標、y_ps'を仮想音源の位置r^- _ps'のｙ座標、ｊを虚数単位、kを波数、cを音速、k=ω/c、H⁽¹⁾ ₀を０次の第一種ハンケル関数とし、スピーカｐの駆動信号の振幅の予測分布w_p(ω)は、

と定義される、
音場収音再生装置。 The sound field sound collecting and reproducing device according to claim 1 or 2,
The plurality of speakers are arranged in a straight line, γ _p is a window function, y _{s, p} is the y-coordinate of the position r ⁻ _{s, p} of the speaker p, and y _{ps ′} is the position r ⁻ _{ps ′ of the} virtual sound source. Y-coordinate, j is imaginary unit, k is wave number, c is sound velocity, k = ω / c, H ⁽¹⁾ ₀ is the first-order first-class Hankel function, and the predicted distribution w of the driving signal of speaker p is w _p (ω) is

Defined as
Sound field recording and playback device.

請求項１から４の何れかの音場収音再生装置であって、
スピーカｐの駆動信号の振幅の予測分布w_p(ω)は、

と定義され、
音源を複数とし、γ_pを窓関数、y_s,pをスピーカｐの位置r^- _s,pのｙ座標、y_ps',qを仮想音源ｑの位置r^- _ps',qのｙ座標、ｊを虚数単位、kを波数、cを音速、k=ω/c、音源の振幅またはその比をa_q、H⁽¹⁾ ₀を０次の第一種ハンケル関数とし、
前記複数のスピーカが平面状に配置される場合には、前記w^_pを

とし、
前記複数のスピーカが直線状に配置される場合には、前記w^_pを

とする、
音場収音再生装置。 The sound field recording and reproducing device according to any one of claims 1 to 4,
The predicted distribution w _p (ω) of the amplitude of the driving signal of the speaker p is

Defined as
A plurality of sound sources, γ _p is a window function, y _{s, p} is the y coordinate of the position r ⁻ _{s, p} of the speaker p, y _{ps ′, q} is the y coordinate of the position r ⁻ _{ps ′, q} of the virtual sound source q, j is the imaginary unit, k is the wave number, c is the speed of sound, k = ω / c, the amplitude of the sound source or its ratio is a _q , and H ⁽¹⁾ ₀ is the first-order Hankel function of the 0th order,
When the plurality of speakers are arranged in a plane, the w ^ _p

age,
If the multiple speakers are arranged in a straight line, the w ^ _p

And
Sound field recording and playback device.

請求項５の音場収音再生装置であって、
前記変換フィルタ部は、周波数ωごとに、異なる音源の位置に対するフィルタF(ω)を適用する、
音場収音再生装置。 The sound field collecting and reproducing device according to claim 5,
The conversion filter unit applies a filter F (ω) for different sound source positions for each frequency ω.
Sound field recording and playback device.

複数のマイクロホンが配置されている空間には音源が一つ以上存在し、複数のスピーカが直線状または平面状に波面の進行方向に対し制御点よりも後方に配置されるものとし、ωを周波数とし、
前記複数のスピーカにより再現される仮想音源の位置と前記複数のスピーカの位置とに基づいて求まるスピーカの駆動信号の振幅の予測分布と、前記複数のスピーカの各位置から複数の前記制御点の各位置までの伝達特性とに基づいて求まり、前記仮想音源の位置からの波面を前記制御点で実現するフィルタF(ω)を、前記複数のマイクロホンで収音された信号それぞれの周波数領域信号P（ω）に対して適用してフィルタ処理後信号D(ω)を生成する変換フィルタステップを含む、
音場収音再生方法。 One or more sound sources exist in the space where the plurality of microphones are arranged, and the plurality of speakers are arranged linearly or planarly behind the control point with respect to the traveling direction of the wavefront, and ω is the frequency age,
A predicted distribution of the amplitude of the driving signal of the speaker obtained based on the position of the virtual sound source reproduced by the plurality of speakers and the position of the plurality of speakers, and each of the plurality of control points from each position of the plurality of speakers. A filter F (ω) that is obtained based on the transfer characteristic up to the position and realizes the wavefront from the position of the virtual sound source at the control point, and each of the frequency domain signals P ( including a transform filter step applied to ω) to generate a filtered signal D (ω),
Sound field recording and playback method.

請求項１から請求項６の何れかの音場収音再生装置として、コンピュータを機能させるためのプログラム。 A program for causing a computer to function as the sound field sound collecting / reproducing apparatus according to any one of claims 1 to 6.