JP2011139263A

JP2011139263A - Stereophonic sound generation system, method for controlling the same and control program

Info

Publication number: JP2011139263A
Application number: JP2009297581A
Authority: JP
Inventors: Sho Ko; 捷黄
Original assignee: University of Aizu
Current assignee: University of Aizu
Priority date: 2009-12-28
Filing date: 2009-12-28
Publication date: 2011-07-14
Anticipated expiration: 2029-12-28
Also published as: JP5472613B2

Abstract

<P>PROBLEM TO BE SOLVED: To provide a stereophonic sound generation system using a plane arrangement speaker capable of generating sound images localized, not only in the horizontal direction but also in the arbitrary vertical direction, by a simple method, and to provide a method for controlling the stereophonic sound generation system and a control program. <P>SOLUTION: A sound image localization setting part 301 sets localization positions (θ, α) of the sound images. A speaker selection part 302 selects speakers 40, from which sound signals from a sound source 10 are output, based on the localization positions of the sound images. A head transfer function determination part 303 reads and determines a head transfer function corresponding to the respective speakers 40 from a head transfer function table 305 based on sound image localization. An energy distribution part 304 calculates energy amounts of the sound signals to be output from the respective speakers 40. A convolution processing part 306 convolves the head transfer function determined to the sound signals to be output from the selected speakers 40, and in addition, distributes calculated energy of the sound signals for every speaker 40. <P>COPYRIGHT: (C)2011,JPO&INPIT

Description

本発明は、平面配置スピーカを用いた立体音響生成システムに関するものであって、特に、立体音響を生成する時に頭部伝達関数（ＨＲＴＦ）を左右のスピーカグループに加えることによって立体効果を高めたシステムに係る。 The present invention relates to a stereophonic sound generation system using planarly arranged speakers, and in particular, a system that enhances the stereoeffect by adding a head related transfer function (HRTF) to left and right speaker groups when generating stereophonic sound. Concerning.

従来では、３Ｄの立体音響を再現する手法として、特許文献１に示されるように、全空間をカバーするように立体的にスピーカを配置し、各スピーカの音量比を操作して立体的な音像を生成するＡＰ（Amplitude Panning）法がある。このようなＡＰ法は、ターゲットとする音像方向の近傍のスピーカだけを利用するので、仰角方向の音像を再現するためには高い位置にスピーカを配置する必要がある。 Conventionally, as a technique for reproducing 3D stereophonic sound, as disclosed in Patent Document 1, three-dimensional sound images are arranged by three-dimensionally arranging speakers so as to cover the entire space and operating the volume ratio of each speaker. There is an AP (Amplitude Panning) method for generating. Since such an AP method uses only a speaker in the vicinity of the target sound image direction, it is necessary to place the speaker at a high position in order to reproduce a sound image in the elevation angle direction.

しかしながら、一般的な家庭環境では、立体配置スピーカシステムのように高い位置にスピーカを配置することは困難であるため、この観点から、平面配置スピーカによる立体音響生成システムに関する提案が、例えば特許文献２や特許文献３に示すように従来からなされている。これらは、クロストークのない２チャンネルヘッドホン再生に近い状態をスピーカシステムで実現する方法である。代表的なものとしてクロストークを逆フィルターによってキャンセルするトランスノーラル法などがある。 However, in a general home environment, it is difficult to arrange a speaker at a high position as in a three-dimensional speaker system. From this viewpoint, a proposal regarding a three-dimensional sound generation system using a planar speaker is disclosed in Patent Document 2, for example. Or, as shown in Patent Document 3, it has been conventionally performed. These are methods for realizing a state close to 2-channel headphone playback without crosstalk with a speaker system. A typical example is a transnormal method in which crosstalk is canceled by an inverse filter.

また、特許文献４のように、ＦＩＲフィルタを適用することにより、同一平面に配置された２つのスピーカであっても音像を水平方向だけでなく上下方向にパンニング可能な３次元パンニング装置も提案されている。 Further, as disclosed in Patent Document 4, a three-dimensional panning device that can pan a sound image not only in a horizontal direction but also in a vertical direction by using two speakers arranged in the same plane by applying an FIR filter has been proposed. ing.

特開２００９−７７３７９号公報JP 2009-77379 A 特開平９−２３３６００号公報JP-A-9-233600 特開２００８−１６０２６５号公報JP 2008-160265 A 特許第３１７７７１４号公報Japanese Patent No. 3177714

ところで、上記特許文献２や特許文献３に示すような平面配置スピーカを用いた立体音響生成システムでは、頭部伝達関数を用いたステレオヘッドホン、あるいはクロストークキャンセル処理を加えた２チャンネルのスピーカシステムを使用しているが、処理が複雑で、受聴位置が厳しい制限を受けるといった課題が生じる。すなわち、特許文献２や特許文献３のようなトランスノーラル法を採用してクロストークをキャンセルする方法は、受聴位置が狭く当該位置からズレてしまうとクロストークが含まれてしまい、さらには環境の反響を受け易いといった課題がある。 By the way, in the stereophonic sound generation system using the planar arrangement speaker as shown in Patent Document 2 and Patent Document 3 described above, a stereo headphone using a head-related transfer function or a two-channel speaker system to which a crosstalk cancellation process is added. Although it is used, the problem is that the processing is complicated and the listening position is severely restricted. In other words, the method of canceling crosstalk by adopting the transnoral method such as Patent Document 2 and Patent Document 3 includes crosstalk when the listening position is narrow and deviates from the position, and further, the environment. There is a problem that it is easy to receive a response.

また、上記特許文献４示した方法では、上下方向のパンニングをすべてＦＩＲフィルタで処理するために、その演算処理に時間が掛かるという課題がある。 In addition, the method disclosed in Patent Document 4 has a problem in that it takes time to perform the arithmetic processing because all panning in the vertical direction is processed by the FIR filter.

本発明は上記のような従来技術の課題を解決するために提案されたものであって、その目的は、水平方向だけでなく、任意の垂直方向に定位する音像を簡単な方法により生成することが可能な平面配置スピーカを用いた立体音響生成システム、その制御方法及び制御プログラムを提供することにある。 The present invention has been proposed in order to solve the above-described problems of the prior art, and an object thereof is to generate a sound image localized not only in the horizontal direction but also in an arbitrary vertical direction by a simple method. The present invention provides a stereophonic sound generation system using a planarly arranged speaker, a control method thereof, and a control program.

前記の目的を達成するために、本発明の平面配置スピーカによる立体音響生成システムは、５チャンネルの平面配置スピーカを備え、音源からの音声信号に音像定位処理を施すことで音像を任意の位置に定位させ、当該スピーカを通じて立体音響を生成する立体音響生成システムであって、各種操作を入力する操作部と、音像の定位位置を生成する音像定位処理部と、を備え、前記音像定位処理部は、前記操作部を通じて入力された所望の音像の定位位置を設定する音像定位設定部と、使用するスピーカを決定し、各スピーカから出力する音量配分を決定する音量配分処理部と、前記音像の定位位置に基づいて、各スピーカから出力される音声信号に畳み込む頭部伝達関数を決定する頭部伝達関数決定部と、使用するスピーカにより出力される音声信号に対して、頭部伝達関数決定部により決定された頭部伝達関数を畳み込み、かつ、前記音声エネルギー配分処理部により算出されたエネルギー配分を割り当てる畳み込み処理部と、を有し、前記音量配分処理部は、音像定位設定部による設定に拘らず、音像の第１の基準方向として、受聴者を基準に、仰角が０度で各方位角をスピーカ位置とする方向と、仰角がα_v（０度より大きく９０度より小さければ任意に複数設定可能）で各方位角をセンタースピーカ以外の隣り合うスピーカの中間位置とする方向と、仰角が９０度で各方位角を０度とする方向と、を決定する第１の基準方向決定手段と、前記第１の基準方向のうち仰角が０度の場合には当該第１の基準方向のスピーカに前記音源からの音声信号の全音量を配分し、前記第１の基準方向のうち仰角がα_vの場合には方位角の視点で当該第１の基準方向を挟む２つの各スピーカに１／２音量を配分し、前記第１の基準方向のうち仰角が９０度の場合には受聴者を基準に前方に位置する３つの各スピーカに１／６音量を配分し、後方に位置する２つの各スピーカに１／４音量を配分する第１の音量配分手段と、前記第１の基準方向のうちから、仰角、方位角の視点で、前記音像定位設定部により設定された音像を挟む４つの第２の基準方向を決定する第２の基準方向決定手段と、前記音像定位設定部により設定された音像の仰角が前記α_v以下である場合に、［発明を実施するための形態］において述べる［数１］に基づいて、前記第２の基準方向決定手段により決定された各第２の基準方向の音量配分と所定の結合係数とを乗算し、この値を決定された全ての前記第２の基準方向分で加算することで、スピーカ毎の音量配分を算出する第２の音量配分手段と、を有することを特徴とする。 In order to achieve the above object, a stereophonic sound generation system using a planar speaker according to the present invention includes a five-channel planar speaker and applies a sound image localization process to an audio signal from a sound source to place a sound image at an arbitrary position. A stereophonic sound generation system that localizes and generates stereophonic sound through the speaker, comprising: an operation unit that inputs various operations; and a sound image localization processing unit that generates a localization position of a sound image, and the sound image localization processing unit includes: A sound image localization setting unit that sets a localization position of a desired sound image input through the operation unit; a volume distribution processing unit that determines a speaker to be used and determines a volume distribution output from each speaker; and the localization of the sound image Based on the position, the head-related transfer function determination unit that determines the head-related transfer function to be convoluted with the audio signal output from each speaker, and the output from the speaker used A convolution processing unit that convolves the head-related transfer function determined by the head-related transfer function determination unit with respect to the audio signal and allocates the energy distribution calculated by the audio energy distribution processing unit, and the volume Regardless of the setting by the sound image localization setting unit, the distribution processing unit sets the first reference direction of the sound image as a reference with respect to the listener, a direction in which the elevation angle is 0 degrees and each azimuth angle is the speaker position, and the elevation angle is α _v A direction in which each azimuth is an intermediate position between adjacent speakers other than the center speaker, and a direction in which each elevation is 90 degrees and each azimuth is 0 degrees. And a first reference direction determination means for determining the total volume of the audio signal from the sound source to the speakers in the first reference direction when the elevation angle is 0 degree in the first reference direction. And the first When the elevation angle is α _v among the two reference directions, ½ volume is distributed to each of the two speakers sandwiching the first reference direction from the viewpoint of the azimuth angle, and the elevation angle is 90 ° among the first reference directions. First volume distribution means for allocating 1/6 volume to each of the three speakers located in front of the listener and ¼ volume to each of the two speakers located behind, A second reference direction determining means for determining four second reference directions sandwiching the sound image set by the sound image localization setting unit from the viewpoints of elevation angle and azimuth angle from among the first reference directions; When the elevation angle of the sound image set by the sound image localization setting unit is equal to or less than α _v , the second reference direction determining means determines the angle based on [Formula 1] described in [Mode for Carrying Out the Invention]. Volume distribution in each second reference direction determined and a predetermined coupling coefficient Multiplies, by adding in all the second reference direction component of this value has been determined, and having a second volume allocation means for calculating the volume distribution of each speaker, a.

以上のような本発明によれば、受聴者の耳と同程度の高さの水平面上に配置するスピーカを用いる場合であっても、水平方向（方位角の方向）だけでなく垂直方向（仰角の方向）にも自由に定位する音像を生成し立体音響を実現可能な立体音響システム、その制御方法及び制御プログラムを提供することができる。特に、現在のサラウンドシステムで採用されている５．１チャンネルの標準スピーカシステムを利用することで立体音響を生成することができるため、特別な生成装置や、立体的にスピーカを配置することが必要ない利点を有する。 According to the present invention as described above, not only in the horizontal direction (direction of azimuth) but also in the vertical direction (elevation angle), even when using a speaker placed on a horizontal plane that is as high as the ears of the listener. 3D), a stereophonic sound system capable of generating a stereo image freely and also realizing stereophonic sound, a control method thereof, and a control program can be provided. In particular, since it is possible to generate stereophonic sound by using the 5.1 channel standard speaker system adopted in the current surround system, it is necessary to arrange special speakers and speakers in three dimensions. Has no advantage.

また、本発明は、できるだけ多くのスピーカを用い、多方向からの信号にそれぞれ左か右の頭部伝達関数を畳み込むことによってクロストークの影響を軽減することができるため、クロストークのキャンセル処理を必要とせず、受聴位置の制限を緩くすることが可能である。さらに、通常は受聴位置をスピーカシステムの中心とすることを理想とするが、その近傍においても良好な立体音響を実現することができ、また、受聴者の頭の回転や左右、前後の移動の影響を少なくすることが可能である。 In addition, the present invention can reduce the influence of crosstalk by using as many speakers as possible and convolving the left or right head-related transfer functions with signals from multiple directions. It is possible to relax the listening position restriction without the need. In addition, it is ideal to have the listening position at the center of the speaker system, but good stereophonic sound can be achieved in the vicinity of the speaker system, and the listener's head can be rotated and moved left and right and back and forth. It is possible to reduce the influence.

本発明の実施形態に係る立体音響生成システムの全体構成の一例を示すブロック図。The block diagram which shows an example of the whole structure of the stereophonic sound production | generation system which concerns on embodiment of this invention. 本発明の実施形態に係る立体音響生成システムの平面配置スピーカ（５チャンネル）の配置例を示す図。The figure which shows the example of arrangement | positioning of the planar arrangement | positioning speaker (5 channels) of the stereophonic sound generation system which concerns on embodiment of this invention. 本発明の実施形態に係る立体音響生成システムの音像の定位例を示す図。The figure which shows the localization example of the sound image of the stereophonic sound production | generation system which concerns on embodiment of this invention. 本発明の実施形態に係る立体音響生成システムの平面配置スピーカ（８チャンネル）の配置例を示す図。The figure which shows the example of arrangement | positioning of the planar arrangement | positioning speaker (8 channels) of the stereophonic sound generation system which concerns on embodiment of this invention. 本発明の実施形態に係る立体音響生成システムのスピーカ配置及び基準方向（第１の基準方向）を示す図（５チャンネル）。The figure (5 channels) which shows the speaker arrangement | positioning and reference direction (1st reference direction) of the stereophonic sound production | generation system which concerns on embodiment of this invention. 本発明の実施形態に係る立体音響生成システムのスピーカ配置及び基準方向（第１の基準方向）を示す図（８チャンネル）。The figure (8 channels) which shows the speaker arrangement | positioning and reference direction (1st reference direction) of the stereophonic sound production | generation system which concerns on embodiment of this invention.

［本実施形態］
［１．構成］
次に、本発明の実施形態に係る平面配置スピーカを用いた立体音響システムの構成について、図１〜３を参照して以下に説明する。図１は、本実施形態に係る立体音響システムの全体構成を示す機能ブロック図である。なお、以下では、特別言及のないの場合は５チャンネルの場合を想定し、スピーカ４０は、図２に示すように、左側のスピーカグループ（Ｌ、ＳＬ、８チャンネル場合はＭＬも）と、右のスピーカグループ（Ｒ、ＳＲ、８チャンネルの場合はＭＲも）と、が配置される。 [This embodiment]
[1. Constitution]
Next, the structure of the stereophonic sound system using the planar speaker according to the embodiment of the present invention will be described below with reference to FIGS. FIG. 1 is a functional block diagram showing the overall configuration of the stereophonic sound system according to the present embodiment. In the following, the case of 5 channels is assumed unless otherwise specified, and the speaker 40 is connected to the left speaker group (L, SL, ML in the case of 8 channels) and the right as shown in FIG. Speaker groups (R, SR, and MR in the case of 8 channels) are arranged.

図１に示す通り、本実施形態に係る平面配置スピーカによる立体音響システムは、音源１０と、音像の定位位置やチャンネル数の指定等、各種入力操作を行う操作部２０と、後述する各機能により所望の音像の定位を実現するように、スピーカ４０毎の音量配分や頭部伝達関数の付与等の定位処理を実行する音像定位処理部３０と、スピーカ４０（Ｃ、Ｌ、ＳＬ、Ｒ、ＳＲ）と、から構成されている。 As shown in FIG. 1, the stereophonic sound system using the planar speaker according to the present embodiment includes a sound source 10, an operation unit 20 that performs various input operations such as designation of a localization position of a sound image and the number of channels, and functions described later. A sound image localization processing unit 30 for performing localization processing such as volume distribution for each speaker 40 and provision of a head-related transfer function and a speaker 40 (C, L, SL, R, SR) so as to realize a desired sound image localization. ) And.

ここで、音像定位処理部３０の具体的な構成について以下に詳述する。
まず、図１に示す通り、前記操作部２０による入力操作に応じて、所望の仮想的な音像の定位位置を設定する音像定位設定部３０１を備えている。例えば、図３に示すように、この音像定位設定部３０１は、操作部２０を通じて操作により、受聴者を基準とした音像の方位角θと、上下方向を決める仰角αからなる所望の音像定位（方位角θ、仰角α）を設定する。 Here, a specific configuration of the sound image localization processing unit 30 will be described in detail below.
First, as shown in FIG. 1, a sound image localization setting unit 301 is provided for setting a desired virtual sound image localization position in accordance with an input operation by the operation unit 20. For example, as shown in FIG. 3, the sound image localization setting unit 301 is operated through the operation unit 20, and a desired sound image localization (the desired sound image localization consisting of the azimuth angle θ of the sound image with respect to the listener and the elevation angle α that determines the vertical direction) Azimuth angle θ and elevation angle α) are set.

また、音像定位処理部３０は、音像定位設定部３０１により設定された音像の定位に応じて、図２のように配置されたスピーカ４０のうち、使用するものを選択するスピーカ選択部３０２を備えている。なお、具体的なスピーカ４０の選択態様については［２．３．音声信号のエネルギー配分処理］の項目において説明する。 Further, the sound image localization processing unit 30 includes a speaker selection unit 302 that selects a speaker to be used among the speakers 40 arranged as shown in FIG. 2 according to the localization of the sound image set by the sound image localization setting unit 301. ing. In addition, about the specific selection mode of the speaker 40, [2.3. This will be described in the item of “Audio signal energy distribution processing”.

また、音像定位処理部３０は、音源１０からの音声信号に対して畳み込む左右のスピーカ４０グループ毎の頭部伝達関数を決定する頭部伝達関数決定部３０３を備えている。ここで、頭部伝達関数とは、音像位置から音を発生させる場合の、音源から受聴者の耳に至るまでの音の伝達関数であり、特に、音像位置、受聴者、スピーカ位置等により区別される。そのため、頭部伝達関数決定部３０３は、左右のスピーカ４０のグループに対しては、左側用として頭部伝達関数（ＨＲＴＦ_L）を、右側用として頭部伝達関数（ＨＲＴＦ_R）を、所望の音像定位（θ、α）に基づき決定する。 The sound image localization processing unit 30 includes a head-related transfer function determining unit 303 that determines a head-related transfer function for each of the left and right speaker 40 groups to be convoluted with the sound signal from the sound source 10. Here, the head-related transfer function is a transfer function of sound from the sound source to the listener's ear when sound is generated from the sound image position, and is particularly distinguished by the sound image position, listener, speaker position, etc. Is done. Therefore, the head-related transfer function determination unit 303 selects the head-related transfer function (HRTF _L ) for the left side and the head-related transfer function (HRTF _R ) for the right side as desired for the group of left and right speakers 40. It is determined based on the sound image localization (θ, α).

また、音像定位処理部３０は、音像定位設定部３０１により設定された音像の定位位置に基づいて、スピーカ選択部３０２により選択された各スピーカ４０に対して、配分する音声信号のエネルギー量を演算するエネルギー配分部３０４を備えている。このエネルギー配分部３０４による音声信号のエネルギー配分の計算態様は［２．３．音声信号のエネルギー配分処理］の項目において詳述する。なお、スピーカ選択部３０２とエネルギー配分部３０４が、特許請求の範囲上の「音量配分処理部」に対応する。 Further, the sound image localization processing unit 30 calculates the energy amount of the audio signal to be distributed to each speaker 40 selected by the speaker selection unit 302 based on the localization position of the sound image set by the sound image localization setting unit 301. An energy distribution unit 304 is provided. The calculation mode of the energy allocation of the audio signal by the energy allocation unit 304 is [2.3. This will be described in detail in the section of “Audio signal energy distribution processing”. The speaker selection unit 302 and the energy distribution unit 304 correspond to a “volume distribution processing unit” in the claims.

また、音像定位処理部３０は、頭部伝達関数テーブル３０５を備え、５チャンネル又は８チャンネルの場合における各スピーカ４０に対する受聴者毎の頭部伝達関数を格納する。 The sound image localization processing unit 30 includes a head-related transfer function table 305, and stores a head-related transfer function for each listener for each speaker 40 in the case of 5 channels or 8 channels.

また、音像定位処理部３０は、音源１０からの音声信号に対して頭部伝達関数決定部３０３により決定された頭部伝達関数を畳み込み、かつ、エネルギー配分部３０４により演算された音声信号のエネルギーを各スピーカ４０に割り当てる畳み込み処理部３０６を備えている。この畳み込み処理部３０６は、頭部伝達関数決定部３０３により決定されるこの頭部伝達関数ＨＲＴＦを音源１０からの音声信号に畳み込むことによって、音の方向感覚を強くし、さらに、各スピーカ４０に所定のエネルギーを配分することで立体的な音像を作り出す。 Further, the sound image localization processing unit 30 convolves the head-related transfer function determined by the head-related transfer function determination unit 303 with the sound signal from the sound source 10 and the energy of the sound signal calculated by the energy distribution unit 304. Is provided to each speaker 40. The convolution processing unit 306 convolves the head-related transfer function HRTF determined by the head-related transfer function determining unit 303 with the audio signal from the sound source 10, thereby strengthening the sense of direction of the sound. A three-dimensional sound image is created by allocating predetermined energy.

［２．作用］
［２．１．基本作用］
上記のような構成を有する平面配置スピーカを用いた立体音響システムにおける、音像生成の基本作用を以下に説明する。
まず、操作部２０を通じて入力操作により音像定位設定部３０１は、ユーザが所望する音像の定位位置（θ，α）を設定する。 [2. Action]
[2.1. Basic action]
The basic operation of sound image generation in the stereophonic sound system using the planar speaker having the above configuration will be described below.
First, the sound image localization setting unit 301 sets a localization position (θ, α) of a sound image desired by the user by an input operation through the operation unit 20.

そして、スピーカ選択部３０２が、音像定位設定部３０１により設定された音像定位位置に基づいて、音源１０からの音声信号を出力するスピーカ４０を選択する。すなわち、５チャンネルの場合、設定された音像の定位位置に応じて、図２に示すようなスピーカ４０のＣ、Ｌ、ＳＬ、Ｒ、ＳＲのうちのいずれを使用するかを選択し、また、８チャンネルの場合、図４に示すようなスピーカ４０のＣ、Ｌ、ＳＬ、ＭＬ、ＢＣ、Ｒ、ＳＲ、ＭＲのうちのいずれかを選択する。 Then, the speaker selection unit 302 selects the speaker 40 that outputs the audio signal from the sound source 10 based on the sound image localization position set by the sound image localization setting unit 301. That is, in the case of 5 channels, according to the localization position of the set sound image, select which of C, L, SL, R, SR of the speaker 40 as shown in FIG. In the case of 8 channels, any one of C, L, SL, ML, BC, R, SR, and MR of the speaker 40 as shown in FIG. 4 is selected.

次に、頭部伝達関数決定部３０３は、音像定位設定部３０１により設定された音像定位に基づいて、左右のスピーカ４０グループに対応する頭部伝達関数を頭部伝達関数テーブル３０５から読み出し、決定する（下記［２．２．頭部伝達関数決定及び畳み込み処理］参照。）。 Next, the head-related transfer function determination unit 303 reads out the head-related transfer functions corresponding to the left and right speaker 40 groups from the head-related transfer function table 305 based on the sound image localization set by the sound image localization setting unit 301 and determines them. (See [2.2. Head Transfer Function Determination and Convolution Processing] below.)

また、エネルギー配分部３０４は、スピーカ選択部３０２により選択された各スピーカ４０から出力する音声信号のエネルギー量を算出する（下記［２．３．音声信号のエネルギー配分処理］参照。）。 Further, the energy distribution unit 304 calculates the energy amount of the audio signal output from each speaker 40 selected by the speaker selection unit 302 (see [2.3. Audio signal energy distribution process] below).

そして、畳込み処理部３０６は、選択されたスピーカ４０により出力される音源１０からの音声信号に対して頭部伝達関数決定部３０３により決定された頭部伝達関数を畳み込み、加えて、エネルギー配分部３０４により演算されたスピーカ４０毎の音声信号のエネルギーを配分する。これにより、出力対象となる各スピーカ４０から所定のエネルギー量の音声信号を出力することで、所望の音像の定位を実現する。 Then, the convolution processing unit 306 convolves the head-related transfer function determined by the head-related transfer function determining unit 303 with the audio signal from the sound source 10 output from the selected speaker 40, and adds the energy distribution. The energy of the audio signal for each speaker 40 calculated by the unit 304 is distributed. As a result, a desired sound image is localized by outputting a sound signal having a predetermined energy amount from each speaker 40 to be output.

［２．２．頭部伝達関数の決定及び畳み込み処理］
次に、頭部伝達関数決定部３０３による各スピーカ４０に対する頭部伝達関数の決定処理、及び決定された頭部伝達関数の音源１０からの音声信号への畳み込み処理について、具体的に説明する。 [2.2. Head Transfer Function Determination and Convolution Processing]
Next, the head-related transfer function determination process for each speaker 40 by the head-related transfer function determination unit 303 and the process of convolving the determined head-related transfer function into the audio signal from the sound source 10 will be specifically described.

この頭部伝達関数決定部３０３は、音像定位設定部３０１により設定された音像の定位位置（θ、α）に対応させるように、左右のスピーカ４０のグループ毎に、頭部伝達関数テーブル３０５に格納された所定の頭部伝達関数（音像位置から音を発生させた場合の音源から左と右の耳までの伝達関数ＨＲＴＦ_LとＨＲＴＦ_R）を読み出すことで決定する。そして、畳込み処理部３０６は、この頭部伝達関数決定部３０３により決定された頭部伝達関数を、音源１０からの音声信号に対して畳み込む。 The head-related transfer function determination unit 303 stores the head-related transfer function table 305 for each group of the left and right speakers 40 so as to correspond to the sound image localization position (θ, α) set by the sound image localization setting unit 301. It is determined by reading out the stored predetermined head-related transfer functions (transfer functions HRTF _L and HRTF _R from the sound source to the left and right ears when sound is generated from the sound image position). Then, the convolution processing unit 306 convolves the head-related transfer function determined by the head-related transfer function determining unit 303 with the audio signal from the sound source 10.

より詳細には、５チャンネルのシステムの場合、左側のスピーカ４０のグループ（Ｌ、ＳＬ、８チャンネル場合はＭＬも）には左の耳の頭部伝達関数ＨＲＴＦ_L（θ，α）、右側のスピーカ４０のグループ（Ｒ、ＳＲ、８チャンネルの場合はＭＲも）には右の耳の頭部伝達関数ＨＲＴＦ_R（θ，α）を頭部伝達関数決定部３０３により決定し、頭部伝達関数畳み込み部３０６が、この頭部伝達関数を音源１０から送られる音声信号に畳み込む。 More specifically, in the case of a 5-channel system, the left ear head transfer function HRTF _L (θ, α) and the right-side speaker 40 group (L, SL, ML in the case of 8 channels) The head transfer function HRTF _R (θ, α) of the right ear is determined by the head transfer function determination unit 303 for the group of speakers 40 (R, SR, and MR in the case of 8 channels), and the head transfer function is determined. The convolution unit 306 convolves this head-related transfer function with the audio signal sent from the sound source 10.

これにより、後述する音声信号のエネルギー配分処理により配分されたエネルギーで、左側のスピーカ４０グループからは頭部伝達関数ＨＲＴＦ_L（θ，α）が畳み込まれた音声信号が出力され、右側のスピーカ４０グループからは頭部伝達関数ＨＲＴＦ_R（θ，α）が畳み込まれた音声信号が出力される。 As a result, a sound signal in which the head-related transfer function HRTF _L (θ, α) is convoluted is output from the left speaker 40 group with the energy distributed by the energy distribution process of the sound signal described later, and the right speaker From the 40 groups, audio signals in which the head-related transfer function HRTF _R (θ, α) is convoluted are output.

また、前方（Ｃ）と後方（ＢＣ但し、８チャンネルの場合のみ使用する。）のスピーカ４０では、後述するエネルギー配分処理により当該スピーカ４０に配分されたエネルギーの半分ずつを、左右の頭部伝達関数ＨＲＴＦ_L（θ，α）と頭部伝達関数ＨＲＴＦ_R（θ，α）が畳み込まれた音声信号で合成し、出力される。 Further, in the front (C) and rear (BC, which is used only in the case of 8 channels), half of the energy distributed to the speaker 40 by energy distribution processing described later is transmitted to the left and right heads. The function HRTF _L (θ, α) and the head related transfer function HRTF _R (θ, α) are synthesized with the convoluted audio signal and output.

このように、ＨＲＴＦの特性を音声信号に畳み込むことによって、音の方向感覚を強くし、立体的な音像を作り出す。すなわち、トランスノーラルシステムなどとは異なり、本方法は左右のスピーカ４０のグループにそれぞれ左と右のＨＲＴＦを畳み込むので、スピーカ４０のグループのトータル作用でスピーカ４０の位置による頭部伝達関数の影響とクロストークの影響を弱めることが可能となり、音像の方向特性を作り出すことができる。 Thus, by convolving the HRTF characteristics with the audio signal, the sense of direction of the sound is strengthened and a three-dimensional sound image is created. That is, unlike the transnormal system, the present method convolves the left and right HRTFs in the left and right speaker 40 groups, respectively, and therefore the influence of the head-related transfer function due to the position of the speakers 40 by the total action of the speaker 40 groups. It is possible to weaken the influence of crosstalk and create the direction characteristics of the sound image.

また、このような特徴を実現するためには、シングルユニットのスピーカ４０よりも縦方向に配置したマルチユニットのスピーカ４０を使用した方が音像の方向特性は顕著であり、受聴者の頭の上下動による影響も抑制することが可能である。 In order to realize such a feature, the direction characteristics of the sound image are more prominent when the multi-unit speaker 40 arranged in the vertical direction is used than the single unit speaker 40, and the listener's head is located above and below the head. It is also possible to suppress the effects of movement.

なお、頭部伝達関数は音像方向によってエネルギーが変化し、実測データは方向に関する連続性に欠けるので、頭部伝達関数のエネルギーを全て正規化し、音像方向による頭部伝達関数のエネルギーの違いを後述する拡張ＡＰによって実現する。 Note that the energy of the head-related transfer function changes depending on the direction of the sound image, and the measured data lacks continuity with respect to the direction. This is realized by the extended AP.

［２．３．音声信号のエネルギー配分処理］
次に、上記で示した音像定位設定部３０１により設定した音像の定位位置に基づいた、スピーカ選択部３０２及びエネルギー配分部３０４による決定するスピーカ４０の選択処理及びスピーカ４０毎のエネルギー配分処理を図５及び６、表１〜４を参照して以下に詳述する。 [2.3. Audio signal energy distribution processing]
Next, a speaker 40 selection process and an energy distribution process for each speaker 40 determined by the speaker selection unit 302 and the energy distribution unit 304 based on the sound image localization position set by the sound image localization setting unit 301 described above are shown in FIG. 5 and 6, detailed below with reference to Tables 1-4.

まず、全方向の音像の定位を考える前に、音像の基準方向（特許請求の範囲上の「第１の基準方向」に対応する。以下、第１の基準方向と称する。）となる、仰角０度で各スピーカ４０の置かれる方位角、所定の仰角α_v（以下では、α_v＝３０とするが変更可能。）でスピーカ４０（センタースピーカを除く）位置の中間の方位角、加えて仰角９０度方向（方位角は０度）におけるエネルギー配分Ｅ_pを決定する。ここで、第１の基準方向としては、隣合うスピーカ４０（センタースピーカを除く）の中間位置を方位角にとるのが基本だが、仰角０度の場合は、従前のシステムとの互換性を考慮しスピーカ４０の位置を採用する。また、仰角９０度の場合は、いずれの方位角でも構わないため、０度に設定する。 First, before considering the localization of the sound image in all directions, the elevation angle becomes the reference direction of the sound image (corresponding to the “first reference direction” in the claims, hereinafter referred to as the first reference direction). An azimuth angle at which each speaker 40 is placed at 0 degrees, an azimuth angle in the middle of the position of the speaker 40 (excluding the center speaker) at a predetermined elevation angle α _v (hereinafter, α _v = 30 can be changed), The energy distribution E _p in the 90 ° elevation direction (azimuth angle is 0 °) is determined. Here, as the first reference direction, the intermediate position of the adjacent speakers 40 (excluding the center speaker) is basically taken as the azimuth, but in the case of an elevation angle of 0 degrees, consideration is given to compatibility with the previous system. The position of the speaker 40 is adopted. Further, in the case of an elevation angle of 90 degrees, any azimuth angle may be used, so it is set to 0 degrees.

具体的には、通常の５チャンネル標準ホームシアターシステムの場合は、図２に示すように、スピーカ４０の配置方位はＣ（０度）、Ｌ（３０度）、Ｒ（３３０度）、ＳＬ（１２０度）とＳＲ（２４０度）である。そのため、第１の基準方向及びエネルギー配分として、図５に示すように、仰角０度で方位角０、３０、１２０、２４０、３３０度（実際にスピーカのある方位）、仰角３０度で方位角０、７５、１８０、２８５度（センタースピーカを除いた各スピーカの中間方位）、仰角９０度（方位角任意）で方位角０度、の各スピーカ４０のエネルギー配分Ｅ_p（_p=Ｃ，Ｌ，Ｒ，ＳＬ，ＲＬ）を決め、全空間のエネルギー配分の基準とする。 Specifically, in the case of a normal 5-channel standard home theater system, as shown in FIG. 2, the arrangement orientation of the speaker 40 is C (0 degrees), L (30 degrees), R (330 degrees), SL (120 Degrees) and SR (240 degrees). Therefore, as shown in FIG. 5, as the first reference direction and energy distribution, as shown in FIG. 5, the azimuth angle is 0, 30, 120, 240, 330 degrees (the azimuth where the speaker is actually located) at the elevation angle of 0 degrees, and the azimuth angle at the elevation angle of 30 degrees Energy distribution E _p ( _p = C, L) of each speaker 40 of 0, 75, 180, 285 degrees (intermediate azimuth of each speaker excluding the center speaker), elevation angle 90 degrees (azimuth angle arbitrary) and azimuth angle 0 degree , R, SL, RL) are determined and used as a reference for energy distribution in the entire space.

これらの第１の基準方向における使用するスピーカ４０、各スピーカ４０のエネルギー配分は下記表１に示す通りである。
［表１］

The speaker 40 used in the first reference direction and the energy distribution of each speaker 40 are as shown in Table 1 below.
[Table 1]

また、図４に示すような８チャンネル等間隔スピーカ配置システムの場合は、スピーカの配置方位はＣ（０度）、Ｌ（４５度）、ＭＬ（９０度）、ＳＬ（１３５度）、ＢＣ（１８０度）、ＳＲ（２２５度）、ＭＲ（２７０度）、とＲ（３１５度）である。そのため、８チャンネルの場合の第１の基準方向及びエネルギー配分として、図６に示すように、仰角０度で方位角０、４５、９０、１３５、１８０、２２５、２７０、３１５度（実際にスピーカのある方位）、仰角３０度で方位角０、６７．５、１１２．５、１５７．５度、２０２．５度、２４７．５度と２９２．５度（センタースピーカを除いた各スピーカの中間方位）、仰角９０度で方位角０度（方位角任意）、における各スピーカのエネルギー配分Ｅ_p（_p＝Ｃ, Ｌ, Ｒ, ＭＬ, ＭＲ, ＳＬ, ＲＬ）を決め、全空間のエネルギー配分の基準とする。 In the case of an 8-channel equally spaced speaker arrangement system as shown in FIG. 4, the speaker arrangement directions are C (0 degrees), L (45 degrees), ML (90 degrees), SL (135 degrees), BC ( 180 degrees), SR (225 degrees), MR (270 degrees), and R (315 degrees). Therefore, as the first reference direction and energy distribution in the case of 8 channels, as shown in FIG. 6, the elevation angle is 0 degree and the azimuth angles are 0, 45, 90, 135, 180, 225, 270, 315 degrees (actually speakers Azimuth), elevation angle 30 degrees, azimuth angles 0, 67.5, 112.5, 157.5 degrees, 202.5 degrees, 247.5 degrees and 292.5 degrees (middle of each speaker excluding the center speaker) Azimuth), energy distribution E _p ( _p = C, L, R, ML, MR, SL, RL) is determined for each speaker at an azimuth angle of 90 degrees and an azimuth angle of 0 degrees (any azimuth angle). The standard.

これらの８チャンネルの場合の第１の基準方向における、使用するスピーカ４０、及び各スピーカ４０のエネルギー配分は下記表２に示す通りである。
［表２］

The speaker 40 to be used and the energy distribution of each speaker 40 in the first reference direction in the case of these eight channels are as shown in Table 2 below.
[Table 2]

このように、スピーカ選択部３０２及びエネルギー配分部３０４では、仰角０、α_v、９０度における第１の基準方向を決定する機能（特許請求の範囲上の「第１の基準方向決定手段」に対応する。）と、当該第１の基準方向におけるエネルギー配分を決定する機能（特許請求の範囲上の「第１の音量配分手段」に対応する。）を有している。 As described above, the speaker selection unit 302 and the energy distribution unit 304 have a function of determining the first reference direction at elevation angles 0, α _v , and 90 degrees (in the “first reference direction determination unit” in the claims). And a function for determining energy distribution in the first reference direction (corresponding to “first volume distribution means” in the claims).

次に、これらの第１の基準方向のエネルギー配分を基準に全方向のＡＰ法に拡張する場合を説明する。上記以外の方向（θ,α）の音像に関しては、下記の［数１］に基づいて、音像定位設定部３０１により設定された音像の定位位置を囲む４つの基準方向（特許請求の範囲上の「第２の基準方向」に対応する。以下、第２の基準方向と称する。）のスピーカ４０のエネルギー配分を基に、線形結合により最終的なスピーカのエネルギー配分を決定する。
ここでは、まず、音像定位設定部３０１により設定された音像の仰角αが、第１の基準方向（第１の基準方向）として設定したαｖ以下である場合（α≦α_vの場合）について説明する。 Next, a case where the energy distribution in the first reference direction is extended to the omni-directional AP method based on the reference will be described. For sound images in directions (θ, α) other than the above, based on the following [Equation 1], four reference directions surrounding the sound image localization position set by the sound image localization setting unit 301 (on the claims) The final speaker energy distribution is determined by linear combination based on the energy distribution of the speakers 40 corresponding to the “second reference direction” (hereinafter referred to as the second reference direction).
Here, first, the elevation angle of the sound image which is set by the sound image localization setting unit 301 alpha is, for the first reference direction (the case of alpha ≦ alpha _v) (first reference direction) when it is αv less set as described To do.

［数１］

[Equation 1]

ここで、（θ_rl，α_l）、（θ_ll，α_l）、（θ_rh，α_h）、（θ_lh，α_h）は、音像定位設定部３０１により設定された音像に対して右下、左下、右上、左上に位置する第２の基準方向であり、また、ｋ_rl、ｋ_ll、ｋ_rh、ｋ_lhは、下記［数２］により与えられる。なお、θ_llとθ_lhが０度となる場合には、代わりに３６０度を用いる。 Here, (θ _rl , α _l ), (θ _ll , α _l ), (θ _rh , α _h ), (θ _lh , α _h ) are on the right with respect to the sound image set by the sound image localization setting unit 301. The second reference direction is located at the lower, lower left, upper right, and upper left, and k _rl , k _ll , k _rh , and k _lh are given by the following [ _Equation 2]. When θ _ll and θ _lh are 0 degrees, 360 degrees is used instead.

［数２］
ｋ_rl = ((θ_ll-θ)/(θ_ll-θ_rl)) ((α_h-α)/(α_h-α_l))
ｋ_ll = ((θ-θ_rl)/(θ_ll-θ_rl)) ((α_h-α)/(α_h-α_l))
ｋ_rh = ((θ_lh-θ)/(θ_lh-θ_rh)) ((α-α_l)/(α_h-α_l))
ｋ_lh = ((θ-θ_rh)/(θ_lh-θ_rh)) ((α-α_l)/(α_h-α_l)) [Equation 2]
k _rl = ((θ _ll -θ) / (θ _ll -θ _rl )) ((α _h -α) / (α _h -α _l ))
k _ll = ((θ-θ _rl ) / (θ _ll -θ _rl )) ((α _h -α) / (α _h -α _l ))
k _rh = ((θ _lh -θ) / (θ _lh -θ _rh )) ((α-α _l ) / (α _h -α _l ))
k _lh = ((θ-θ _rh ) / (θ _lh -θ _rh )) ((α-α _l ) / (α _h -α _l ))

例えば、５チャンネルシステムで音像方向が（θ，α）＝（６０，１５）の場合では、４つの第２の基準方向は、（θ_rl， α_l）＝（３０,０），（θ_ll，α_l）＝（１２０，０），（θ_rh，α_h）＝（０，３０），（θ_lh，α_h）＝（７５，３０）となるため、ｋ_rl，ｋ_ll，ｋ_rh，ｋ_lhは、次のように算出される。 For example, when the sound image direction is (θ, α) = (60, 15) in a 5-channel system, the four second reference directions are (θ _rl , α _l ) = (30, 0), (θ _ll , Α _l ) = (120, 0), (θ _rh , α _h ) = (0, 30), (θ _lh , α _h ) = (75, 30), so k _rl , k _ll , k _rh , K _lh are calculated as follows.

［数３］
ｋ_rl＝６０／９０×１５／３０＝１／３
ｋ_ll＝３０／９０×１５／３０＝１／６
ｋ_rh＝１５／７５×１５／３０＝１／１０
ｋ_lh＝６０／７５×１５／３０＝２／５ [Equation 3]
k _rl = _60/90 × _15/30 = 1/3
k _ll = _30/90 × _15/30 = _1/6
k _rh = _15/75 × _15/30 = 1/10
k _lh = _60/75 × _15/30 = _2/5

そのため、音像の定位位置が（θ，α）＝（６０，１５）の場合、各スピーカ４０のエネルギー配分は下記表３の通りとなる。
［表３］

Therefore, when the localization position of the sound image is (θ, α) = (60, 15), the energy distribution of each speaker 40 is as shown in Table 3 below.
[Table 3]

このように、スピーカ選択部３０２及びエネルギー配分部３０４は、音像定位設定部３０１により設定された音像の定位位置に対して４方向の第２の基準方向（特許請求の範囲上の「第２の基準方向決定手段」に対応する。）を決定する機能と、この４方向の第２の基準方向におけるエネルギー配分を決定する機能（特許請求の範囲上の「第２の音量配分手段」に対応する。）を有している。 In this manner, the speaker selection unit 302 and the energy distribution unit 304 have four second reference directions (“second” in the claims) with respect to the localization position of the sound image set by the sound image localization setting unit 301. And a function for determining energy distribution in the second reference direction of the four directions (corresponding to “second volume distribution means” in the claims). .)have.

具体的には、下記のように４つの第２の基準方向を決定している。すなわち、この場合、音像定位設定部３０１により設定された音像の仰角が１５度であるため、４つの第２の基準方向の仰角は、第１の基準方向のうち、この１５度を挟む０度と３０度の２つとなり、α_l＝０，α_h＝３０となる。 Specifically, four second reference directions are determined as follows. That is, in this case, since the elevation angle of the sound image set by the sound image localization setting unit 301 is 15 degrees, the elevation angles of the four second reference directions are 0 degrees sandwiching the 15 degrees among the first reference directions. And 30 degrees, α _l = 0 and α _h = 30.

５チャンネルのシステムを仮定すれば、第１の基準方向より、α_l＝０の方位角は、０，３０，１２０，２４０，３３０度であるから、この６０度を挟んだ３０度と１２０度が第２の基準方向の基準方位となる。つまり、θ_rlは６０度より小さい値の中の最大値（３０度）となり、θ_llは６０度より大きい値の中の最小値（１２０度）となる。 Assuming a 5-channel system, the azimuth angle of α _l = 0 from the first reference direction is 0, 30, 120, 240, and 330 degrees. Therefore, 30 degrees and 120 degrees sandwiching this 60 degrees. Is the reference orientation of the second reference direction. That is, θ _rl is a maximum value (30 degrees) among values smaller than 60 degrees, and θ _ll is a minimum value (120 degrees) among values larger than 60 degrees.

また、第１の基準方向より、α_h＝３０の方位角は、０（３６０），７５，１８０，２８５度となるので、この６０度を挟んだ０度と７５度が第２の基準方向の基準方位となる。つまり、θ_rhは６０度より小さい値の中の最大値（０度）となり、θ_lhは６０度より大きい値の中の最小値（７５度）となる。 Further, the azimuth angle of α _h = 30 from the first reference direction is 0 (360), 75, 180, 285 degrees, so 0 degrees and 75 degrees sandwiching the 60 degrees are the second reference direction. This is the reference orientation. That is, θ _rh is the maximum value (0 degrees) among the values smaller than 60 degrees, and θ _lh is the minimum value (75 degrees) among the values larger than 60 degrees.

一方、音像定位設定部３０１により設定された音像の仰角αが、第１の基準方向として設定したαｖを上回る場合（α＞α_vの場合）について以下に説明する。α＞α_vの場合では、第２の基準方向となる右上、左上の仰角が９０度（α_h＝９０）となるため、θ_rhとθ_lhが同じとなり、音像方向を囲む３つの第２の基準方向のスピーカ４０のエネルギー配分を基に、上記［数１］に基づいて、線形結合により最終的なスピーカのエネルギー配分を決める。 On the other hand, the sound image of the elevation angle alpha which is set by the sound image localization setting unit 301 (in the case of α> α _v) when exceeding the αv set as a first reference direction is described below. In the case of α> α _v , the upper right and upper left elevation angles that are the second reference direction are 90 degrees (α _h = 90), so θ _rh and θ _lh are the same, and the three second surrounding the sound image direction Based on the energy distribution of the speakers 40 in the reference direction, the final speaker energy distribution is determined by linear combination based on the above [Equation 1].

具体的には、上記［数１］のｋ_rl，ｋ_ll，ｋ_rh，ｋ_lhは、下記［数４］により与えられる。なお、θ_llが０度となる場合は代わりに３６０度を用いる。 Specifically, k _rl , k _ll , k _rh , and k _lh in the above [ _Equation 1] are given by the following [ _Equation 4]. If θ _ll is 0 degrees, 360 degrees is used instead.

［数４］
ｋ_rl＝（（θ_ll−θ）／（θ_ll−θ_rl））（（α_h−α）／（α_h−α_l）)
ｋ_ll＝（（θ−θ_rl）／（θ_ll−θ_rl））（（α_h−α）／（α_h−α_l））
ｋ_lh＝ｋ_rh＝（（θ−θ_l）／（α_h−α_l）） [Equation 4]
k _rl = ((θ _ll −θ) / (θ _ll −θ _rl )) ((α _h −α) / (α _h −α _l ))
k _ll = ((θ−θ _rl ) / (θ _ll −θ _rl )) ((α _h −α) / (α _h −α _l ))
k _lh = k _rh = ((θ−θ _l ) / (α _h −α _l ))

例えば、同じく５チャンネルシステムで音像定位設定部３０１により設定された音像の定位位置が（θ，α）＝（６０，６０）の場合では、３つの第２の基準方向は（θ_rl，α_l）＝（０，３０），（θ_ll，α_l）＝（７５，３０），（θ_rh，α_h）＝（θ_lh，α_h）＝（−，９０）となり、また、ｋ_rl，ｋ_ll，ｋ_rh，ｋ_lhは、上記［数４］に基づき次のように算出される。 For example, when the localization position of the sound image set by the sound image localization setting unit 301 is (θ, α) = (60, 60) in the same 5-channel system, the three second reference directions are (θ _rl , α _l ) = (0, 30), (θ _ll , α _l ) = (75, 30), (θ _rh , α _h ) = (θ _lh , α _h ) = (−, 90), and k _rl , k _ll , k _rh , and k _lh are calculated as follows based on the above [ _Equation 4].

［数５］
ｋ_rl＝１５／７５×３０／６０＝１／１０
ｋ_ll＝６０／７５×３０／６０＝２／５
ｋ_rh＝ｋ_lh＝３０／６０＝１／２ [Equation 5]
k _rl = _15/75 × _30/60 = 1/10
k _ll = _60/75 × _30/60 = _2/5
k _rh = k _lh = _30/60 = 1/2

この算出されたｋ_rl，ｋ_ll，ｋ_rh，ｋ_lhの値を上記［数１］に代入することで得られた各スピーカ４０のエネルギー配分は下記表４の通りとなる。
［表４］

The energy distribution of each speaker 40 obtained by substituting the calculated values of k _rl , k _ll , k _rh , and k _lh into the above [Equation 1] is as shown in Table 4 below.
[Table 4]

なお、上記で挙げた３つの第２の基準方向は下記のように決定される。すなわち、音像定位設定部３０１により設定された音像の定位位置が（θ，α）＝（６０，６０）の場合、仰角が６０度であるため、第１の基準方向の仰角０，３０，９０度のうち、当該第２の基準方向の仰角は６０度を挟む３０度と９０度であり、α_l＝３０，α_h＝９０となる。 Note that the three second reference directions mentioned above are determined as follows. That is, when the localization position of the sound image set by the sound image localization setting unit 301 is (θ, α) = (60, 60), the elevation angle is 60 degrees, and thus the elevation angles 0, 30, 90 in the first reference direction. Of the degrees, the elevation angles in the second reference direction are 30 degrees and 90 degrees with 60 degrees interposed therebetween, and α _l = 30 and α _h = 90.

これにより、設定された音像の定位位置に対して右上と左上の第２の基準方向の仰角は９０度となる。ここで、α_h＝９０の場合は、方位に関係なく基準方向は真上の方向となるため、θ_rhとθ_lhは任意の値となり、上述の通り考慮する必要がない。 Thereby, the elevation angle in the second reference direction in the upper right and upper left with respect to the set position of the sound image is 90 degrees. Here, when α _h = 90, the reference direction is directly above regardless of the direction, so θ _rh and θ _lh are arbitrary values and need not be considered as described above.

また、５チャンネルのシステムを仮定すれば、第１の基準方向より、α_l＝３０の方位角は０（３６０），７５，１８０，２８５度となり、６０度を挟んだ０度と７５度が第２の基準方向の基準方位となる。つまり、θ_rlは６０度より小さい値の中の最大値（０度）となり、θ_llは６０度より大きい値の中の最小値（７５度）となる。 Assuming a 5-channel system, the azimuth angle of α _l = 30 is 0 (360), 75, 180, 285 degrees from the first reference direction, and 0 degrees and 75 degrees across 60 degrees. It becomes the reference azimuth of the second reference direction. That, theta _rl is the largest value among the 60-degree smaller value (0 °) and, theta _ll is the minimum value of the 60-degree greater than (75 °).

このように、第１及び２の基準方向のエネルギー配分と、上記［数１］，［数２］，［数４］を用いて、スピーカ選択部３０２及びエネルギー配分部３０４は、使用するスピーカの選択処理と、各スピーカに対する音源１０からの音声信号のエネルギー配分処理が行うことで、水平方向だけでなく垂直方向に任意の仰角を有する音像をも生成することができる。 Thus, using the energy distribution in the first and second reference directions and the above [Equation 1], [Equation 2], and [Equation 4], the speaker selection unit 302 and the energy distribution unit 304 use the speaker to be used. By performing the selection process and the energy distribution process of the audio signal from the sound source 10 for each speaker, it is possible to generate a sound image having an arbitrary elevation angle not only in the horizontal direction but also in the vertical direction.

上記表１〜４によれば、定位された音像の仰角が０度の場合は、一般的なＡＰ法と同様に、音像方向近傍のスピーカ４０だけを用い、スピーカ４０のエネルギー比を変化させることにより当該音像の定位位置を移動させることが可能である。 According to Tables 1-4 above, when the elevation angle of the localized sound image is 0 degree, the energy ratio of the speaker 40 is changed using only the speaker 40 in the vicinity of the sound image direction, as in the general AP method. Thus, the localization position of the sound image can be moved.

また、定位された音像の仰角が０度でない場合は、使用するスピーカ４０の数を増やす必要があり、音像と同じ方位角に実際にスピーカがあっても、そのスピーカではなく近くのスピーカを利用する必要がある。特に、正面方向の音像を生成する場合は、センターのスピーカ４０によるエネルギーを抑えた方が、音像の仰角感覚が得られ易くなることが上記結果から確かめられた。 If the elevation angle of the localized sound image is not 0 degree, it is necessary to increase the number of speakers 40 to be used, and even if there is actually a speaker at the same azimuth angle as the sound image, a nearby speaker is used instead of the speaker. There is a need to. In particular, when generating a sound image in the front direction, it was confirmed from the above results that it is easier to obtain an elevation angle sensation of a sound image when the energy of the center speaker 40 is suppressed.

また、仰角９０度の音像に関しては前後、左右のバランスを考慮に入れて、全てのスピーカ４０を利用する必要がある。なお、全空間での音像移動を考えた時には、どのスピーカもエネルギーの変化が連続でスムーズであることが必要であり、特に、仰角９０度の方向は方位による差がないようにする必要がある。 In addition, regarding a sound image with an elevation angle of 90 degrees, it is necessary to use all the speakers 40 in consideration of the balance between front and rear and left and right. When considering the movement of the sound image in the entire space, it is necessary for any speaker to have a continuous and smooth change in energy, and in particular, the direction at an elevation angle of 90 degrees needs to have no difference depending on the direction. .

［３．効果］
以上のような本実施形態によれば、受聴者の耳と同程度の高さの水平面上に配置するスピーカを用いる場合であっても、水平方向（方位角の方向）だけでなく垂直方向（仰角の方向）にも自由に定位する音像を生成し立体音響を実現可能な立体音響システム、その制御方法及び制御プログラムを提供することができる。特に、現在のサラウンドシステムで採用されている５．１チャンネルの標準スピーカシステムを利用することで立体音響を生成することができるため、特別な生成装置や、立体的にスピーカを配置することが必要ない利点を有する。 [3. effect]
According to the present embodiment as described above, not only in the horizontal direction (direction of azimuth) but also in the vertical direction (even in the direction of the azimuth angle), even when using a speaker arranged on a horizontal plane having the same height as the listener's ear. It is possible to provide a stereophonic sound system capable of generating a sound image that is freely localized in the direction of the elevation angle and realizing stereophonic sound, a control method thereof, and a control program. In particular, since it is possible to generate stereophonic sound by using the 5.1 channel standard speaker system adopted in the current surround system, it is necessary to arrange special speakers and speakers in three dimensions. Has no advantage.

また、本発明は、できるだけ多くのスピーカ用い、多方向からの信号にそれぞれ左か右の頭部伝達関数を畳み込むことによってクロストークの影響を軽減することができるため、クロストークのキャンセル処理を必要とせず、受聴位置の制限を緩くすることが可能である。さらに、通常は受聴位置をスピーカシステムの中心とすることを理想とするが、その近傍においても良好な立体音響を実現することができ、また、受聴者の頭の回転や左右、前後の移動の影響を少なくすることが可能である。 In addition, the present invention can reduce the influence of crosstalk by using as many speakers as possible and convolving the left or right head-related transfer functions with signals from multiple directions, thus requiring crosstalk cancellation processing. Instead, it is possible to loosen the listening position restriction. In addition, it is ideal to have the listening position at the center of the speaker system, but good stereophonic sound can be achieved in the vicinity of the speaker system, and the listener's head can be rotated and moved left and right and back and forth. It is possible to reduce the influence.

［他の実施形態］
なお、本発明は、上記のようにエネルギー配分部３０４が上記［数１］のように線形結合により各スピーカ４０により出力される音声信号のエネルギーを算出する実施形態に限定するものではなく、各スピーカ４０のエネルギー配分を、基準方向の非線形結合によって決定する実施形態も包含する。例えば、２乗正弦関数や両サイドの変化を速くし中間の変化を遅くした補正関数などを用いて算出することが可能である。 [Other Embodiments]
Note that the present invention is not limited to the embodiment in which the energy distribution unit 304 calculates the energy of the audio signal output from each speaker 40 by linear combination as described in [Equation 1] as described above. Also included are embodiments in which the energy distribution of the speaker 40 is determined by non-linear coupling in the reference direction. For example, it is possible to calculate using a square sine function or a correction function in which changes on both sides are made faster and intermediate changes are made slower.

１０…音源
２０…操作部
３０…音像定位処理部
３０１…音像定位設定部
３０２…スピーカ選択部
３０３…頭部伝達関数決定部
３０４…エネルギー配分部
３０５…頭部伝達関数テーブル
３０６…畳み込み処理部
４０…スピーカ
ＨＲＴＦ…頭部伝達関数
ＨＲＴＦ_L…左側スピーカ用頭部伝達関数
ＨＲＴＦ_R…右側スピーカ用頭部伝達関数 DESCRIPTION OF SYMBOLS 10 ... Sound source 20 ... Operation part 30 ... Sound image localization process part 301 ... Sound image localization setting part 302 ... Speaker selection part 303 ... Head-related transfer function determination part 304 ... Energy distribution part 305 ... Head-related transfer function table 306 ... Convolution process part 40 ... Speaker HRTF ... Head related transfer function HRTF _L ... Head transfer function for left speaker HRTF _R ... Head transfer function for right speaker

Claims

５チャンネルの平面配置スピーカを備え、音源からの音声信号に音像定位処理を施すことで音像を任意の位置に定位させ、当該スピーカを通じて立体音響を生成する立体音響生成システムであって、
各種操作を入力する操作部と、
音像の定位位置を生成する音像定位処理部と、を備え、
前記音像定位処理部は、
前記操作部を通じて入力された所望の音像の定位位置を設定する音像定位設定部と、
使用するスピーカを決定し、各スピーカから出力する音量配分を決定する音量配分処理部と、
前記音像の定位位置に基づいて、各スピーカから出力される音声信号に畳み込む頭部伝達関数を決定する頭部伝達関数決定部と、
使用するスピーカにより出力される音声信号に対して、頭部伝達関数決定部により決定された頭部伝達関数を畳み込み、かつ、前記音声エネルギー配分処理部により算出されたエネルギー配分を割り当てる畳み込み処理部と、を有し、
前記音量配分処理部は、
音像定位設定部による設定に拘らず、音像の第１の基準方向として、受聴者を基準に、仰角が０度で各方位角をスピーカ位置とする方向と、仰角がα_v（０度より大きく９０度より小さければ任意に複数設定可能）で各方位角をセンタースピーカ以外の隣り合うスピーカの中間位置とする方向と、仰角が９０度で各方位角を０度とする方向と、を決定する第１の基準方向決定手段と、
前記第１の基準方向のうち仰角が０度の場合には当該第１の基準方向のスピーカに前記音源からの音声信号の全音量を配分し、前記第１の基準方向のうち仰角がα_vの場合には方位角の視点で当該第１の基準方向を挟む２つの各スピーカに１／２音量を配分し、前記第１の基準方向のうち仰角が９０度の場合には受聴者を基準に前方に位置する３つの各スピーカに１／６音量を配分し、後方に位置する２つの各スピーカに１／４音量を配分する第１の音量配分手段と、
前記第１の基準方向のうちから、仰角、方位角の視点で、前記音像定位設定部により設定された音像を挟む４つの第２の基準方向を決定する第２の基準方向決定手段と、
前記音像定位設定部により設定された音像の仰角が前記α_v以下である場合に、［数１］に基づいて、前記第２の基準方向決定手段により決定された各第２の基準方向の音量配分と所定の結合係数とを乗算し、この値を決定された全ての前記第２の基準方向分で加算することで、スピーカ毎の音量配分を算出する第２の音量配分手段と、を有することを特徴とする立体音響システム。
［数１］

A stereophonic sound generating system including a five-channel planar speaker, localizing a sound image to an arbitrary position by performing sound image localization processing on an audio signal from a sound source, and generating stereophonic sound through the speaker,
An operation unit for inputting various operations;
A sound image localization processing unit that generates a localization position of the sound image,
The sound image localization processing unit
A sound image localization setting unit for setting a localization position of a desired sound image input through the operation unit;
A volume distribution processing unit that determines a speaker to be used and determines a volume distribution to be output from each speaker;
A head-related transfer function determination unit that determines a head-related transfer function to be convoluted with a sound signal output from each speaker, based on the localization position of the sound image;
A convolution processing unit that convolves the head-related transfer function determined by the head-related transfer function determination unit with respect to the audio signal output from the speaker to be used, and assigns the energy distribution calculated by the sound energy distribution processing unit; Have
The volume distribution processing unit
Regardless of the setting by the sound image localization setting unit, the first reference direction of the sound image is the direction in which the elevation angle is 0 degrees and each azimuth angle is the speaker position with respect to the listener, and the elevation angle is α _v (greater than 0 degrees). The direction in which each azimuth is set to an intermediate position between adjacent speakers other than the center speaker and the direction in which the elevation angle is 90 degrees and each azimuth is 0 degrees are determined. First reference direction determining means;
When the elevation angle is 0 degree in the first reference direction, the entire volume of the audio signal from the sound source is distributed to the speakers in the first reference direction, and the elevation angle is α _{v in} the first reference direction. In this case, ½ volume is distributed to each of the two speakers sandwiching the first reference direction from the viewpoint of the azimuth, and if the elevation angle is 90 degrees in the first reference direction, the listener is used as a reference. First volume distribution means for distributing 1/6 volume to each of the three speakers positioned in the front and distributing 1/4 volume to each of the two speakers positioned in the rear;
A second reference direction determining means for determining four second reference directions sandwiching the sound image set by the sound image localization setting unit from the viewpoint of elevation angle and azimuth angle from the first reference direction;
When the elevation angle of the sound image set by the sound image localization setting unit is equal to or less than α _v , the sound volume in each second reference direction determined by the second reference direction determination unit based on [Equation 1] A second volume distribution means for calculating the volume distribution for each speaker by multiplying the distribution by a predetermined coupling coefficient and adding this value for all the determined second reference directions; A stereophonic sound system characterized by that.
[Equation 1]

８チャンネルの平面配置スピーカを備え、音源からの音声信号に音像定位処理を施すことで音像を任意の位置に定位させ、当該スピーカを通じて立体音響を生成する立体音響生成システムであって、
各種操作を入力する操作部と、
音像の定位位置を生成する音像定位処理部と、を備え、
前記音像定位処理部は、
前記操作部を通じて入力された所望の音像の定位位置を設定する音像定位設定部と、
使用するスピーカを決定し、各スピーカから出力する音量配分を決定する音量配分処理部と、
前記音像の定位位置に基づいて、各スピーカから出力される音声信号に畳み込む頭部伝達関数を決定する頭部伝達関数決定部と、
使用するスピーカにより出力される音声信号に対して、頭部伝達関数決定部により決定された頭部伝達関数を畳み込み、かつ、前記音声エネルギー配分処理部により算出されたエネルギー配分を割り当てる畳み込み処理部と、を有し、
前記音量配分処理部は、
音像定位設定部による設定に拘らず、音像の第１の基準方向として、受聴者を基準に、仰角が０度で各方位角をスピーカ位置とする方向と、仰角がα_v（０度より大きく９０度より小さければ任意に複数設定可能）で各方位角をセンタースピーカ以外の隣り合うスピーカの中間位置とする方向と、仰角が９０度で各方位角を０度とする方向と、を決定する第１の基準方向決定手段と、
前記第１の基準方向のうち仰角が０度の場合には当該第１の基準方向のスピーカに前記音源からの音声信号の全音量を配分し、前記第１の基準方向のうち仰角がα_vの場合には方位角の視点で当該第１の基準方向を挟む２つの各スピーカに１／２音量を配分し、前記第１の基準方向のうち仰角が９０度の場合には当該第１の基準方向の各スピーカに１／８音量を配分する第１のエネルギー配分手段と、
前記第１の基準方向のうちから、仰角、方位角の視点で、前記音像定位設定部により設定された音像を挟む４つの第２の基準方向を決定する第２の基準方向決定手段と、
前記音像定位設定部により設定された音像の仰角が前記α_v以下である場合に、［数２］に基づいて、前記第２の基準方向決定手段により決定された各第２の基準方向の音量配分と所定の結合係数とを乗算し、この値を決定された全ての前記第２の基準方向分で加算することで、スピーカ毎の音量配分を算出する第２の音量配分手段と、を有することを特徴とする立体音響システム。
［数２］

A stereophonic sound generation system that includes an 8-channel planar speaker, localizes a sound image by performing sound image localization processing on a sound signal from a sound source, and generates stereophonic sound through the speaker.
An operation unit for inputting various operations;
A sound image localization processing unit that generates a localization position of the sound image,
The sound image localization processing unit
A sound image localization setting unit for setting a localization position of a desired sound image input through the operation unit;
A volume distribution processing unit that determines a speaker to be used and determines a volume distribution to be output from each speaker;
A head-related transfer function determination unit that determines a head-related transfer function to be convoluted with a sound signal output from each speaker, based on the localization position of the sound image;
A convolution processing unit that convolves the head-related transfer function determined by the head-related transfer function determination unit with respect to the audio signal output from the speaker to be used, and assigns the energy distribution calculated by the sound energy distribution processing unit; Have
The volume distribution processing unit
Regardless of the setting by the sound image localization setting unit, the first reference direction of the sound image is the direction in which the elevation angle is 0 degrees and each azimuth angle is the speaker position with respect to the listener, and the elevation angle is α _v (greater than 0 degrees). The direction in which each azimuth is set to an intermediate position between adjacent speakers other than the center speaker and the direction in which the elevation angle is 90 degrees and each azimuth is 0 degrees are determined. First reference direction determining means;
When the elevation angle is 0 degree in the first reference direction, the entire volume of the audio signal from the sound source is distributed to the speakers in the first reference direction, and the elevation angle is α _{v in} the first reference direction. In this case, ½ volume is distributed to each of the two speakers sandwiching the first reference direction from the viewpoint of the azimuth, and when the elevation angle is 90 degrees in the first reference direction, the first First energy distribution means for distributing 1/8 volume to each speaker in the reference direction;
A second reference direction determining means for determining four second reference directions sandwiching the sound image set by the sound image localization setting unit from the viewpoint of elevation angle and azimuth angle from the first reference direction;
When the elevation angle of the sound image set by the sound image localization setting unit is equal to or less than α _v , the sound volume in each second reference direction determined by the second reference direction determination means based on [Expression 2] A second volume distribution means for calculating the volume distribution for each speaker by multiplying the distribution by a predetermined coupling coefficient and adding this value for all the determined second reference directions; A stereophonic sound system characterized by that.
[Equation 2]

前記第２の音量配分手段は、前記音像定位設定部により設定された音像の仰角が前記α_vを上回る場合に、［数３］に基づいて、前記第２の基準方向決定手段により決定された各第２の基準方向の音量配分と所定の結合係数とを乗算し、この値を決定された全ての前記第２の基準方向分で加算することで、スピーカ毎の音量配分を算出することを特徴とする請求項１又は２に記載の立体音響システム。
［数３］

The second sound volume distribution unit is determined by the second reference direction determination unit based on [Equation 3] when the elevation angle of the sound image set by the sound image localization setting unit exceeds the α _v . Multiplying the volume distribution in each second reference direction by a predetermined coupling coefficient and adding this value for all the determined second reference directions to calculate the volume distribution for each speaker. The stereophonic sound system according to claim 1 or 2, characterized by the above.
[Equation 3]

前記頭部伝達関数決定部は、受聴者を基準として左側のスピーカには左側用の頭部伝達関数を、右側のスピーカには右側用の頭部伝達関数を、正面のスピーカには左及び右側用の頭部伝達関数を決定することを特徴とする請求項１又は３に記載の立体音響システム。 The head-related transfer function determining unit is based on the listener, the left-side head related transfer function for the left-side speaker, the right-side head-related transfer function for the right-side speaker, and the left and right-side for the front-side speaker. The stereophonic sound system according to claim 1 or 3, wherein a head related transfer function is determined.

前記頭部伝達関数決定部は、受聴者を基準として左側のスピーカには左側用の頭部伝達関数を、右側のスピーカには右側用の頭部伝達関数を、正面及び背面のスピーカには左及び右側用の頭部伝達関数を決定することを特徴とする請求項２又は３に記載の立体音響システム。 The head-related transfer function determining unit is based on the listener, the left-side head related transfer function for the left speaker, the right-side head related transfer function for the right-side speaker, and the left-side head-related transfer function for the front and back speakers. The stereophonic sound system according to claim 2 or 3, wherein a head-related transfer function for the right side and the right side is determined.

前記音像定位位置に応じたスピーカ毎の頭部伝達関数を格納する頭部伝達関数テーブルを備え、
前記頭部伝達関数決定部は、当該頭部伝達関数テーブルから所望の頭部伝達関数を読み出し決定することを特徴とする請求項１〜５のいずれか１項に記載の立体音響システム。 A head-related transfer function table storing a head-related transfer function for each speaker according to the sound image localization position;
The stereophonic sound system according to any one of claims 1 to 5, wherein the head-related transfer function determination unit reads and determines a desired head-related transfer function from the head-related transfer function table.

５チャンネルの平面配置スピーカを備え、音源からの音声信号に音像定位処理を施すことで音像を任意の位置に定位させ、当該スピーカを通じて立体音響を生成する立体音響生成システムの制御方法であって、
各種操作を入力する操作部と、
音像の定位位置を生成する音像定位処理部と、を備え、
前記音像定位処理部は、
前記操作部を通じて入力された所望の音像の定位位置を設定する音像定位設定ステップと、
使用するスピーカを決定し、各スピーカから出力する音量配分を決定する音量配分処理ステップと、
前記音像の定位位置に基づいて、各スピーカから出力される音声信号に畳み込む頭部伝達関数を決定する頭部伝達関数決定ステップと、
使用するスピーカにより出力される音声信号に対して、頭部伝達関数決定ステップで決定された頭部伝達関数を畳み込み、かつ、前記音声エネルギー配分処理ステップで算出されたエネルギー配分を割り当てる畳み込み処理ステップと、を実行し、
前記音量配分処理ステップは、
音像定位設定ステップによる設定に拘らず、音像の第１の基準方向として、受聴者を基準に、仰角が０度で各方位角をスピーカ位置とする方向と、仰角がα_v（０度より大きく９０度より小さければ任意に複数設定可能）で各方位角をセンタースピーカ以外の隣り合うスピーカの中間位置とする方向と、仰角が９０度で各方位角を０度とする方向と、を決定する処理と、
前記第１の基準方向のうち仰角が０度の場合には当該第１の基準方向のスピーカに前記音源からの音声信号の全音量を配分し、前記第１の基準方向のうち仰角がα_vの場合には方位角の視点で当該第１の基準方向を挟む２つの各スピーカに１／２音量を配分し、前記第１の基準方向のうち仰角が９０度の場合には受聴者を基準に前方に位置する３つの各スピーカに１／６音量を配分し、後方に位置する２つの各スピーカに１／４音量を配分する処理と、
前記第１の基準方向のうちから、仰角、方位角の視点で、前記音像定位設定ステップで設定された音像を挟む４つの第２の基準方向を決定する処理と、
前記音像定位設定ステップで設定された音像の仰角が前記α_v以下である場合に、［数４］に基づいて、決定された各第２の基準方向の音量配分と所定の結合係数とを乗算し、この値を決定された全ての前記第２の基準方向分で加算することで、スピーカ毎の音量配分を算出する処理と、を含むことを特徴とする立体音響システムの制御方法。
［数４］

A control method for a stereophonic sound generation system that includes a 5-channel planar speaker, localizes a sound image by performing sound image localization processing on a sound signal from a sound source, and generates stereophonic sound through the speaker.
An operation unit for inputting various operations;
A sound image localization processing unit that generates a localization position of the sound image,
The sound image localization processing unit
A sound image localization setting step for setting a localization position of a desired sound image input through the operation unit;
A volume distribution processing step for determining a speaker to be used and determining a volume distribution to be output from each speaker;
A head-related transfer function determination step for determining a head-related transfer function to be convoluted with an audio signal output from each speaker based on the localization position of the sound image;
A convolution processing step that convolves the head-related transfer function determined in the head-related transfer function determination step with the sound signal output by the speaker to be used, and assigns the energy distribution calculated in the sound energy distribution processing step; Run,
The volume distribution processing step includes:
Regardless of the setting in the sound image localization setting step, the first reference direction of the sound image is the direction in which the elevation angle is 0 degree and each azimuth angle is the speaker position with respect to the listener, and the elevation angle is α _v (greater than 0 degree). The direction in which each azimuth is set to an intermediate position between adjacent speakers other than the center speaker and the direction in which the elevation angle is 90 degrees and each azimuth is 0 degrees are determined. Processing,
When the elevation angle is 0 degree in the first reference direction, the entire volume of the audio signal from the sound source is distributed to the speakers in the first reference direction, and the elevation angle is α _{v in} the first reference direction. In this case, ½ volume is distributed to each of the two speakers sandwiching the first reference direction from the viewpoint of the azimuth, and if the elevation angle is 90 degrees in the first reference direction, the listener is used as a reference. Distributing 1/6 volume to each of the three speakers located in front and distributing 1/4 volume to each of the two speakers located behind;
A process of determining four second reference directions sandwiching the sound image set in the sound image localization setting step from the viewpoints of elevation angle and azimuth angle from among the first reference directions;
When the elevation angle of the sound image set in the sound image localization setting step is less than or equal to α _v , the sound volume distribution in each second reference direction determined based on [Equation 4] is multiplied by a predetermined coupling coefficient. And a process of calculating a volume distribution for each speaker by adding this value for all of the determined second reference directions, and a method for controlling a stereophonic sound system.
[Equation 4]

８チャンネルの平面配置スピーカを備え、音源からの音声信号に音像定位処理を施すことで音像を任意の位置に定位させ、当該スピーカを通じて立体音響を生成する立体音響生成システムの制御方法であって、
各種操作を入力する操作部と、
音像の定位位置を生成する音像定位処理部と、を備え、
前記音像定位処理部は、
前記操作部を通じて入力された所望の音像の定位位置を設定する音像定位設定ステップと、
使用するスピーカを決定し、各スピーカから出力する音量配分を決定する音量配分処理ステップと、
前記音像の定位位置に基づいて、各スピーカから出力される音声信号に畳み込む頭部伝達関数を決定する頭部伝達関数決定ステップと、
使用するスピーカにより出力される音声信号に対して、頭部伝達関数決定ステップで決定された頭部伝達関数を畳み込み、かつ、前記音声エネルギー配分処理ステップで算出されたエネルギー配分を割り当てる畳み込み処理ステップと、を実行し、
前記音量配分処理ステップは、
音像定位設定ステップによる設定に拘らず、音像の第１の基準方向として、受聴者を基準に、仰角が０度で各方位角をスピーカ位置とする方向と、仰角がα_v（０度より大きく９０度より小さければ任意に複数設定可能）で各方位角をセンタースピーカ以外の隣り合うスピーカの中間位置とする方向と、仰角が９０度で各方位角を０度とする方向と、を決定する処理と、
前記第１の基準方向のうち仰角が０度の場合には当該第１の基準方向のスピーカに前記音源からの音声信号の全音量を配分し、前記第１の基準方向のうち仰角がα_vの場合には方位角の視点で当該第１の基準方向を挟む２つの各スピーカに１／２音量を配分し、前記第１の基準方向のうち仰角が９０度の場合には当該第１の基準方向の各スピーカに１／８音量を配分する処理と、
前記第１の基準方向のうちから、仰角、方位角の視点で、前記音像定位設定ステップで設定された音像を挟む４つの第２の基準方向を決定する処理と、
前記音像定位設定ステップで設定された音像の仰角が前記α_v以下である場合に、［数５］に基づいて、決定された各第２の基準方向の音量配分と所定の結合係数とを乗算し、この値を決定された全ての前記第２の基準方向分で加算することで、スピーカ毎の音量配分を算出する処理と、を含むことを特徴とする立体音響システムの制御方法。
［数５］

A control method for a stereophonic sound generation system comprising an 8-channel planar speaker, localizing a sound image to an arbitrary position by applying sound image localization processing to a sound signal from a sound source, and generating stereophonic sound through the speaker,
An operation unit for inputting various operations;
A sound image localization processing unit that generates a localization position of the sound image,
The sound image localization processing unit
A sound image localization setting step for setting a localization position of a desired sound image input through the operation unit;
A volume distribution processing step for determining a speaker to be used and determining a volume distribution to be output from each speaker;
A head-related transfer function determination step for determining a head-related transfer function to be convoluted with an audio signal output from each speaker based on the localization position of the sound image;
A convolution processing step that convolves the head-related transfer function determined in the head-related transfer function determination step with the sound signal output by the speaker to be used, and assigns the energy distribution calculated in the sound energy distribution processing step; Run,
The volume distribution processing step includes:
Regardless of the setting in the sound image localization setting step, the first reference direction of the sound image is the direction in which the elevation angle is 0 degree and each azimuth angle is the speaker position with respect to the listener, and the elevation angle is α _v (greater than 0 degree). The direction in which each azimuth is set to an intermediate position between adjacent speakers other than the center speaker and the direction in which the elevation angle is 90 degrees and each azimuth is 0 degrees are determined. Processing,
When the elevation angle is 0 degree in the first reference direction, the entire volume of the audio signal from the sound source is distributed to the speakers in the first reference direction, and the elevation angle is α _{v in} the first reference direction. In this case, ½ volume is distributed to each of the two speakers sandwiching the first reference direction from the viewpoint of the azimuth, and when the elevation angle is 90 degrees in the first reference direction, the first A process of allocating 1/8 volume to each speaker in the reference direction;
A process of determining four second reference directions sandwiching the sound image set in the sound image localization setting step from the viewpoints of elevation angle and azimuth angle from among the first reference directions;
When the elevation angle of the sound image set in the sound image localization setting step is equal to or less than α _v , the sound volume distribution in each second reference direction determined by [Equation 5] is multiplied by a predetermined coupling coefficient. And a process of calculating a volume distribution for each speaker by adding this value for all of the determined second reference directions, and a method for controlling a stereophonic sound system.
[Equation 5]

前記音量配分処理ステップは、前記音像定位設定ステップで設定された音像の仰角が前記α_vを上回る場合に、［数６］に基づいて、決定された各第２の基準方向の音量配分と所定の結合係数とを乗算し、この値を決定された全ての前記第２の基準方向分で加算することで、スピーカ毎の音量配分を算出する処理を含むことを特徴とする請求項７又は８に記載の立体音響システムの制御方法。
［数６］

In the sound volume distribution processing step, when the elevation angle of the sound image set in the sound image localization setting step exceeds the α _v , the sound volume distribution in each second reference direction determined based on [Equation 6] and a predetermined value are determined. 9. A process of calculating a sound volume distribution for each speaker by multiplying the value by the coupling coefficient of the second and adding the values for all the determined second reference directions. The control method of the stereophonic sound system as described in 2.
[Equation 6]

５チャンネルの平面配置スピーカを備え、コンピュータにより、音源からの音声信号に音像定位処理を施すことで音像を任意の位置に定位させ、当該スピーカを通じて立体音響を生成させる立体音響生成システムの制御プログラムであって、
このプログラムは前記コンピュータに、
操作部を通じて入力された所望の音像の定位位置を設定する音像定位設定処理と、
使用するスピーカを決定し、各スピーカから出力する音量配分を決定する音量配分処理と、
前記音像の定位位置に基づいて、各スピーカから出力される音声信号に畳み込む頭部伝達関数を決定する頭部伝達関数決定処理と、
使用するスピーカにより出力される音声信号に対して、頭部伝達関数決定ステップで決定された頭部伝達関数を畳み込み、かつ、前記音声エネルギー配分処理により算出されたエネルギー配分を割り当てる畳み込み処理と、を実行させ、
前記音量配分処理は、
音像定位設定処理による設定に拘らず、音像の第１の基準方向として、受聴者を基準に、仰角が０度で各方位角をスピーカ位置とする方向と、仰角がα_v（０度より大きく９０度より小さければ任意に複数設定可能）で各方位角をセンタースピーカ以外の隣り合うスピーカの中間位置とする方向と、仰角が９０度で各方位角を０度とする方向と、を決定する処理と、
前記第１の基準方向のうち仰角が０度の場合には当該第１の基準方向のスピーカに前記音源からの音声信号の全音量を配分し、前記第１の基準方向のうち仰角がα_vの場合には方位角の視点で当該第１の基準方向を挟む２つの各スピーカに１／２音量を配分し、前記第１の基準方向のうち仰角が９０度の場合には受聴者を基準に前方に位置する３つの各スピーカに１／６音量を配分し、後方に位置する２つの各スピーカに１／４音量を配分する処理と、
前記第１の基準方向のうちから、仰角、方位角の視点で、前記音像定位設定処理で設定された音像を挟む４つの第２の基準方向を決定する処理と、
前記音像定位設定処理で設定された音像の仰角が前記α_v以下である場合に、［数７］に基づいて、決定された各第２の基準方向の音量配分と所定の結合係数とを乗算し、この値を決定された全ての前記第２の基準方向分で加算することで、スピーカ毎の音量配分を算出する処理と、を含むことを特徴とする立体音響システムの制御プログラム。
［数７］

A control program for a three-dimensional sound generation system that includes a five-channel planar speaker and localizes a sound image to an arbitrary position by performing sound image localization processing on a sound signal from a sound source by a computer and generates a three-dimensional sound through the speaker. There,
This program is stored in the computer
A sound image localization setting process for setting a localization position of a desired sound image input through the operation unit;
A volume distribution process for determining a speaker to be used and determining a volume distribution to be output from each speaker;
A head-related transfer function determination process for determining a head-related transfer function to be convoluted with a sound signal output from each speaker, based on the localization position of the sound image;
A convolution process in which the head-related transfer function determined in the head-related transfer function determination step is convoluted with the sound signal output from the speaker to be used, and the energy distribution calculated by the sound energy distribution process is assigned. Let it run
The volume distribution process is:
Regardless of the setting by the sound image localization setting process, the first reference direction of the sound image is a direction in which the elevation angle is 0 degrees and each azimuth angle is the speaker position with respect to the listener, and the elevation angle is α _v (greater than 0 degrees). The direction in which each azimuth is set to an intermediate position between adjacent speakers other than the center speaker and the direction in which the elevation angle is 90 degrees and each azimuth is 0 degrees are determined. Processing,
When the elevation angle is 0 degree in the first reference direction, the entire volume of the audio signal from the sound source is distributed to the speakers in the first reference direction, and the elevation angle is α _{v in} the first reference direction. In this case, ½ volume is distributed to each of the two speakers sandwiching the first reference direction from the viewpoint of the azimuth, and if the elevation angle is 90 degrees in the first reference direction, the listener is used as a reference. Distributing 1/6 volume to each of the three speakers located in front and distributing 1/4 volume to each of the two speakers located behind;
A process of determining four second reference directions sandwiching the sound image set in the sound image localization setting process from the viewpoints of elevation angle and azimuth angle from among the first reference directions;
When the elevation angle of the sound image set in the sound image localization setting process is equal to or less than α _v , the sound volume distribution in each second reference direction determined based on [Equation 7] is multiplied by a predetermined coupling coefficient. And a process of calculating a volume distribution for each speaker by adding the values for all of the determined second reference directions, a control program for a stereophonic sound system, comprising:
[Equation 7]

８チャンネルの平面配置スピーカを備え、コンピュータにより、音源からの音声信号に音像定位処理を施すことで音像を任意の位置に定位させ、当該スピーカを通じて立体音響を生成させる立体音響生成システムの制御プログラムであって、
このプログラムは前記コンピュータに、
操作部を通じて入力された所望の音像の定位位置を設定する音像定位設定処理と、
使用するスピーカを決定し、各スピーカから出力する音量配分を決定する音量配分処理処理と、
前記音像の定位位置に基づいて、各スピーカから出力される音声信号に畳み込む頭部伝達関数を決定する頭部伝達関数決定処理と、
使用するスピーカにより出力される音声信号に対して、頭部伝達関数決定ステップで決定された頭部伝達関数を畳み込み、かつ、前記音声エネルギー配分処理で算出されたエネルギー配分を割り当てる畳み込み処理と、を実行させ、
前記音量配分処理は、
音像定位設定処理による設定に拘らず、音像の第１の基準方向として、受聴者を基準に、仰角が０度で各方位角をスピーカ位置とする方向と、仰角がα_v（０度より大きく９０度より小さければ任意に複数設定可能）で各方位角をセンタースピーカ以外の隣り合うスピーカの中間位置とする方向と、仰角が９０度で各方位角を０度とする方向と、を決定する処理と、
前記第１の基準方向のうち仰角が０度の場合には当該第１の基準方向のスピーカに前記音源からの音声信号の全音量を配分し、前記第１の基準方向のうち仰角がα_vの場合には方位角の視点で当該第１の基準方向を挟む２つの各スピーカに１／２音量を配分し、前記第１の基準方向のうち仰角が９０度の場合には当該第１の基準方向の各スピーカに１／８音量を配分する処理と、
前記第１の基準方向のうちから、仰角、方位角の視点で、前記音像定位設定処理で設定された音像を挟む４つの第２の基準方向を決定する処理と、
前記音像定位設定処理で設定された音像の仰角が前記α_v以下である場合に、［数８］に基づいて、決定された各第２の基準方向の音量配分と所定の結合係数とを乗算し、この値を決定された全ての前記第２の基準方向分で加算することで、スピーカ毎の音量配分を算出する処理と、を含むことを特徴とする立体音響システムの制御プログラム。
［数８］

A control program for a stereophonic sound generation system that includes an 8-channel planar speaker and localizes a sound image at an arbitrary position by performing sound image localization processing on a sound signal from a sound source by a computer and generates stereophonic sound through the speaker. There,
This program is stored in the computer
A sound image localization setting process for setting a localization position of a desired sound image input through the operation unit;
A volume distribution process for determining a speaker to be used and determining a volume distribution to be output from each speaker;
A head-related transfer function determination process for determining a head-related transfer function to be convoluted with a sound signal output from each speaker, based on the localization position of the sound image;
A convolution process in which the head-related transfer function determined in the head-related transfer function determination step is convoluted with the sound signal output by the speaker to be used, and the energy distribution calculated in the sound energy distribution process is assigned. Let it run
The volume distribution process is:
Regardless of the setting by the sound image localization setting process, the first reference direction of the sound image is a direction in which the elevation angle is 0 degrees and each azimuth angle is the speaker position with respect to the listener, and the elevation angle is α _v (greater than 0 degrees). The direction in which each azimuth is set to an intermediate position between adjacent speakers other than the center speaker and the direction in which the elevation angle is 90 degrees and each azimuth is 0 degrees are determined. Processing,
When the elevation angle is 0 degree in the first reference direction, the entire volume of the audio signal from the sound source is distributed to the speakers in the first reference direction, and the elevation angle is α _{v in} the first reference direction. In this case, ½ volume is distributed to each of the two speakers sandwiching the first reference direction from the viewpoint of the azimuth, and when the elevation angle is 90 degrees in the first reference direction, the first A process of allocating 1/8 volume to each speaker in the reference direction;
A process of determining four second reference directions sandwiching the sound image set in the sound image localization setting process from the viewpoints of elevation angle and azimuth angle from among the first reference directions;
When the elevation angle of the sound image set in the sound image localization setting process is less than or equal to α _v , the sound volume distribution in each second reference direction determined by [Equation 8] is multiplied by a predetermined coupling coefficient. And a process of calculating a volume distribution for each speaker by adding the values for all of the determined second reference directions, a control program for a stereophonic sound system, comprising:
[Equation 8]

前記音量配分処理は、前記音像定位設定処理で設定された音像の仰角が前記α_vを上回る場合に、［数９］に基づいて、決定された各第２の基準方向の音量配分と所定の結合係数とを乗算し、この値を決定された全ての前記第２の基準方向分で加算することで、スピーカ毎の音量配分を算出する処理を含むことを特徴とする請求項１０又は１１に記載の立体音響システムの制御プログラム。
［数９］

In the volume distribution process, when the elevation angle of the sound image set in the sound image localization setting process exceeds α _v , the volume distribution in each second reference direction determined based on [Equation 9] and a predetermined The method according to claim 10 or 11, further comprising a process of calculating a volume distribution for each speaker by multiplying a coupling coefficient and adding the value for all of the determined second reference directions. The control program of the described stereophonic sound system.
[Equation 9]