JP6905332B2

JP6905332B2 - Multi-channel acoustic audio signal converter and its program

Info

Publication number: JP6905332B2
Application number: JP2016244966A
Authority: JP
Inventors: 知美小倉; 大出　訓史; 訓史大出; 一穂小野; 飯田　一博; 一博飯田
Original assignee: Japan Broadcasting Corp
Current assignee: Japan Broadcasting Corp
Priority date: 2016-12-19
Filing date: 2016-12-19
Publication date: 2021-07-21
Anticipated expiration: 2036-12-19
Also published as: JP2018101824A

Description

本発明は、後方のチャンネルを含むマルチチャンネル音響の音声信号を実配置されるスピーカに応じて信号変換するマルチチャンネル音響の音声信号変換装置及びそのプログラムに関する。 The present invention relates to a multi-channel acoustic audio signal conversion device and a program thereof that converts a multi-channel acoustic audio signal including a rear channel according to a speaker actually arranged.

近年、２２．２ｃｈ（チャンネル）のマルチチャンネル音響システム（ＩＴＵ−Ｒ勧告ＢＳ.２０５１）や、１１．１ｃｈ、７．１ｃｈ、或いは５．１ｃｈなどの後方にスピーカを配置することを想定したマルチチャンネル音響システムが知られている。 In recent years, a multi-channel that assumes that a speaker is placed behind a 22.2ch (channel) multi-channel sound system (ITU-R recommendation BS.2051), 11.1ch, 7.1ch, or 5.1ch. Sound systems are known.

しかしながら、これらのマルチチャンネル音響システムにおけるスピーカ配置を利用者側で設置しようとしても、場所の制約などにより後方にスピーカを設置できないことがよくある。 However, even if the user tries to install the speaker arrangement in these multi-channel sound systems, it is often impossible to install the speaker in the rear due to space restrictions or the like.

その解決策として、頭部伝達関数（ＨＲＴＦ）を用いたバイノーラル再生方式が知られている（例えば、非特許文献１参照）。この方式では、受聴者本人の頭部伝達関数（ＨＲＴＦ）を鼓膜上で再現した場合に、精度の高い音像制御が実現できる。 As a solution to this, a binaural reproduction method using a head-related transfer function (HRTF) is known (see, for example, Non-Patent Document 1). In this method, highly accurate sound image control can be realized when the head-related transfer function (HRTF) of the listener himself / herself is reproduced on the eardrum.

しかし、頭部伝達関数（ＨＲＴＦ）には頭部や耳介形状に起因する個人性があり、他人の頭部伝達関数を使用してその音声を受聴すると、前後誤判定や音像の上昇などが生じやすいことが知られている（例えば、非特許文献２参照）。そして、頭部伝達関数（ＨＲＴＦ）は特殊実験室で測定装置を用いて測定するため、任意の受聴者の任意の方向の頭部伝達関数（ＨＲＴＦ）を測定することは、事実上不可能である。 However, the head-related transfer function (HRTF) has individuality due to the shape of the head and auricle, and when listening to the sound using another person's head-related transfer function, misjudgment of the front and back and an increase in the sound image occur. It is known that it is likely to occur (see, for example, Non-Patent Document 2). And since the head related transfer function (HRTF) is measured in a special laboratory using a measuring device, it is virtually impossible to measure the head related transfer function (HRTF) in any direction of any listener. be.

また、マルチチャンネル音響の音声信号を入力し、ディスプレイの周囲に配置された複数のスピーカによりマルチチャンネル音響再生を行い、臨場感を高める前方スピーカ装置が開示されている（例えば、特許文献１参照）。また、ディスプレイの下方に設置可能とし、マルチチャンネル音響を再生可能な複数のスピーカを棒状に配置した前方スピーカ装置も知られている。 Further, there is disclosed a front speaker device that inputs an audio signal of a multi-channel sound and reproduces the multi-channel sound by a plurality of speakers arranged around the display to enhance the sense of presence (see, for example, Patent Document 1). .. In addition, a front speaker device that can be installed below the display and has a plurality of speakers capable of reproducing multi-channel sound arranged in a rod shape is also known.

ただし、これらの前方スピーカ装置の技法においても頭部伝達関数の個人性の問題は解消されていない。 However, the technique of these front speaker devices does not solve the problem of individuality of the head-related transfer function.

特許５６０４３６５号明細書Japanese Patent No. 5604365

K. Matsui and A. Ando, “Binaural Reproduction of 22.2 Multichannel Sound over Loudspeakers,” AES 129th Convention, Convention Paper 8272 (2010).K. Matsui and A. Ando, “Binaural Reproduction of 22.2 Multichannel Sound over Loudspeakers,” AES 129th Convention, Convention Paper 8272 (2010). M.Morimoto and Y.Ando, "On the simulation of sound localization", J.Acoust.Soc. Jpn.(E)1,3(1980) pp.167-174M.Morimoto and Y.Ando, "On the simulation of sound localization", J.Acoust.Soc. Jpn. (E) 1,3 (1980) pp.167-174

後方のチャンネルを含むマルチチャンネル音響の音声信号を受聴する際に、後方にスピーカを設置できない場合でも、当該受聴者本人の頭部伝達関数（ＨＲＴＦ）について測定することなく、多くの受聴者が後方に音像を知覚できるようにする技法が望まれる。 When listening to a multi-channel acoustic audio signal that includes a rear channel, many listeners are rearward without measuring the head related transfer function (HRTF) of the listener, even if the speaker cannot be installed behind. A technique that makes it possible to perceive a sound image is desired.

本発明の目的は、上述の問題に鑑みて、後方のチャンネルを含むマルチチャンネル音響の音声信号を側方又は斜め前方に実配置されるスピーカから再生して後方に音像を知覚可能とするために、音声信号の信号変換を行うマルチチャンネル音響の音声信号変換装置及びそのプログラムを提供することにある。 In view of the above problems, an object of the present invention is to reproduce an audio signal of a multi-channel sound including a rear channel from a speaker actually arranged laterally or diagonally forward so that a sound image can be perceived rearward. , A present invention is to provide a multi-channel acoustic audio signal conversion device and a program thereof for signal conversion of an audio signal.

本発明におけるマルチチャンネル音響の音声信号変換装置は、後方にスピーカが設置できない場合でも、後方のチャンネルの音声信号に対し、可聴周波数帯域の１つ又は複数の帯域を抑圧する帯域抑圧フィルタを適用して信号変換を行うことで、斜め前方又は側方に実配置される一対のスピーカから後方感を強調して再生可能とし、多くの受聴者が後方に音像を知覚させることができるようにした。 The multi-channel audio audio signal converter of the present invention applies a band suppression filter that suppresses one or more audible frequency bands to the audio signal of the rear channel even when a speaker cannot be installed in the rear. By performing signal conversion, it is possible to reproduce by emphasizing the rear feeling from a pair of speakers actually arranged diagonally forward or sideways, so that many listeners can perceive the sound image in the rear.

即ち、本発明の音声信号変換装置は、後方のチャンネルを含むマルチチャンネル音響の音声信号に対して信号変換を行う音声信号変換装置であって、可聴周波数帯域内の１つ又は複数の帯域を抑圧する帯域抑圧フィルタを記憶する帯域抑圧フィルタ記憶部と、前記マルチチャンネル音響の音声信号のうち後方のチャンネルの音声信号に対し、前記帯域抑圧フィルタを適用することにより、後方感強調処理を施した音響信号を生成する後方感強調処理部と、を備え、前記マルチチャンネル音響の音声信号のうち後方に設置されるスピーカから再生することが想定されていたチャンネルの音声信号について周波数スペクトルを分析し、この分析結果を基に前記帯域抑圧フィルタ記憶部に記憶されている帯域抑圧フィルタの形状及び抑圧帯域のいずれか一方又は双方を自動的に変更可能にする分析部を更に備えることを特徴とする。 That is, the audio signal conversion device of the present invention is an audio signal conversion device that performs signal conversion for a multi-channel audio audio signal including a rear channel, and suppresses one or a plurality of bands in the audible frequency band. By applying the band suppression filter to the band suppression filter storage unit that stores the band suppression filter to be used and the audio signal of the rear channel of the multi-channel sound audio signal, the sound that has been subjected to the rear feeling enhancement processing. It is equipped with a rear feeling enhancement processing unit that generates a signal, and analyzes the frequency spectrum of the audio signal of the channel that is supposed to be reproduced from the speaker installed behind the multi-channel acoustic audio signal, and this results of the analysis and further comprising said Rukoto the analyzer to one or both of the shape and the suppression band of the band suppression filter stored in the band suppression filter storage unit based on the automatically changeable.

また、本発明の音声信号変換装置において、前記後方感強調処理を施した音響信号は、受聴者の側方又は斜め前方に実配置されるスピーカから再生して後方に音像を知覚可能とすることを特徴とする。 Further, in the audio signal conversion device of the present invention, the acoustic signal subjected to the rear feeling enhancement processing is reproduced from a speaker actually arranged on the side or diagonally forward of the listener so that the sound image can be perceived rearward. It is characterized by.

また、本発明の音声信号変換装置において、前記マルチチャンネル音響の音声信号のうち後方以外のチャンネルの音声信号と、前記後方感強調処理を施した音声信号とを重み付け加算により合成し、実配置されるスピーカから再生するための再生信号を生成する再生信号生成部を更に備えることを特徴とする。 Further, in the audio signal conversion device of the present invention, the audio signals of channels other than the rear of the multi-channel audio audio signals and the audio signals subjected to the rear feeling enhancement processing are combined by weighting addition and actually arranged. It is characterized by further including a reproduction signal generation unit for generating a reproduction signal for reproduction from a speaker.

また、本発明の音声信号変換装置において、前記再生信号生成部は、前記マルチチャンネル音響の音声信号のうち左右後方のチャンネルの音声信号に対して、左右に異なる信号レベルで前記後方感強調処理を施した音声信号と重み付け加算により合成し、側方もしくは斜め前方の実配置のスピーカから再生するための再生信号を生成することを特徴とする。 Further, in the audio signal conversion device of the present invention, the reproduction signal generation unit performs the rear feeling enhancement processing at different signal levels on the left and right with respect to the audio signals of the left and right rear channels of the multi-channel audio audio signals. It is characterized in that it is combined with the applied audio signal by weighting addition to generate a reproduction signal for reproduction from a speaker in an actual arrangement on the side or diagonally forward.

また、本発明の音声信号変換装置において、前記後方感強調処理部は、前記マルチチャンネル音響の音声信号のうち後方のチャンネルが複数ある際に、該チャンネル毎に異なる帯域抑圧フィルタを適用可能に構成されていることを特徴とする。 Further, in the audio signal conversion device of the present invention, the rear feeling enhancement processing unit is configured to be able to apply a different band suppression filter for each of the plurality of rear channels of the multi-channel acoustic audio signal. It is characterized by being done.

また、本発明の音声信号変換装置において、前記後方感強調処理部は、当該帯域抑圧フィルタについて実配置されるスピーカに応じて選択変更可能に構成されていることを特徴とする。 Further, in the audio signal conversion device of the present invention, the rear feeling enhancement processing unit is configured to be selectable and changeable according to the speaker actually arranged for the band suppression filter.

また、本発明の音声信号変換装置において、前記分析部は、前記分析結果を基に前記帯域抑圧フィルタ記憶部に記憶されている複数の帯域抑圧フィルタのうち前記後方感強調処理部によって使用する帯域抑圧フィルタを自動的に選択設定可能に構成されていることを特徴とする。 Further, in the audio signal conversion device of the present invention, the analysis unit is a band used by the rear feeling enhancement processing unit among a plurality of band suppression filters stored in the band suppression filter storage unit based on the analysis result. The feature is that the suppression filter can be automatically selected and set.

また、本発明の音声信号変換装置において、前記帯域抑圧フィルタ記憶部は、予め定めた抑圧量で略２ｋＨｚから略８ｋＨｚの間で可変帯域となる複数種の帯域抑圧フィルタを記憶していることを特徴とする。 Further, in the audio signal conversion device of the present invention, the band suppression filter storage unit stores a plurality of types of band suppression filters having a variable band between approximately 2 kHz and approximately 8 kHz with a predetermined suppression amount. It is a feature.

また、本発明の音声信号変換装置において、前記帯域抑圧フィルタ記憶部は、当該マルチチャンネル音響の音声信号における後方のチャンネルの音声信号用の頭部伝達関数の４ｋＨｚ以上の周波数帯域の中で最も低い周波数のノッチ周波数を抑圧可能とする上側遮断周波数を持つ帯域抑圧フィルタを、前記後方感強調処理部によって使用する帯域抑圧フィルタとして定め記憶していることを特徴とする。 Further, in the voice signal conversion device of the present invention, the band suppression filter storage unit is the lowest in the frequency band of 4 kHz or more of the head transmission function for the voice signal of the rear channel in the voice signal of the multi-channel sound. A band suppression filter having an upper cutoff frequency capable of suppressing the notch frequency of the frequency is defined and stored as a band suppression filter used by the rear feeling enhancement processing unit.

また、本発明のプログラムは、コンピュータを、本発明のマルチチャンネル音響の音声信号変換装置として機能させるためのプログラムとして構成される。
Also, the program of the present invention comprises a computer, a program for functioning as a multi-channel audio of the audio signal conversion apparatus of the present invention.

本発明によれば、後方にスピーカが設置できない場合でも、後方のスピーカから再生することを想定した音声信号を、受聴者本人の頭部伝達関数（ＨＲＴＦ）について測定することなく、多くの受聴者が後方に音像を知覚できるように再生できるようになる。 According to the present invention, even if a speaker cannot be installed in the rear, many listeners do not need to measure the head-related transfer function (HRTF) of the listener himself / herself to reproduce an audio signal assumed to be reproduced from the rear speaker. Will be able to reproduce the sound image backwards so that it can be perceived.

また、本発明におけるマルチチャンネル音響の音声信号変換装置は、特許文献１に開示されるディスプレイの周囲に配置された複数のスピーカによりマルチチャンネル音響再生を可能とする前方スピーカ装置や、マルチチャンネル音響を再生可能な複数のスピーカを棒状に配置した前方スピーカ装置に対しても、多くの受聴者が後方に音像を知覚できるようになる。 Further, the audio signal conversion device for multi-channel sound in the present invention includes a front speaker device that enables multi-channel sound reproduction by a plurality of speakers arranged around the display disclosed in Patent Document 1, and multi-channel sound. Even for a front speaker device in which a plurality of reproducible speakers are arranged in a rod shape, many listeners can perceive a sound image in the rear.

本発明による第１実施形態のマルチチャンネル音響の音声信号変換装置と、その周辺装置の概略構成を示すブロック図である。It is a block diagram which shows the schematic structure of the audio signal conversion apparatus of the multi-channel acoustic of 1st Embodiment by this invention, and the peripheral apparatus thereof. （ａ）乃至（ｅ）は、それぞれ本発明による第１実施形態のマルチチャンネル音響の音声信号変換装置における帯域抑圧フィルタ記憶部で記憶するフィルタ例を示す図である。(A) to (e) are diagrams showing examples of filters stored in the band suppression filter storage unit in the audio signal conversion device for the multi-channel acoustic of the first embodiment according to the present invention, respectively. 本発明による第１実施形態のマルチチャンネル音響の音声信号変換装置における後方感強調処理部の処理を示すフローチャートである。It is a flowchart which shows the processing of the back feeling enhancement processing part in the audio signal conversion apparatus of the multi-channel sound of 1st Embodiment by this invention. （ａ），（ｂ）は、それぞれ本発明による第１実施形態のマルチチャンネル音響の音声信号変換装置に係る後方に音像を知覚できる割合と帯域抑圧フィルタの抑圧帯域の関係を実験的に評価した結果を示す図である。In each of (a) and (b), the relationship between the ratio at which a sound image can be perceived rearward and the suppression band of the band suppression filter according to the audio signal conversion device of the first embodiment according to the present invention was experimentally evaluated. It is a figure which shows the result. 本発明による第２実施形態のマルチチャンネル音響の音声信号変換装置と、その周辺装置の概略構成を示すブロック図である。It is a block diagram which shows the schematic structure of the audio signal conversion device of the multi-channel sound of 2nd Embodiment by this invention, and the peripheral device thereof. 本発明による第２実施形態のマルチチャンネル音響の音声信号変換装置における再生信号処理部の処理を示すフローチャートである。It is a flowchart which shows the processing of the reproduction signal processing part in the audio signal conversion apparatus of the multi-channel sound of 2nd Embodiment by this invention. 本発明による第２実施形態のマルチチャンネル音響の音声信号変換装置における再生信号処理部の処理に係る、再生するチャンネルへの利得係数の一例を示す図である。It is a figure which shows an example of the gain coefficient to the channel to reproduce which concerns on the processing of the reproduction signal processing part in the audio signal conversion apparatus of the multi-channel sound of 2nd Embodiment by this invention. （ａ），（ｂ）は、それぞれ本発明による第１実施形態のマルチチャンネル音響の音声信号変換装置に係る典型的なスピーカ配置例を示す図である。(A) and (b) are diagrams showing typical speaker arrangement examples according to the multi-channel acoustic audio signal conversion device of the first embodiment according to the present invention, respectively. 本発明による第３実施形態のマルチチャンネル音響の音声信号変換装置と、その周辺装置の概略構成を示すブロック図である。It is a block diagram which shows the schematic structure of the audio signal conversion device of the multi-channel sound of 3rd Embodiment by this invention, and the peripheral device thereof. 本発明による第３実施形態のマルチチャンネル音響の音声信号変換装置における分析部のメイン処理を示すフローチャートである。It is a flowchart which shows the main processing of the analysis part in the audio signal conversion apparatus of the multi-channel acoustic of 3rd Embodiment by this invention. 本発明による第３実施形態のマルチチャンネル音響の音声信号変換装置における分析部のサブ処理を示すフローチャートである。It is a flowchart which shows the sub-processing of the analysis part in the audio signal conversion apparatus of the multi-channel acoustic of 3rd Embodiment by this invention. 本発明による第３実施形態のマルチチャンネル音響の音声信号変換装置における分析部のサブ処理を示すフローチャートである。It is a flowchart which shows the sub-processing of the analysis part in the audio signal conversion apparatus of the multi-channel acoustic of 3rd Embodiment by this invention. （ａ）は、本発明による第３実施形態のマルチチャンネル音響の音声信号変換装置における帯域抑圧フィルタ記憶部内に保持される典型的な基準の帯域抑圧フィルタと入力信号（後方チャンネル音声信号）の周波数スペクトラム分布の関係がずれている様子を例示する図であり、（ｂ）乃至（ｄ）は分析部の処理によって帯域抑圧フィルタのフィルタ形状を変更する様子を例示する図である。(A) is the frequency of a typical reference band suppression filter and input signal (rear channel audio signal) held in the band suppression filter storage unit in the audio signal converter of the multi-channel audio according to the third embodiment according to the present invention. It is a figure which illustrates the state that the relationship of a spectrum distribution is deviated, and (b) to (d) are a figure which illustrates how the filter shape of a band suppression filter is changed by the processing of the analysis unit. （ａ），（ｂ）は、それぞれ本発明による第３実施形態のマルチチャンネル音響の音声信号変換装置における分析部の処理によって帯域抑圧フィルタのフィルタ形状を変更する様子を例示する図である。(A) and (b) are diagrams illustrating how the filter shape of the band suppression filter is changed by the processing of the analysis unit in the audio signal conversion device for the multi-channel acoustic according to the third embodiment of the present invention, respectively.

以下、図面を参照して、本発明による各実施形態のマルチチャンネル音響の音声信号変換装置について説明する。 Hereinafter, the audio signal conversion device for multi-channel acoustics of each embodiment according to the present invention will be described with reference to the drawings.

〔第１実施形態〕
図１は、本発明による第１実施形態のマルチチャンネル音響の音声信号変換装置１０と、その周辺装置の概略構成を示すブロック図である。 [First Embodiment]
FIG. 1 is a block diagram showing a schematic configuration of a multi-channel acoustic audio signal conversion device 10 according to the first embodiment of the present invention and its peripheral devices.

図１に示す音声信号供給装置１００は、複数の音源が発音した複数の音響（歌唱音や楽器音等）の混合音の波形を示すマルチチャンネル音響の音声信号を供給可能とする装置である。特に、図１に示す音声信号供給装置１００は、本来、例示する５ｃｈのマルチチャンネル音響システムのスピーカ配置２００（図示破線）用に、後方のチャンネルの音声信号（後方チャンネル音声信号Ｘｍ）と、前方のチャンネルの音声信号（前方チャンネル音声信号Ｘｏ）と、を含むマルチチャンネル音響の音声信号を供給可能となっている。 The audio signal supply device 100 shown in FIG. 1 is a device capable of supplying an audio signal of multi-channel sound showing a waveform of a mixed sound of a plurality of sounds (singing sound, musical instrument sound, etc.) produced by a plurality of sound sources. In particular, the audio signal supply device 100 shown in FIG. 1 originally has an audio signal of the rear channel (rear channel audio signal Xm) and a front audio signal (Xm) for the speaker arrangement 200 (broken line in the figure) of the 5ch multi-channel audio system illustrated. It is possible to supply the audio signal of the channel (forward channel audio signal Xo) and the audio signal of the multi-channel sound including.

スピーカ配置２００は、受聴者の頭部Ｈを基準に前方のスピーカ（前方中央のスピーカＦＣ及び斜め前方左右に一対のスピーカＦＬ，ＦＲ）と、後方のスピーカ（斜め後方左右に一対のスピーカＢＬ，ＢＲ）が配置されることが想定され、音声信号供給装置１００から供給するマルチチャンネル音響の音声信号は、これら各スピーカによって音像を定位させることが想定されたものである。尚、音声信号供給装置１００は、低音用の放音装置（ウーハ）をスピーカ配置２００に対し追加することで５.１ｃｈのサラウンドシステムを構成することも可能なマルチチャンネル音響の音声信号を供給する装置でもよい。 The speaker arrangement 200 includes a front speaker (a speaker FC in the front center and a pair of speakers FL and FR diagonally forward and left and right) and a rear speaker (a pair of speakers BL and diagonally rear left and right) with reference to the listener's head H. BR) is assumed to be arranged, and the audio signal of the multi-channel sound supplied from the audio signal supply device 100 is assumed to localize the sound image by each of these speakers. The audio signal supply device 100 supplies a multi-channel audio audio signal that can form a 5.1ch surround system by adding a bass sound emitting device (woofer) to the speaker arrangement 200. It may be a device.

ただし、図１に示す本実施形態の例では、音声信号供給装置１００から供給するマルチチャンネル音響の音声信号は、例示する５ｃｈの変形マルチチャンネル音響システムとして実配置されるスピーカ配置３００にて再生されるようになっている。尚、本発明に直接関係しない要素、例えばスピーカ配置３００にて再生するためのパワーアンプ等の設置の図示は省略している。 However, in the example of the present embodiment shown in FIG. 1, the audio signal of the multi-channel acoustic supplied from the audio signal supply device 100 is reproduced by the speaker arrangement 300 actually arranged as the exemplary 5ch modified multi-channel acoustic system. It has become so. It should be noted that the illustration of the installation of elements not directly related to the present invention, such as a power amplifier for reproducing with the speaker arrangement 300, is omitted.

つまり、スピーカ配置３００は、受聴者の頭部Ｈを基準に前方のスピーカ（前方中央のスピーカＦＣ及び斜め前方左右に一対のスピーカＦＬ，ＦＲ）についてはスピーカ配置２００と同配値であるが、後方のスピーカ（斜め後方左右に一対のスピーカＢＬ，ＢＲ）が設置できない等の理由で、側方のスピーカ（側方左右に一対のスピーカＳＬ，ＳＲ）として設置されている。 That is, the speaker arrangement 300 has the same distribution value as the speaker arrangement 200 for the front speakers (speaker FC in the front center and a pair of speakers FL and FR diagonally forward and left and right) with reference to the listener's head H. It is installed as a side speaker (a pair of speakers SL, SR on the side left and right) because the rear speaker (a pair of speakers BL, BR on the left and right diagonally rear) cannot be installed.

このようなスピーカ配置３００において、音声信号供給装置１００から供給するマルチチャンネル音響の音声信号について、前方チャンネル音声信号Ｘｏをそれぞれ対応する前方中央のスピーカＦＣ及び斜め前方左右に一対のスピーカＦＬ，ＦＲにて再生し、後方チャンネル音声信号Ｘｍをそれぞれ斜め後方左右に一対のスピーカＢＬ，ＢＲの代わりに設置した側方左右に一対のスピーカＳＬ，ＳＲにて再生すると、音像が定位する位置がずれてしまい受聴者が誤認識してしまうことになる。 In such a speaker arrangement 300, regarding the audio signal of the multi-channel audio supplied from the audio signal supply device 100, the front channel audio signal Xo is applied to the corresponding front center speaker FC and diagonally forward left and right pair of speakers FL and FR. When the rear channel audio signal Xm is played back diagonally backward and left and right by a pair of speakers SL and SR installed in place of the pair of speakers BL and BR, the position where the sound image is localized shifts. The listener will misrecognize it.

そこで、図１に示す例では、このようなマルチチャンネル音響の音声信号のうち後方チャンネル音声信号Ｘｍについては本発明による第１実施形態の音声信号変換装置１０へ供給されるようになっている。一方、前方チャンネル音声信号Ｘｏはそれぞれ対応する前方中央のスピーカＦＣ及び斜め前方左右に一対のスピーカＦＬ，ＦＲに供給されるようになっている。 Therefore, in the example shown in FIG. 1, among such multi-channel acoustic audio signals, the rear channel audio signal Xm is supplied to the audio signal conversion device 10 of the first embodiment according to the present invention. On the other hand, the front channel audio signal Xo is supplied to the corresponding front center speaker FC and a pair of diagonal front left and right speakers FL and FR, respectively.

そして、本実施形態の音声信号変換装置１０は、帯域抑圧フィルタ記憶部１１と、後方感強調処理部１２とを備えるよう構成されている。 The audio signal conversion device 10 of the present embodiment is configured to include a band suppression filter storage unit 11 and a rear feeling enhancement processing unit 12.

帯域抑圧フィルタ記憶部１１は、その詳細は後述するが、所定数（１つを含む）の帯域抑圧フィルタを記憶している。この所定数の帯域抑圧フィルタは、フィルタ形状として複数種類あるときは「形状種別」で分類されるとともに、それぞれの形状種別毎に、予め定めた抑圧量で予め定めた抑圧帯域幅の基準の帯域抑圧フィルタが定められ、抑圧帯域として複数種類あるときは各基準の帯域抑圧フィルタに対し異なる抑圧帯域幅となる帯域抑圧フィルタが「帯域種別」で分類されている。 The band suppression filter storage unit 11 stores a predetermined number (including one) of band suppression filters, the details of which will be described later. When there are a plurality of types of filter shapes, the predetermined number of band suppression filters are classified by "shape type", and each shape type has a predetermined suppression amount and a predetermined suppression bandwidth reference band. When the suppression filter is defined and there are a plurality of types of suppression bands, the band suppression filters having different suppression bandwidths for each reference band suppression filter are classified by "band type".

「形状種別」としては、図２を参照して後述するが、矩形状（図２（ａ））、対称三角状（図２（ｂ））、非対称三角状（図２（ｃ））、変形対称三角状又は変形非対称三角状（図２（ｄ））、複数多重した変形三角状（図２（ｅ））がある。帯域抑圧フィルタは、所謂デジタルフィルタによるデジタル信号処理か、又は所謂アクティブフィルタ回路によるアナログ信号処理で実現できる。 The "shape type" will be described later with reference to FIG. 2, but is rectangular (FIG. 2 (a)), symmetrical triangle (FIG. 2 (b)), asymmetric triangle (FIG. 2 (c)), and deformed. There are symmetric triangles, deformed asymmetric triangles (FIG. 2 (d)), and multiple multiple variants (FIG. 2 (e)). The band suppression filter can be realized by digital signal processing by a so-called digital filter or analog signal processing by a so-called active filter circuit.

「帯域種別」としては、所定オクターブバンド（例えば１／３オクターブバンド）の中心周波数（例えば２ｋＨｚから８ｋＨｚの間で予め規定される周波数）を対象にして、予め定めた抑圧量（例えば９６ｄＢ）で予め定めた抑圧帯域幅（２ｋＨｚから６．３ｋＨｚ）のものを基準の帯域抑圧フィルタとして１つ用意しておき、更に、当該基準の帯域抑圧フィルタに対し異なる抑圧帯域幅（２ｋＨｚから８ｋＨｚ，２．５ｋＨｚから８ｋＨｚ等）を１つ以上用意している。 The "band type" is a predetermined suppression amount (for example, 96 dB) for the center frequency (for example, a frequency predetermined between 2 kHz and 8 kHz) of a predetermined octave band (for example, 1/3 octave band). A predetermined suppression bandwidth (2 kHz to 6.3 kHz) is prepared as a reference band suppression filter, and a suppression bandwidth different from that of the reference band suppression filter (2 kHz to 8 kHz, 2. One or more (5 kHz to 8 kHz, etc.) are prepared.

即ち、本実施形態の例では、「形状種別」で分類される所定種類数の帯域抑圧フィルタについて、それぞれの形状種別毎に、「帯域種別」で分類される複数の帯域抑圧フィルタを帯域抑圧フィルタ記憶部１１に記憶している。 That is, in the example of the present embodiment, with respect to a predetermined number of band suppression filters classified by "shape type", a plurality of band suppression filters classified by "band type" are used as band suppression filters for each shape type. It is stored in the storage unit 11.

ただし、帯域抑圧フィルタ記憶部１１にて、矩形状（図２（ａ））のもの１種類のみについて１つの基準の帯域抑圧フィルタのみを保持する形態であっても、多くの受聴者に対応させることができ一定の作用・効果が得られるが、後述する理由で、より多くの受聴者に対応させるために、「帯域種別」で分類される複数の帯域抑圧フィルタを保持する形態が望ましい。 However, even if the band suppression filter storage unit 11 holds only one reference band suppression filter for only one type of rectangular shape (FIG. 2A), it corresponds to many listeners. However, for the reason described later, it is desirable to hold a plurality of band suppression filters classified by "band type" in order to accommodate a larger number of listeners.

後方感強調処理部１２は、その詳細は後述するが、入力される後方チャンネル音声信号Ｘｍに対して、帯域抑圧フィルタ記憶部１１から読み出した帯域抑圧フィルタを適用することで後方感強調処理を行い、これにより側方のチャンネルの音声信号（後方感強調処理済みの音声信号Ｘｍ’）へと変換し、この後方感強調処理済みの音声信号Ｘｍ’を側方左右に一対のスピーカＳＬ，ＳＲへ出力する機能部である。 Although the details will be described later, the rear feeling enhancement processing unit 12 performs the rear feeling enhancement processing by applying the band suppression filter read from the band suppression filter storage unit 11 to the input rear channel audio signal Xm. As a result, it is converted into the audio signal of the side channel (audio signal Xm'with rear feeling enhancement processing), and this audio signal Xm'with rear feeling enhancement processing is converted into a pair of speakers SL, SR on the left and right sides. It is a functional part that outputs.

ここで、当該受聴者本人の頭部伝達関数（ＨＲＴＦ）を測定することなく、より多くの受聴者が後方に音像を知覚できるようにするために、当該受聴者によって、帯域抑圧フィルタ記憶部１１に保持されている種々の帯域抑圧フィルタの形状種別及び帯域種別を選択的に外部設定できるように、当該外部設定に対応する選択設定信号を発生する外部インターフェース（図示せず）が音声信号変換装置１０に設けられている。そして、後方感強調処理部１２は、この選択設定信号に対応する帯域抑圧フィルタを帯域抑圧フィルタ記憶部１１から読み出し、後方感強調処理を行うよう構成されている。 Here, in order to allow more listeners to perceive the sound image backwards without measuring the head related transfer function (HRTF) of the listener himself / herself, the band suppression filter storage unit 11 is used by the listener. An external interface (not shown) that generates a selection setting signal corresponding to the external setting is an audio signal conversion device so that the shape type and band type of various band suppression filters held in the device can be selectively externally set. It is provided in 10. Then, the rear feeling enhancement processing unit 12 is configured to read out the band suppression filter corresponding to this selection setting signal from the band suppression filter storage unit 11 and perform the rear feeling enhancement processing.

複数の帯域抑圧フィルタが帯域抑圧フィルタ記憶部１１に記憶してある場合、当該帯域抑圧フィルタを識別できる情報、例えば帯域抑圧フィルタの名称や番号なども帯域抑圧フィルタ記憶部１１に記憶させておくようにする。当該外部インターフェースを用いて、受聴者が複数の帯域抑圧フィルタの名称や番号に対して使用する帯域抑圧フィルタを選択するよう構成することができる。 When a plurality of band suppression filters are stored in the band suppression filter storage unit 11, information that can identify the band suppression filter, such as the name and number of the band suppression filter, is also stored in the band suppression filter storage unit 11. To. The external interface can be configured to allow the listener to select the band suppression filter to use for the names and numbers of the plurality of band suppression filters.

尚、本実施形態の例では、側方左右に対称に実配置される一対のスピーカＳＬ，ＳＲに対し、共通の帯域抑圧フィルタを適用する例としているが、左側方用スピーカＳＬと右側方用スピーカＳＲとで、異なる帯域抑圧フィルタを適用可能に構成してもよい。このようにチャンネル毎に異なる帯域抑圧フィルタを適用する場合、どのチャンネルに対してどの帯域抑圧フィルタを使用するのかを識別できるように、帯域抑圧フィルタを使用するチャンネルの番号も対応付けて記憶しておき、当該外部インターフェースを用いて受聴者が選択できるよう構成することができる。 In the example of this embodiment, a common band suppression filter is applied to a pair of speakers SL and SR that are actually arranged symmetrically on the left and right sides, but the left side speaker SL and the right side speaker SL are used. A different band suppression filter may be applied to the speaker SR. When applying a different band suppression filter for each channel in this way, the number of the channel that uses the band suppression filter is also stored in association with each other so that which band suppression filter is used for which channel can be identified. It can be configured to be selectable by the listener using the external interface.

このように、例示する本実施形態の音声信号変換装置１０は、後方感強調処理部１２により、入力される後方チャンネル音声信号Ｘｍに対して帯域抑圧フィルタを適用して後方感強調処理を行い、これにより側方のチャンネルの音声信号（後方感強調処理済みの音声信号Ｘｍ’）へと変換し、この後方感強調処理済みの音声信号Ｘｍ’を側方左右に一対のスピーカＳＬ，ＳＲへ出力するように構成されている。 As described above, in the audio signal conversion device 10 of the present embodiment illustrated, the rear sense enhancement processing unit 12 applies a band suppression filter to the input rear channel audio signal Xm to perform the rear sense enhancement processing. As a result, it is converted into an audio signal of a side channel (audio signal Xm'with rear feeling enhancement processing), and this audio signal Xm'with rear feeling enhancement processing is output to a pair of speakers SL and SR on the left and right sides. It is configured to do.

ただし、この音声信号変換装置１０は、前方チャンネル音声信号Ｘｏ及び後方チャンネル音声信号Ｘｍを入力する構成とすることもでき、この場合も後方感強調処理部１２により、後方チャンネル音声信号Ｘｍに対して帯域抑圧フィルタ記憶部１１から読み出した帯域抑圧フィルタを適用して後方感強調処理を行って側方のチャンネルの音声信号（後方感強調処理済みの音声信号Ｘｍ’）へと変換して出力するとともに、入力された前方チャンネル音声信号Ｘｏをそのまま出力する構成とすることもできる。 However, the audio signal conversion device 10 may be configured to input the front channel audio signal Xo and the rear channel audio signal Xm. In this case as well, the rear feeling enhancement processing unit 12 may refer to the rear channel audio signal Xm. Band suppression filter The band suppression filter read from the storage unit 11 is applied to perform backward feeling enhancement processing, and the audio signal of the side channel (audio signal Xm'with backward feeling enhancement processing) is converted and output. , The input front channel audio signal Xo can be output as it is.

尚、本実施形態の例では、前方チャンネルの音声信号Ｘｏと後方チャンネル音声信号Ｘｍを区別して入力する例を示しているが、音声信号に付随しているメタデータからどの音声信号が後方のチャンネルの音声信号であるか否かを判断し、音声信号変換装置１０内で自動的に前方（或いは側方）のチャンネルの音声信号と後方のチャンネルの音声信号に分離させる構成とすることもできる。 In the example of the present embodiment, an example in which the audio signal Xo of the front channel and the audio signal Xm of the rear channel are separately input is shown, but which audio signal is the rear channel from the metadata attached to the audio signal. It is also possible to determine whether or not it is the audio signal of the above, and automatically separate the audio signal of the front (or side) channel and the audio signal of the rear channel in the audio signal conversion device 10.

また、音声信号変換装置１０は、その入力段にマルチチャンネル音響のアナログ音声信号を入力してアナログ・デジタル変換処理を施してデジタル信号処理（本実施形態の例では、後方感強調処理）を行い、デジタル信号処理後にデジタル・アナログ変換処理を施してスピーカ配置３００へ出力する構成とすることもできる。 Further, the audio signal conversion device 10 inputs an analog audio signal of multi-channel sound to the input stage, performs analog-to-digital conversion processing, and performs digital signal processing (in the example of the present embodiment, backward feeling enhancement processing). It is also possible to perform digital-to-analog conversion processing after digital signal processing and output to the speaker arrangement 300.

図２（ａ）乃至（ｅ）には、それぞれ本発明による第１実施形態のマルチチャンネル音響の音声信号変換装置１０における帯域抑圧フィルタ記憶部１１で記憶するフィルタ例を示している。図２（ａ）乃至（ｅ）には対応する帯域抑圧フィルタの「形状種別」を示している。 2 (a) to 2 (e) show examples of filters stored by the band suppression filter storage unit 11 in the audio signal conversion device 10 for the multi-channel acoustic of the first embodiment according to the present invention, respectively. 2 (a) to 2 (e) show the "shape type" of the corresponding band suppression filter.

前述したように、帯域抑圧フィルタの「形状種別」として、矩形状（図２（ａ））、対称三角状（図２（ｂ））、非対称三角状（図２（ｃ））、変形対称三角状又は変形非対称三角状（図２（ｄ））、複数多重した変形三角状（図２（ｅ））がある。 As described above, the "shape type" of the band suppression filter includes a rectangular shape (FIG. 2 (a)), a symmetric triangle shape (FIG. 2 (b)), an asymmetric triangle shape (FIG. 2 (c)), and a deformed symmetric triangle. There are a shape or a deformed asymmetric triangle (FIG. 2 (d)) and a plurality of multiple deformed triangles (FIG. 2 (e)).

図２（ａ），（ｂ）の帯域抑圧フィルタは、下側遮断周波数Ｆ_Ｌと上側遮断周波数Ｆ_Ｕ、並びに予め定めた抑圧量Ｆ_Ｇ（例えば９６ｄＢ）、減衰傾度（例えば−９６ｄＢ／（１／２４ｏｃｔ.））によって規定することができる。減衰傾度を急峻にした場合、図２（ａ）のような矩形状になり、減衰傾度を緩やかにした場合、図２（ｂ）のような対称三角状となる。 FIG. 2 (a), the band suppression filter (b) is lower cutoff frequency _{F L} and an upper cutoff frequency _{F U,} and a predetermined amount of suppression _{F G} (e.g. 96 dB), the attenuation slope (e.g. -96 dB / (1 It can be specified by / 24oct.)). When the damping gradient is steep, it becomes a rectangular shape as shown in FIG. 2 (a), and when the damping gradient is made gentle, it becomes a symmetrical triangular shape as shown in FIG. 2 (b).

非対称三角状（図２（ｃ））の帯域抑圧フィルタは、下側遮断周波数Ｆ_Ｌと上側遮断周波数Ｆ_Ｕ、並びに下側遮断周波数Ｆ_Ｌに関する第１減衰傾度、上側遮断周波数Ｆ_Ｕに関する第２減衰傾度、抑圧量Ｆ_Ｇ（例えば９６ｄＢ）によって規定することができる。 Asymmetric triangular band suppression filter (FIG. 2 (c)), the lower cut-off frequency F _L and an upper cutoff frequency F _U, and the first attenuation slope about the lower cut-off frequency F _L, the upper cut-off frequency F _U for the second can be defined by the attenuation gradient, suppression amount _{F G} (e.g. 96 dB).

変形対称三角状又は変形非対称三角状（図２（ｄ））の帯域抑圧フィルタは、下側遮断第１周波数Ｆ_Ｌ１と上側遮断第１周波数Ｆ_Ｕ１、下側遮断第２周波数Ｆ_Ｌ２と上側遮断第２周波数Ｆ_Ｕ２、下側強調量Ｅ_Ｌと上側強調量Ｅ_Ｕ並びに下側遮断第１周波数Ｆ_Ｌ１に関する第１減衰傾度、下側遮断第２周波数Ｆ_Ｌ２に関する第２減衰傾度、上側遮断第１周波数Ｆ_Ｕ１に関する第３減衰傾度、上側遮断第２周波数Ｆ_Ｕ２に関する第４減衰傾度、及び抑圧量Ｆ_Ｇ（例えば９６ｄＢ）によって規定することができる。 The band suppression filter having a deformed symmetric triangular shape or a deformed asymmetric triangular shape (FIG. 2 (d)) has a lower cutoff first frequency _FL1 and an upper cutoff first frequency _FU1 , a lower cutoff second frequency _FL2 and an upper cutoff. a second frequency _{F U2,} lower enhancement amount _{E L} and the upper enhancement amount _{E U} and a lower cut-off first frequency _{F L1} for the first attenuation slope, the second attenuation gradient about the lower cut-off second frequency _{F L2,} the upper cutoff can be defined by a frequency _{F U1} relates third attenuation slope, a fourth attenuation gradient about the upper cut-off second frequency _{F U2,} and suppression amount _{F G} (e.g. 96 dB).

また、複数多重した変形三角状（図２（ｅ））の帯域抑圧フィルタは、複数のノッチに個別に下側遮断周波数Ｆ_Ｌと上側遮断周波数Ｆ_Ｕ、下側の減衰傾度、上側の減衰傾度、抑圧量によって規定することができる。例えばノッチＳ_Ｎ１，Ｓ_Ｎ１に対応させる抑圧量Ｆ_Ｇ１，Ｆ_Ｇ２をそれぞれ９０ｄＢ，９６ｄＢとして予め規定しておくことができる。 Further, the band suppression filter multiple multiplexed deformation triangular (FIG. 2 (e)), the lower cut-off frequency individually to a plurality of notches F _L and an upper cutoff frequency F _U, lower attenuation slope, the upper attenuation slope , Can be defined by the amount of suppression. For example the notch _S _{N1, S} suppression amount to correspond to the _N1 _{_F G1,} _F _G2 respectively 90 dB, may have been defined in advance as 96 dB.

（後方感強調処理）
図３を参照して、後方感強調処理部１２による後方感強調処理について説明する。まず、後方感強調処理部１２は、入力される後方チャンネル音声信号Ｘｍに対し、帯域抑圧フィルタ記憶部１２に保持される初期設定された帯域抑圧フィルタを適用する（ステップＳ１）。ここで、初期設定された帯域抑圧フィルタとして、矩形状（図２（ａ））の帯域抑圧フィルタによる予め定めた抑圧量（例えば９６ｄＢ）で予め定めた抑圧帯域幅（下側遮断周波数Ｆ_Ｌ＝２ｋＨｚ、上側遮断周波数Ｆ_Ｕ＝６．３ｋＨｚ）、減衰傾度（−９６ｄＢ／（１／２４ｏｃｔ.））の基準の帯域抑圧フィルタを用いるものとする。 (Backward feeling enhancement processing)
The rear feeling enhancement processing by the rear feeling enhancement processing unit 12 will be described with reference to FIG. First, the rear feeling enhancement processing unit 12 applies the initially set band suppression filter held in the band suppression filter storage unit 12 to the input rear channel audio signal Xm (step S1). _{Here, as the initially set band suppression filter, the suppression bandwidth (lower cutoff frequency FL} =) predetermined with a predetermined suppression amount (for example, 96 dB) by the rectangular (FIG. 2A) band suppression filter. 2 kHz, the upper cut-off frequency _F U = _6.3kHz), and those using reference band suppression filter of the attenuation slope (-96dB / (1 / 24oct. )).

続いて、後方感強調処理部１２は、外部からの選択設定信号の有無を監視し（ステップＳ２）、当該選択設定信号が無ければ（ステップＳ２：Ｎｏ）、後方チャンネルの音声信号Ｘｍに対し当該基準の帯域抑圧フィルタを適用した後方感強調処理済みの音声信号Ｘｍ’を、本例では側方チャンネル音声信号として出力する（ステップＳ４）。 Subsequently, the rear feeling enhancement processing unit 12 monitors the presence or absence of the selection setting signal from the outside (step S2), and if there is no selection setting signal (step S2: No), the rear channel audio signal Xm is said to be the same. In this example, the audio signal Xm'that has been subjected to the rearward enhancement processing to which the reference band suppression filter is applied is output as a side channel audio signal (step S4).

一方、当該選択設定信号が有るとき（ステップＳ２：Ｙｅｓ）、後方感強調処理部１２は、当該選択設定信号に対応する帯域抑圧フィルタを、帯域抑圧フィルタ記憶部１１から読み出して、入力される後方チャンネル音声信号Ｘｍに対して適用し（ステップＳ３）、当該選択設定信号に対応する帯域抑圧フィルタを適用した後方感強調処理済みの音声信号Ｘｍ’を、本例では側方チャンネル音声信号として出力する（ステップＳ４）。 On the other hand, when the selection setting signal is present (step S2: Yes), the rear feeling enhancement processing unit 12 reads the band suppression filter corresponding to the selection setting signal from the band suppression filter storage unit 11 and inputs it to the rear. The audio signal Xm'applied to the channel audio signal Xm (step S3) and subjected to the rearward enhancement processing to which the band suppression filter corresponding to the selection setting signal is applied is output as a side channel audio signal in this example. (Step S4).

本実施形態の例では、斜め後方左右に一対のスピーカＢＬ，ＢＲ用として想定されていた後方チャンネル音声信号Ｘｍに対して帯域抑圧フィルタを適用し、それぞれ側方左右に一対のスピーカＳＬ，ＳＲ用の側方チャンネル音声信号（後方感強調処理済みの音声信号Ｘｍ’）へと変換する例を示しているが、斜め前方左右に一対のスピーカＦＬ，ＦＲ用の斜め前方チャンネル音声信号（後方感強調処理済みの音声信号Ｘｍ’）へと変換するよう構成してもよい。この場合には、受聴者が複数の帯域抑圧フィルタの名称や番号に対して使用する帯域抑圧フィルタを選択する際に、その時の抑圧量も選択可能にして調整できるようにすることが好ましい。従って、後方感強調処理部１２は、当該帯域抑圧フィルタについて実配置されるスピーカに応じて選択変更可能に構成することができる。また、実配置されるスペーカ位置を制限しないようにするためには、マルチチャンネル音響の音声信号のうち後方に設置されたスピーカから再生することが想定された複数のチャンネルがある際に、該チャンネル毎に異なる帯域抑圧フィルタを適用可能に構成すればよい。 In the example of this embodiment, a band suppression filter is applied to the rear channel audio signal Xm, which is supposed to be used for a pair of speakers BL and BR diagonally to the left and right, and a pair of speakers SL and SR are applied to the sides and left and right, respectively. An example of converting to a side channel audio signal (audio signal Xm'with rear feeling enhancement processing) is shown, but the diagonal front channel audio signal for a pair of speakers FL and FR diagonally forward and left and right (rear feeling enhancement). It may be configured to be converted into a processed audio signal Xm'). In this case, when the listener selects the band suppression filter to be used for the names and numbers of the plurality of band suppression filters, it is preferable that the suppression amount at that time can also be selected and adjusted. Therefore, the rear feeling enhancement processing unit 12 can be configured to be selectable and changeable according to the speakers actually arranged for the band suppression filter. Further, in order not to limit the position of the actually placed spacer, when there are a plurality of channels of the multi-channel acoustic audio signal that are supposed to be reproduced from the speaker installed behind, the channel is used. A different band suppression filter may be applied to each case.

ところで、このように斜め後方左右に一対のスピーカＢＬ，ＢＲ用として想定されていた後方チャンネル音声信号Ｘｍに対して帯域抑圧フィルタを適用し、それぞれ側方左右に一対のスピーカＳＬ，ＳＲ用の側方チャンネル音声信号（後方感強調処理済みの音声信号Ｘｍ’）へと変換することで、多くの受聴者が後方に音像を知覚できるようになるか否かについて実験により検証した。 By the way, a band suppression filter is applied to the rear channel audio signal Xm that is supposed to be used for the pair of speakers BL and BR diagonally backward and left and right, and the side for the pair of speakers SL and SR are applied to the side and left and right respectively. It was experimentally verified whether or not many listeners could perceive the sound image backward by converting it into a square channel audio signal (audio signal Xm'with rearward enhancement processing).

より具体的には、１／３オクターブバンドの中心周波数１．６ｋＨｚから１２．５ｋＨｚまでの周波数帯域を様々な帯域幅で抑圧した広帯域白色雑音を、受聴者の側方左右に一対のスピーカＳＬ，ＳＲから再生して、複数の受聴者に知覚した音像の方向を回答させる主観評価実験を行った。 More specifically, a pair of speakers SL on the left and right sides of the listener suppresses wideband white noise in which the center frequency of the 1/3 octave band is suppressed from 1.6 kHz to 12.5 kHz with various bandwidths. A subjective evaluation experiment was conducted in which multiple listeners were asked to answer the direction of the perceived sound image by reproducing from the SR.

音源は２００Ｈｚ〜１７ｋＨｚの広帯域白色雑音とした。帯域抑圧フィルタは、１．６ｋＨｚから１２．５ｋＨｚの１／３オクターブバンドを１〜１０バンドまで連続して抑圧した矩形状帯域抑圧フィルタの帯域幅を可変とした５５種類とし、帯域抑圧フィルタなしを含めて全５６種類である。帯域抑圧フィルタの抑圧量はＦ_Ｇ＝９６ｄＢで固定した。刺激の時間長は１．２秒とし、定常状態を１秒、ハミング窓によるフェードイン、フェードアウトを０．１秒ずつ設けた側方チャンネル音声信号（後方感強調処理済みの音声信号Ｘｍ’）を複数の受聴者に受聴させた。尚、実験に用いたスピーカはＦＯＳＴＥＸ製ＦＥ８３Ｅ、パワーアンプはＹＡＭＡＨＡ製ＰＡ４０５０、オーディオインターフェースはＲＭＥ製ＦｉｒｅｆａｃｅＵＣ、音声信号変換装置１０を構成したパーソナルコンピュータとして、エプソン製ＥｎｄｅｖｏｒＰｒｏ５６００で構成した。 The sound source was wideband white noise of 200 Hz to 17 kHz. There are 55 types of band suppression filters with variable bandwidth, which is a rectangular band suppression filter that continuously suppresses 1/3 octave bands from 1.6 kHz to 12.5 kHz from 1 to 10 bands. There are 56 types in total including. Suppression amount of band suppression filter was fixed with _F G = 96 dB. The duration of the stimulus is 1.2 seconds, and the side channel audio signal (audio signal Xm'with rearward enhancement processing) provided with a steady state of 1 second and fade-in and fade-out of 0.1 seconds each by the humming window is used. Multiple listeners were asked to listen. The speaker used in the experiment was FE83E manufactured by FOSTEX, the power amplifier was PA4050 manufactured by YAMAHA, the audio interface was Yamaha UC manufactured by RME, and Endever Pro5600 manufactured by Epson was used as a personal computer constituting the audio signal conversion device 10.

そして、水平面内の方位角±９０°に設置した側方に左右一対のスピーカＳＬ，ＳＲから同相で刺激を提示した。スピーカＳＬ，ＳＲから当該受聴者の頭部Ｈの中心までの距離は１．２ｍであり、高さは、１．２ｍ（ほぼ当該受聴者の耳の高さ）に調整した。提示音圧レベルは、頭部中心位置で、６０ｄＢ（Ａ特性）である。当該受聴者には、音像の上昇角をマッピング法で回答させた。回答時問は６秒とした。１試行で全５６種類の実験音源を１回ずつランダムに提示し、提示順を変えて各々の当該受聴者について１０試行繰り返した。複数の当該受聴者は成人１０名である。 Then, the stimuli were presented in the same phase from the pair of left and right speakers SL and SR to the sides installed at an azimuth angle of ± 90 ° in the horizontal plane. The distance from the speakers SL and SR to the center of the listener's head H was 1.2 m, and the height was adjusted to 1.2 m (approximately the height of the listener's ears). The presented sound pressure level is 60 dB (A characteristic) at the center position of the head. The listener was asked to answer the rising angle of the sound image by the mapping method. The answer time was 6 seconds. A total of 56 types of experimental sound sources were randomly presented once in one trial, and the presentation order was changed, and 10 trials were repeated for each of the relevant listeners. The plurality of such listeners is 10 adults.

全５６種類の実験音源を基に、多くの受聴者が後方に音像を知覚できる割合の高いものから順に、上位１１パターン（Ｎｏ．１〜Ｎｏ．１１）の帯域抑圧フィルタの抑圧帯域についてまとめた結果を図４（ａ）に示している。 Based on all 56 types of experimental sound sources, the suppression bands of the band suppression filters of the top 11 patterns (No. 1 to No. 11) are summarized in order from the one with the highest rate at which many listeners can perceive the sound image backward. The results are shown in FIG. 4 (a).

図４（ａ）にてＮｏ．１として示す帯域抑圧フィルタは、矩形状（図２（ａ））の帯域抑圧フィルタによる予め定めた抑圧量（例えば９６ｄＢ）で予め定めた抑圧帯域幅（下側遮断周波数Ｆ_Ｌ＝２ｋＨｚ、上側遮断周波数Ｆ_Ｕ＝６．３ｋＨｚ）、減衰傾度−９６ｄＢ／（１／２４ｏｃｔ.）の基準の帯域抑圧フィルタに対応している。 In FIG. 4A, No. Band suppression filter shown as 1, a rectangular shape (FIG. 2 (a)) predetermined amount of suppression (e.g. 96 dB) in a predetermined suppression bandwidth (lower cut-off frequency F _{L =} 2 kHz by the band suppression filter, the upper cut-off frequency _F U = _6.3kHz), and corresponds to the reference band suppression filter of the attenuation slope -96dB / (1 / 24oct.) .

即ち、Ｎｏ．１として示す帯域抑圧フィルタのみを帯域抑圧フィルタ記憶部１２に保持しておき、後方感強調処理部１２により読み出して後方感強調処理を適用するだけでも、７８％の確率で後方に音像を知覚できている。 That is, No. Even if only the band suppression filter shown as 1 is held in the band suppression filter storage unit 12, read out by the rear feeling enhancement processing unit 12 and the rear feeling enhancement processing is applied, the sound image can be perceived backward with a probability of 78%. ing.

また、図４（ｂ）に示すように、Ｎｏ．１，２，３として示す帯域抑圧フィルタを帯域抑圧フィルタ記憶部１２に保持しておき、当該受聴者によって選択設定させ、後方感強調処理部１２により読み出して後方感強調処理を適用することで、９１％の確率で、後方に音像を知覚できるようになることが分かった。 In addition, as shown in FIG. 4 (b), No. The band suppression filters shown as 1, 2 and 3 are held in the band suppression filter storage unit 12, selected and set by the listener, and read out by the rear feeling enhancement processing unit 12 to apply the rear feeling enhancement processing. It was found that there was a 91% chance that the sound image could be perceived backwards.

一方、Ｎｏ．１，２，３として示す帯域抑圧フィルタでは抑圧帯域が広く、音響品質を損ないかねないことから、より狭い帯域幅となる複数の帯域抑圧フィルタを当該受聴者によって選択設定させるよう構成することができる。例えば、図４（ｂ）に示すように、Ｎｏ．６，７，１０，１１として示す帯域抑圧フィルタを当該受聴者によって選択設定させることで、８１％の確率で後方に音像を知覚できるようになることが分かった。また、更に狭い帯域幅となる複数の帯域抑圧フィルタとして、Ｎｏ．６，１１として示す帯域抑圧フィルタを当該受聴者によって選択設定させることで、７９％の確率で後方に音像を知覚できるようになることが分かった。 On the other hand, No. Since the band suppression filters shown as 1, 2 and 3 have a wide suppression band and may impair the acoustic quality, a plurality of band suppression filters having a narrower bandwidth can be selectively set by the listener. .. For example, as shown in FIG. 4 (b), No. It was found that by selecting and setting the band suppression filters shown as 6, 7, 10 and 11 by the listener, the sound image can be perceived backward with a probability of 81%. Further, as a plurality of band suppression filters having a narrower bandwidth, No. It was found that by selecting and setting the band suppression filters shown as 6 and 11 by the listener, the sound image can be perceived backward with a probability of 79%.

このように、概ね２ｋＨｚ〜８ｋＨｚの間で可変帯域となる複数種の帯域抑圧フィルタを帯域抑圧フィルタ記憶部１２に保持しておき、受聴者によって選択設定可能に構成することで、より多くの受聴者が後方に音像を知覚できるようになる。そして、図２に示したように、形状種別として複数種の形状の帯域抑圧フィルタを用意することで、更に一層、高品質でより多くの受聴者が後方に音像を知覚できるようになることが期待できる。 In this way, by holding a plurality of types of band suppression filters having a variable band between approximately 2 kHz and 8 kHz in the band suppression filter storage unit 12 and configuring them so that they can be selected and set by the listener, more reception is possible. The listener will be able to perceive the sound image backwards. Then, as shown in FIG. 2, by preparing band suppression filters having a plurality of shapes as shape types, it is possible for more listeners to perceive the sound image backwards with even higher quality. You can expect it.

特に、頭部伝達関数（ＨＲＴＦ）は個人性があるが、平均的な後方チャンネル音声信号用の頭部伝達関数（ＨＲＴＦ）において、４ｋＨｚ以上の周波数帯域の中で最も低い周波数のノッチ周波数は約６ｋＨｚ〜１０ｋＨｚに集中している。そこで、前述の頭部伝達関数（ＨＲＴＦ）の４ｋＨｚ以上の周波数帯域の中で最も低い周波数のノッチ周波数を抑圧可能とする上側遮断周波数を持つ帯域抑圧フィルタを、後方感強調処理部１２によって使用する帯域抑圧フィルタとして定め、帯域抑圧フィルタ記憶部１１に記憶しておくようにすることが好ましい。これにより、より多くの受聴者が後方に音像を知覚する汎用的な後方音像知覚効果が得られるようになる。 In particular, the head related transfer function (HRTF) is personal, but in the head related transfer function (HRTF) for the average rear channel audio signal, the notch frequency of the lowest frequency in the frequency band of 4 kHz or more is about. It is concentrated at 6kHz to 10kHz. Therefore, the rearward enhancement processing unit 12 uses a band suppression filter having an upper cutoff frequency that enables suppression of the notch frequency of the lowest frequency in the frequency band of 4 kHz or more of the above-mentioned head transmission function (HRTF). It is preferable to define it as a band suppression filter and store it in the band suppression filter storage unit 11. This makes it possible to obtain a general-purpose backward sound image perception effect in which more listeners perceive the sound image backward.

〔第２実施形態〕
図５は、本発明による第２実施形態のマルチチャンネル音響の音声信号変換装置１０と、その周辺装置の概略構成を示すブロック図である。尚、図５において、図１と同様な構成要素には同一の参照番号を付している。第２実施形態では、入力されるマルチチャンネル音響の音声信号のチャンネル数より少ない出力チャンネル数（スピーカ数）で斜め前方（側方でもよい）から再生する場合を想定している。 [Second Embodiment]
FIG. 5 is a block diagram showing a schematic configuration of the audio signal conversion device 10 for multi-channel acoustics according to the second embodiment of the present invention and its peripheral devices. In FIG. 5, the same reference numbers are assigned to the same components as those in FIG. In the second embodiment, it is assumed that the audio signal of the input multi-channel sound is reproduced from diagonally forward (may be sideways) with a number of output channels (number of speakers) smaller than the number of channels.

図５に示す音声信号供給装置１００は、第１実施形態と同様に、本来、例示する５ｃｈのマルチチャンネル音響システムのスピーカ配置２００（図示破線）用に、後方のチャンネルの音声信号（後方チャンネル音声信号Ｘｍ）と、前方のチャンネルの音声信号（前方チャンネル音声信号Ｘｏ）と、を含むマルチチャンネル音響の音声信号を供給可能とする装置として構成される。 Similar to the first embodiment, the audio signal supply device 100 shown in FIG. 5 originally has an audio signal of the rear channel (rear channel audio) for the speaker arrangement 200 (broken line in the figure) of the 5ch multi-channel audio system illustrated. It is configured as a device capable of supplying a multi-channel acoustic audio signal including a signal Xm) and an audio signal of the front channel (forward channel audio signal Xo).

そして、スピーカ配置２００は、受聴者の頭部Ｈを基準に前方のスピーカ（前方中央のスピーカＦＣ及び斜め前方左右に一対のスピーカＦＬ，ＦＲ）と、後方のスピーカ（斜め後方左右に一対のスピーカＢＬ，ＢＲ）に配置されることが想定され、音声信号供給装置１００から供給するマルチチャンネル音響の音声信号は、これら各スピーカによって音像を定位させることが想定されたものである。尚、音声信号供給装置１００は、低音用の放音装置（ウーハ）をスピーカ配置２００に対し追加することで５.１ｃｈのサラウンドシステムを構成することも可能なマルチチャンネル音響の音声信号を供給する装置とすることもできる。 The speaker arrangement 200 includes a front speaker (a front center speaker FC and a pair of diagonally front left and right speakers FL and FR) and a rear speaker (a pair of diagonally rear left and right speakers) with reference to the listener's head H. It is assumed that they are arranged in BL, BR), and the audio signal of the multi-channel sound supplied from the audio signal supply device 100 is assumed to localize the sound image by each of these speakers. The audio signal supply device 100 supplies a multi-channel audio audio signal that can form a 5.1ch surround system by adding a bass sound emitting device (woofer) to the speaker arrangement 200. It can also be a device.

ただし、図５に示す例では、音声信号供給装置１００から供給するマルチチャンネル音響の音声信号は、例示する３ｃｈの変形マルチチャンネル音響システムとして実配置されるスピーカ配置４００にて再生されるようになっている。スピーカ配置４００は、受聴者の頭部Ｈを基準に前方のスピーカＦＣと、斜め前方のスピーカ（スピーカＦＬ，ＦＲ）が設置されている。 However, in the example shown in FIG. 5, the audio signal of the multi-channel acoustic supplied from the audio signal supply device 100 is reproduced by the speaker arrangement 400 actually arranged as the illustrated 3ch modified multi-channel acoustic system. ing. In the speaker arrangement 400, a front speaker FC and diagonally front speakers (speakers FL, FR) are installed with reference to the listener's head H.

スピーカ配置４００において音像が定位する位置がずれて受聴者が誤認識することを抑制するために、５ｃｈのマルチチャンネル音響の音声信号は、本発明による第２実施形態の音声信号変換装置１０に供給され、この音声信号変換装置１０からスピーカ配置４００の３ｃｈの変形マルチチャンネル音響システムのスピーカ配置４００にて再生されるようになっている。尚、本発明に直接関係しない要素、例えばスピーカ配置４００にて再生するためのパワーアンプ等の設置の図示は省略している。 In order to prevent the listener from erroneously recognizing the position where the sound image is localized in the speaker arrangement 400, the audio signal of the 5ch multi-channel acoustic is supplied to the audio signal conversion device 10 of the second embodiment according to the present invention. Then, it is reproduced from the audio signal conversion device 10 by the speaker arrangement 400 of the 3ch modified multi-channel acoustic system of the speaker arrangement 400. It should be noted that the illustration of the installation of elements not directly related to the present invention, such as a power amplifier for reproducing with the speaker arrangement 400, is omitted.

そして、本実施形態の音声信号変換装置１０は、帯域抑圧フィルタ記憶部１１と、後方感強調処理部１２と、再生信号生成部１３とを備えるよう構成される。即ち、本実施形態の音声信号変換装置１０は、第１実施形態と比較して、再生信号生成部１３を更に備える点で相違する。 The audio signal conversion device 10 of the present embodiment is configured to include a band suppression filter storage unit 11, a rear feeling enhancement processing unit 12, and a reproduction signal generation unit 13. That is, the audio signal conversion device 10 of the present embodiment is different from the first embodiment in that it further includes a reproduction signal generation unit 13.

帯域抑圧フィルタ記憶部１１及び後方感強調処理部１２の動作は、第１実施形態と同様であり更なる詳細な説明は省略する。 The operation of the band suppression filter storage unit 11 and the rear feeling enhancement processing unit 12 is the same as that of the first embodiment, and further detailed description thereof will be omitted.

再生信号生成部１３は、後方感強調処理部１２から出力される後方感強調処理済みの音声信号Ｘｍ’（２チャンネル分）と前方チャンネル音声信号Ｘｏ（３チャンネル分）を入力し、予め定めた利得係数を基に重み付け加算合成処理を施して、スピーカ配置４００にて再生されるべき３チャンネル分のマルチチャンネル音響の音声信号（再生信号）を生成し、スピーカ配置４００に向けて出力する。 The reproduction signal generation unit 13 inputs the rear-sensing-enhanced processed audio signal Xm'(for 2 channels) and the front-channel audio signal Xo (for 3 channels) output from the rear-sensing enhancing processing unit 12, and determines in advance. A weighted addition / synthesis process is performed based on the gain coefficient to generate a multi-channel acoustic audio signal (reproduced signal) for three channels to be reproduced in the speaker arrangement 400, and output the audio signal (reproduced signal) toward the speaker arrangement 400.

つまり、前述した第１実施形態では、後方感強調処理部１２から出力される後方感強調処理済みの音声信号Ｘｍ’（２チャンネル分）を、側方チャンネル音声信号としてスピーカ配置３００に向けて出力するよう構成していたが、本実施形態では、前方チャンネル音声信号Ｘｏ（３チャンネル分）との重み付け加算合成処理を経て、斜め前方のスピーカ（スピーカＦＬ，ＦＲ）用の前方チャンネル音声信号が生成される。 That is, in the above-described first embodiment, the rear feeling enhancement processed audio signal Xm'(for 2 channels) output from the rear feeling enhancement processing unit 12 is output as a side channel audio signal toward the speaker arrangement 300. However, in the present embodiment, the front channel audio signal for the diagonally forward speaker (speaker FL, FR) is generated through the weighted addition / synthesis process with the front channel audio signal Xo (for 3 channels). Will be done.

（再生信号生成処理）
ここで、図６を参照して、再生信号生成部１３による再生信号生成処理について説明する。まず、再生信号生成部１３は、後方感強調処理部１２によって帯域抑圧フィルタを適用した後方感強調処理済みの音声信号Ｘｍ’（２チャンネル分）と、音声信号供給装置１００から得られる前方チャンネル音声信号Ｘｏ（３チャンネル分）を入力する（ステップＳ１１）。 (Reproduction signal generation processing)
Here, the reproduction signal generation process by the reproduction signal generation unit 13 will be described with reference to FIG. First, the reproduction signal generation unit 13 has the rear feeling enhancement processed audio signal Xm'(for 2 channels) to which the band suppression filter is applied by the rear feeling enhancement processing unit 12, and the front channel audio obtained from the audio signal supply device 100. The signal Xo (for 3 channels) is input (step S11).

続いて、再生信号生成部１３は、各チャンネルの音声信号に対し、再生するチャンネルへの予め定められた利得係数ｋ_ｉｎ（添え字ｉは出力チャンネル番号、添え字ｎは入力チャンネル番号を示す）を乗算する（ステップＳ１２）。 Then, the reproduction signal generation section 13, to the audio signals of each channel, the gain factor predetermined to the channel to be reproduced k _{in (suffix} i output channel number, the subscript n indicates the input channel number) Is multiplied (step S12).

利得係数ｋ_ｉｎの一例を図７に示している。入力信号は、前方チャンネル音声信号Ｘｏ（３チャンネル分）と、後方感強調処理済みの音声信号Ｘｍ’（２チャンネル分）の５チャンネル分であり、それぞれ本来、５ｃｈのマルチチャンネル音響システムのスピーカ配置２００における前方中央のスピーカＦＣ用、斜め前方左右に一対のスピーカＦＬ，ＦＲ用、及び斜め後方左右に一対のスピーカＢＬ，ＢＲ用の音声信号に対応しており、図７では便宜上、順番に“１ｃｈ”〜“５ｃｈ”として表し、利得係数ｋ_ｉｎにおける添え字ｎはその入力チャンネル番号に対応させている。そして、再生信号生成部１３の出力信号（再生信号）は、スピーカ配置４００における斜め前方左右に一対のスピーカＦＬ，ＦＲ用、及び前方中央のスピーカＦＣ用の音声信号に対応しており、図７では便宜上、順番に“１ｃｈ”〜“３ｃｈ”として表し、利得係数ｋ_ｉｎにおける添え字ｉはその出力チャンネル番号に対応させている。 Shows an example of a gain factor k _in Figure 7. The input signals are 5 channels of the front channel audio signal Xo (for 3 channels) and the audio signal Xm'(for 2 channels) that has been subjected to the rear feeling enhancement processing, and each is originally a speaker arrangement of a 5ch multi-channel sound system. In 200, the audio signals for the front center speaker FC, the pair of speakers FL and FR diagonally forward and left and right, and the pair of speakers BL and BR diagonally backward and left and right are supported. It is represented as "1ch" to "5ch", and _{the subscript n in} the gain coefficient quin corresponds to the input channel number. The output signal (reproduction signal) of the reproduction signal generation unit 13 corresponds to the audio signals for the pair of speakers FL, FR, and the speaker FC in the front center diagonally forward and left and right in the speaker arrangement 400, and is shown in FIG. Then, for convenience, they are sequentially represented as “1ch” to “3ch”, and _{the subscript i in} the gain coefficient kin corresponds to the output channel number.

続いて、再生信号生成部１３は、利得係数を乗算した各チャンネルの音声信号を、再生するチャンネル用に加算し、再生信号を生成する（ステップＳ１３）。即ち、図７に示すように予め規定された再生するチャンネル毎の利得係数ｋ_ｉｎの割り当てを参照して、式（１）のように５チャンネル分の入力信号ｘ_ｎ（添え字ｎは入力チャンネル番号）に利得係数ｋ_ｉｎを乗算した音声信号を加算して３チャンネル分の再生信号を生成する。 Subsequently, the reproduction signal generation unit 13 adds the audio signal of each channel multiplied by the gain coefficient for the channel to be reproduced, and generates a reproduction signal (step S13). That is, by referring to the assignment of gain factor k _in the respective channels for reproduction defined in advance as shown in FIG. 7, the input signal x _{n (subscript} n of 5 channels as shown in Equation (1) input channel It generates a reproduction signal of the addition to three channels of the audio signal multiplied by the gain factor k _in the number).

そして、再生信号生成部１３は、再生するチャンネル毎の再生信号をスピーカ配置４００に向けて出力する（ステップＳ１４）。 Then, the reproduction signal generation unit 13 outputs the reproduction signal for each channel to be reproduced toward the speaker arrangement 400 (step S14).

尚、本実施形態では、図８（ａ）に示すように、本来、受聴者の頭部Ｈの背面からの配置角度θで斜め後方左右に一対のスピーカＢＬ，ＢＲが配置されることが想定されている後方チャンネル音声信号Ｘｍであるときに、後方感強調処理済みの音声信号Ｘｍ’に対し、受聴者の頭部Ｈの正面からの配置角度θ_０で斜め前方左右に一対のスピーカＦＬ，ＦＲにおける左右のレベルバランスを考慮した利得係数ｋ_ｉｎが設定されている。 In this embodiment, as shown in FIG. 8A, it is assumed that a pair of speakers BL and BR are originally arranged diagonally backward and left and right at an arrangement angle θ from the back surface of the listener's head H. When the rear channel audio signal Xm is set, a pair of speakers FL, diagonally forward, left and right, with respect to the audio signal Xm'that has been subjected to the rear feeling enhancement processing, at an arrangement angle θ _{0 from the front of the listener's head H.} The gain coefficient quin is set _in consideration of the left and right level balance in FR.

即ち、ｓｉｎθとｓｉｎθ_０の比が、“前方左用のスピーカＦＬのための利得係数ｋ_Ｌ及び前方右用のスピーカＦＲのための利得係数ｋ_Ｒの差分”と“利得係数ｋ_Ｌ及び利得係数ｋ_Ｒの和”の比と等しくなるよう利得係数ｋ_ｉｎが設定されている。 That is, the ratio of sin [theta and sin [theta ₀ is a "gain factor" gain factor _k difference _R for gain factor _{k L} and the speaker FR for the front right for speaker FL for the front left _{"k L} and gain factor k gain factor k _in to be equal to the ratio of the sum of _R "is set.

より具体的には、図７における利得係数ｋ_１４，ｋ_２４，ｋ_１５，ｋ_２５は、例えば図８（ａ）において左右対称でθ_０＝６０°の斜め前方スピーカＦＬ，ＦＲにより、左右対称でθ＝４５°の斜め後方スピーカＢＬ，ＢＲ用の入力信号に対し後方感強調処理を施して再生する場合を想定すると、この左右のレベルバランスの調整のために従来から知られているサイン則（例えば、安藤彰男,“音場の高臨場感技術”，映像情報メディア学会誌Vol.66, No.8, pp.671-677(2012)参照）を用いることができる。 More specifically, the gain coefficients k ₁₄ , k ₂₄ , k ₁₅ , and k ₂₅ in FIG. 7 are symmetrical in FIG. 8 (a), for example, by the diagonally forward speakers FL and FR of _{θ 0 = 60 °.} Assuming that the input signals for the diagonal rear speakers BL and BR with θ = 45 ° are reproduced by applying the rear feeling enhancement processing, the sine rule conventionally known for adjusting the left and right level balance is assumed. (For example, see Akio Ando, “High Realism Technology of Sound Field”, Journal of the Institute of Image Information and Television Engineers Vol.66, No.8, pp.671-677 (2012)).

つまり、図８（ａ）において、例えば受聴者の頭部Ｈの中心から後方感強調済みの音声信号を再生するスピーカＦＬ，ＦＲまでの角度θ_０＝６０°、受聴者の頭部Ｈの中心から後方に定位させたい音像の位置での角度θを４５°、入力信号（５チャンネル分の音声信号）を前方左側のスピーカＦＬから再生する音声の利得係数をｋ_１４、前方右側のスピーカＦＲから再生する音声の利得係数をｋ_２４とすると、式（２）から求めることができる。 _{That is, in FIG. 8A, for example, the angle θ 0} = 60 ° from the center of the listener's head H to the speakers FL and FR for reproducing the audio signal whose rear feeling is emphasized, and the center of the listener's head H. The angle θ at the position of the sound image that you want to localize from is 45 °, the gain coefficient of the sound that reproduces the input signal (audio signal for 5 channels) from the front left speaker FL is k ₁₄ , and the front right speaker FR. Assuming that the gain coefficient of the reproduced sound is k ₂₄ , it can be obtained from the equation (2).

同じく、図８（ｂ）に示すように、本来、受聴者の頭部Ｈの背面からの配置角度θで斜め後方左右に一対のスピーカＢＬ，ＢＲが配置されることが想定されている後方チャンネル音声信号Ｘｍであるときに、後方感強調処理済みの音声信号Ｘｍ’に対し、受聴者の頭部Ｈの正面からの配置角度θ_０で側方左右に一対のスピーカＳＬ，ＳＲにおける左右のレベルバランスを考慮した利得係数ｋ_ｉｎを設定することができる。 Similarly, as shown in FIG. 8B, the rear channels are originally assumed to have a pair of speakers BL and BR diagonally rearward and left and right at an arrangement angle θ from the back surface of the listener's head H. When the audio signal is Xm, the left and right levels of the pair of speakers SL and SR on the left and right sides at _{an arrangement angle θ 0} from the front of the listener's head H with respect to the audio signal Xm'that has been subjected to the rearward enhancement processing. The gain coefficient quin can be set _{in consideration of the balance.}

つまり、図８（ｂ）において、例えば受聴者の頭部Ｈの中心から後方感強調済みの音声信号を再生するスピーカＳＬ，ＳＲまでの角度θ_０＝９０°、受聴者の頭部Ｈの中心から後方に定位させたい音像の位置での角度θを４５°、入力信号（５チャンネル分の音声信号）を側方左側のスピーカＳＬから再生する音声の利得係数をｋ_１４、側方右側のスピーカＳＲから再生する音声の利得係数をｋ_２４とすると、同様に、式（２）から求めることができる。 _{That is, in FIG. 8B, for example, the angle θ 0} = 90 ° from the center of the listener's head H to the speakers SL and SR for reproducing the audio signal whose rear feeling is emphasized, and the center of the listener's head H. The angle θ at the position of the sound image to be localized backward from is 45 °, the gain coefficient of the sound reproduced from the speaker SL on the left side of the input signal (audio signal for 5 channels) is k ₁₄ , and the speaker on the right side of the side. Assuming that the gain coefficient of the sound reproduced from the SR is k ₂₄ , it can be similarly obtained from the equation (2).

このような左右のレベルバランスはサイン則の他にもタンゼント則など別の方法を用いて利得係数を算出してもよい。また、本実施形態の例では、５ｃｈ或いは５．１ｃｈのマルチチャンネル音響を例に説明しているが、再生チャンネルへの割り当てとして、２２．２ｃｈや、１１．１ｃｈ、或いは１０．２ｃｈの場合など、入力チャンネル数と出力チャンネル数の組み合わせについてそれぞれの利得係数のパターンを、再生信号生成部１３により読み出し可能な記憶部（図示せず）に予め記憶しておき、図示しない外部インターフェースを用いて受聴者がその割り当てを選択可能にしておくこともできる。これにより、生信号生成部１３は、入力チャンネル数や出力チャンネル数に応じて、尚且つ左右に異なる信号レベルで重み付け加算で合成して側方もしくは斜め前方の実配置のスピーカから再生するための再生信号を生成することができる。 For such a left-right level balance, the gain coefficient may be calculated by using another method such as the Tangent's law in addition to the sine law. Further, in the example of the present embodiment, the multi-channel sound of 5ch or 5.1ch is described as an example, but the allocation to the reproduction channel is 22.2ch, 11.1ch, 10.2ch, or the like. , The pattern of each gain coefficient for the combination of the number of input channels and the number of output channels is stored in advance in a storage unit (not shown) readable by the reproduction signal generation unit 13, and received using an external interface (not shown). The assignment can also be made selectable by the listener. As a result, the raw signal generation unit 13 synthesizes the signals by weighting addition at different signal levels on the left and right according to the number of input channels and the number of output channels, and reproduces the signals from the speakers in the actual arrangement on the side or diagonally forward. A reproduction signal can be generated.

尚、このように側方又は前方のチャンネルの音声信号に対して、帯域抑圧フィルタによるフィルタ処理に遅延が生じるときは、再生信号生成部１３の出力チャンネルの音声信号として音声信号変換装置１０への入力信号のチャンネルの音声信号の位相関係を維持するよう同期を確保するために、一定量の遅延を加えてもよい。 When there is a delay in the filter processing by the band suppression filter for the audio signals of the side or front channels in this way, the audio signals of the output channel of the reproduction signal generation unit 13 are sent to the audio signal conversion device 10. A certain amount of delay may be added to ensure synchronization so as to maintain the phase relationship of the audio signals of the channels of the input signal.

以上のように、入力されるマルチチャンネル音響の音声信号のチャンネル数より少ない出力チャンネル数（スピーカ数）で側方又は斜め前方から再生する場合でも、第１実施形態と同様に、より多くの受聴者が後方に音像を知覚することができようになる。 As described above, even when reproducing from the side or diagonally forward with a number of output channels (number of speakers) smaller than the number of channels of the input multi-channel acoustic audio signal, a larger number of receivers are received as in the first embodiment. The listener will be able to perceive the sound image backwards.

従って、概ね２ｋＨｚ〜８ｋＨｚの間で可変帯域となる複数種の帯域抑圧フィルタを帯域抑圧フィルタ記憶部１２に保持しておき、受聴者によって選択設定可能に構成することで、より多くの受聴者が後方に音像を知覚できるようになる。そして、図２に示したように、形状種別として複数種の形状の帯域抑圧フィルタを用意することで、更に一層、高品質でより多くの受聴者が後方に音像を知覚できるようになることが期待できる。 Therefore, by holding a plurality of types of band suppression filters having a variable band between approximately 2 kHz and 8 kHz in the band suppression filter storage unit 12 and configuring them so that they can be selected and set by the listener, more listeners can use them. You will be able to perceive the sound image backwards. Then, as shown in FIG. 2, by preparing band suppression filters having a plurality of shapes as shape types, it is possible for more listeners to perceive the sound image backwards with even higher quality. You can expect it.

特に、頭部伝達関数（ＨＲＴＦ）は個人性があるが、平均的に、後方チャンネル音声信号用の頭部伝達関数（ＨＲＴＦ）において、４ｋＨｚ以上の周波数帯域の中で最も低い周波数のノッチ周波数は約６ｋＨｚ〜１０ｋＨｚに集中している。そこで、後方チャンネル音声信号用の頭部伝達関数（ＨＲＴＦ）の４ｋＨｚ以上の周波数帯域の中で最も低い周波数のノッチ周波数を抑圧可能に上側遮断周波数を持つ帯域抑圧フィルタを、或る帯域種別の基準の帯域抑圧フィルタとして予め設定しておくことで、より多くの受聴者が後方に音像を知覚する汎用的な後方音像知覚効果が得られるようになる。 In particular, the head related transfer function (HRTF) is unique, but on average, the head related transfer function (HRTF) for rear channel audio signals has the lowest notch frequency in the frequency band above 4 kHz. It is concentrated at about 6kHz to 10kHz. Therefore, a band suppression filter having an upper cutoff frequency that can suppress the notch frequency of the lowest frequency in the frequency band of 4 kHz or more of the head transmission function (HRTF) for the rear channel audio signal is used as a reference for a certain band type. By setting in advance as the band suppression filter of, it becomes possible to obtain a general-purpose rear sound image perception effect in which more listeners perceive the sound image rearward.

〔第３実施形態〕
図９は、本発明による第３実施形態のマルチチャンネル音響の音声信号変換装置１０と、その周辺装置の概略構成を示すブロック図である。尚、図９において、図５と同様な構成要素には同一の参照番号を付している。第３実施形態における音声信号変換装置１０では、第２実施形態の構成に加えて分析部１４が設けられ、分析部１４により入力信号（入力される後方チャンネル音声信号Ｘｍ）を周波数分析し、分析結果によって帯域抑圧フィルタ記憶部１１に記憶されている或る帯域種別の基準の帯域抑圧フィルタを変形するか、又は或る帯域種別の基準の帯域抑圧フィルタに換えて別の帯域抑圧フィルタを選択するか、が変更可能となっている。 [Third Embodiment]
FIG. 9 is a block diagram showing a schematic configuration of the audio signal conversion device 10 for multi-channel acoustics according to the third embodiment of the present invention and its peripheral devices. In FIG. 9, the same reference numbers are assigned to the same components as those in FIG. In the voice signal conversion device 10 of the third embodiment, an analysis unit 14 is provided in addition to the configuration of the second embodiment, and the analysis unit 14 frequency-analyzes and analyzes the input signal (input rear channel voice signal Xm). Depending on the result, the reference band suppression filter of a certain band type stored in the band suppression filter storage unit 11 is modified, or another band suppression filter is selected in place of the reference band suppression filter of a certain band type. Or can be changed.

尚、以下に説明する第３実施形態では、上述した第２実施形態の音声信号変換装置１０に対して更に分析部１４を設ける例を説明するが、上述した第１実施形態の音声信号変換装置１０に対して更に同様の分析部１４を設ける構成とすることができる。 In the third embodiment described below, an example in which the analysis unit 14 is further provided for the audio signal conversion device 10 of the second embodiment described above will be described, but the audio signal conversion device of the first embodiment described above will be described. A similar analysis unit 14 may be further provided for 10.

即ち、本実施形態の音声信号変換装置１０は、帯域抑圧フィルタ記憶部１１と、後方感強調処理部１２と、再生信号生成部１３と、分析部１４とを備えるよう構成され、第２実施形態と比較して、分析部１４を更に備える点で相違する。 That is, the audio signal conversion device 10 of the present embodiment is configured to include a band suppression filter storage unit 11, a rear feeling enhancement processing unit 12, a reproduction signal generation unit 13, and an analysis unit 14. The difference is that the analysis unit 14 is further provided as compared with the above.

帯域抑圧フィルタ記憶部１１、後方感強調処理部１２、及び再生信号生成部１３の動作は、第１及び第２実施形態と同様であり更なる詳細な説明は省略する。 The operations of the band suppression filter storage unit 11, the rear feeling enhancement processing unit 12, and the reproduction signal generation unit 13 are the same as those in the first and second embodiments, and further detailed description thereof will be omitted.

分析部１４は、後方チャンネル音声信号Ｘｍを入力して周波数分析を行い、その分析結果によって帯域抑圧フィルタ記憶部１１に記憶されている或る帯域種別の基準の帯域抑圧フィルタを変形するか、又は或る帯域種別の基準の帯域抑圧フィルタから別の帯域抑圧フィルタに選択することで変更可能とし、変更した際にはその旨を後方感強調処理部１２に指示して変更後の帯域抑圧フィルタを利用させる機能部である。 The analysis unit 14 inputs the rear channel audio signal Xm to perform frequency analysis, and depending on the analysis result, modifies the reference band suppression filter of a certain band type stored in the band suppression filter storage unit 11 or modifies the band suppression filter. It can be changed by selecting another band suppression filter from the reference band suppression filter of a certain band type, and when it is changed, it is instructed to the rear feeling enhancement processing unit 12 to indicate the changed band suppression filter. It is a functional part to be used.

（分析処理）
ここで、図１０乃至図１２を参照して、分析部１４による分析処理について説明する。図１０は、本実施形態のマルチチャンネル音響の音声信号変換装置１０における分析部１４のメイン処理を示すフローチャートであり、図１１及び図１２は、分析部１４のサブ処理を示すフローチャートである。 (Analytical processing)
Here, the analysis process by the analysis unit 14 will be described with reference to FIGS. 10 to 12. FIG. 10 is a flowchart showing the main processing of the analysis unit 14 in the multi-channel acoustic audio signal conversion device 10 of the present embodiment, and FIGS. 11 and 12 are flowcharts showing the sub-processing of the analysis unit 14.

尚、図１０乃至図１２に示す分析処理は、帯域抑圧フィルタ記憶部１１にて図２に示すような複数種の形状種別の帯域抑圧フィルタが保持され、更にその形状種別毎に、複数の帯域種別の帯域抑圧フィルタが保持されている場合を説明する。 In the analysis processing shown in FIGS. 10 to 12, the band suppression filter storage unit 11 holds a plurality of types of band suppression filters as shown in FIG. 2, and further, a plurality of bands are held for each shape type. A case where a type of band suppression filter is held will be described.

まず、図１０を参照するに、分析部１４は、所定帯域（例えば±１オクターブバンド）内に入力される後方チャンネル音声信号Ｘｍに対し、周波数スペクトルの分析を行う（ステップＳ２１）。 First, referring to FIG. 10, the analysis unit 14 analyzes the frequency spectrum of the rear channel audio signal Xm input in a predetermined band (for example, ± 1 octave band) (step S21).

続いて、分析部１４は、帯域抑圧フィルタ記憶部１１に保持される或る帯域種別の基準の帯域抑圧フィルタについて、変更判定処理を実行する（ステップＳ２２）。 Subsequently, the analysis unit 14 executes a change determination process for the reference band suppression filter of a certain band type held in the band suppression filter storage unit 11 (step S22).

ここで、帯域抑圧フィルタ記憶部１１にて図２（ａ）乃至（ｅ）に示すような複数種の形状種別の帯域抑圧フィルタが保持され、更にその形状種別毎に、複数の帯域種別の帯域抑圧フィルタが保持されている。このため、その形状種別毎に、或る帯域種別の基準の帯域抑圧フィルタが設定されているものとし、この或る帯域種別の基準の帯域抑圧フィルタに対して異なる帯域幅の複数の帯域種別の帯域抑圧フィルタが保持されている。 Here, the band suppression filter storage unit 11 holds a plurality of types of band suppression filters as shown in FIGS. 2 (a) to 2 (e), and further, each of the shape types holds a plurality of band types of bands. The suppression filter is retained. Therefore, it is assumed that a reference band suppression filter of a certain band type is set for each shape type, and a plurality of bandwidth types having different bandwidths with respect to the reference band suppression filter of a certain band type. The band suppression filter is retained.

そして、後述する図１１及び図１２に示すサブ処理（変更判定処理）により、その形状種別毎に、或る帯域種別の基準の帯域抑圧フィルタの形状が変更され、併せて当該異なる帯域幅の複数の帯域種別の帯域抑圧フィルタの形状が変更される。 Then, by the sub-processing (change determination processing) shown in FIGS. 11 and 12 described later, the shape of the reference band suppression filter of a certain band type is changed for each shape type, and a plurality of the different bandwidths are combined. The shape of the band suppression filter for each band type is changed.

尚、帯域抑圧フィルタ記憶部１１にて１つの或る帯域種別の基準の帯域抑圧フィルタのみが保持されているときは、その帯域抑圧フィルタの形状が変更される。 When the band suppression filter storage unit 11 holds only one reference band suppression filter for a certain band type, the shape of the band suppression filter is changed.

また、帯域抑圧フィルタの変更は、帯域抑圧フィルタ記憶部１１にて１種類の或る帯域種別の基準の帯域抑圧フィルタについて異なる帯域幅の複数の帯域種別の帯域抑圧フィルタが保持されているときに、複数有るうちの１つの帯域抑圧フィルタを選択することを含む。 Further, the band suppression filter is changed when the band suppression filter storage unit 11 holds a plurality of bandwidth suppression filters having different bandwidths for one type of reference band suppression filter of a certain bandwidth type. , Including selecting one of the plurality of band suppression filters.

図１０を参照するに、分析部１４は、当該変更判定処理後に、帯域抑圧フィルタについて変更した際にはその旨を後方感強調処理部１２に指示する（ステップＳ２３）。 With reference to FIG. 10, when the band suppression filter is changed after the change determination process, the analysis unit 14 instructs the rearward feeling enhancement processing unit 12 to that effect (step S23).

そして、分析部１４からの指示を受け付けた後方感強調処理部１２は、当該分析部１４によって変更された帯域抑圧フィルタについて、これまで使用していた形状種別の帯域抑圧フィルタに対応させてその変更後の帯域抑圧フィルタを適用するよう構成される。 Then, the rear feeling enhancement processing unit 12 that receives the instruction from the analysis unit 14 changes the band suppression filter changed by the analysis unit 14 in correspondence with the band suppression filter of the shape type that has been used so far. It is configured to apply a later band suppression filter.

（変更判定処理）
次に、変更判定処理を説明する。図１１を参照するに、分析部１４は、入力される後方チャンネル音声信号Ｘｍの周波数スペクトルのうち、周波数スペクトルの分析を行った所定帯域（例えば±１オクターブバンド）の範囲で、帯域抑圧フィルタ記憶部１１に既設定されている帯域抑圧フィルタの変更判定処理を行う。 (Change judgment processing)
Next, the change determination process will be described. With reference to FIG. 11, the analysis unit 14 stores the band suppression filter in the range of a predetermined band (for example, ± 1 octave band) in which the frequency spectrum is analyzed in the frequency spectrum of the input rear channel audio signal Xm. The change determination process of the band suppression filter already set in the unit 11 is performed.

まず、分析部１４は、当該所定帯域内に高域側ピークＳ_Ｐ２から見て所定レベル以上、例えば２０ｄＢ以上で所定幅以上（例えば０．１２５％）のノッチが有るか否かを監視する（ステップＳ３１）。 First, the analysis portion 14, a predetermined level or higher as viewed from the high frequency side peak S _P2 within the predetermined band, monitors whether a notch is present in more than a predetermined width (e.g., 0.125%), for example, 20dB or more ( Step S31).

当該所定帯域内にて２０ｄＢ以上のノッチが有るとき（ステップＳ３１：Ｙｅｓ）、分析部１４は、後述する図１３（ａ）の上側図に示すような当該所定帯域内におけるピークＳ_Ｐ１に基づく半値Ｓ_Ｌ、ピークＳ_Ｐ２に基づく半値Ｓ_Ｕ、及びノッチＳ_Ｎ（後述する図１４（ａ）の上側図に示すＳ_Ｎ１, Ｓ_Ｎ２を含む）を保持する（ステップＳ３２）。 When there is a notch of 20 dB or more in the predetermined band (step S31: Yes), the analysis unit 14 determines the half price based on the _{peak SP1 in the predetermined band as shown in the upper diagram of FIG. 13 (a) described later.} S _L, half _{S U} based on the peak _{S P2,} and to hold the notch _{S N} (including _{S N1,} S _N2 shown in the upper diagram of FIG. 14 to be described later (a)) (step S32).

続いて、分析部１４は、帯域抑圧フィルタ記憶部１１に保持されるうちの帯域抑圧フィルタにおける特定の形状種別について、特定の帯域種別のものを選択する（ステップＳ３３）。 Subsequently, the analysis unit 14 selects a specific band type for the specific shape type of the band suppression filter held in the band suppression filter storage unit 11 (step S33).

続いて、分析部１４は、図１２を参照するに、当該特定の帯域種別の帯域抑圧フィルタＳ_Ｕを基に｜Ｓ_Ｕ−Ｆ_Ｕ｜が閾値Ｔｈを超えてずれていないかを判定する（ステップＳ３４）。 Next, the analysis portion 14, referring to FIG. 12, based on the band suppression filter S _U of the specific band type _| S U _-F U | determines is not displaced beyond a threshold value Th ( Step S34).

閾値Ｔｈを超えてずれているとき（ステップＳ３４：Ｎｏ）、分析部１４は、当該特定の形状種別の帯域抑圧フィルタにおける全ての帯域種別のものについて変更なしとして判定し（ステップＳ３５）、ステップＳ３８に移行する。 When the deviation exceeds the threshold value Th (step S34: No), the analysis unit 14 determines that there is no change in all the band types in the band suppression filter of the specific shape type (step S35), and steps S38. Move to.

一方、閾値Ｔｈを超えてずれていないとき（ステップＳ３４：Ｙｅｓ）、分析部１４は、当該特定の帯域種別のものを閾値Ｔｈ以上、上側遮断周波数Ｆ_Ｕが下限周波数Ｆ_Ｕｍｉｎ以上、上限周波数Ｆ_Ｕｍａｘ以下となるよう所定値で変更する（ステップＳ３６）。この変更処理は、閾値Ｔｈ以上、上側遮断周波数Ｆ_Ｕが予め定めた下限周波数Ｆ_Ｕｍｉｎ以上、及び予め定めた上限周波数Ｆ_Ｕｍａｘ以下となる複数有るうちの１つを選択することを含む。つまり、当該特定の帯域種別のフィルタ（各帯域種別における基準の帯域抑圧フィルタ）の形状を直接的に所定値で形状変更するか、或いは複数の帯域抑圧フィルタが保持されているうちの閾値Ｔｈ以上、上側遮断周波数Ｆ_Ｕが予め定めた下限周波数Ｆ_Ｕｍｉｎ以上、及び予め定めた上限周波数Ｆ_Ｕｍａｘ以下となる１つを選択するよう構成することができる。例えば、Ｆ_Ｕｍｉｎ＝６．３ｋＨｚ、上限周波数Ｆ_Ｕｍａｘ＝８ｋＨｚとすることができる。 Meanwhile, when no deviation exceeds the threshold value Th (step S34: Yes), the analysis unit 14, the specific band type ones the threshold value Th or more, the upper cut-off frequency _{F U} is a lower limit frequency _{F Umin} above, the upper limit frequency F It is changed by a predetermined value so as to be _{Umax or less (step S36).} This change process includes the threshold Th or more, the upper cut-off frequency F _U is predetermined lower limit frequency F _Umin above, and selecting one of the plurality of equal to or less than the upper limit frequency F _Umax a predetermined. That is, the shape of the filter of the specific band type (reference band suppression filter in each band type) is directly changed by a predetermined value, or the threshold Th of or more while a plurality of band suppression filters are held. may be configured such that the upper cut-off frequency _{F U} is predetermined lower limit frequency _{F Umin} above, and selects one of the predetermined upper limit frequency _{F Umax} below. For _example, it is possible to F Umin = 6.3 kHz, the upper limit frequency _F Umax = 8 kHz to.

続いて、分析部１４は、当該特定の形状種別の帯域抑圧フィルタについて、当該所定値を用いて全ての帯域種別のものを変更する（ステップＳ３７）。これは、図９に示すように、外部インターフェースで受聴者が形状種別・帯域種別に関する選択する余地を残しているためである。 Subsequently, the analysis unit 14 changes all the band types of the band suppression filter of the specific shape type by using the predetermined value (step S37). This is because, as shown in FIG. 9, the listener has room to select the shape type and band type on the external interface.

続いて、分析部１４は、全ての形状種別の帯域抑圧フィルタの変更判定が終了したか否かを判定する（ステップＳ３８）。全ての形状種別の帯域抑圧フィルタの変更判定が終了していないとき（ステップＳ３８：Ｎｏ）、図１１に示すステップＳ３３に移行し、別の形状種別の帯域抑圧フィルタの選択を行う。全ての形状種別の帯域抑圧フィルタの変更判定が終了していれば（ステップＳ３８：Ｙｅｓ）、帯域抑圧フィルタの変更判定処理を終了する。 Subsequently, the analysis unit 14 determines whether or not the change determination of the band suppression filters of all the shape types is completed (step S38). When the change determination of the band suppression filters of all the shape types is not completed (step S38: No), the process proceeds to step S33 shown in FIG. 11 to select a band suppression filter of another shape type. If the change determination of the band suppression filter of all the shape types is completed (step S38: Yes), the change determination process of the band suppression filter is completed.

尚、分析部１４は、帯域抑圧フィルタの変更判定に係る帯域内に複数のノッチを検出した場合、全てのノッチを順に走査（例えば、高域のノッチから順に走査）して、変更判定処理を実行する。 When a plurality of notches are detected in the band related to the change determination of the band suppression filter, the analysis unit 14 scans all the notches in order (for example, scans in order from the high frequency notch) to perform the change determination process. Execute.

このように、分析部１４は、音声信号変換装置に入力されるマルチチャンネル音響の音声信号のうち後方に設置されるスピーカから再生することが想定されていたチャンネルの音声信号について周波数スペクトルを分析し、この分析結果を基に帯域抑圧フィルタ記憶部１１に記憶されている帯域抑圧フィルタの形状及び抑圧帯域のいずれか一方又は双方を自動的に変更可能に構成される。 In this way, the analysis unit 14 analyzes the frequency spectrum of the audio signal of the channel that is supposed to be reproduced from the speaker installed behind the audio signal of the multi-channel audio input to the audio signal converter. Based on this analysis result, one or both of the shape of the band suppression filter and the suppression band stored in the band suppression filter storage unit 11 can be automatically changed.

特に、分析部１４は、その分析結果を基に帯域抑圧フィルタ記憶部１１に記憶されている複数の帯域抑圧フィルタのうち後方感強調処理部１２によって使用する帯域抑圧フィルタを自動的に選択設定するよう構成することができる。 In particular, the analysis unit 14 automatically selects and sets the band suppression filter to be used by the rear feeling enhancement processing unit 12 among the plurality of band suppression filters stored in the band suppression filter storage unit 11 based on the analysis result. Can be configured as

図１３（ａ）は、例えば図２（ａ）に示す矩形状の帯域抑圧フィルタについて、入力信号（後方チャンネル音声信号）の周波数スペクトラム分布に対し、帯域抑圧フィルタ記憶部１１に記憶されている特定の帯域種別の帯域抑圧フィルタの｜Ｓ_Ｕ−Ｆ_Ｕ｜が閾値Ｔｈを超えてずれていない様子を示している。 13 (a) shows, for example, the rectangular band suppression filter shown in FIG. 2 (a), which is stored in the band suppression filter storage unit 11 with respect to the frequency spectrum distribution of the input signal (rear channel audio signal). the band suppression filter band type _| S U _-F U | indicates a state that does not deviate by more than a threshold value Th.

そして、分析部１４による帯域抑圧フィルタの変更処理によって、例えば図２（ａ）に示す矩形状の帯域抑圧フィルタについては、図１３（ｂ）に示すように、｜Ｓ_Ｕ−Ｆ_Ｕ｜＜Ｔｈのときに、Ｆ_Ｕ＞Ｓ_Ｎ、且つＴｈ≦｜Ｓ_Ｕ−Ｆ_Ｕ｜、且つＦ_Ｕｍｉｎ≦Ｆ_Ｕ≦Ｆ_Ｕｍａｘとなるよう所定値で中心周波数の変更を行い基準の帯域抑圧フィルタとして更新する。或いは複数有るうちの選択によるときは、これを満たす中心周波数のものを基準の帯域抑圧フィルタとして選択更新する。 Then, the processing of changing the band suppression filter according to the analysis unit 14, the rectangular band suppression filter shown in FIG. 2 (a) for example, as shown in FIG. _{_{13 (b), | S U}} -F U | <Th when _the, F U> _{S _N,} and Th ≦ of _| S U _-F U _|, and updates as _{_{_F} Umin} ≦ _F _U ≦ _F _Umax become as reference band suppression filter to change the center frequency by a predetermined value .. Alternatively, when a selection is made among a plurality of them, the one having a center frequency satisfying this is selected and updated as a reference band suppression filter.

また、図１３（ｃ）に示すように、｜Ｓ_Ｕ−Ｆ_Ｕ｜＜Ｔｈのときに、Ｆ_Ｕ＞Ｓ_Ｎ、且つＴｈ≦｜Ｓ_Ｕ−Ｆ_Ｕ｜、且つＦ_Ｕｍｉｎ≦Ｆ_Ｕ≦Ｆ_Ｕｍａｘとなるよう所定値で帯域幅の変更を行い基準の帯域抑圧フィルタとして更新する。或いは複数有るうちの選択によるときは、これを満たす帯域幅のものを基準の帯域抑圧フィルタとして選択更新する。 Further, as shown in FIG. _{_{13 (c), | S U}} -F U | < when _Th, F U> _{S _N,} and _{_{Th ≦ | S U -F U |}} , and _{_F Umin} ≦ _F _U ≦ _F _{The bandwidth is changed by a predetermined value so as to be Umax,} and the bandwidth is updated as a reference band suppression filter. Alternatively, when a selection is made among a plurality of them, the one having a bandwidth satisfying this is selected and updated as a reference band suppression filter.

また、図１３（ｄ）に示すように、Ｔｈ≦｜Ｓ_Ｕ−Ｆ_Ｕ｜、且つＦ_Ｕｍｉｎ≦Ｆ_Ｕ≦Ｆ_Ｕｍａｘを満たすが、Ｓ_Ｎ＜Ｆ_Ｇのときに、Ｆ_Ｇ’＝Ｓ_Ｎ＋α＞Ｆ_Ｇ（α：マージン）となるよう所定値で抑圧量の変更を行い基準の帯域抑圧フィルタとする。或いは複数有るうちの選択によるときは、これを満たす抑圧量のものを基準の帯域抑圧フィルタとして選択更新する。 Further, as shown in FIG. _{13 (d), Th ≦ |} S U -F U |, but and satisfy _{_{_{F Umin ≦ F U ≦ F Umax}}} , when _{_{S N <F G, F G}} '= S N + α> F G: a band suppression filter criteria to change the reduction amount with a predetermined value so that the _(alpha margin). Alternatively, when a selection is made among a plurality of them, the suppression amount satisfying the selection is selected and updated as the reference band suppression filter.

更に、図２（ｂ）乃至図２（ｄ）に示す三角状ベースの帯域抑圧フィルタについては、図１４（ａ）に示すように、｜Ｓ_Ｕ−Ｆ_Ｕ｜＜Ｔｈのときに、Ｆ_Ｕ＞Ｓ_Ｎ、且つＴｈ≦｜Ｓ_Ｕ−Ｆ_Ｕ｜、且つＦ_Ｕｍｉｎ≦Ｆ_Ｕ≦Ｆ_Ｕｍａｘとなるよう所定値で減衰傾度の変更を行い基準の帯域抑圧フィルタとして更新する。或いは複数有るうちの選択によるときは、これを満たす減衰傾度のものを基準の帯域抑圧フィルタとして選択更新する。 Furthermore, for the triangular base of the band suppression filter shown in FIG. 2 (b) to FIG. 2 (d), the as shown in FIG. _{_{14 (a), | S U}} -F U | when <Th, _{F U} > _{S N,} and _{_{Th ≦ | S U -F U |}} , and updates as a band suppression filter criteria to change the attenuation slope at a predetermined value so as to be _{_{_{F Umin ≦ F U ≦ F Umax}}} . Alternatively, when a selection is made among a plurality of them, the one having an attenuation gradient satisfying this is selected and updated as a reference band suppression filter.

帯域抑圧フィルタの変更判定処理は、当該判定対象の所定帯域内における入力信号のノッチ数と帯域抑圧フィルタのノッチ数が一致するか否かで帯域抑圧フィルタの変更判定を行うようにしてもよい。例えば、図２（ｅ）に示す複数ノッチの帯域抑圧フィルタについては、図１４（ｂ）に示すように、入力信号のノッチ数と帯域抑圧フィルタのノッチ数が一致するときに、ノッチ数が一致しないよう形状変更を行い基準の帯域抑圧フィルタとして更新する。或いは複数有るうちの選択によるときは、これを満たすノッチ数のものを基準の帯域抑圧フィルタとして選択更新する。 In the band suppression filter change determination process, the band suppression filter change determination may be performed based on whether or not the number of notches of the input signal and the number of notches of the band suppression filter in the predetermined band to be determined match. For example, with respect to the multi-notch band suppression filter shown in FIG. 2 (e), as shown in FIG. 14 (b), when the number of notches in the input signal and the number of notches in the band suppression filter match, the number of notches matches. Change the shape so that it does not occur, and update it as a reference band suppression filter. Alternatively, when a plurality of notches are selected, the one having the number of notches satisfying this is selected and updated as the reference band suppression filter.

以上のように、分析部１４は、入力信号（後方チャンネル音声信号Ｘｍ）の周波数スペクトルの分析処理によって、入力信号にノッチもしくはピークがみられる場合、基準とする帯域抑圧フィルタよりも、図１３に示すように抑圧する中心周波数が高い又は低いフィルタとなるように変更又は選択することや、抑圧する帯域幅がより広い又は狭いフィルタとなるように変更又は選択することや、抑圧量を変更又は選択することや、減衰傾度がより急峻な又は緩やかなフィルタとなるように変形又は選択することや、ノッチ数を変更するように変形または選択することで、基準の帯域抑圧フィルタを変更することができる。上記の例では、入力信号の特徴としてピークやノッチを用いたが、入力信号が抑圧帯域を含むか否かなど、帯域抑圧による周波数特性の変化が確認できるものであればよい。 As described above, when a notch or a peak is found in the input signal by the analysis processing of the frequency spectrum of the input signal (rear channel audio signal Xm), the analysis unit 14 is shown in FIG. 13 rather than the reference band suppression filter. As shown, changing or selecting a filter with a high or low center frequency to suppress, changing or selecting a filter with a wider or narrower bandwidth to suppress, and changing or selecting the amount of suppression. The reference band suppression filter can be changed by modifying or selecting so that the filter has a steeper or gentler attenuation gradient, or by modifying or selecting so as to change the number of notches. .. In the above example, peaks and notches are used as the characteristics of the input signal, but any change in frequency characteristics due to band suppression, such as whether or not the input signal includes a suppression band, can be confirmed.

そして、音声信号が入力されている間、周波数分析は４００ｍｓ毎など一定時間毎に所定オクターブバンド単位で分析を行うことで、動的に帯域抑圧フィルタを変更することができる。 Then, while the audio signal is being input, the band suppression filter can be dynamically changed by performing the frequency analysis in units of predetermined octave bands at regular time intervals such as every 400 ms.

これにより、帯域抑圧フィルタ記憶部１１にて図２（ａ）乃至（ｅ）に示すような複数種の形状種別の帯域抑圧フィルタが保持され、更にその形状種別毎に、複数の帯域種別の帯域抑圧フィルタが保持されているときに、その形状種別毎に、或る帯域種別の基準の帯域抑圧フィルタの形状が変更され、併せて当該異なる帯域幅の複数の帯域種別の帯域抑圧フィルタの形状が変更される。この場合、図９に示すように、外部インターフェースで受聴者が形状種別・帯域種別に関する選択する余地を残されている。 As a result, the band suppression filter storage unit 11 holds the band suppression filters of a plurality of types of shape types as shown in FIGS. 2A to 2E, and further, the bands of a plurality of band types are held for each shape type. When the suppression filter is held, the shape of the reference band suppression filter of a certain band type is changed for each shape type, and the shape of the band suppression filter of a plurality of band types having the different bandwidth is changed. Be changed. In this case, as shown in FIG. 9, there is room for the listener to select the shape type and band type on the external interface.

ただし、本分析処理によって、入力信号（入力される後方チャンネル音声信号Ｘｍ）に応じた或る帯域種別の基準の帯域抑圧フィルタを変更可能となるため、外部インターフェースで受聴者が形状種別・帯域種別に関する選択するよう構成していなくても、高精度に、多くの受聴者が後方に音像を知覚できるようになる。 However, since this analysis processing makes it possible to change the reference band suppression filter of a certain band type according to the input signal (input rear channel audio signal Xm), the listener can use the external interface to change the shape type / band type. Many listeners will be able to perceive the sound image backwards with high accuracy, even if they are not configured to make choices about.

そして、帯域抑圧フィルタ記憶部１１に保持される図２（ａ）乃至（ｅ）に示すような複数種の形状種別の帯域抑圧フィルタの変更処理によって、入力信号に応じて動的に更新されるようになる。 Then, it is dynamically updated according to the input signal by the change processing of the band suppression filters of a plurality of types of shape types as shown in FIGS. 2 (a) to 2 (e) held in the band suppression filter storage unit 11. Will be.

また、各実施形態の音声信号変換装置１０をコンピュータとして機能させることができ、当該コンピュータに、本発明に係る各構成要素を実現させるためのプログラムは、当該コンピュータの内部又は外部に備えられるメモリに記憶される。コンピュータに備えられる中央演算処理装置（ＣＰＵ）などの制御で、各構成要素の機能を実現するための処理内容が記述されたプログラムを、適宜、メモリから読み込んで、各実施形態の音声信号変換装置１０の各構成要素の機能をコンピュータに実現させることができる。ここで、各構成要素の機能をハードウェアの一部で実現してもよい。 Further, the audio signal conversion device 10 of each embodiment can be made to function as a computer, and a program for realizing each component according to the present invention in the computer is stored in a memory provided inside or outside the computer. It will be remembered. A program in which processing contents for realizing the functions of each component are appropriately read from a memory under the control of a central processing unit (CPU) provided in a computer, and the audio signal conversion device of each embodiment is read as appropriate. The functions of each of the ten components can be realized in a computer. Here, the function of each component may be realized by a part of hardware.

以上、特定の実施形態の例を挙げて本発明を説明したが、本発明は前述の実施形態の例に限定されるものではなく、その技術思想を逸脱しない範囲で種々変形可能である。例えば、上述した各実施形態の例では、５ｃｈ（又は５．１ｃｈ）のマルチチャンネル音響の音声信号を入力し変換する例を説明したが、２２．２ｃｈ，１０．２ｃｈ，１１．１ｃｈ，７．１ｃｈ等のマルチチャンネル音響の音声信号を入力し後方感強調処理により後方チャンネル音声信号を側方又は斜め前方のチャンネルの音声信号へ後方感強調処理を施して変換するよう構成することができる。尚、２２．２ｃｈ等の複数階層のマルチチャンネル音響の音声信号を入力し複数階層の出力チャンネルの音声信号へ後方感強調処理により変換する際には、対応する階層内で後方感強調処理を施すように構成する。これにより、上層部後方と中層部後方の音像を分けて知覚できるようにすることができる。 Although the present invention has been described above with reference to examples of specific embodiments, the present invention is not limited to the examples of the above-described embodiments, and various modifications can be made without departing from the technical idea. For example, in the example of each of the above-described embodiments, an example of inputting and converting an audio signal of 5ch (or 5.1ch) multi-channel audio has been described, but 22.2ch, 10.2ch, 11.1ch, 7. It is possible to input an audio signal of a multi-channel sound such as 1ch and convert the rear channel audio signal into a lateral or diagonally forward channel audio signal by performing the rear feeling enhancement processing. When inputting a multi-layer multi-channel acoustic audio signal such as 22.2ch and converting it into an audio signal of a multi-layer output channel by backward feeling enhancement processing, the backward feeling enhancement processing is performed in the corresponding layer. It is configured as follows. As a result, the sound images behind the upper layer and the rear of the middle layer can be perceived separately.

また、出力チャンネルも上述した実施形態の例に限らず、種々のスピーカ配置に適用することができる。例えば、本発明におけるマルチチャンネル音響の音声信号変換装置は、特許文献１に開示されるディスプレイの周囲に配置された複数のスピーカによりマルチチャンネル音響再生を可能とする前方スピーカ装置や、マルチチャンネル音響を再生可能な複数のスピーカを棒状に配置した前方スピーカ装置に対しても、多くの受聴者が後方に音像を知覚できるようになる。 Further, the output channel is not limited to the example of the above-described embodiment, and can be applied to various speaker arrangements. For example, the audio signal conversion device for multi-channel sound in the present invention includes a front speaker device that enables multi-channel sound reproduction by a plurality of speakers arranged around a display disclosed in Patent Document 1, and multi-channel sound. Even for a front speaker device in which a plurality of reproducible speakers are arranged in a rod shape, many listeners can perceive a sound image in the rear.

また、上述した各実施形態の音声信号変換装置では、所謂クロストークキャンセルを行うことなく、後方感強調効果を実現することができるが、クロストークキャンセルを行う再生機と組み合わせてもよい。更に、クロストークキャンセル処理は、後方感強調部の前段又は後段に設けることもできる。従って、本発明に係る音声信号変換装置は、上述した実施形態の例に限定されるものではなく、特許請求の範囲の記載によってのみ制限される。 Further, in the audio signal conversion device of each of the above-described embodiments, the effect of enhancing the rearward feeling can be realized without performing so-called crosstalk cancellation, but it may be combined with a player that performs crosstalk cancellation. Further, the crosstalk canceling process can be provided in the front stage or the rear stage of the rear feeling emphasizing portion. Therefore, the audio signal conversion device according to the present invention is not limited to the example of the above-described embodiment, but is limited only by the description of the claims.

本発明によれば、後方にスピーカが設置できない場合でも、後方のスピーカから再生することを想定した音声信号を、受聴者本人の頭部伝達関数（ＨＲＴＦ）を測定することなく、多くの人が後方に音像を知覚できるように再生することができるので、後方のチャンネルを含むマルチチャンネル音響の音声信号を実配置されるスピーカに応じて信号変換する用途に有用である。 According to the present invention, even if a speaker cannot be installed in the rear, many people can obtain an audio signal that is supposed to be reproduced from the rear speaker without measuring the head related transfer function (HRTF) of the listener himself / herself. Since the sound image can be reproduced so that it can be perceived in the rear, it is useful for signal conversion of the audio signal of the multi-channel sound including the rear channel according to the speaker actually arranged.

１０音声信号変換装置
１１帯域抑圧フィルタ記憶部
１２後方感強調処理部
１３再生信号生成部
１４分析部
１００音声信号供給装置
２００５ｃｈのマルチチャンネル音響システム
３００５ｃｈの変形マルチチャンネル音響システム
４００３ｃｈの変形マルチチャンネル音響システム 10 Audio signal conversion device 11 Band suppression filter storage unit 12 Rear feeling enhancement processing unit 13 Reproduction signal generation unit 14 Analysis unit 100 Audio signal supply device 200 5ch multi-channel audio system 300 5ch modified multi-channel acoustic system 400 3ch modified multi Channel sound system

Claims

後方のチャンネルを含むマルチチャンネル音響の音声信号に対して信号変換を行う音声信号変換装置であって、
可聴周波数帯域内の１つ又は複数の帯域を抑圧する帯域抑圧フィルタを記憶する帯域抑圧フィルタ記憶部と、
前記マルチチャンネル音響の音声信号のうち後方のチャンネルの音声信号に対し、前記帯域抑圧フィルタを適用することにより、後方感強調処理を施した音響信号を生成する後方感強調処理部と、を備え、
前記マルチチャンネル音響の音声信号のうち後方に設置されるスピーカから再生することが想定されていたチャンネルの音声信号について周波数スペクトルを分析し、この分析結果を基に前記帯域抑圧フィルタ記憶部に記憶されている帯域抑圧フィルタの形状及び抑圧帯域のいずれか一方又は双方を自動的に変更可能にする分析部を更に備えることを特徴とする、マルチチャンネル音響の音声信号変換装置。 An audio signal conversion device that performs signal conversion for multi-channel acoustic audio signals including the rear channel.
A band suppression filter storage unit that stores a band suppression filter that suppresses one or more bands in the audible frequency band, and a band suppression filter storage unit.
A rear feeling enhancement processing unit that generates an acoustic signal to which the rear feeling enhancement processing is applied by applying the band suppression filter to the sound signal of the rear channel among the audio signals of the multi-channel sound is provided.
The frequency spectrum of the audio signal of the channel that was supposed to be reproduced from the speaker installed behind the audio signal of the multi-channel sound is analyzed, and the frequency spectrum is stored in the band suppression filter storage unit based on the analysis result. A multi-channel audio audio signal conversion device further comprising an analysis unit that can automatically change one or both of the shape of the band suppression filter and the suppression band.

前記後方感強調処理を施した音響信号は、受聴者の側方又は斜め前方に実配置されるスピーカから再生して後方に音像を知覚可能とすることを特徴とする、請求項１に記載のマルチチャンネル音響の音声信号変換装置。 The first aspect of the present invention, wherein the acoustic signal subjected to the rearward feeling enhancement processing is reproduced from a speaker actually arranged on the side or diagonally forward of the listener so that the sound image can be perceived rearward. Multi-channel acoustic audio signal converter.

前記マルチチャンネル音響の音声信号のうち後方以外のチャンネルの音声信号と、前記後方感強調処理を施した音声信号とを重み付け加算により合成し、実配置されるスピーカから再生するための再生信号を生成する再生信号生成部を更に備えることを特徴とする、請求項１又は２に記載のマルチチャンネル音響の音声信号変換装置。 Of the multi-channel audio signals, the audio signals of channels other than the rear and the audio signals subjected to the rear feeling enhancement processing are combined by weighting addition to generate a reproduction signal for reproduction from the actually arranged speaker. The audio signal conversion device for multi-channel audio according to claim 1 or 2, further comprising a reproduction signal generation unit.

前記再生信号生成部は、前記マルチチャンネル音響の音声信号のうち左右後方のチャンネルの音声信号に対して、左右に異なる信号レベルで前記後方感強調処理を施した音声信号と重み付け加算により合成し、側方もしくは斜め前方の実配置のスピーカから再生するための再生信号を生成することを特徴とする、請求項３に記載のマルチチャンネル音響の音声信号変換装置。 The reproduction signal generation unit synthesizes the audio signals of the left and right rear channels of the multi-channel acoustic audio signals with the audio signals subjected to the rear feeling enhancement processing at different signal levels on the left and right by weighting addition. The audio signal conversion device for multi-channel audio according to claim 3, wherein a reproduction signal for reproduction is generated from a speaker arranged in an actual arrangement on the side or diagonally forward.

前記後方感強調処理部は、前記マルチチャンネル音響の音声信号のうち後方のチャンネルが複数ある際に、該チャンネル毎に異なる帯域抑圧フィルタを適用可能に構成されていることを特徴とする、請求項１から４のいずれか一項に記載のマルチチャンネル音響の音声信号変換装置。 The rear feeling enhancement processing unit is characterized in that, when there are a plurality of rear channels in the audio signal of the multi-channel acoustic, a different band suppression filter can be applied to each channel. The audio signal conversion device for multi-channel acoustics according to any one of 1 to 4.

前記後方感強調処理部は、当該帯域抑圧フィルタについて実配置されるスピーカに応じて選択変更可能に構成されていることを特徴とする、請求項１から５のいずれか一項に記載のマルチチャンネル音響の音声信号変換装置。 The multi-channel according to any one of claims 1 to 5, wherein the rear feeling enhancement processing unit is configured to be selectable and changeable according to a speaker actually arranged for the band suppression filter. Acoustic audio signal converter.

前記分析部は、前記分析結果を基に前記帯域抑圧フィルタ記憶部に記憶されている複数の帯域抑圧フィルタのうち前記後方感強調処理部によって使用する帯域抑圧フィルタを自動的に選択設定可能に構成されていることを特徴とする、請求項１から６のいずれか一項に記載のマルチチャンネル音響の音声信号変換装置。 Based on the analysis result, the analysis unit can automatically select and set the band suppression filter used by the rear feeling enhancement processing unit from among the plurality of band suppression filters stored in the band suppression filter storage unit. The audio signal conversion device for multi-channel acoustics according to any one of claims 1 to 6, wherein the device is characterized by the above.

前記帯域抑圧フィルタ記憶部は、予め定めた抑圧量で略２ｋＨｚから略８ｋＨｚの間で可変帯域となる複数種の帯域抑圧フィルタを記憶していることを特徴とする、請求項１から７のいずれか一項に記載のマルチチャンネル音響の音声信号変換装置。 Any of claims 1 to 7, wherein the band suppression filter storage unit stores a plurality of types of band suppression filters having a variable band between about 2 kHz and about 8 kHz with a predetermined suppression amount. The multi-channel audio audio signal converter according to item 1.

前記帯域抑圧フィルタ記憶部は、当該マルチチャンネル音響の音声信号における後方のチャンネルの音声信号用の頭部伝達関数の４ｋＨｚ以上の周波数帯域の中で最も低い周波数のノッチ周波数を抑圧可能とする上側遮断周波数を持つ帯域抑圧フィルタを、前記後方感強調処理部によって使用する帯域抑圧フィルタとして定め記憶していることを特徴とする、請求項１から８のいずれか一項に記載のマルチチャンネル音響の音声信号変換装置。 The band suppression filter storage unit is an upper cutoff capable of suppressing the notch frequency of the lowest frequency in the frequency band of 4 kHz or more of the head transmission function for the audio signal of the rear channel in the audio signal of the multi-channel acoustic. The voice of the multi-channel acoustic according to any one of claims 1 to 8, wherein the band suppression filter having a frequency is defined and stored as the band suppression filter used by the rear feeling enhancement processing unit. Signal converter.

コンピュータを、請求項１から９のいずれか一項に記載のマルチチャンネル音響の音声信号変換装置として機能させるためのプログラム。 A program for causing a computer to function as an audio signal converter for multi-channel acoustics according to any one of claims 1 to 9.