JP2014178578A

JP2014178578A - Sound processor

Info

Publication number: JP2014178578A
Application number: JP2013053485A
Authority: JP
Inventors: Janner Geordi; ジェイナージョルディ; Marxer Ricardo; マークサーリカルド; Bonada Jordi; ボナダジョルディ; Kazunobu Kondo; 多伸近藤
Original assignee: Yamaha Corp
Current assignee: Yamaha Corp
Priority date: 2013-03-15
Filing date: 2013-03-15
Publication date: 2014-09-25

Abstract

PROBLEM TO BE SOLVED: To provide a sound processor capable of preventing a residual component out of a sound component to be suppressed from auditorily being perceived.SOLUTION: A separation filter formation part 32 forms a separation filter G (τ) having a coefficient value g (k,τ) of each frequency set so as to suppress a specific component of a sound signal for each unit period overlapping each other on a time base. A phase shift filter formation part 34 forms a phase shift filter P (τ) varying only the amount of phase shift θ (k,τ) corresponding to a coefficient value g (k,τ) of the frequency out of the separation filter G (τ) and a variation n (k,τ) varied for each unit period, for each unit period.

Description

本発明は、複数の音響成分が混合された音響信号から特定の音響成分を分離（抽出または抑圧）する技術に関する。 The present invention relates to a technique for separating (extracting or suppressing) a specific acoustic component from an acoustic signal in which a plurality of acoustic components are mixed.

音響信号の目的音成分と非目的音成分とが混在した音響信号について非目的音成分を抑圧（各成分を分離）するための各種の技術が従来から提案されている。例えば特許文献１には、音響信号の単位期間（フレーム）毎のスペクトルに抑圧係数列（例えばウィナーフィルタ）を作用させることで音響信号のうち目的音成分以外の非目的音成分を抑圧（目的音成分を強調）する技術が開示されている。 Various techniques for suppressing non-target sound components (separating each component) have been proposed in the past for acoustic signals in which target sound components and non-target sound components are mixed. For example, Patent Document 1 discloses that a non-target sound component other than a target sound component is suppressed (target sound) in an acoustic signal by applying a suppression coefficient sequence (for example, a Wiener filter) to a spectrum for each unit period (frame) of the sound signal. Techniques for emphasizing ingredients are disclosed.

特開２０１２−１１３１９０公報JP2012-113190A

しかし、非目的音成分を確実に抑圧し得る完全な処理係数列を推定することは現実には困難であるから、処理係数列を適用した処理後の音響信号には非目的音成分の一部の成分（以下「残存成分」という）が残存し得る。以上の事情を背景として、本発明は、抑圧対象の音響成分のうちの残存成分を聴感的に知覚され難くすることを目的とする。 However, since it is actually difficult to estimate a complete processing coefficient sequence that can reliably suppress the non-target sound component, a part of the non-target sound component is included in the processed acoustic signal to which the processing coefficient sequence is applied. May remain (hereinafter referred to as “residual component”). In view of the above circumstances, an object of the present invention is to make it difficult to perceptually perceive residual components among acoustic components to be suppressed.

以上の課題を解決するために、本発明の音響処理装置は、音響信号の特定成分が抑圧されるように複数の周波数の各々の係数値（例えば係数値ｇ(k,τ)）が設定された分離フィルタを、時間軸上で相互に重複する単位期間毎に生成する分離フィルタ生成手段と、複数の周波数の各々における位相を、分離フィルタのうち当該周波数の係数値と単位期間毎に変動する変動値（例えば変動値ｎ(k,τ)）とに応じた移相量だけ変化させる移相フィルタを、単位期間毎に生成する移相フィルタ生成手段とを具備する。例えば、移相量は、各周波数の係数値が特定成分を抑圧する度合が大きいほど大きい数値となる制御値（例えば制御値Ｆ(k,τ)）と変動値との乗算に応じて設定される。 In order to solve the above problems, in the sound processing apparatus of the present invention, coefficient values (for example, coefficient values g (k, τ)) of a plurality of frequencies are set so that specific components of the sound signal are suppressed. The separation filter generating means for generating the separation filter for each unit period that overlaps each other on the time axis, and the phase of each of the plurality of frequencies are varied for each coefficient period of the separation filter and for each unit period. Phase shift filter generation means for generating a phase shift filter that changes by a phase shift amount corresponding to a fluctuation value (for example, fluctuation value n (k, τ)) for each unit period. For example, the amount of phase shift is set according to multiplication of a control value (for example, control value F (k, τ)) and a fluctuation value that becomes a larger numerical value as the degree to which the coefficient value of each frequency suppresses the specific component is larger. The

以上の構成では、音響信号の複数の周波数の各々の移相量が、音響信号の特定成分を抑圧する分離フィルタのうち当該周波数の係数値と単位期間毎に変動する変動値とに応じて設定される。すなわち、音響信号の各周波数の移相量に対する変動値の変動の影響（移相量の変動の度合）が分離フィルタの係数値に応じて周波数毎に制御される。したがって、例えば、音響信号のうち特定成分が優勢な周波数（例えば分離フィルタの係数値が小さい周波数）に対する移相量を、音響信号のうち特定成分以外の音響成分が優勢な周波数に対する移相量と比較して大きく変動させることが可能である。各単位期間は時間軸上で相互に重複する（すなわち、音響信号の各周波数の成分が複数の単位期間にわたる）から、音響信号のうち単位期間毎の移相量の変動が大きい周波数の成分ほど時間軸上の広範囲に分散される。特定成分のエネルギーの最大値は、各周波数成分が時間軸上の広範囲に分散されるほど小さくなる。したがって、以上の構成によれば、分離フィルタの作用の前後で各周波数の位相が維持される構成と比較して、特定成分のうち分離フィルタの係数値では完全には抑圧されない残存成分を聴感的に知覚され難くすることが可能である。 In the above configuration, the phase shift amount of each of the plurality of frequencies of the acoustic signal is set according to the coefficient value of the frequency and the fluctuation value that varies for each unit period among the separation filters that suppress the specific component of the acoustic signal. Is done. That is, the influence of the fluctuation of the fluctuation value on the phase shift amount of each frequency of the acoustic signal (the degree of fluctuation of the phase shift quantity) is controlled for each frequency according to the coefficient value of the separation filter. Therefore, for example, the phase shift amount for a frequency where a specific component is dominant in the acoustic signal (for example, the frequency where the coefficient value of the separation filter is small) is the phase shift amount for a frequency where an acoustic component other than the specific component is dominant in the acoustic signal. It is possible to make a large fluctuation in comparison. Since each unit period overlaps with each other on the time axis (that is, each frequency component of the acoustic signal extends over a plurality of unit periods), the component of the acoustic signal whose frequency shift amount is large for each unit period. Widely distributed on the time axis. The maximum value of the energy of the specific component becomes smaller as each frequency component is dispersed over a wide range on the time axis. Therefore, according to the above configuration, compared to the configuration in which the phase of each frequency is maintained before and after the operation of the separation filter, the remaining components that are not completely suppressed by the coefficient value of the separation filter among the specific components are audibly heard. It is possible to make it difficult to perceive.

本発明の好適な態様において、変動値は、複数の周波数の各々について個別に設定される。以上の構成によれば、複数の周波数にわたる残存成分を効果的に知覚され難くすることが可能である。変動値の好適例は、単位期間毎に生成された乱数（例えば一様乱数）である。以上の構成によれば、乱数を発生させる簡便な構成により、残存成分を時間軸上に効果的に分散させることが可能である。 In a preferred aspect of the present invention, the variation value is individually set for each of the plurality of frequencies. According to the above configuration, it is possible to make it difficult to effectively perceive residual components over a plurality of frequencies. A suitable example of the fluctuation value is a random number (for example, a uniform random number) generated for each unit period. According to the above configuration, it is possible to effectively disperse the remaining components on the time axis with a simple configuration for generating random numbers.

本発明の好適な態様に係る音響処理装置は、分離フィルタと移相フィルタとを音響信号に作用させる信号処理手段を具備する。以上の構成によれば、分離フィルタと移相フィルタとを作用させた音響信号を生成することが可能である。 The acoustic processing apparatus according to a preferred aspect of the present invention includes signal processing means for causing the separation filter and the phase shift filter to act on the acoustic signal. According to the above configuration, it is possible to generate an acoustic signal in which the separation filter and the phase shift filter are operated.

以上の各態様に係る音響処理装置は、音響信号の処理に専用されるＤＳＰ（Digital Signal Processor）などのハードウェア（電子回路）によって実現されるほか、ＣＰＵ（Central Processing Unit）等の汎用の演算処理装置とプログラムとの協働によっても実現される。具体的には、本発明のプログラムは、音響信号の特定成分が抑圧されるように複数の周波数の各々の係数値が設定された分離フィルタを、時間軸上で相互に重複する単位期間毎に生成する分離フィルタ生成処理と、複数の周波数の各々における位相を、分離フィルタのうち当該周波数の係数値と単位期間毎に変動する変動値とに応じた移相量だけ変化させる移相フィルタを、単位期間毎に生成する移相フィルタ生成処理とをコンピュータに実行させる。本発明のプログラムは、コンピュータが読取可能な記録媒体に格納された形態で提供されてコンピュータにインストールされ得る。記録媒体は、例えば非一過性（non-transitory）の記録媒体であり、ＣＤ-ＲＯＭ等の光学式記録媒体（光ディスク）が好例であるが、半導体記録媒体や磁気記録媒体等の公知の任意の形式の記録媒体を包含し得る。また、例えば、本発明のプログラムは、通信網を介した配信の形態で提供されてコンピュータにインストールされ得る。 The sound processing apparatus according to each of the above aspects is realized by hardware (electronic circuit) such as a DSP (Digital Signal Processor) dedicated to processing of an acoustic signal, or a general-purpose operation such as a CPU (Central Processing Unit). This is also realized by cooperation between the processing device and the program. Specifically, the program of the present invention performs separation filters in which coefficient values of a plurality of frequencies are set so that specific components of an acoustic signal are suppressed for each unit period that overlaps each other on the time axis. A separation filter generation process to generate, and a phase shift filter that changes a phase at each of a plurality of frequencies by a phase shift amount corresponding to a coefficient value of the frequency and a variation value that varies for each unit period of the separation filter, A computer is caused to execute a phase shift filter generation process generated for each unit period. The program of the present invention can be provided in a form stored in a computer-readable recording medium and installed in the computer. The recording medium is, for example, a non-transitory recording medium, and an optical recording medium (optical disk) such as a CD-ROM is a good example, but a known arbitrary one such as a semiconductor recording medium or a magnetic recording medium This type of recording medium can be included. For example, the program of the present invention can be provided in the form of distribution via a communication network and installed in a computer.

本発明の第１実施形態に係る音響処理装置のブロック図である。1 is a block diagram of a sound processing apparatus according to a first embodiment of the present invention. フィルタ生成部の具体的な構成のブロック図である。It is a block diagram of the concrete structure of a filter production | generation part.

＜第１実施形態＞
図１は、本発明の第１実施形態に係る音響処理装置１００のブロック図である。図１に示すように、音響処理装置１００には信号供給装置１２と放音装置１４とが接続される。信号供給装置１２は、音響信号ＳXを音響処理装置１００に供給する。音響信号ＳXは、複数の音響成分（例えば楽音や音声）の混合音の波形を示す時間領域信号である。例えば、複数種の楽器の演奏音（歌唱音等の音声も含意する）の混合音を示す音響信号ＳXが音響処理装置１００に供給される。周囲の音響を収音して音響信号ＳXを生成する収音機器や、可搬型または内蔵型の記録媒体から音響信号ＳXを取得して音響処理装置１００に供給する再生装置や、通信網から音響信号ＳXを受信して音響処理装置１００に供給する通信装置が信号供給装置１２として採用され得る。 <First Embodiment>
FIG. 1 is a block diagram of a sound processing apparatus 100 according to the first embodiment of the present invention. As shown in FIG. 1, a signal supply device 12 and a sound emitting device 14 are connected to the sound processing device 100. The signal supply device 12 supplies the acoustic signal SX to the acoustic processing device 100. The acoustic signal SX is a time domain signal indicating a waveform of a mixed sound of a plurality of acoustic components (for example, musical sound and voice). For example, an acoustic signal SX indicating a mixed sound of performance sounds of multiple types of musical instruments (which also imply voices such as singing sounds) is supplied to the acoustic processing device 100. A sound collecting device that collects ambient sounds to generate an acoustic signal SX, a playback device that acquires the acoustic signal SX from a portable or built-in recording medium and supplies the acoustic signal SX to the acoustic processing device 100, or an acoustic from a communication network A communication device that receives the signal SX and supplies the signal SX to the sound processing device 100 may be employed as the signal supply device 12.

第１実施形態の音響処理装置１００は、信号供給装置１２から供給される音響信号ＳXに対する音響処理で音響信号ＳYを生成する信号処理装置である。音響信号ＳYは、音響信号ＳXに包含される複数の音響成分のうち１以上の特定の音響成分（以下「第１音響成分」という）を抽出した音響の波形を示す時間領域信号である。すなわち、音響処理装置１００は、音響信号ＳXのうち第１音響成分以外の１以上の音響成分（以下「第２音響成分」という）を抑圧することで音響信号ＳYを生成する。第１音響成分は、音響信号ＳXのうち強調対象となる目的音成分とも換言され、第２音響成分は、音響信号ＳXのうち抑圧対象となる非目的音成分とも換言される。放音装置１４（例えばスピーカやヘッドホン）は、音響処理装置１００から供給される音響信号ＳYに応じた音波を放射する。なお、音響信号ＳYをデジタルからアナログに変換するＤ/Ａ変換器の図示は便宜的に省略した。 The acoustic processing device 100 according to the first embodiment is a signal processing device that generates an acoustic signal SY by acoustic processing on the acoustic signal SX supplied from the signal supply device 12. The acoustic signal SY is a time domain signal indicating an acoustic waveform obtained by extracting one or more specific acoustic components (hereinafter referred to as “first acoustic component”) from among a plurality of acoustic components included in the acoustic signal SX. That is, the sound processing apparatus 100 generates the sound signal SY by suppressing one or more sound components (hereinafter referred to as “second sound component”) other than the first sound component in the sound signal SX. The first acoustic component is also referred to as a target sound component to be emphasized in the acoustic signal SX, and the second acoustic component is also referred to as a non-target sound component to be suppressed in the acoustic signal SX. The sound emitting device 14 (for example, a speaker or a headphone) emits a sound wave corresponding to the acoustic signal SY supplied from the acoustic processing device 100. The D / A converter that converts the acoustic signal SY from digital to analog is not shown for convenience.

図１に示すように、音響処理装置１００は、演算処理装置２２と記憶装置２４とを具備するコンピュータシステムで実現される。記憶装置２４は、演算処理装置２２が実行するプログラムや演算処理装置２２が使用する各種のデータを記憶する。半導体記録媒体や磁気記録媒体等の公知の記録媒体や複数種の記録媒体の組合せが記憶装置２４として任意に採用され得る。音響信号ＳXを記憶装置２４に記憶した構成（したがって信号供給装置１２は省略され得る）も好適である。 As shown in FIG. 1, the sound processing device 100 is realized by a computer system including an arithmetic processing device 22 and a storage device 24. The storage device 24 stores a program executed by the arithmetic processing device 22 and various data used by the arithmetic processing device 22. A known recording medium such as a semiconductor recording medium or a magnetic recording medium or a combination of a plurality of types of recording media can be arbitrarily employed as the storage device 24. A configuration in which the acoustic signal SX is stored in the storage device 24 (therefore, the signal supply device 12 can be omitted) is also suitable.

図１の演算処理装置２２は、記憶装置２４に記憶されたプログラムを実行することで、音響信号ＳXから音響信号ＳYを生成するための複数の機能（周波数分析部２０，フィルタ生成部３０，信号処理部４０，波形生成部５０）を実現する。なお、演算処理装置２２の各機能を複数の集積回路に分散した構成や、専用の電子回路（例えばＤＳＰ）が一部の機能を実現する構成も採用され得る。 The arithmetic processing unit 22 in FIG. 1 executes a program stored in the storage device 24 to thereby generate a plurality of functions (frequency analysis unit 20, filter generation unit 30, signal) for generating the acoustic signal SY from the acoustic signal SX. A processing unit 40 and a waveform generation unit 50) are realized. A configuration in which each function of the arithmetic processing unit 22 is distributed over a plurality of integrated circuits, or a configuration in which a dedicated electronic circuit (for example, a DSP) realizes a part of the functions may be employed.

図１の周波数分析部２０は、音響信号ＳXを時間軸上で区分した複数の単位期間（フレーム）の各々についてスペクトル（複素スペクトル）ＱX(τ)を算定する。任意の１個の単位期間のスペクトルＱX(τ)は、周波数軸上の相異なる周波数に対応する複数の周波数成分（複素数）Ｘ(k,τ)の系列である。記号kは、周波数軸上の任意の１個の周波数（周波数帯域）を指示する変数であり、記号τは、時間軸上の複数の単位期間の何れかを指示する変数である。相前後する各単位期間は時間軸上で相互に部分的に重複する。スペクトルＱX(τ)の生成には例えば短時間フーリエ変換等の公知の周波数分析が任意に利用される。なお、通過帯域が相違する複数の帯域通過フィルタで構成されるフィルタバンクを周波数分析部２０として採用することも可能である。また、音響信号ＳXを記憶装置２４に記憶した構成では、記憶装置２４から周波数分析部２０に音響信号ＳXが供給される。 The frequency analysis unit 20 in FIG. 1 calculates a spectrum (complex spectrum) QX (τ) for each of a plurality of unit periods (frames) obtained by dividing the acoustic signal SX on the time axis. The spectrum QX (τ) of any one unit period is a series of a plurality of frequency components (complex numbers) X (k, τ) corresponding to different frequencies on the frequency axis. The symbol k is a variable indicating any one frequency (frequency band) on the frequency axis, and the symbol τ is a variable indicating any of a plurality of unit periods on the time axis. Each successive unit period partially overlaps with each other on the time axis. For the generation of the spectrum QX (τ), a known frequency analysis such as a short-time Fourier transform is arbitrarily used. Note that a filter bank composed of a plurality of bandpass filters having different passbands may be employed as the frequency analysis unit 20. In the configuration in which the acoustic signal SX is stored in the storage device 24, the acoustic signal SX is supplied from the storage device 24 to the frequency analysis unit 20.

図１のフィルタ生成部３０は、処理フィルタＭ(τ)を単位期間毎に順次に生成する。本実施形態の処理フィルタＭ(τ)は、音響信号ＳX（スペクトルＱX(τ)）の第２音響成分を抑圧するためのフィルタであり、相異なる周波数（または周波数帯域）に対応する複数の係数値ｍ(k,τ)の系列である。 The filter generation unit 30 in FIG. 1 sequentially generates the processing filter M (τ) for each unit period. The processing filter M (τ) of this embodiment is a filter for suppressing the second acoustic component of the acoustic signal SX (spectrum QX (τ)), and has a plurality of factors corresponding to different frequencies (or frequency bands). It is a series of numerical values m (k, τ).

信号処理部４０は、音響信号ＳXの各単位期間のスペクトルＱX(τ)に当該単位期間の処理フィルタＭ(τ)を作用させることで、音響信号ＳYのスペクトル（複素スペクトル）ＱY(τ)を単位期間毎に順次に生成する。具体的には、信号処理部４０は、以下の数式(1)で表現される通り、音響信号ＳXの各単位期間の周波数成分Ｘ(k,τ)に対し、当該単位期間についてフィルタ生成部３０が生成した処理フィルタＭ(τ)のうち当該周波数成分Ｘ(k,τ)と共通の周波数に対応する係数値ｍ(k,τ)を乗算することで、音響信号ＳYのスペクトルＱY(τ)の周波数成分Ｙ(k,τ)を算定する。

The signal processing unit 40 applies the processing filter M (τ) of the unit period to the spectrum QX (τ) of each unit period of the acoustic signal SX, thereby obtaining the spectrum (complex spectrum) QY (τ) of the acoustic signal SY. Generated sequentially for each unit period. Specifically, as represented by the following formula (1), the signal processing unit 40 performs the filter generation unit 30 for the unit period for the frequency component X (k, τ) of each unit period of the acoustic signal SX. Is multiplied by a coefficient value m (k, τ) corresponding to a common frequency with the frequency component X (k, τ) of the processing filter M (τ) generated by the signal SY. The frequency component Y (k, τ) of is calculated.

波形生成部５０は、信号処理部４０が単位期間毎に生成するスペクトルＱY(τ)から時間領域の音響信号ＳYを生成する。具体的には、波形生成部５０は、各単位期間のスペクトルＱY(τ)を例えば短時間逆フーリエ変換で時間領域の信号に変換し、相前後する各単位期間の信号を時間軸上で相互に重複させた状態で加算することにより音響信号ＳYを生成する。波形生成部５０が生成した音響信号ＳYが放音装置１４に供給されることで音響として放射される。 The waveform generator 50 generates a time domain acoustic signal SY from the spectrum QY (τ) generated by the signal processor 40 for each unit period. Specifically, the waveform generation unit 50 converts the spectrum QY (τ) of each unit period into a time domain signal by, for example, short-time inverse Fourier transform, and mutually converts the signal of each unit period on the time axis. The sound signal SY is generated by adding the signals in a state where they are overlapped with each other. The acoustic signal SY generated by the waveform generation unit 50 is supplied to the sound emitting device 14 and is emitted as sound.

第１実施形態のフィルタ生成部３０が生成する処理フィルタＭ(τ)は、分離フィルタＧ(τ)と移相フィルタＰ(τ)とで構成される。分離フィルタＧ(τ)は、音響信号ＳXの各周波数成分Ｘ(k,τ)の強度を変調するフィルタであり、相異なる周波数に対応する複数の係数値ｇ(k,τ)の系列である。また、移相フィルタＰ(τ)は、音響信号ＳXの各周波数成分Ｘ(k,τ)の位相を変調するフィルタであり、相異なる周波数に対応する複数の係数値ｐ(k,τ)の系列である。処理フィルタＭ(τ)の各係数値ｍ(k,τ)は、以下の数式(2)で表現される通り、分離フィルタＧ(τ)の係数値ｇ(k,τ)と移相フィルタＰ(τ)の係数値ｐ(k,τ)との乗算値に相当する。

The processing filter M (τ) generated by the filter generation unit 30 of the first embodiment includes a separation filter G (τ) and a phase shift filter P (τ). The separation filter G (τ) is a filter that modulates the intensity of each frequency component X (k, τ) of the acoustic signal SX, and is a series of a plurality of coefficient values g (k, τ) corresponding to different frequencies. . The phase shift filter P (τ) is a filter that modulates the phase of each frequency component X (k, τ) of the acoustic signal SX, and has a plurality of coefficient values p (k, τ) corresponding to different frequencies. It is a series. Each coefficient value m (k, τ) of the processing filter M (τ) is expressed by the following equation (2), and the coefficient value g (k, τ) of the separation filter G (τ) and the phase shift filter P This corresponds to a multiplication value of the coefficient value p (k, τ) of (τ).

＜フィルタ生成部３０＞
図２は、フィルタ生成部３０の具体的な構成のブロック図である。図２に示すように、フィルタ生成部３０は、分離フィルタ生成部３２と移相フィルタ生成部３４と変動値生成部３６とを含んで構成される。分離フィルタ生成部３２は分離フィルタＧ(τ)を生成し、移相フィルタ生成部３４は移相フィルタＰ(τ)を生成する。数式(2)を参照して前述した通り、分離フィルタ生成部３２が生成した分離フィルタＧ(τ)と移相フィルタ生成部３４が生成した移相フィルタＰ(τ)とに応じた処理フィルタＭ(τ)が信号処理部４０による第２音響成分の抑圧（第１音響成分の強調）に適用される。 <Filter generation unit 30>
FIG. 2 is a block diagram of a specific configuration of the filter generation unit 30. As shown in FIG. 2, the filter generation unit 30 includes a separation filter generation unit 32, a phase shift filter generation unit 34, and a fluctuation value generation unit 36. The separation filter generation unit 32 generates a separation filter G (τ), and the phase shift filter generation unit 34 generates a phase shift filter P (τ). As described above with reference to Equation (2), the processing filter M according to the separation filter G (τ) generated by the separation filter generation unit 32 and the phase shift filter P (τ) generated by the phase shift filter generation unit 34. (τ) is applied to suppression of the second acoustic component (emphasis of the first acoustic component) by the signal processing unit 40.

分離フィルタ生成部３２は、分離フィルタＧ(τ)を単位期間毎に順次に生成する。本実施形態の分離フィルタ生成部３２は、音響信号ＳXの第２音響成分が抑圧されるように分離フィルタＧ(τ)の各係数値ｇ(k,τ)を設定する。具体的には、分離フィルタＧ(τ)は、各係数値ｇ(k,τ)が以下の数式(3)で表現されるウィナー（Wiener）フィルタである。

数式(3)の記号|Ｘ(k,τ)|²は周波数成分Ｘ(k,τ)のパワーであり、記号|Ｅ(k,τ)|²は、音響信号ＳXから推定される第１音響成分のパワーである。第１音響成分のパワー|Ｅ(k,τ)|²の推定には公知の技術が任意に採用される。数式(1)および数式(2)から理解される通り、係数値ｇ(k,τ)は、周波数成分Ｘ(k,τ)の強度に作用し、周波数成分Ｘ(k,τ)の利得（スペクトルゲイン）を意味する。数式(3)から理解される通り、第２音響成分が第１音響成分に対して優勢な周波数（例えば第２音響成分の強度が第１音響成分の強度を上回る周波数）の係数値ｇ(k,τ)ほど小さい数値となるように、各係数値ｇ(k,τ)は０以上かつ１以下の範囲内で音響信号ＳXの音響特性に応じて可変に設定される。 The separation filter generation unit 32 sequentially generates the separation filter G (τ) for each unit period. The separation filter generation unit 32 of the present embodiment sets each coefficient value g (k, τ) of the separation filter G (τ) so that the second acoustic component of the acoustic signal SX is suppressed. Specifically, the separation filter G (τ) is a Wiener filter in which each coefficient value g (k, τ) is expressed by the following formula (3).

The symbol | X (k, τ) | ^{2 in} Equation (3) is the power of the frequency component X (k, τ), and the symbol | E (k, τ) | ² is the first estimated from the acoustic signal SX. It is the power of the acoustic component. A known technique is arbitrarily employed for estimating the power | E (k, τ) | ² of the first acoustic component. As understood from the equations (1) and (2), the coefficient value g (k, τ) acts on the intensity of the frequency component X (k, τ) and the gain of the frequency component X (k, τ) ( Spectrum gain). As understood from Equation (3), the coefficient value g (k) of the frequency at which the second acoustic component is dominant over the first acoustic component (for example, the frequency at which the intensity of the second acoustic component exceeds the intensity of the first acoustic component). , τ), the coefficient values g (k, τ) are variably set in the range of 0 or more and 1 or less according to the acoustic characteristics of the acoustic signal SX.

ところで、分離フィルタＧ(τ)は以上のように生成されるから、音響信号ＳXのスペクトルＱX(τ)に分離フィルタＧ(τ)を作用させるだけの構成（以下「対比例」という）でも、音響信号ＳXから第２音響成分を抑圧することは可能である（Ｙ(k,τ)＝ｇ(k,τ)Ｘ(k,τ)）。対比例では、音響信号ＳXのスペクトルＱX(τ)の位相が分離フィルタＧ(τ)の作用後にも維持される。しかし、第１音響成分のパワー|Ｅ(k,τ)|²を音響信号ＳXから完全に推定することは現実には困難であるから、対比例のもとで音響信号ＳXに分離フィルタＧ(τ)を作用させた音響には、第２音響成分の一部（残存成分）が残存し得る。第１実施形態の移相フィルタＰ(τ)は、分離フィルタＧ(τ)では抑圧し切れない残存成分を聴感的に知覚され難くするためのフィルタである。 By the way, since the separation filter G (τ) is generated as described above, even in a configuration in which the separation filter G (τ) is simply applied to the spectrum QX (τ) of the acoustic signal SX (hereinafter referred to as “proportional”), It is possible to suppress the second acoustic component from the acoustic signal SX (Y (k, τ) = g (k, τ) X (k, τ)). In contrast, the phase of the spectrum QX (τ) of the acoustic signal SX is maintained even after the action of the separation filter G (τ). However, since it is actually difficult to completely estimate the power | E (k, τ) | ² of the first acoustic component from the acoustic signal SX, the separation filter G ( A part (residual component) of the second acoustic component may remain in the sound on which τ) is applied. The phase shift filter P (τ) of the first embodiment is a filter for making it difficult to perceptually perceive residual components that cannot be suppressed by the separation filter G (τ).

図２の移相フィルタ生成部３４は、音響信号ＳXに作用させる移相フィルタＰ(τ)を単位期間毎に順次に生成する。移相フィルタＰ(τ)は、前述の通り、相異なる周波数（または周波数帯域）に対応する複数の係数値ｐ(k,τ)の系列である。具体的には、移相フィルタＰ(τ)の各係数値ｐ(k,τ)は、以下の数式(4)で表現される通り、スペクトルＱX(τ)における各周波数成分Ｘ(k,τ)の位相を移相量θ(k,τ)だけ変化させるように設定される。

The phase shift filter generation unit 34 in FIG. 2 sequentially generates the phase shift filter P (τ) that acts on the acoustic signal SX for each unit period. As described above, the phase shift filter P (τ) is a series of a plurality of coefficient values p (k, τ) corresponding to different frequencies (or frequency bands). Specifically, each coefficient value p (k, τ) of the phase shift filter P (τ) is expressed by the following formula (4), and each frequency component X (k, τ) in the spectrum QX (τ) is expressed. ) Is changed by a phase shift amount θ (k, τ).

数式(4)の記号ｊは虚数単位を意味する。第１実施形態の移相量θ(k,τ)は、制御値Ｆ(k,τ)と変動値ｎ(k,τ)とに応じて可変に設定される。具体的には、移相フィルタ生成部３４は、以下の数式(5)で表現される通り、制御値Ｆ(k,τ)と変動値ｎ(k,τ)とを乗算することで移相量θ(k,τ)を算定する。

The symbol j in the formula (4) means an imaginary unit. The phase shift amount θ (k, τ) of the first embodiment is variably set according to the control value F (k, τ) and the fluctuation value n (k, τ). Specifically, the phase shift filter generation unit 34 multiplies the control value F (k, τ) and the fluctuation value n (k, τ) by the phase shift as expressed by the following formula (5). The quantity θ (k, τ) is calculated.

図２の変動値生成部３６は、数式(5)の変動値ｎ(k,τ)を複数の周波数の各々について単位期間毎に算定する。変動値ｎ(k,τ)は、単位期間毎に刻々と変動する数値であり、例えば単位期間毎に発生する乱数である。例えば、変動値生成部３６は、所定範囲(-π≦ｎ(k,τ)≦π)内の各数値が略一様な確率で発生する一様乱数を変動値ｎ(k,τ)として算定する。したがって、変動値ｎ(k,τ)は正数または負数に設定され得る。 The fluctuation value generation unit 36 in FIG. 2 calculates the fluctuation value n (k, τ) of Equation (5) for each of a plurality of frequencies for each unit period. The variation value n (k, τ) is a numerical value that varies every unit period, for example, a random number generated every unit period. For example, the fluctuation value generation unit 36 uses, as the fluctuation value n (k, τ), a uniform random number in which each numerical value within a predetermined range (−π ≦ n (k, τ) ≦ π) is generated with a substantially uniform probability. Calculate. Therefore, the variation value n (k, τ) can be set to a positive number or a negative number.

数式(5)の制御値Ｆ(k,τ)は、複数の周波数の各々について単位期間毎に設定される。具体的には、任意の１個の単位期間における任意の１個の周波数の制御値Ｆ(k,τ)は、当該単位期間の分離フィルタＧ(τ)のうち当該周波数の係数値ｇ(k,τ)に応じて設定される。具体的には、移相フィルタ生成部３４は、係数値ｇ(k,τ)により音響信号ＳXが抑圧される度合が大きい（係数値ｇ(k,τ)が小さい）ほど制御値Ｆ(k,τ)を大きい数値に設定する。例えば、前掲の数式(6)から理解される通り、所定値（数式(6)の例示では１）から係数値ｇ(k,τ)を減算した数値が制御値Ｆ(k,τ)として好適である。 The control value F (k, τ) in Expression (5) is set for each unit period for each of a plurality of frequencies. Specifically, the control value F (k, τ) of any one frequency in any one unit period is the coefficient value g (k) of the frequency in the separation filter G (τ) of the unit period. , τ). Specifically, the phase shift filter generation unit 34 increases the control value F (k) as the degree to which the acoustic signal SX is suppressed by the coefficient value g (k, τ) increases (the coefficient value g (k, τ) decreases). , τ) is set to a large value. For example, as understood from Equation (6) above, a value obtained by subtracting the coefficient value g (k, τ) from a predetermined value (1 in the example of Equation (6)) is suitable as the control value F (k, τ). It is.

以上の説明から理解される通り、本実施形態では、各周波数の移相量θ(k,τ)に対する変動値ｎ(k,τ)の変動の影響（移相量θ(k,τ)の変動の度合）が、分離フィルタＧ(τ)の係数値ｇ(k,τ)（制御値Ｆ(k,τ)）に応じて単位期間毎および周波数毎に制御される。係数値ｇ(k,τ)が小さい（第２音響成分が優勢である）ほど制御値Ｆ(k,τ)は大きな数値に設定されるから、音響信号ＳXのうち第２音響成分が優勢な周波数の移相量θ(k,τ)は、第１音響成分が優勢な周波数の移相量θ(k,τ)と比較して大きく変動するという傾向がある。 As understood from the above description, in the present embodiment, the influence of the fluctuation of the fluctuation value n (k, τ) on the phase shift quantity θ (k, τ) of each frequency (the phase shift quantity θ (k, τ) The degree of variation is controlled for each unit period and for each frequency according to the coefficient value g (k, τ) (control value F (k, τ)) of the separation filter G (τ). Since the control value F (k, τ) is set to a larger value as the coefficient value g (k, τ) is smaller (the second acoustic component is more dominant), the second acoustic component is dominant in the acoustic signal SX. The frequency phase shift amount θ (k, τ) tends to vary greatly compared to the phase shift amount θ (k, τ) at which the first acoustic component is dominant.

音響信号ＳXのうち時間軸上の特定の時点に存在する特定の周波数の残存成分に着目する。各単位期間は時間軸上で相互に重複する（すなわち、音響信号の各周波数の成分が複数の単位期間にわたる）から、残存成分は、複数の単位期間に重複して存在する。各単位期間の残存成分は、当該単位期間の移相フィルタＰ(τ)の係数値ｐ(k,τ)の作用で移相量θ(k,τ)だけ位相が変化する。移相量θ(k,τ)は、単位期間毎に変動する変動値ｎ(k,τ)に応じて設定されるから、各単位期間の残存成分の位相は単位期間毎に相異なる移相量θ(k,τ)だけ変動する。すなわち、各単位期間の残存成分は、時間軸上の相異なる時点に移動する。波形生成部５０では、各単位期間が相互に重複した状態で加算されるから、移相フィルタＰ(τ)の作用前に時間軸上の１点に存在していた残存成分は、移相フィルタＰ(τ)の作用後には時間軸上の複数の時点に分散される。すなわち、音響信号ＳXの残存成分の強度（エネルギー）が時間軸上で分散される。移相フィルタＰ(τ)は、音響信号ＳXのうち第２音響成分の残存成分に残響を付与する要素とも換言される。 Attention is paid to a residual component of a specific frequency existing at a specific time point on the time axis in the acoustic signal SX. Since each unit period overlaps with each other on the time axis (that is, the component of each frequency of the acoustic signal extends over a plurality of unit periods), the remaining component exists in a plurality of unit periods. The phase of the remaining component in each unit period changes by the phase shift amount θ (k, τ) by the action of the coefficient value p (k, τ) of the phase shift filter P (τ) in the unit period. Since the phase shift amount θ (k, τ) is set according to the fluctuation value n (k, τ) that varies for each unit period, the phase of the remaining component in each unit period is different for each unit period. It fluctuates by an amount θ (k, τ). That is, the remaining component of each unit period moves to different time points on the time axis. In the waveform generation unit 50, each unit period is added in a state of being overlapped with each other. Therefore, the remaining component existing at one point on the time axis before the action of the phase shift filter P (τ) is the phase shift filter. After the action of P (τ), it is dispersed at a plurality of time points on the time axis. That is, the intensity (energy) of the remaining component of the acoustic signal SX is dispersed on the time axis. In other words, the phase shift filter P (τ) is an element that gives reverberation to the remaining component of the second acoustic component of the acoustic signal SX.

各周波数成分Ｘ(k,τ)の移相量θ(k,τ)は、第２音響成分が優勢な周波数ほど大きいから、音響信号ＳXに移相フィルタＰ(τ)を作用させることで、音響信号ＳXの複数の周波数成分Ｘ(k,τ)のうち第２音響成分が優勢な周波数の周波数成分Ｘ(k,τ)ほど時間軸上の広範囲に分散される。したがって、第１実施形態によれば、音響信号ＳXの位相が分離フィルタＧ(τ)の作用後にも維持される対比例と比較して、第２音響成分の残存成分を聴感的に知覚され難くすることができる。 Since the phase shift amount θ (k, τ) of each frequency component X (k, τ) is larger as the frequency where the second acoustic component is dominant, by applying the phase shift filter P (τ) to the acoustic signal SX, Of the plurality of frequency components X (k, τ) of the acoustic signal SX, the frequency component X (k, τ) having the dominant frequency of the second acoustic component is dispersed over a wide range on the time axis. Therefore, according to the first embodiment, the remaining component of the second acoustic component is less likely to be perceptually perceived as compared with the contrast in which the phase of the acoustic signal SX is maintained even after the action of the separation filter G (τ). can do.

＜第２実施形態＞
本発明の第２実施形態を以下に説明する。なお、以下に例示する各構成において作用や機能が第１実施形態と同等である要素については、第１実施形態の説明で参照した符号を流用して各々の詳細な説明を適宜に省略する。 Second Embodiment
A second embodiment of the present invention will be described below. In addition, about the element in which an effect | action and a function are equivalent to 1st Embodiment in each structure illustrated below, each reference detailed in description of 1st Embodiment is diverted, and each detailed description is abbreviate | omitted suitably.

第１実施形態では、音響信号ＳXの第１音響成分を抽出した音響信号ＳYを生成した。第２実施形態の音響処理装置１００は、音響信号ＳY1および音響信号ＳY2を音響信号ＳXから選択的に生成する。音響信号ＳY1は、第１実施形態の音響信号ＳYと同様に、第１音響成分を目的音成分として音響信号ＳXから抽出（第２音響成分を非目的音成分として抑圧）した時間領域信号であり、音響信号ＳY2は、第２音響成分を目的音成分として音響信号ＳXから抽出（第１音響成分を非目的音成分として抑圧）した時間領域信号である。音響信号ＳY1または音響信号ＳY2の何れを生成するかは、例えば利用者からの指示に応じて選択される。 In the first embodiment, the acoustic signal SY obtained by extracting the first acoustic component of the acoustic signal SX is generated. The acoustic processing apparatus 100 according to the second embodiment selectively generates the acoustic signal SY1 and the acoustic signal SY2 from the acoustic signal SX. The acoustic signal SY1 is a time domain signal extracted from the acoustic signal SX with the first acoustic component as the target sound component (suppressing the second acoustic component as the non-target sound component), like the acoustic signal SY of the first embodiment. The acoustic signal SY2 is a time-domain signal extracted from the acoustic signal SX using the second acoustic component as the target sound component (suppressing the first acoustic component as the non-target sound component). Whether the acoustic signal SY1 or the acoustic signal SY2 is generated is selected in accordance with an instruction from the user, for example.

第２実施形態のフィルタ生成部３０は、音響信号ＳXの第１音響成分を抽出するための処理フィルタＭ1(τ)と音響信号ＳXの第２音響成分を抽出するための処理フィルタＭ2(τ)とを選択的に生成する。処理フィルタＭi(τ)（ｉ＝１,２）は、相異なる周波数に対応する複数の係数値ｍi(k,τ)の系列であり、分離フィルタＧi(τ)と移相フィルタＰi(τ)とを含んで構成される。分離フィルタＧi(τ)は、音響信号ＳXの各周波数成分Ｘ(k,τ)の強度を変調する複数の係数値ｇi(k,τ)で構成され、移相フィルタＰi(τ)は、各周波数成分Ｘ(k,τ)の位相を変調する複数の係数値ｐi(k,τ)で構成される。 The filter generator 30 of the second embodiment includes a processing filter M1 (τ) for extracting the first acoustic component of the acoustic signal SX and a processing filter M2 (τ) for extracting the second acoustic component of the acoustic signal SX. And are selectively generated. The processing filter Mi (τ) (i = 1, 2) is a series of a plurality of coefficient values mi (k, τ) corresponding to different frequencies, and the separation filter Gi (τ) and the phase shift filter Pi (τ). It is comprised including. The separation filter Gi (τ) is composed of a plurality of coefficient values gi (k, τ) that modulate the intensity of each frequency component X (k, τ) of the acoustic signal SX, and the phase shift filter Pi (τ) It is composed of a plurality of coefficient values pi (k, τ) for modulating the phase of the frequency component X (k, τ).

信号処理部４０は、音響信号ＳXの各単位期間のスペクトルＱX(τ)に当該単位期間の処理フィルタＭi(τ)を作用させることで、音響信号ＳYiのスペクトルＱYi(τ)を単位期間毎に順次に生成する。具体的には、信号処理部４０は、以下の数式(7)の演算で音響信号ＳYiの周波数成分Ｙi(k,τ)を算定する。

The signal processing unit 40 applies the processing filter Mi (τ) of the unit period to the spectrum QX (τ) of each unit period of the acoustic signal SX, thereby obtaining the spectrum QYi (τ) of the acoustic signal SYi for each unit period. Generate sequentially. Specifically, the signal processing unit 40 calculates the frequency component Yi (k, τ) of the acoustic signal SYi by the calculation of the following formula (7).

処理フィルタＭi(τ)の各係数値ｍi(k,τ)は、以下の数式(8)で表現される通り、係数値ｇi(k,τ)と係数値ｐi(k,τ)との乗算値に相当する。

Each coefficient value mi (k, τ) of the processing filter Mi (τ) is multiplied by the coefficient value gi (k, τ) and the coefficient value pi (k, τ) as expressed by the following equation (8). Corresponds to the value.

分離フィルタＧ1(τ)の各係数値ｇ1(k,τ)は、第１実施形態の係数値ｇ(k,τ)と同様に前掲の数式(3)で算定され、周波数成分Ｘ(k,τ)にて第２音響成分が優勢であるほど小さい数値となる。したがって、数式(7)および数式(8)から理解される通り、処理フィルタＭ1(k,τ)は、第１実施形態の処理フィルタＭ(k,τ)と同様に、音響信号ＳXの第２音響成分を抑圧するとともに、第２音響成分の残存成分（分離フィルタＧ1(τ)では抑圧し切れない成分）が移相により聴感的に知覚され難くなるように作用する。 Each coefficient value g1 (k, τ) of the separation filter G1 (τ) is calculated by the above equation (3) similarly to the coefficient value g (k, τ) of the first embodiment, and the frequency component X (k, τ) The value becomes smaller as the second acoustic component becomes dominant at τ). Therefore, as can be understood from Equation (7) and Equation (8), the processing filter M1 (k, τ) is similar to the processing filter M (k, τ) of the first embodiment. While suppressing the acoustic component, it acts so that the remaining component of the second acoustic component (a component that cannot be suppressed by the separation filter G1 (τ)) is hardly perceptually perceived by the phase shift.

他方、分離フィルタＧ2(τ)の各係数値ｇ2(k,τ)は、周波数成分Ｘ(k,τ)において第１音響成分が優勢であるほど小さい数値に設定される。例えば、係数値ｇ2(k,τ)は、以下の数式(9)の演算で係数値ｇ1(k,τ)に応じて算定される。

以上の説明から理解される通り、処理フィルタＭ2(k,τ)は、音響信号ＳXの第１音響成分を抑圧するとともに、第１音響成分の残存成分（分離フィルタＧ2(τ)では抑圧し切れない成分）が移相により聴感的に知覚され難くなるように作用する。。 On the other hand, each coefficient value g2 (k, τ) of the separation filter G2 (τ) is set to a smaller value as the first acoustic component becomes dominant in the frequency component X (k, τ). For example, the coefficient value g2 (k, τ) is calculated according to the coefficient value g1 (k, τ) by the calculation of the following formula (9).

As understood from the above description, the processing filter M2 (k, τ) suppresses the first acoustic component of the acoustic signal SX and completely suppresses the remaining component of the first acoustic component (the separation filter G2 (τ)). Not to be perceptually perceived by phase shift. .

第２実施形態においても第１実施形態と同様の効果が実現される。また、第２実施形態では、音響信号ＳXから第１音響成分を抽出した音響信号ＳY1に加え、音響信号ＳXから第２音響成分を抽出した音響信号ＳY2を生成することができる。 In the second embodiment, the same effect as in the first embodiment is realized. In the second embodiment, in addition to the acoustic signal SY1 obtained by extracting the first acoustic component from the acoustic signal SX, the acoustic signal SY2 obtained by extracting the second acoustic component from the acoustic signal SX can be generated.

＜変形例＞
以上に例示した各形態には様々な変形が加えられる。具体的な変形の態様を例示すれば以下の通りである。なお、以下の例示から２以上の態様を任意に選択して組合せてもよい。 <Modification>
Various modifications can be made to each of the forms exemplified above. An example of a specific modification is as follows. Two or more aspects may be arbitrarily selected from the following examples and combined.

（１）分離フィルタＧ(τ)の形式は前述の各形態での例示に限定されない。例えば、各係数値ｇ(k,τ)が以下の数式(10)で表現されるウィナーフィルタを分離フィルタＧ(τ)として利用することも可能である。

数式(10)の記号max{ }は、括弧内の最大値を選択する演算（すなわち差分|Ｘ(k,τ)|²−|Ｉ(k,τ)|²を非負値に制限する演算）を意味する。記号|Ｉ(k,τ)|²は、音響信号ＳXから推定される第２音響成分のパワーである。第２音響成分のパワー|Ｉ(k,τ)|²の推定には公知の技術が任意に採用される。 (1) The format of the separation filter G (τ) is not limited to the examples in the above-described embodiments. For example, a Wiener filter in which each coefficient value g (k, τ) is expressed by the following formula (10) can be used as the separation filter G (τ).

The symbol max {} in Equation (10) is an operation for selecting the maximum value in parentheses (that is, an operation for limiting the difference | X (k, τ) | ² − | I (k, τ) | ² to a non-negative value) Means. The symbol | I (k, τ) | ² is the power of the second acoustic component estimated from the acoustic signal SX. A known technique is arbitrarily employed for estimating the power | I (k, τ) | ² of the second acoustic component.

また、第２実施形態では、分離フィルタＧ1(τ)の係数値ｇ1(k,τ)から分離フィルタＧ2(τ)の各係数値ｇ2(k,τ)を算定したが、第１音響成分を抑圧するための分離フィルタＧ2(τ)の各係数値ｇ2(k,τ)を、以下に例示される数式(11)または(12)の演算で算定することも可能である。

以上の説明から理解される通り、分離フィルタＧ(τ)は、音響信号ＳXの特定成分（第１音響成分または第２音響成分）が抑圧されるように複数の周波数の各々の係数値ｇ(k,τ)が設定されたフィルタとして包括的に表現される。 In the second embodiment, each coefficient value g2 (k, τ) of the separation filter G2 (τ) is calculated from the coefficient value g1 (k, τ) of the separation filter G1 (τ). It is also possible to calculate each coefficient value g2 (k, τ) of the separation filter G2 (τ) for suppression by the calculation of the following formula (11) or (12).

As understood from the above description, the separation filter G (τ) has the coefficient values g () of each of the plurality of frequencies so that the specific component (the first acoustic component or the second acoustic component) of the acoustic signal SX is suppressed. k, τ) is comprehensively expressed as a set filter.

（２）変動値生成部３６が生成する変動値ｎ(k,τ)は一様乱数に限定されない。例えば各数値の発生確率が正規分布に従う正規乱数を変動値ｎ(k,τ)として利用することも可能である。もっとも、変動値ｎ(k,τ)は乱数に限定されない。例えば、単位期間毎に所定量ずつ増加または減少する数値を変動値ｎ(k,τ)として利用することも可能である。また、前述の各形態では、周波数毎に個別に変動値ｎ(k,τ)を設定したが、複数の周波数にわたり共通の変動値ｎ(τ)を単位期間毎に生成することも可能である。以上の説明から理解される通り、前述の各形態の変動値ｎ(k,τ)は、単位期間毎に変動する数値として包括的に表現される。 (2) The fluctuation value n (k, τ) generated by the fluctuation value generator 36 is not limited to a uniform random number. For example, a normal random number in which the occurrence probability of each numerical value follows a normal distribution can be used as the fluctuation value n (k, τ). However, the fluctuation value n (k, τ) is not limited to a random number. For example, a numerical value that increases or decreases by a predetermined amount for each unit period can be used as the fluctuation value n (k, τ). Further, in each of the above-described embodiments, the variation value n (k, τ) is individually set for each frequency. However, a common variation value n (τ) can be generated for each unit period over a plurality of frequencies. . As understood from the above description, the variation value n (k, τ) of each of the above-described forms is comprehensively expressed as a numerical value that varies for each unit period.

（３）第１実施形態では、所定値（１）から係数値ｇ(k,τ)を減算した数値を制御値Ｆ(k,τ)として算定した。以上の構成では、係数値ｇ(k,τ)が１に近い場合（第１音響成分が優勢である場合）にも制御値Ｆ(k,τ)が有意な正数に設定され、結果的に周波数成分Ｘ(k,τ)が移相される。以上の傾向を考慮すると、第１音響成分が優勢である周波数成分Ｘ(k,τ)については位相の変動を積極的に抑制する構成が好適である。例えば、係数値ｇ(k,τ)が所定の閾値ＴHを上回る程度に周波数成分Ｘ(k,τ)にて第１音響成分が優勢である場合には、移相フィルタ生成部３４は、制御値Ｆ(k,τ)を強制的に０に設定する。前掲の数式(5)から理解される通り、制御値Ｆ(k,τ)が０の周波数成分については移相量θ(k,τ)も０となる。すなわち、係数値ｇ(k,τ)が所定の閾値ＴHを上回る周波数成分Ｘ(k,τ)については位相が変更されない。したがって、以上の構成によれば、音響信号ＳXの第１音響成分の波形歪を抑制できるという利点がある。なお、第２実施形態においても同様の構成が採用される。具体的には、係数値ｇi(k,τ)が所定の閾値ＴHi（閾値ＴH1と閾値ＴH2との異同は不問）を上回る周波数の制御値Ｆi(k,τ)（移相量θi(k,τ)）が０に設定される。 (3) In the first embodiment, a numerical value obtained by subtracting the coefficient value g (k, τ) from the predetermined value (1) is calculated as the control value F (k, τ). In the above configuration, the control value F (k, τ) is set to a significant positive number even when the coefficient value g (k, τ) is close to 1 (when the first acoustic component is dominant), and as a result The frequency component X (k, τ) is phase-shifted. Considering the above tendency, it is preferable that the frequency component X (k, τ), in which the first acoustic component is dominant, be configured to positively suppress the phase variation. For example, when the first acoustic component is dominant in the frequency component X (k, τ) to such an extent that the coefficient value g (k, τ) exceeds a predetermined threshold value TH, the phase shift filter generation unit 34 performs control. The value F (k, τ) is forcibly set to 0. As understood from the above formula (5), the phase shift amount θ (k, τ) is also zero for the frequency component for which the control value F (k, τ) is zero. That is, the phase is not changed for the frequency component X (k, τ) in which the coefficient value g (k, τ) exceeds the predetermined threshold value TH. Therefore, according to the above structure, there exists an advantage that the waveform distortion of the 1st acoustic component of acoustic signal SX can be suppressed. Note that the same configuration is also adopted in the second embodiment. Specifically, the control value Fi (k, τ) (phase shift amount θi (k, τ)) at a frequency at which the coefficient value gi (k, τ) exceeds a predetermined threshold value THi (the difference between the threshold value TH1 and the threshold value TH2 does not matter). τ)) is set to zero.

（４）第２実施形態においては、音響信号ＳY1および音響信号ＳY2を選択的に生成したが、音響信号ＳY1および音響信号ＳY2を並列に生成することも可能である。以上の構成によれば、例えば音響信号ＳY1および音響信号ＳY2の各々に別個の処理(例えば残響付与等の各種の効果付与)を実行したうえで両者を混合するといった特徴的な音響処理が実現される。 (4) In the second embodiment, the acoustic signal SY1 and the acoustic signal SY2 are selectively generated, but the acoustic signal SY1 and the acoustic signal SY2 can also be generated in parallel. According to the above configuration, for example, characteristic acoustic processing is realized in which the acoustic signal SY1 and the acoustic signal SY2 are separately processed (for example, various effects such as reverberation are imparted) and then mixed. The

（５）携帯電話機等の端末装置と通信するサーバ装置で音響処理装置１００を実現することも可能である。例えば、音響処理装置１００は、端末装置から受信した音響信号ＳXから音響信号ＳYを生成して端末装置に送信する。なお、音響信号ＳXのスペクトルＱX(τ)を端末装置から受信する構成（例えば端末装置が周波数分析部２０を具備する構成）では音響処理装置１００から周波数分析部２０が省略され、スペクトルＱY(τ)を端末装置に送信する構成（例えば端末装置が波形生成部５０を具備する構成）では音響処理装置１００から波形生成部５０が省略される。また、音響処理装置１００が生成した処理フィルタＭ(τ)を端末装置等の外部装置に転送して音響信号ＳXの処理に適用する構成では、信号処理部４０も音響処理装置１００から省略され得る。 (5) The sound processing apparatus 100 can also be realized by a server device that communicates with a terminal device such as a mobile phone. For example, the acoustic processing device 100 generates an acoustic signal SY from the acoustic signal SX received from the terminal device and transmits the acoustic signal SY to the terminal device. In the configuration in which the spectrum QX (τ) of the acoustic signal SX is received from the terminal device (for example, the configuration in which the terminal device includes the frequency analysis unit 20), the frequency analysis unit 20 is omitted from the acoustic processing device 100, and the spectrum QY (τ ) To the terminal device (for example, a configuration in which the terminal device includes the waveform generation unit 50), the waveform generation unit 50 is omitted from the acoustic processing device 100. In the configuration in which the processing filter M (τ) generated by the acoustic processing device 100 is transferred to an external device such as a terminal device and applied to the processing of the acoustic signal SX, the signal processing unit 40 can also be omitted from the acoustic processing device 100. .

１００……音響処理装置、１２……信号供給装置、１４……放音装置、２０……周波数分析部、２２……演算処理装置、２４……記憶装置、３０……フィルタ生成部、３２……分離フィルタ生成部、３４……移相フィルタ生成部、３６……変動値生成部、４０……信号処理部、５０……波形生成部
DESCRIPTION OF SYMBOLS 100 ... Acoustic processing device, 12 ... Signal supply device, 14 ... Sound emission device, 20 ... Frequency analysis part, 22 ... Arithmetic processing device, 24 ... Memory | storage device, 30 ... Filter production | generation part, 32 ... ... Separation filter generator 34... Phase shift filter generator 36. Fluctuation value generator 40. Signal processor 50. Waveform generator

Claims

音響信号の特定成分が抑圧されるように複数の周波数の各々の係数値が設定された分離フィルタを、時間軸上で相互に重複する単位期間毎に生成する分離フィルタ生成手段と、
前記複数の周波数の各々における位相を、前記分離フィルタのうち当該周波数の係数値と単位期間毎に変動する変動値とに応じた移相量だけ変化させる移相フィルタを、単位期間毎に生成する移相フィルタ生成手段と
を具備する音響処理装置。 Separation filter generation means for generating a separation filter in which coefficient values of each of a plurality of frequencies are set so as to suppress a specific component of an acoustic signal for each unit period overlapping each other on a time axis;
A phase shift filter that changes the phase at each of the plurality of frequencies by a phase shift amount corresponding to a coefficient value of the frequency and a fluctuation value that varies for each unit period of the separation filter is generated for each unit period. A sound processing apparatus comprising: a phase shift filter generating unit.

前記移相量は、各周波数の前記係数値が前記特定成分を抑圧する度合が大きいほど大きい数値となる制御値と前記変動値との乗算に応じて設定される
請求項１の音響処理装置。 The acoustic processing apparatus according to claim 1, wherein the phase shift amount is set according to multiplication of a control value that becomes a larger numerical value as the degree to which the coefficient value of each frequency suppresses the specific component is larger and the fluctuation value.

前記変動値は、単位期間毎に生成された乱数である
請求項１または請求項２の音響処理装置。 The sound processing apparatus according to claim 1, wherein the fluctuation value is a random number generated for each unit period.

前記分離フィルタと前記移相フィルタとを前記音響信号に作用させる信号処理手段
を具備する
請求項１から請求項３の何れかの音響処理装置。
The sound processing device according to claim 1, further comprising: a signal processing unit that causes the separation filter and the phase shift filter to act on the sound signal.