JP2008203315A

JP2008203315A - Audio encoding/decoding device and method, and software

Info

Publication number: JP2008203315A
Application number: JP2007036077A
Authority: JP
Inventors: Junji Araki; 潤二荒木
Original assignee: Matsushita Electric Industrial Co Ltd
Current assignee: Panasonic Holdings Corp
Priority date: 2007-02-16
Filing date: 2007-02-16
Publication date: 2008-09-04

Abstract

<P>PROBLEM TO BE SOLVED: To solve the problem that, in conventional MS stereo processing, compression efficiency is reduced, Middle-Side (MS) stereo processing is not performed since no processing is present for increasing deviation between Middle signals and Side signals generated from an input signal. <P>SOLUTION: In order to increase the deviation between Middle signals and Side signals, a weighting coefficient is introduced to a formula for calculating conventional Middle signals and Side signals, and the deviation between signals is intentionally made large. Thereby, opportunity for executing the MS stereo processing is increased, and as a result, a bit amount required for encoding is reduced and encoding efficiency is improved. Moreover, by reducing the bit amount required for encoding, more bits can be allocated to other encoding processing and high quality sound is attained. <P>COPYRIGHT: (C)2008,JPO&INPIT

Description

本発明は、デジタル圧縮符号化方式を用いたオーディオ信号の符号化・復号化装置に関し、特にＭＳステレオ処理を有するオーディオ信号の符号化・復号化装置に関する。 The present invention relates to an audio signal encoding / decoding device using a digital compression encoding method, and more particularly to an audio signal encoding / decoding device having MS stereo processing.

従来のオーディオ信号符号化方法、および、復号化方法としては、公知なものとしてＩＳＯ／ＩＥＣの国際標準方式、通称ＭＰＥＧ方式などが挙げられる。現在、幅広い応用を持ち、低ビットレート時でも高音質な符号化方式として、非特許文献１に記載のＭＰＥＧ２ＡＡＣ（ＡｄｖａｎｃｅｄＡｕｄｉｏＣｏｄｉｎｇ）などがあげられる。この方式の拡張規格も複数規格化が現在なされている。 As a conventional audio signal encoding method and decoding method, ISO / IEC international standard system, commonly called MPEG system, and the like are known. Currently, MPEG2 AAC (Advanced Audio Coding) described in Non-Patent Document 1 can be cited as an encoding method having a wide range of applications and high sound quality even at low bit rates. A plurality of extended standards of this method are currently being standardized.

これらの符号化・復号化方式において、圧縮効率を高めるための処理の一つとしてＭＳステレオ処理がある。以下図面を参照しながら、ＭＳステレオ処理のエンコーダとデコーダについて説明する。 In these encoding / decoding methods, there is MS stereo processing as one of the processes for improving the compression efficiency. Hereinafter, an encoder and a decoder for MS stereo processing will be described with reference to the drawings.

図５はＭＳステレオ処理のエンコーダにおける構成を示す図である。図５において、１１はＬチャンネル入力信号（以下Ｌ信号と呼ぶ）とＲチャンネル入力信号（以下Ｒ信号と呼ぶ）と（以下ＬＲ信号と呼ぶ）からＭｉｄｄｌｅ信号（和信号、以下Ｍ信号と呼ぶ）とＳｉｄｅ信号（差信号、以下Ｓ信号と呼ぶ）と（以下ＭＳ信号と呼ぶ）を算出するＭＳ信号算出手段であり、１２は前記ＭＳ信号算出手段１１で算出したＭＳ信号から２乗振幅和信号（Ｘ）を算出するＭＳ２乗振幅和算出手段であり、１３はＭＳ２乗振幅和算出手段１２から算出したＭＳ信号の２乗振幅和信号Ｘをもとに、ＭＳステレオ処理を実施するかどうかの判定結果を出力するＭＳステレオ判定手段であり、１４は前記ＭＳステレオ判定手段１３で判定した判定結果を基に、ＬＲ信号もしくはＭＳ信号を選択して符号化処理を行い、さらに前記判定結果をフラグに設定する信号処理手段である。 FIG. 5 is a diagram showing a configuration of an encoder for MS stereo processing. In FIG. 5, reference numeral 11 denotes an L channel input signal (hereinafter referred to as L signal), an R channel input signal (hereinafter referred to as R signal), and (hereinafter referred to as LR signal) to a Middle signal (sum signal, hereinafter referred to as M signal). And Side signal (difference signal, hereinafter referred to as S signal) and MS signal calculation means for calculating (hereinafter referred to as MS signal), 12 is a square amplitude sum signal from the MS signal calculated by the MS signal calculation means 11 MS square amplitude sum calculating means for calculating (X). Reference numeral 13 denotes whether to perform MS stereo processing based on the square amplitude sum signal X of the MS signal calculated from the MS square amplitude sum calculating means 12. MS stereo determination means for outputting a determination result, and 14 selects an LR signal or an MS signal based on the determination result determined by the MS stereo determination means 13 and performs an encoding process. A signal processing means for setting the serial determination result flag.

以上のように構成されたＭＳステレオ処理の動作について、図６のフロー図を用いて説明する。まず、ＭＳ信号算出ステップ２０１にて、式１を用いて、ＭＳ信号算出手段１１に入力されたＬＲ信号（式中Ｌ（ｎ）、Ｒ（ｎ）、ｎはサンプル番号）から、ＭＳ信号（式中Ｍ（ｎ）、Ｓ（ｎ））を算出する。 The operation of the MS stereo processing configured as described above will be described with reference to the flowchart of FIG. First, in the MS signal calculation step 201, using the equation 1, the LR signal (L (n), R (n), where n is a sample number) input to the MS signal calculation unit 11 is used to calculate the MS signal ( In the formula, M (n), S (n)) is calculated.

次に、ＭＳ２乗振幅和算出ステップ２０２にて、ＭＳ２乗振幅和算出手段１２が、ＭＳ信号算出手段１１で算出したＭＳ信号から、式２を用いてそれぞれの振幅２乗和を表すＳＵＭ＿Ｍ、ＳＵＭ＿Ｓを算出する。ここで、Σの累積加算範囲はＭＳステレオ処理の処理単位であり、例えば１０２４個のサンプルからなる１フレームである。 Next, in the MS square amplitude sum calculation step 202, the MS square amplitude sum calculation means 12 uses the SUM_M and SUM_S representing the respective amplitude square sums using the expression 2 from the MS signal calculated by the MS signal calculation means 11. Is calculated. Here, the cumulative addition range of Σ is a processing unit of MS stereo processing, and is, for example, one frame composed of 1024 samples.

さらに式３を用いて、Ｍ信号とＳ信号のそれぞれの振幅２乗和を表すＳＵＭ＿ＭとＳＵＭ＿Ｓとの偏りを表す値Ｘを算出する。 Further, using Formula 3, a value X representing the bias between SUM_M and SUM_S representing the sum of squared amplitudes of the M signal and S signal is calculated.

次にＭＳステレオ判定ステップ２０３にて、ＭＳステレオ判定手段１３が、ＭＳ２乗振幅和算出手段１２で算出した偏りを表すＸと、ＬＲ信号を符号化するよりもＭＳステレオ処理を実行してＭＳ信号を符号化する方が圧縮効率が上がると判断できる閾値Ｔ１，Ｔ２とを比較し、ＭＳステレオ処理を実行するかどうかを判定し、その判定結果を出力する。ここでＴ１はＳＵＭ＿ＭがＳＵＭ＿Ｓに対して十分大きい場合の閾値で、Ｔ２はＳＵＭ＿ＳがＳＵＭ＿Ｍに対して十分に大きい場合の閾値である。つまり、Ｘ≧Ｔ１もしくはＸ≦Ｔ２であればＭＳステレオ処理を実行するという判定結果を出力し、ＭＳ信号選択ステップ２０４にて、信号処理手段１４がＭＳ信号算出手段１１の出力であるＭＳ信号を選択して符号化するとともに、次のＭＳステレオ実行フラグ設定ステップ２０５にて、信号処理手段１４がＭＳステレオ実行フラグを設定する。また、Ｔ２＜Ｘ＜Ｔ１であればＭＳステレオ処理を実行しないという判定結果を出力し、ＬＲ信号選択ステップ２０６にて、信号処理手段１４がＬＲ信号を選択して符号化する。 Next, in MS stereo determination step 203, the MS stereo determination means 13 performs MS stereo processing rather than encoding the LR signal with X representing the bias calculated by the MS square amplitude sum calculation means 12, and the MS signal. Are compared with threshold values T1 and T2 that can be determined that the compression efficiency is increased, and it is determined whether or not the MS stereo processing is executed, and the determination result is output. Here, T1 is a threshold when SUM_M is sufficiently larger than SUM_S, and T2 is a threshold when SUM_S is sufficiently larger than SUM_M. That is, if X ≧ T1 or X ≦ T2, a determination result that MS stereo processing is executed is output, and in the MS signal selection step 204, the signal processing means 14 determines the MS signal that is the output of the MS signal calculation means 11 At the next MS stereo execution flag setting step 205, the signal processing means 14 sets the MS stereo execution flag. If T2 <X <T1, a determination result indicating that the MS stereo processing is not executed is output, and in LR signal selection step 206, the signal processing means 14 selects and encodes the LR signal.

続いて、ＭＳステレオ処理のデコーダについて説明する。図７はＭＳステレオ処理のデコーダの構成を示す図である。１６は入力信号を復号化してＬＲ信号もしくはＭＳ信号を出力すると共に、ＭＳステレオ実行フラグの有無を判別する信号処理手段であり、１７は信号処理手段１６においてＭＳステレオ実行フラグを判別してエンコーダ側でＭＳステレオ処理が実行されたと判断した場合に、信号処理手段１６からのＭＳ信号に所定の処理を実行してＬＲ信号を復元するデコード手段である。 Next, the MS stereo processing decoder will be described. FIG. 7 is a diagram showing a configuration of a decoder for MS stereo processing. Reference numeral 16 denotes signal processing means for decoding the input signal and outputting the LR signal or the MS signal and determining the presence or absence of the MS stereo execution flag. Reference numeral 17 denotes the encoder side by determining the MS stereo execution flag in the signal processing means 16. The decoding means restores the LR signal by executing a predetermined process on the MS signal from the signal processing means 16 when it is determined that the MS stereo processing has been executed.

以上のように構成されたＭＳステレオデコード処理の動作について、以下図８のフロー図を用いて説明する。図８のＭＳステレオ判別ステップ２０７にて、信号処理手段１６が、エンコーダ側でＭＳステレオ処理されたかどうかを示すフラグを確認する事で判別し判別結果を出力する。フラグを判別してエンコーダ側でＭＳステレオ処理が実行されていないと判断した場合は、入力信号をデコードする必要がないため、ＭＳステレオデコード処理を終了する。また、フラグを判別してエンコーダ側でＭＳステレオ処理が実行されていると判断した場合は、デコードステップ２０８にて、デコード手段１７が、式４に従いＭＳ信号をデコードしてＬＲ信号を復元する。 The operation of the MS stereo decoding process configured as described above will be described below with reference to the flowchart of FIG. In the MS stereo discrimination step 207 of FIG. 8, the signal processing means 16 discriminates by checking a flag indicating whether or not MS stereo processing has been performed on the encoder side, and outputs a discrimination result. If it is determined that the MS stereo process has not been executed on the encoder side by determining the flag, the MS stereo decoding process is terminated because there is no need to decode the input signal. If it is determined that the MS stereo processing is being executed on the encoder side by determining the flag, the decoding unit 17 decodes the MS signal according to Equation 4 to restore the LR signal in decoding step 208.

ＭＳステレオ処理の上記動作により、Ｌ信号とＲ信号が同符号でかつ絶対値が類似した信号であればＳ信号が小さな値となり、またＬ信号とＲ信号が異符号でかつ信号の絶対値が類似した信号であればＭ信号が小さな値となり、Ｌ信号とＲ信号をそのまま圧縮するよりも割り当てるビット量が少なくて済むために符号化効率を向上させる事が可能となる。 By the above operation of the MS stereo processing, if the L signal and the R signal have the same sign and the absolute values are similar, the S signal becomes a small value, and the L signal and the R signal have different signs and the absolute value of the signal is If the signals are similar, the M signal has a small value, and the bit amount to be allocated is smaller than the compression of the L signal and the R signal as they are, so that the encoding efficiency can be improved.

なお、ＭＳ信号を圧縮するよりもＬＲ信号をそのまま圧縮した方が圧縮効率が良い、つまりＬ信号とＲ信号とに相関がないと判断された場合は、ＭＳステレオ処理をしない。
国際規格ＩＳＯ／ＩＥＣ１３８１８−７：１９９７ When it is determined that the compression efficiency is better when the LR signal is compressed as it is than when the MS signal is compressed, that is, when there is no correlation between the L signal and the R signal, the MS stereo processing is not performed.
International standard ISO / IEC 13818-7: 1997

しかしながら、従来のＭＳステレオ処理では、入力されたＬＲ信号に何も処理を加える事なくＭＳ信号を求めるために、入力信号に対して相関を高める処理を行う事ができず、相関がないと判断されてＭＳステレオ処理をせずに圧縮効率を下げている場合があった。例えば、Ｒ信号がＬ信号のｋ倍（ｋ：整数）で表現可能な場合、Ｒ信号をｋＬとすると、Ｍ信号（式中Ｍ（ｋ））、Ｓ信号（式中Ｓ（ｋ））は式５のように求まる。 However, in the conventional MS stereo processing, in order to obtain the MS signal without performing any processing on the input LR signal, it is not possible to perform processing for increasing the correlation with respect to the input signal, and it is determined that there is no correlation. In some cases, the compression efficiency is lowered without performing MS stereo processing. For example, when the R signal can be expressed by k times the L signal (k: integer), if the R signal is kL, the M signal (M (k) in the formula) and the S signal (S (k) in the formula) are It is obtained as shown in Equation 5.

ここで、Ｒ信号はＬ信号の整数倍であるから、両信号はどちらか一方の入力信号（本例ではＬチャンネル信号）を用いて表現できるため、圧縮効率を高める事が可能であるが、ＭＳステレオ処理では和と差による演算結果のみがＭ信号とＳ信号に反映されるため、両信号の相関が高いとは判断されない。例えば、ｋの絶対値が１より十分大きい場合はＭ信号、Ｓ信号ともに絶対値が大きいままなので、Ｌ信号とＲ信号との間に相関がないと判断されてＭＳステレオ処理をされず、圧縮効率を高める事ができない、という課題があった。 Here, since the R signal is an integral multiple of the L signal, both signals can be expressed using either one of the input signals (in this example, the L channel signal), so that the compression efficiency can be increased. In the MS stereo processing, only the calculation result based on the sum and difference is reflected in the M signal and the S signal, so that it is not determined that the correlation between the two signals is high. For example, when the absolute value of k is sufficiently larger than 1, since the absolute values of both the M signal and the S signal remain large, it is determined that there is no correlation between the L signal and the R signal, and the MS stereo processing is not performed. There was a problem that the efficiency could not be increased.

上記従来の課題を解決するため、本発明のオーディオ符号化装置は、２チャンネルのオーディオ信号から重み係数を算出する重み係数算出手段と、前記重み係数算出手段で算出した前記重み係数を用いて前記オーディオ信号に対するＭｉｄｄｌｅ信号とＳｉｄｅ信号とを算出するＭＳ信号算出手段と、前記ＭＳ信号算出手段で算出した前記Ｍｉｄｄｌｅ信号とＳｉｄｅ信号とを符号化する第１の信号処理手段とを備え、前記重み係数算出手段は、係数を前記オーディオ信号に掛けたものから算出したＭｉｄｄｌｅ信号とＳｉｄｅ信号との偏りが最大となるような係数を重み係数として算出することを特徴とするものである。 In order to solve the above-described conventional problem, the audio encoding device of the present invention uses the weighting coefficient calculation unit that calculates a weighting coefficient from a two-channel audio signal and the weighting coefficient calculated by the weighting coefficient calculation unit. MS signal calculation means for calculating a Middle signal and Side signal for an audio signal, and first signal processing means for encoding the Middle signal and Side signal calculated by the MS signal calculation means, and the weighting factor The calculating means is characterized in that a coefficient that maximizes the bias between the middle signal and the side signal calculated from the coefficient multiplied to the audio signal is calculated as a weighting coefficient.

また、前記ＭＳ信号算出手段で算出した前記Ｍｉｄｄｌｅ信号とＳｉｄｅ信号とをもとに、ＭＳステレオ処理を実施するか否かを判定するＭＳステレオ判定手段を備えた事を特徴とするものである。 Further, the present invention is characterized by comprising MS stereo determination means for determining whether or not to perform MS stereo processing based on the Middle signal and Side signal calculated by the MS signal calculation means.

また、前記重み係数算出手段が、前記２チャンネルのオーディオ信号のそれぞれの信号の２乗を累積したものの比の平方根を前記重み係数として算出することを特徴とするものである。 Further, the weighting factor calculating means calculates the square root of the ratio of the sum of the squares of the signals of the two-channel audio signals as the weighting factor.

また、前記オーディオ信号が時間領域で表現される信号である事を特徴とするものである。 The audio signal is a signal expressed in a time domain.

また、前記オーディオ信号が周波数領域で表現される信号である事を特徴とするものである。 The audio signal is a signal expressed in a frequency domain.

また、本発明のオーディオ復号化装置は、オーディオ符号化装置で符号化された信号からＭｉｄｄｌｅ信号とＳｉｄｅ信号とを復号する第２の信号処理手段と、前記オーディオ符号化装置で算出された重み係数を用いて、前記第２の信号処理手段で算出したＭｉｄｄｌｅ信号とＳｉｄｅ信号とから２チャンネルのオーディオ信号を復元するデコード手段とを備えたことを特徴とするものである。 The audio decoding device of the present invention includes a second signal processing means for decoding a Middle signal and a Side signal from a signal encoded by the audio encoding device, and a weighting factor calculated by the audio encoding device. And a decoding means for restoring a two-channel audio signal from the Middle signal and Side signal calculated by the second signal processing means.

本発明のオーディオ符号化装置によれば、２チャンネルのオーディオ信号と重み係数から算出するＭｉｄｄｌｅ信号とＳｉｄｅ信号の偏りが最大となるような重み係数を算出し、それを用いてＭｉｄｄｌｅ信号とＳｉｄｅ信号を算出することにより、２チャンネルのオーディオ信号をそのまま圧縮符号化するよりも圧縮効率を高める事ができる。 According to the audio encoding device of the present invention, a middle signal calculated from a two-channel audio signal and a weighting factor and a weighting factor that maximizes the bias of the side signal are calculated, and the middle signal and the side signal are calculated using the weighting factor. By calculating the above, it is possible to increase the compression efficiency as compared with the case where the 2-channel audio signal is directly compressed and encoded.

また、重み係数を用いてＭｉｄｄｌｅ信号とＳｉｄｅ信号を算出する事により、ＭＳ判定手段においてＭＳステレオ処理を実施するように判断する場合が増加するため、圧縮効率を高める事ができる。 Further, by calculating the Middle signal and the Side signal using the weighting factor, the number of cases in which the MS determination unit determines to perform the MS stereo processing increases, so that the compression efficiency can be improved.

また本発明のオーディオ復号化装置によれば、Ｍｉｄｄｌｅ信号とＳｉｄｅ信号とを重み係数を用いて２チャンネルの信号に復号化する事ができる。 According to the audio decoding apparatus of the present invention, the Middle signal and the Side signal can be decoded into a 2-channel signal using a weighting coefficient.

以下本発明を実施するための最良の形態について、図面を参照しながら説明する。 The best mode for carrying out the present invention will be described below with reference to the drawings.

（実施の形態１）
図１は実施の形態１におけるオーディオ符号化装置のエンコーダの構成を示す図である。図１において、１は、後述の重み計数ＧをＬチャンネル信号（Ｌ信号）又はＲチャンネル信号（Ｒ信号）に掛けた上で、Ｍｉｄｄｌｅ信号（Ｍ信号）およびＳｉｄｅ信号（Ｓ信号）を算出するＭＳ信号算出手段であり、２は、ＭＳ信号算出手段１で算出したＭ信号とＳ信号から２乗振幅和Ｘを算出するＭＳ２乗振幅和算出手段であり、３は、２乗振幅和ＸからＭＳステレオ処理を実施するかどうかを判定するＭＳステレオ判定手段であり、４は、ＭＳステレオ判定手段３で判定した判定結果を基に、ＬＲ信号又はＭＳ信号のどちらか一方を選択して符号化処理をするとともに、ＭＳステレオ処理を実施したかどうかをフラグに設定し、さらに重み計数Ｇの情報を付加する信号処理手段であり、５は、ＬＲ信号から、後述の方法で重み係数Ｇを算出する重み係数算出手段である。 (Embodiment 1)
FIG. 1 is a diagram showing a configuration of an encoder of an audio encoding device according to Embodiment 1. In FIG. In FIG. 1, 1 calculates a Middle signal (M signal) and a Side signal (S signal) after multiplying an L channel signal (L signal) or an R channel signal (R signal) by a weighting factor G, which will be described later. MS signal calculating means, 2 is an MS square amplitude sum calculating means for calculating a square amplitude sum X from the M signal and S signal calculated by the MS signal calculating means 1, and 3 is from the square amplitude sum X MS stereo determination means for determining whether or not to perform MS stereo processing. 4 is a code selected by selecting either the LR signal or the MS signal based on the determination result determined by the MS stereo determination means 3 This is a signal processing means that sets whether or not the MS stereo processing has been performed and adds information on the weighting coefficient G. 5 is a weighting factor G that is calculated from the LR signal by a method that will be described later. It is a weighting factor calculation means for output.

以下、重み計数Ｇの算出方法について説明する。重み計数Ｇは、Ｌ信号もしくはＲ信号にＧを掛けて算出したＭ信号とＳ信号との２乗振幅和の比（又はその逆数）が最大になるときのＧの値である。 Hereinafter, a method for calculating the weighting factor G will be described. The weighting factor G is the value of G when the ratio of the sum of squared amplitudes of the M signal and S signal (or the reciprocal thereof) calculated by multiplying the L signal or R signal by G is the maximum.

まず、ｎサンプル目のＲチャンネル信号（Ｒ（ｎ））を、これにＧをかけたものＧ・Ｒ（ｎ）に置き換えると、ｎサンプル目のＭ信号（Ｍ（ｎ））とｎサンプル目のＳ信号Ｓ（ｎ）とは、式６で表される。 First, when the R channel signal (R (n)) of the nth sample is replaced with G · R (n) obtained by multiplying it by the G, the M signal (M (n)) of the nth sample and the nth sample The S signal S (n) is expressed by Expression 6.

次に、これらの２乗振幅和ＳＵＭ＿Ｍ、ＳＵＭ＿Ｓは、式７で表される。ここで、Σの累積加算範囲はＭＳステレオ処理の処理単位であり、例えば１０２４個のサンプルからなる１フレームである。 Next, these squared amplitude sums SUM_M and SUM_S are expressed by Expression 7. Here, the cumulative addition range of Σ is a processing unit of MS stereo processing, and is, for example, one frame composed of 1024 samples.

次に、ＳＵＭ＿ＭとＳＵＭ＿Ｓの比Ｘは式８で表される。ただし、Ｘ≧１とする。つまり、ＳＵＭ＿Ｍ≧ＳＵＭ＿Ｓであれば式８のように立式し、ＳＵＭ＿Ｍ＜ＳＵＭ＿Ｓであれば、式８の右辺の分母と分子を逆にする。本実施の形態では、以下ＳＵＭ＿Ｍ≧ＳＵＭ＿Ｓとする。 Next, the ratio X of SUM_M and SUM_S is expressed by Expression 8. However, X ≧ 1. That is, if SUM_M ≧ SUM_S, the expression is as shown in Expression 8, and if SUM_M <SUM_S, the denominator on the right side of Expression 8 is reversed. In the present embodiment, it is assumed that SUM_M ≧ SUM_S hereinafter.

Ｘは値が大きいほど前記Ｍ信号と前記Ｓ信号の信号間の偏りが大きい事を意味している。従って、Ｘが最大になるようなＧ（Ｇ＞０）を算出するために、ＸをＧで偏微分したものがゼロとなるときのＧの値を算出する。ＸをＧで偏微分したものは式９で表わされる。 X means that the larger the value, the greater the deviation between the M signal and the S signal. Therefore, in order to calculate G (G> 0) that maximizes X, the value of G is calculated when X is partially differentiated by G and becomes zero. A result of partial differentiation of X by G is expressed by Equation 9.

式９は次のように変形できる（以下、Ｌ（ｎ）をＬに、Ｒ（ｎ）をＲに省略して表記する。）。 Equation 9 can be modified as follows (hereinafter, L (n) is abbreviated as L and R (n) is abbreviated as R).

Ｘが最大となるのは、式９の値がゼロになるとき、すなわち上式の分子がゼロになるときである。上式の分子は下記のように変形できる。 X is maximized when the value of Equation 9 becomes zero, that is, when the numerator in the above equation becomes zero. The molecule of the above formula can be transformed as follows.

上式の値がゼロになるのは、下記のときである（以下ＬをＬ（ｎ）に、ＲをＲ（ｎ）に戻して表記する。）。 The value of the above equation becomes zero in the following cases (hereinafter, L is returned to L (n) and R is returned to R (n)).

従って、Ｘが最大となるときのＧの値は式１０で表わされる。 Therefore, the value of G when X is maximum is expressed by Equation 10.

このように、重み係数Ｇは、重み係数算出手段５で、Ｌ（ｎ）の２乗を１フレーム期間で累積積算したものと、Ｒ（ｎ）の２乗を１フレーム期間で累積積算したものとの比の平方根を計算する事により算出される。 As described above, the weighting factor G is obtained by accumulating and summing the square of L (n) in one frame period and the sum of squares of R (n) in one frame period by the weighting factor calculation means 5. It is calculated by calculating the square root of the ratio.

以上のように構成されたオーディオ符号化装置のエンコーダの動作について、図２のフロー図を用いて説明する。 The operation of the encoder of the audio encoding apparatus configured as described above will be described with reference to the flowchart of FIG.

まず、重み係数算出ステップ１００にて、重み係数算出手段５が、式１０に基づいて重み係数Ｇを算出する。 First, in the weighting factor calculating step 100, the weighting factor calculating means 5 calculates the weighting factor G based on the equation (10).

次に、ＭＳ信号算出ステップ１０１にて、ＭＳ信号算出手段１が、重み係数算出手段５で算出した重み係数Ｇを式６に代入して、Ｍ信号とＳ信号とを算出する。 Next, in the MS signal calculation step 101, the MS signal calculation unit 1 calculates the M signal and the S signal by substituting the weighting factor G calculated by the weighting factor calculation unit 5 into Equation 6.

次に、ＭＳ２乗振幅和算出ステップ１０２にて、ＭＳ２乗振幅和算出手段２が、ＭＳ信号算出手段１で算出したＭ信号とＳ信号とを式７に代入することにより、ＳＵＭ＿ＭとＳＵＭ＿Ｓを算出し、それを式８に代入して、２チャンネルの入力信号間の偏りを意味するＸを算出する。 Next, in MS square amplitude sum calculation step 102, MS square amplitude sum calculation means 2 calculates SUM_M and SUM_S by substituting the M signal and S signal calculated by MS signal calculation means 1 into Equation 7. Then, by substituting it into Expression 8, X is calculated, which means the bias between the input signals of the two channels.

次に、ＭＳステレオ判定ステップ１０３にて、ＭＳステレオ判定手段３が、ＭＳ２乗振幅和算出手段２で算出したＸと、ＬＲ信号を符号化するよりもＭＳステレオ処理を実行して前ＭＳ信号を符号化する方が圧縮効率が上がると判断可能となる閾値Ｔ１、Ｔ２とを比較し、ＭＳステレオ処理を実行するかどうかを判定する。ここでＴ１はＳＵＭ＿ＭがＳＵＭ＿Ｓに対して十分大きい場合の閾値で、Ｔ２はＳＵＭ＿ＳがＳＵＭ＿Ｍに対して十分に大きい場合の閾値である。すなわち、Ｘ≧Ｔ１もしくはＸ≦Ｔ２であればＭＳステレオ処理を実行し、Ｔ２＜Ｘ＜Ｔ１であればＭＳステレオ処理を実行しないと判定して、その判定結果を出力する。 Next, in the MS stereo determination step 103, the MS stereo determination means 3 executes MS stereo processing rather than encoding the X and LR signals calculated by the MS square amplitude sum calculation means 2 to obtain the previous MS signal. The thresholds T1 and T2 that can be determined that the compression efficiency increases when encoding is performed, and it is determined whether or not the MS stereo processing is executed. Here, T1 is a threshold when SUM_M is sufficiently larger than SUM_S, and T2 is a threshold when SUM_S is sufficiently larger than SUM_M. That is, if X ≧ T1 or X ≦ T2, MS stereo processing is executed, and if T2 <X <T1, it is determined not to execute MS stereo processing, and the determination result is output.

ＭＳステレオ判定手段３がＭＳステレオ処理を実行すると判定した場合は、ＭＳ信号選択ステップ１０４にて、信号処理手段４がＭＳ信号算出手段１で算出したＭＳ信号を選択して符号化処理を行うとともに、ＭＳステレオ実行フラグ設定ステップ１０５にて、信号処理手段４がＭＳステレオ実行フラグを設定する。 If the MS stereo determination means 3 determines that the MS stereo processing is to be executed, the MS signal selection step 104 selects the MS signal calculated by the MS signal calculation means 1 and performs the encoding process. In the MS stereo execution flag setting step 105, the signal processing means 4 sets the MS stereo execution flag.

また、ＭＳステレオ判定手段３がＭＳステレオ処理を実行しないと判定した場合は、ＬＲ信号選択ステップ１０６にて、信号処理手段４がＬＲ信号を選択して符号化処理を行う。 If the MS stereo determination means 3 determines not to execute the MS stereo processing, the signal processing means 4 selects the LR signal and performs the encoding process in the LR signal selection step 106.

続いて、ＭＳステレオ処理のデコーダについて説明する。図３は実施の形態１におけるオーディオ符号化装置のデコーダの構成を示す図である。図３において、６は入力信号を復号化してＬＲ信号もしくはＭＳ信号を出力すると共に、ＭＳステレオ実行フラグの有無を判別し、さらにエンコーダ側で付加された重み係数Ｇを検出する信号処理手段であり、７は信号処理手段６においてＭＳステレオ実行フラグを判別してエンコーダ側でＭＳステレオ処理が実施されたと判断した場合に、信号処理手段６からのＭＳ信号に所定の処理を実行してＬＲ信号を復元するデコード手段である。 Next, the MS stereo processing decoder will be described. FIG. 3 is a diagram showing the configuration of the decoder of the audio encoding device according to the first embodiment. In FIG. 3, 6 is a signal processing means for decoding an input signal and outputting an LR signal or an MS signal, determining the presence or absence of an MS stereo execution flag, and detecting a weighting factor G added on the encoder side. , 7 when the MS stereo execution flag is discriminated in the signal processing means 6 and it is determined that the MS stereo processing has been carried out on the encoder side, the MS signal from the signal processing means 6 is subjected to predetermined processing and the LR signal is converted. Decoding means to restore.

以上のように構成された復号化装置のデコーダの動作について以下図４のフロー図を用いて説明する。図４のＭＳステレオ判別ステップ１０７にて、信号処理手段６が、エンコーダ側でＭＳステレオ処理されたかどうかを示すフラグを確認する事で判別し判別結果を出力する。フラグを判別してエンコーダ側でＭＳステレオ処理が実行されていないと判断した場合は、入力信号をデコードする必要がないため、ＭＳステレオデコード処理を終了する。また、フラグを判別してエンコーダ側でＭＳステレオ処理を実行されていると判断した場合は、デコードステップ１０８にて、デコード手段７が、式１１に従い、信号処理手段６が検出した重み係数Ｇを用いてＭＳ信号をデコードして２チャンネルのＬＲ信号を復元する。 The operation of the decoder of the decoding apparatus configured as described above will be described below with reference to the flowchart of FIG. In the MS stereo discrimination step 107 in FIG. 4, the signal processing means 6 discriminates by checking a flag indicating whether or not MS stereo processing has been performed on the encoder side, and outputs a discrimination result. If it is determined that the MS stereo process has not been executed on the encoder side by determining the flag, the MS stereo decoding process is terminated because there is no need to decode the input signal. If it is determined that the MS stereo processing is being executed on the encoder side by determining the flag, the decoding means 7 uses the weighting coefficient G detected by the signal processing means 6 according to Equation 11 in the decoding step 108. Use to decode the MS signal to restore the 2-channel LR signal.

以上のように、本実施の形態によれば、Ｍ信号とＳ信号間の偏りを大きくするために、従来のＭＳ信号を算出する算出式に重み係数Ｇを導入し、意図的にＭＳ信号間の偏りを大きくすることにより、ＭＳステレオ処理を実行するように判断させて、ＭＳステレオ処理を実行する場合を増加させる事により、結果として符号化に要するビット量を削減する事ができるため、符号化効率を向上させる事が可能となる。また、符号化に要するビット量を削減する事によって他の符号化処理でより多くのビットを割り当てる事が可能となるために高音質化が可能となる。 As described above, according to the present embodiment, in order to increase the bias between the M signal and the S signal, the weighting factor G is introduced into the calculation formula for calculating the conventional MS signal, and the MS signal is intentionally inserted. By increasing the bias of, it is possible to reduce the bit amount required for encoding as a result by increasing the number of cases where MS stereo processing is executed by determining to execute MS stereo processing. It is possible to improve the efficiency. In addition, by reducing the amount of bits required for encoding, it is possible to allocate more bits in other encoding processes, so that high sound quality can be achieved.

なお、本実施の形態ではエンコーダへの入力信号はＬ信号、Ｒ信号と表現したが、２チャンネルの信号であれば、ＬＲ信号に限らない、また、それらは時間領域で表現される信号でも良く、また周波数領域で表現される信号でも良い。 In this embodiment, the input signal to the encoder is expressed as an L signal and an R signal. However, as long as the signal is a two-channel signal, the signal is not limited to the LR signal, and may be a signal expressed in the time domain. Also, a signal expressed in the frequency domain may be used.

本発明は、音楽信号を符号化する機能を含む機器、例えばＰＣ、携帯電話、ポータブルオーディオ機器、ミニコンポに有用である。将来、放送、通信、インターネットなどで符号化を施されたデータが流通する際にも有益である。 The present invention is useful for a device including a function for encoding a music signal, such as a PC, a mobile phone, a portable audio device, and a minicomponent. In the future, it is also useful when data encoded by broadcasting, communication, the Internet, etc. is distributed.

本発明の実施の形態１におけるオーディオ符号化装置の構成を示すブロック図1 is a block diagram showing the configuration of an audio encoding device according to Embodiment 1 of the present invention. 本発明の実施の形態１におけるオーディオ符号化装置の動作を示すフロー図FIG. 3 is a flowchart showing the operation of the audio encoding device according to the first embodiment of the present invention. 本発明の実施の形態１におけるオーディオ復号化装置の構成を示すブロック図1 is a block diagram showing a configuration of an audio decoding device according to Embodiment 1 of the present invention. 本発明の実施の形態１におけるオーディオ復号化装置の動作を示すフロー図FIG. 3 is a flowchart showing the operation of the audio decoding apparatus according to Embodiment 1 of the present invention. 従来のオーディオ符号化装置の構成を示すブロック図Block diagram showing the configuration of a conventional audio encoding device 従来のオーディオ符号化装置の動作を示すフロー図Flow diagram showing the operation of a conventional audio encoding device 従来のオーディオ復号化装置の構成を示すブロック図The block diagram which shows the structure of the conventional audio decoding apparatus. 従来のオーディオ復号化装置の動作を示すフロー図Flow diagram showing the operation of a conventional audio decoding device

符号の説明Explanation of symbols

１ＭＳ信号算出手段
２ＭＳ２乗振幅和算出手段
３ＭＳステレオ判定手段
４信号処理手段
５重み係数算出手段
６信号処理手段
７デコード手段
１００重み係数算出ステップ
１０１ＭＳ信号算出ステップ
１０２ＭＳ２乗振幅和算出ステップ
１０３ＭＳステレオ判定ステップ
１０４ＭＳ信号選択ステップ
１０５ＭＳステレオ実行フラグ設定ステップ
１０６ＬＲ信号選択ステップ
１０７ＭＳステレオ判別ステップ
１０８デコードステップ
ＬＬチャンネル信号
ＲＲチャンネル信号
ＭＭｉｄｄｌｅ信号
ＳＳｉｄｅ信号
Ｇ重み係数 DESCRIPTION OF SYMBOLS 1 MS signal calculation means 2 MS square amplitude sum calculation means 3 MS stereo determination means 4 Signal processing means 5 Weight coefficient calculation means 6 Signal processing means 7 Decoding means 100 Weight coefficient calculation step 101 MS signal calculation step 102 MS square amplitude sum calculation step 103 MS stereo determination step 104 MS signal selection step 105 MS stereo execution flag setting step 106 LR signal selection step 107 MS stereo discrimination step 108 decoding step L L channel signal R R channel signal M Middle signal S Side signal G Weight coefficient

Claims

２チャンネルのオーディオ信号から重み係数を算出する重み係数算出手段と、
前記重み係数算出手段で算出した前記重み係数を用いて前記オーディオ信号に対するＭｉｄｄｌｅ信号とＳｉｄｅ信号とを算出するＭＳ信号算出手段と、
前記ＭＳ信号算出手段で算出した前記Ｍｉｄｄｌｅ信号とＳｉｄｅ信号とを符号化する第１の信号処理手段とを備え、
前記重み係数算出手段は、係数を前記オーディオ信号に掛けたものから算出したＭｉｄｄｌｅ信号とＳｉｄｅ信号との偏りが最大となるような係数を重み係数として算出することを特徴とするオーディオ符号化装置。 Weighting factor calculating means for calculating a weighting factor from the two-channel audio signal;
MS signal calculating means for calculating a Middle signal and a Side signal for the audio signal using the weighting coefficient calculated by the weighting coefficient calculating means;
First signal processing means for encoding the Middle signal and Side signal calculated by the MS signal calculating means;
The audio coding apparatus according to claim 1, wherein the weighting coefficient calculating means calculates a weighting coefficient that maximizes a deviation between the middle signal and the side signal, which is calculated by multiplying the audio signal by the coefficient.

前記ＭＳ信号算出手段で算出した前記Ｍｉｄｄｌｅ信号とＳｉｄｅ信号とをもとに、ＭＳステレオ処理を実施するか否かを判定するＭＳステレオ判定手段を備えた事を特徴とする請求項１記載のオーディオ符号化装置。 2. The audio according to claim 1, further comprising MS stereo determination means for determining whether or not to perform MS stereo processing based on the Middle signal and Side signal calculated by the MS signal calculation means. Encoding device.

前記重み係数算出手段が、前記２チャンネルのオーディオ信号のそれぞれの信号の２乗を累積したものの比の平方根を前記重み係数として算出することを特徴とする請求項１記載のオーディオ符号化装置。 2. The audio encoding device according to claim 1, wherein the weighting factor calculating means calculates the square root of the ratio of the sum of the squares of the two-channel audio signals as the weighting factor.

前記オーディオ信号が時間領域で表現される信号である事を特徴とする請求項１記載のオーディオ符号化装置。 The audio encoding apparatus according to claim 1, wherein the audio signal is a signal expressed in a time domain.

前記オーディオ信号が周波数領域で表現される信号である事を特徴とする請求項１記載のオーディオ符号化装置。 2. The audio encoding device according to claim 1, wherein the audio signal is a signal expressed in a frequency domain.

請求項１記載のオーディオ符号化装置で符号化された信号からＭｉｄｄｌｅ信号とＳｉｄｅ信号とを復号する第２の信号処理手段と、
前記オーディオ符号化装置で算出された重み係数を用いて、前記第２の信号処理手段で算出したＭｉｄｄｌｅ信号とＳｉｄｅ信号とから２チャンネルのオーディオ信号を復元するデコード手段とを備えたことを特徴とするオーディオ復号化装置。 A second signal processing means for decoding a Middle signal and a Side signal from the signal encoded by the audio encoding device according to claim 1;
Decoding means for restoring a 2-channel audio signal from the Middle signal and Side signal calculated by the second signal processing means using the weighting coefficient calculated by the audio encoding device. An audio decoding device.

２チャンネルのオーディオ信号から重み係数を算出する重み係数算出ステップと、
前記重み係数算出ステップで算出した前記重み係数を用いて前記オーディオ信号に対するＭｉｄｄｌｅ信号とＳｉｄｅ信号とを算出するＭＳ信号算出ステップと、
前記ＭＳ信号算出ステップで算出した前記Ｍｉｄｄｌｅ信号とＳｉｄｅ信号とを符号化する第１の信号処理ステップとを有し、
前記重み係数算出ステップでは、係数を前記オーディオ信号に掛けたものから算出したＭｉｄｄｌｅ信号とＳｉｄｅ信号との偏りが最大となるような係数を重み係数として算出することを特徴とするオーディオ符号化方法。 A weighting factor calculating step for calculating a weighting factor from the two-channel audio signal;
An MS signal calculating step of calculating a Middle signal and a Side signal for the audio signal using the weighting factor calculated in the weighting factor calculating step;
A first signal processing step of encoding the Middle signal and Side signal calculated in the MS signal calculation step;
In the weighting coefficient calculation step, a coefficient that maximizes the bias between the Middle signal and the Side signal calculated from the coefficient multiplied to the audio signal is calculated as a weighting coefficient.

請求項７記載のオーディオ符号化方法で符号化された信号からＭｉｄｄｌｅ信号とＳｉｄｅ信号とを復号する第２の信号処理ステップと、
前記オーディオ符号化方法で算出された重み係数を用いて、前記第２の信号処理方法で算出したＭｉｄｄｌｅ信号とＳｉｄｅ信号とから２チャンネルのオーディオ信号を復元するデコードステップとを備えたことを特徴とするオーディオ復号化方法。 A second signal processing step of decoding a Middle signal and a Side signal from the signal encoded by the audio encoding method according to claim 7;
And a decoding step of restoring a two-channel audio signal from the middle signal and the side signal calculated by the second signal processing method using the weighting coefficient calculated by the audio encoding method. Audio decoding method.

請求項７または請求項８のいずれかに記載のオーディオ復号化方法の各ステップをコンピュータに実行させるためのプログラム。 The program for making a computer perform each step of the audio decoding method in any one of Claim 7 or Claim 8.

請求項９記載のプログラムを格納した記録媒体。 A recording medium storing the program according to claim 9.