JP2001154698A

JP2001154698A - Audio encoding device and its method

Info

Publication number: JP2001154698A
Application number: JP33746699A
Authority: JP
Inventors: Katsuyoshi Nishitani; 勝義西谷
Original assignee: Victor Company of Japan Ltd
Current assignee: Victor Company of Japan Ltd
Priority date: 1999-11-29
Filing date: 1999-11-29
Publication date: 2001-06-08

Abstract

PROBLEM TO BE SOLVED: To solve a problem that in processes of a quantization and an encod ing, it is necessary to execute comparison judgements of scale factors by fre quency equivalent to the number of repetitive controls in order to perform grouping for every quantization of a short block and a calculating and processing time of these judgements is required in a MPEG2(moving picture expert group 2)-AAC audio encoding system. SOLUTION: When short blocks are selected, grouping is previously judged from the maximum number of quantizing steps and the statical amount of scale factor bands in insides of respective short blocks, in a stage prior to performing quantization.

Description

【発明の詳細な説明】DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】ＭＰＥＧ２ＡＡＣオーディ
オ符号化方式におけるショートブロックのグルーピング
の前処理に係り、特にグルーピング処理時間が短縮可能
な符号化方式に関するものである。[0001] 1. Field of the Invention [0002] The present invention relates to preprocessing of grouping of short blocks in the MPEG2 AAC audio coding system, and more particularly to a coding system capable of shortening the grouping processing time.

【０００２】[0002]

【従来の技術】ＭＰＥＧ２ＡＡＣオーディオ符号化方
式では、入力時間信号を聴覚心理分析器１でＦＦＴを施
し、周波数スペクトルを求め、それを基にマスキングを
計算し、予め設定された周波数帯域毎の許容量子化雑音
電力を算出する。一方、ＭＤＣＴ器２では、入力時間信
号をＭＤＣＴ（ＭｏｄｉｆｉｅｄＤｉｓｃｒｅｔｅＣ
ｏｓｉｎｅＴｒａｎｓｆｏｒｍ）によりスペクトルデ
ータ（ＭＤＣＴ係数）に変換する。変換は演算ブロック
長を５０％ずつオーバーラップして実行し、例えば、２
０４８サンプルを１０２４本のＭＤＣＴ係数に変換す
る。2. Description of the Related Art In the MPEG2 AAC audio coding system, an input time signal is subjected to FFT by an psychoacoustic analyzer 1 to obtain a frequency spectrum, and a masking is calculated based on the frequency spectrum. Calculate the quantization noise power. On the other hand, the MDCT unit 2 converts the input time signal into an MDCT (Modified Discrete C
The image data is converted into spectral data (MDCT coefficients) by an Osine Transform. The conversion is performed by overlapping the operation block length by 50%.
The 048 samples are converted into 1024 MDCT coefficients.

【０００３】また、ＭＰＥＧ２ＡＡＣオーディオ符号
化方式は、入力信号の特性に応じてＭＤＣＴの演算ブロ
ック長を、長い演算ブロック（ロングブロック）か短い
演算ブロック（ショートブロック）かに切り替えるブロ
ックスイッチングという機能を採用している。一般には
長い演算ブロック長を用いる方がスペクトルの集中度が
高まるので効率的なビット配分を行なえるが、周波数領
域での量子化雑音は、時間領域に戻された時に演算ブロ
ック長全体に広がるため、静寂部の後で急峻な立ち上が
り（アタック部）を有するような波形を長い演算ブロッ
ク長で変換して量子化すると、その量子化雑音が静寂部
まで広がり、極めて耳障りなものとなる。Also, the MPEG2 AAC audio encoding system has a function called block switching for switching the MDCT operation block length between a long operation block (long block) and a short operation block (short block) according to the characteristics of an input signal. Has adopted. In general, using a long operation block length increases the degree of spectral concentration, so efficient bit allocation can be performed.However, quantization noise in the frequency domain spreads over the entire operation block length when returned to the time domain. When a waveform having a steep rising edge (attack portion) after a silence portion is converted and quantized with a long operation block length, the quantization noise spreads to the silence portion and becomes extremely annoying.

【０００４】これを抑圧するために、アタック部の前後
では通常時の長い演算ブロックから複数個の短い演算ブ
ロックに切り替える。これをブロックスイッチングと呼
び、聴覚心理分析器１は入力信号の特性に応じて、フレ
ーム毎にロング・ショートのいづれかのブロックとして
切り替えるように、ブロック情報をＭＤＣＴ器２に伝送
し、ＭＤＣＴ器２はその情報に基づいて演算ブロック長
を選択する。In order to suppress this, before and after the attack section, the normal long operation block is switched to a plurality of short operation blocks. This is called block switching, and the psychoacoustic analyzer 1 transmits block information to the MDCT unit 2 so as to switch as either a long or short block for each frame according to the characteristics of the input signal. The operation block length is selected based on the information.

【０００５】定常的な信号の場合、ＭＤＣＴの演算長を
２０４８サンプルのロングのブロックとし１０２４本の
ＭＤＣＴ係数に変換する。一方、過渡的な信号の場合に
は２５６サンプルのショートのブロックとし１２８本の
ＭＤＣＴ係数に変換する。ショートブロックでは８個連
続で短い変換長を用いて変換することにより、出力のＭ
ＤＣＴ係数の本数を１０２４本とし、ロングブロックと
一致するようにしている。In the case of a stationary signal, the operation length of the MDCT is set to a long block of 2048 samples and converted into 1024 MDCT coefficients. On the other hand, in the case of a transient signal, the block is a short block of 256 samples and converted into 128 MDCT coefficients. In the short block, the output M
The number of DCT coefficients is set to 1024 so as to match the long block.

【０００６】量子化器３では、人間の聴覚特性を基にし
た周波数帯域毎に１０２４本のＭＤＣＴ係数を複数のス
ケールファクタバンド（ｓｆｂ）に分け、スケールファ
クタバンド毎に正規化を行ない、スケールファクタと正
規化されたスペクトルデータを一単位として量子化を行
なう。ショートブロックの場合は１２８本のＭＤＣＴ係
数を複数のスケールファクタバンド（ｓｆｂ）に分け
る。The quantizer 3 divides 1024 MDCT coefficients into a plurality of scale factor bands (sfb) for each frequency band based on human auditory characteristics, normalizes each of the scale factor bands, and Quantization is performed using the normalized spectral data as one unit. In the case of a short block, 128 MDCT coefficients are divided into a plurality of scale factor bands (sfb).

【０００７】各スケールファクタバンド（ｓｆｂ）で計
算された量子化雑音が聴覚心理分析器１で算出された許
容量子化雑音電力よりも大きくならないように各スケー
ルファクタバンド（ｓｆｂ）の量子化ステップ数（スケ
ールファクタ）を制御し、且つ量子化に必要なビット数
が使用可能なビット数以内に収まるように全体の量子化
ステップ数を制御して、量子化を実行する。[0007] The number of quantization steps of each scale factor band (sfb) so that the quantization noise calculated in each scale factor band (sfb) does not become larger than the allowable quantization noise power calculated by the psychoacoustic analyzer 1. The quantization is controlled by controlling the (scale factor) and controlling the total number of quantization steps so that the number of bits necessary for quantization falls within the number of usable bits.

【０００８】更にショートブロックの場合、より高い符
号化効率が得られるグルーピングという機能を量子化器
３で量子化を行なった信号に対して適用して、グルーピ
ング処理を行なっている。Further, in the case of a short block, the grouping process is performed by applying a function of grouping to obtain a higher encoding efficiency to a signal quantized by the quantizer 3.

【０００９】図３乃至図５にグルーピングの一例を示
す。図４にグルーピング前のスケールファクタバンド
（ｓｆｂ）のオーダーの一例を、図５にグルーピング後
のスケールファクタバンドのオーダーの一例を示す。隣
接したショートブロックの相応するスケールファクタバ
ンド（ｓｆｂ）のスケールファクタが全て同じ場合、こ
れらショートブロックを１つのショートブロックと見な
し、スケールファクタを共有するように各ブロック内の
スケールファクタバンド（ｓｆｂ）をインターリーブし
て並替えを行ない、グループ化する（図５）。これによ
り、スケールファクタ分の符号ビットが削減可能とな
る。FIGS. 3 to 5 show examples of grouping. FIG. 4 shows an example of the order of the scale factor band (sfb) before grouping, and FIG. 5 shows an example of the order of the scale factor band after grouping. If the scale factors of the corresponding scale factor bands (sfb) of adjacent short blocks are all the same, these short blocks are regarded as one short block, and the scale factor bands (sfb) in each block are shared so as to share the scale factor. Interleaving, sorting, and grouping (FIG. 5). As a result, the number of code bits corresponding to the scale factor can be reduced.

【００１０】可変符号化器４では、量子化器３で出力さ
れるＭＤＣＴ係数の量子化値とスケールファクタ等の符
号化パラメータに可変長符号化を施し、冗長度を削減す
る。[0010] The variable encoder 4 performs variable length encoding on the quantized value of the MDCT coefficient and the encoding parameter such as the scale factor output from the quantizer 3 to reduce the redundancy.

【００１１】ビット数判定器５では、符号化されたビッ
ト数が使用可能なビット数以内に収まっているかを判定
し、使用可能なビット数を越えている場合は再度量子化
を行ない、生成されるビット数が使用可能なビット数を
下回るまで繰り返される。The bit number determiner 5 determines whether or not the number of encoded bits is within the number of usable bits. If the number of bits exceeds the number of usable bits, the quantization is performed again to generate the number of bits. Until the number of available bits falls below the number of available bits.

【００１２】使用可能ビット数を満足して出力された符
号化データは、ビットストリーム生成器６において、ブ
ロック情報等の符号化パラメータと共に多重化され、ビ
ットストリームとして伝送される。The encoded data output satisfying the number of usable bits is multiplexed together with encoding parameters such as block information in a bit stream generator 6 and transmitted as a bit stream.

【００１３】[0013]

【発明が解決しようとする課題】上記のようなＭＰＥＧ
２ＡＡＣオーディオ符号化方式において、量子化・符
号化の工程は、符号化後のビット数が使用可能なビット
数以内に収まるように判定しながら反復制御を行なうた
め、ショートブロックを選択した場合、量子化の度にグ
ルーピング処理を行なう必要があった。SUMMARY OF THE INVENTION MPEG as described above
2 In the AAC audio encoding method, the quantization / encoding process performs repetitive control while determining that the number of bits after encoding falls within the number of usable bits. It was necessary to perform a grouping process each time quantization was performed.

【００１４】上記符号化ビット数の反復制御では、量子
化毎に各スケールファクタバンド（ｓｆｂ）のスケール
ファクタ値が変化するので、その都度グルーピングを行
なうために、隣接したショートブロックの相応するスケ
ールファクタバンド（ｓｆｂ）のスケールファクタの比
較判定を、反復制御の回数分実行する必要があり、その
判定の計算処理時間がかなりかかってしまうことが問題
となっていた。In the repetitive control of the number of coded bits, the scale factor value of each scale factor band (sfb) changes for each quantization. Therefore, in order to perform grouping each time, a corresponding scale factor of an adjacent short block is used. It is necessary to execute the comparison determination of the scale factor of the band (sfb) by the number of times of the repetitive control, and there has been a problem that the calculation processing time of the determination takes a considerable time.

【００１５】[0015]

【課題を解決するための手段】そこで上記問題点を解決
するために、請求項１の発明は、オーディオ信号が供給
されて符号化が行ななわれるオーディオ符号化装置にお
いて、前記オーディオ信号が供給されて、人間の聴覚特
性に基づいた聴覚心理パラメータを算出する聴覚心理分
析手段１と、前記オーディオ信号を周波数スペクトルに
変換するＭＤＣＴ変換手段２と、前記ＭＤＣＴ変換手段
にて変換された前記周波数スペクトルを、前記聴覚心理
分析手段から伝送される前記聴覚心理パラメータにより
ロングブロック、ショートブロックのいずれかに区分
し、前記ショートブロックの場合はグルーピングのため
の隣接ブロックの判定を行ない、前記ロングブロックの
場合には隣接ブロックの判定は行なわないグループ判定
手段７と、前記グループ判定手段７の出力信号が供給さ
れて、前記周波数スペクトルを前記聴覚心理分析手段に
て算出された前記聴覚心理パラメータの許容量子化雑音
電力よりも前記周波数スペクトルから計算される量子化
雑音が大きくならないように量子化ステップ数を制御し
て量子化を行なう量子化手段３と、前記量子化手段より
の量子化値を可変長符号化する可変長符号化手段４と、
前記可変長符号化手段で符号されたビット数が使用可能
なビット数に収まっているかどうかを判定するビット数
判定手段５と、前記ビット数判定手段の出力と聴覚心理
分析手段の出力とが供給され符号化データをまとめてビ
ットストリームを生成するビットストリーム生成手６段
とから構成したことを特徴とするオーディオ符号化装置
を提供し、請求項２の発明は、請求項１に記載されたオ
ーディオ符号化装置において、前記グループ判定手段
は、ショートブロックが選択された場合、前記ショート
ブロック毎に内部を複数のスケールファクタバンドに分
け、前記スケールファクタバンドの最大量子化ステップ
数、及び統計量を計算し、隣接するショートブロックと
前記最大量子化ステップ数、及び統計量とを比較してグ
ルーピングの判定を行なうことを特徴としたオーディオ
符号化装置を提供し、請求項３の発明は、オーディオ信
号が供給されて符号化が行なわれるオーディオ符号化方
法において、前記オーディオ信号が供給されて、人間の
聴覚特性に基づいた聴覚心理パラメータを算出する聴覚
心理分析ステップ１と、前記オーディオ信号を周波数ス
ペクトルに変換するＭＤＣＴ変換ステップ２と、前記Ｍ
ＤＣＴ変換ステップにて変換された前記周波数スペクト
ルを前記聴覚心理分析ステップから伝送される前記聴覚
心理パラメータによりロングブロック、ショートブロッ
クのいずれかに区分し、前記ショートブロックの場合は
グルーピングのための隣接ブロックの判定を行ない、前
記ロングブロックの場合には隣接ブロックの判定は行な
わないグループ判定ステップ７と、前記グループ判定ス
テップ７の出力信号が供給されて、前記周波数スペクト
ルを前記聴覚心理分析ステップにて算出された前記聴覚
心理パラメータの許容量子化雑音電力よりも前記周波数
スペクトルから計算される量子化雑音が大きくならない
ように量子化ステップ数を制御して量子化を行なう量子
化ステップ３と、前記量子化ステップよりの量子化値を
可変長符号化する可変長符号化ステップ４と、前記可変
長符号化ステップで符号されたビット数が使用可能なビ
ット数に収まっているかどうかを判定するビット数判定
ステップ５と、前記ビット数判定ステップの出力と聴覚
心理分析ステップの出力とが供給され符号化データをま
とめてビットストリームを生成するビットストリーム生
成ステップ６とから構成したことを特徴とするオーディ
オ符号化方法を提供し、請求項４の発明は、請求項３に
記載されたオーディオ符号化方法において、前記グルー
プ判定ステップは、ショートブロックが選択された場
合、前記ショートブロック毎に内部を複数のスケールフ
ァクタバンドに分け、前記スケールファクタバンドの最
大量子化ステップ数、及び統計量を計算し、隣接するシ
ョートブロックと前記最大量子化ステップ数、及び統計
量とを比較してグルーピングの判定を行なうことを特徴
としたオーディオ符号化方法を提供するものである。SUMMARY OF THE INVENTION In order to solve the above-mentioned problems, a first aspect of the present invention is an audio encoding apparatus in which an audio signal is supplied and encoding is performed. Psychoacoustic analysis means 1 for calculating psychoacoustic parameters based on human auditory characteristics, MDCT conversion means 2 for converting the audio signal into a frequency spectrum, and the frequency spectrum converted by the MDCT conversion means Is divided into a long block and a short block according to the psychoacoustic parameter transmitted from the psychoacoustic analysis means, and in the case of the short block, an adjacent block for grouping is determined. Group determining means 7 which does not determine adjacent blocks; The output signal of the step determination means 7 is supplied, and the quantization noise calculated from the frequency spectrum is larger than the permissible quantization noise power of the psychoacoustic parameter calculated by the psychoacoustic analysis means. A quantizing means 3 for performing quantization by controlling the number of quantization steps so as not to make a difference, a variable-length coding means 4 for performing variable-length coding on a quantization value from the quantizing means,
A bit number determination unit 5 for determining whether the number of bits encoded by the variable length encoding unit is within the number of usable bits, and an output of the bit number determination unit and an output of the psychoacoustic analysis unit are supplied. And a bit stream generating means for generating a bit stream by combining the encoded data and generating a bit stream. The invention according to claim 2 provides the audio encoding apparatus according to claim 1. In the encoding device, when a short block is selected, the group determination unit divides the inside into a plurality of scale factor bands for each short block, and calculates a maximum quantization step number and a statistic of the scale factor band. Then, the adjacent short block is compared with the maximum number of quantization steps and the statistic to determine the grouping. A third aspect of the present invention is an audio encoding method in which an audio signal is supplied and encoding is performed, wherein the audio signal is supplied and a human auditory sense is supplied. Psychoacoustic analysis step 1 for calculating psychoacoustic parameters based on characteristics, MDCT conversion step 2 for converting the audio signal into a frequency spectrum, and M
The frequency spectrum transformed in the DCT transformation step is divided into a long block or a short block according to the psychoacoustic parameter transmitted from the psychoacoustic analysis step, and in the case of the short block, an adjacent block for grouping. And the output signal of the group determination step 7 is supplied, and the frequency spectrum is calculated in the psychoacoustic analysis step. A quantization step 3 for performing quantization by controlling the number of quantization steps so that the quantization noise calculated from the frequency spectrum does not become larger than the permissible quantization noise power of the psychoacoustic parameter obtained; Variable length coding of the quantization value from the step A variable length encoding step 4, a bit number determining step 5 for determining whether or not the number of bits encoded in the variable length encoding step is within the usable bit number; And a bit stream generating step of generating a bit stream by combining the output of the psychological analysis step and the coded data to provide a bit stream. Item 3. In the audio coding method according to Item 3, when the short block is selected, the group is divided into a plurality of scale factor bands for each short block, and the maximum quantization step of the scale factor band is performed. Calculate the number and statistics of the adjacent short blocks and the maximum quantization step. Number flops, and by comparing the statistic is to provide an audio encoding method and performs determination of the grouping.

【００１６】[0016]

【発明の実施の形態】本発明のオーディオ符号化装置の
一実施例について、以下に図と共に説明する。図１は、
本発明のオーディオ符号化装置の一実施例を示すブロッ
ク図である。本実施例は、聴覚心理分析器１、ＭＤＣＴ
器２、、可変長符号器４、ビット数判定器５、ビットス
トリーム生成器６、及びグループ判定器７より構成され
ている。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS One embodiment of an audio encoding apparatus according to the present invention will be described below with reference to the drawings. FIG.
FIG. 1 is a block diagram illustrating an embodiment of an audio encoding device according to the present invention. In this embodiment, the psychoacoustic analyzer 1, MDCT
1, a variable length encoder 4, a bit number determiner 5, a bit stream generator 6, and a group determiner 7.

【００１７】図１に示すオーディオ符号化装置において
は、図２に示した従来の符号化装置と比較すると分かる
ように、グループ判定器７が、ＭＤＣＴ器２と量子化器
３の間に設けられている。グループ判定器７では、聴覚
心理分析器１から伝送されるブロック情報に基づき、ロ
ングブロックの場合は何の処理も行なわず、ショートブ
ロックの場合にのみ処理を行なう。In the audio encoding apparatus shown in FIG. 1, a group judging unit 7 is provided between the MDCT unit 2 and the quantizing unit 3, as can be seen from the comparison with the conventional encoding unit shown in FIG. ing. The group determiner 7 does not perform any processing for long blocks, but performs processing only for short blocks based on the block information transmitted from the psychoacoustic analyzer 1.

【００１８】グループ判定器７は、ＭＤＣＴ器２から出
力される８個のショートブロックの１２８本のＭＤＣＴ
係数を複数のスケールファクタバンド（ｓｆｂ）に分け
る。量子化はスケールファクタバンド単位で処理される
ため、スケールファクタバンド（ｓｆｂ）の量子化ステ
ップ数はバンド内部のＭＤＣＴ係数の最大絶対値に依存
する。よって、ショートブロック内部の各スケールファ
クタバンド（ｓｆｂ）のＭＤＣＴ係数の最大絶対値を検
出して、次式（数１）から最大量子化ステップ数を算出
する。The group decision unit 7 has 128 MDCTs of eight short blocks output from the MDCT unit 2.
Divide the coefficients into multiple scale factor bands (sfb). Since the quantization is performed in units of the scale factor band, the number of quantization steps of the scale factor band (sfb) depends on the maximum absolute value of the MDCT coefficient inside the band. Therefore, the maximum absolute value of the MDCT coefficient of each scale factor band (sfb) inside the short block is detected, and the maximum quantization step number is calculated from the following equation (Equation 1).

【００１９】[0019]

【数１】 (Equation 1)

【００２０】ここで、b はブロック番号(b=0,1,...7)、
sfb は b 番目のブロックのスケールファクタバンドの
番号、x は sfb 番目のスケールファクタバンド内部で
の最大絶対値となるＭＤＣＴ係数を表わすものとする。Here, b is a block number (b = 0, 1,... 7),
Let sfb be the number of the scale factor band of the b-th block, and x represent the MDCT coefficient that is the maximum absolute value inside the sfb-th scale factor band.

【００２１】量子化値は最大で８１９１の値を取るの
で、前記の式（１）から各スケールファクタバンドの最
大量子化ステップ数が算出される。Since the quantization value takes a maximum value of 8191, the maximum number of quantization steps for each scale factor band is calculated from the above equation (1).

【００２２】各ショートブロック内部のスケールファク
タバンド（ｓｆｂ）の最大量子化ステップ数を算出し、
その次に隣接するショートブロックの対応するスケール
ファクタバンドの最大量子化ステップ数の絶対差分値の
総和 δij を、つぎの（数２）に従って算出する。Calculating the maximum number of quantization steps of the scale factor band (sfb) inside each short block,
The total sum δij of the absolute difference values of the maximum quantization step numbers of the corresponding scale factor band of the next adjacent short block is calculated according to the following (Equation 2).

【００２３】[0023]

【数２】 (Equation 2)

【００２４】ここで、i は i 番目のショートブロッ
ク、j は j=i+1 番目のショートブロック、k はスケー
ルファクタバンドの番号、N はショートブロック内部の
スケールファクタバンド（ｓｆｂ）の総数を表わすもの
とする。Where i is the ith short block, j is the j = i + 1th short block, k is the scale factor band number, and N is the total number of scale factor bands (sfb) inside the short block. Shall be.

【００２５】総和 δij の値が小さい程、隣接するショ
ートブロック間のスケールファクタバンド（ｓｆｂ）の
最大量子化ステップ数の差がなくなり、量子化において
も、隣接するショートブロックが同じ量子化ステップ数
で量子化される確率が高くなる。そこで、総和 δij が
定数 Δ に対して、δij ≦ Δ という条件を満たして
いれば、i 番目のショートブロックとつぎのj 番目の
ショートブロックは同一グループとして判定する。The smaller the value of the sum δij, the smaller the difference between the maximum number of quantization steps of the scale factor band (sfb) between adjacent short blocks, and the adjacent short blocks have the same number of quantization steps even in quantization. The probability of being quantized increases. Therefore, if the sum δij satisfies the condition Δij ≦ Δ with respect to the constant Δ, the i-th short block and the next j-th short block are determined as the same group.

【００２６】更に、グループ判定の条件を厳密にするた
め、スケールファクタバンド内部で算出される統計量を
用いる。最大量子化ステップ数が同じであっても、スケ
ールファクタバンド内部のＭＤＣＴ係数の形状が異なる
と、グルーピングを行なっても符号化効率が上がらない
ので、統計量に基づいた判定を加えることにより、判定
の精度の向上を期している。Further, in order to make the conditions for group determination strict, a statistic calculated inside the scale factor band is used. Even if the maximum quantization step number is the same, if the shape of the MDCT coefficients inside the scale factor band is different, the coding efficiency will not increase even if grouping is performed. It is expected to improve the accuracy of.

【００２７】ここでは一実施例をとして、各スケールフ
ァクタバンドのＭＤＣＴ係数が最大量子化値（８１９
１）に量子化される場合の量子化ステップ数を計算し、
その総和値を σ とすると、σ は次式（数３）で表わ
される。Here, as an example, the MDCT coefficient of each scale factor band is set to the maximum quantization value (819).
Calculate the number of quantization steps when quantizing to 1),
Assuming that the sum is σ, σ is represented by the following equation (Equation 3).

【００２８】[0028]

【数３】 (Equation 3)

【００２９】ここで、b はブロック番号(b=0,1,...7)、
sfb は b 番目のブロックのスケールファクタバンドの
番号、xk は sfb 番目のスケールファクタバンド内部の
k番目のＭＤＣＴ係数、M はスケールファクタバンド内
部のＭＤＣＴ係数の総数を表わすものとする。Here, b is a block number (b = 0, 1,... 7),
sfb is the scale factor band number of the b-th block, and xk is the scale factor band inside the sfb-th scale factor band
Let the k-th MDCT coefficient, M, represent the total number of MDCT coefficients inside the scale factor band.

【００３０】前記の絶対差分値の総和 δij と同様にし
て、隣接するショートブロックの対応するスケールファ
クタバンドの量子化ステップ数の総和値の絶対差分値の
総和σij を次式（数４）で算出する。Similarly to the sum δij of the absolute difference values, the sum σij of the absolute difference values of the sum of the quantization step numbers of the corresponding scale factor bands of the adjacent short blocks is calculated by the following equation (Equation 4). I do.

【００３１】[0031]

【数４】 (Equation 4)

【００３２】前記総和 δij と同様に、総和 σij が定
数 Δσに対して、σij ≦ Δσ という条件を満たして
いれば、i 番目のショートブロックとつぎの j 番目の
ショートブロックは同一グループとして判定する。Similarly to the sum δij, if the sum σij satisfies the condition of σij ≦ Δσ with respect to the constant Δσ, the ith short block and the next j-th short block are determined to be the same group.

【００３３】なお、定数 Δ 及び Δσ は予め固定値を
設定してもよいし、入力信号の特性に応じてそれらの値
を可変にしてもよい。The constants Δ and Δσ may be set to fixed values in advance, or their values may be varied according to the characteristics of the input signal.

【００３４】以上のようにして、同一グループとして判
定されたショートブロックは、各ブロック内のスケール
ファクタバンドをインターリーブして並替えを行ない、
グループ化を施し（図５）、量子化器３に出力する。As described above, the short blocks determined as the same group are rearranged by interleaving the scale factor bands in each block,
Grouping is performed (FIG. 5), and the result is output to the quantizer 3.

【００３５】量子化器３では、ロングブロックの場合、
グループ判定器７では何の処理も行なわず、ＭＤＣＴ器
２から出力されるＭＤＣＴ係数がそのまま入力される。
ショートブロックの場合、グループ判定器７にてグルー
ピング処理により，インターリーブして並べ替えが行な
われたＭＤＣＴ係数が入力される。In the quantizer 3, in the case of a long block,
The group determination unit 7 does not perform any processing, and the MDCT coefficient output from the MDCT unit 2 is directly input.
In the case of a short block, MDCT coefficients that have been interleaved and rearranged by grouping processing in the group determiner 7 are input.

【００３６】量子化器３は、従来例と同様にして、許容
量子化雑音電力よりも大きくならないように各スケール
ファクタバンドの量子化ステップ数（スケールファク
タ）を制御し、且つ量子化に必要なビット数が使用可能
なビット数以内に収まるように全体の量子化ステップ数
を制御して、量子化を実行する。The quantizer 3 controls the number of quantization steps (scale factor) of each scale factor band so as not to exceed the allowable quantization noise power, as in the conventional example, and is necessary for quantization. The quantization is executed by controlling the total number of quantization steps so that the number of bits falls within the number of usable bits.

【００３７】可変符号化器４では、量子化器３で出力さ
れるＭＤＣＴ係数の量子化値とスケールファクタ等の符
号化パラメータに可変長符号化を施し、冗長度を削減す
る。The variable encoder 4 performs variable length encoding on the quantized values of the MDCT coefficients output from the quantizer 3 and on encoding parameters such as scale factors to reduce redundancy.

【００３８】ビット数判定器５では、符号化されたビッ
ト数が使用可能なビット数以内に収まっているかどうか
を判定し、使用可能なビット数を越えている場合には再
度量子化を行ない、生成されるビット数が使用可能なビ
ット数を下回るまで繰り返される。The bit number determiner 5 determines whether or not the number of coded bits falls within the number of usable bits. If the number of bits exceeds the number of usable bits, quantization is performed again. Repeat until the number of bits generated is less than the number of available bits.

【００３９】使用可能ビット数を満足して出力された符
号化データは、ビットストリーム生成器６において、ブ
ロック情報等の符号化パラメータと共に多重化され、ビ
ットストリームとして伝送される。The encoded data output satisfying the number of usable bits is multiplexed in the bit stream generator 6 together with encoding parameters such as block information and transmitted as a bit stream.

【００４０】[0040]

【発明の効果】以上のように、本発明のオーディオ符号
化装置によれば、ショートブロックが選択された場合
に、量子化する以前の段階で予め各ショートブロック内
部のスケールファクタバンドの最大量子化ステップ数及
び統計量からグルーピングを判定することで、グルーピ
ング処理にかかる時間をかなり短縮化出来た。As described above, according to the audio encoding apparatus of the present invention, when a short block is selected, the maximum quantization of the scale factor band inside each short block is performed in advance before quantization. By determining the grouping based on the number of steps and the statistic, the time required for the grouping process could be considerably reduced.

【図面の簡単な説明】[Brief description of the drawings]

【図１】本発明のオーディオ符号化装置の一実施例のブ
ロック構成を示した図である。FIG. 1 is a diagram showing a block configuration of an embodiment of an audio encoding device according to the present invention.

【図２】従来のオーディオ符号化装置の一例のブロック
構成を示した図である。FIG. 2 is a diagram showing a block configuration of an example of a conventional audio encoding device.

【図３】ショートブロックのグルーピングの一実施例を
示すブロック図である。FIG. 3 is a block diagram showing one embodiment of grouping of short blocks.

【図４】グルーピング前のスケールファクタバンドのオ
ーダーの一実施例を示すブロック図である。FIG. 4 is a block diagram showing one embodiment of an order of a scale factor band before grouping.

【図５】グルーピング後のスケールファクタバンドのオ
ーダーの一実施例を示すブロック図である。FIG. 5 is a block diagram showing an embodiment of the order of the scale factor band after grouping.

【符号の説明】[Explanation of symbols]

１聴覚心理分析器２ＭＤＣＴ器３量子化器４可変長符号器５ビット数判定器６ビットストリーム生成器７グループ判定器 block0, block1, block2,… ショートブロック group0, group1, group2, … スケールファクタが略同
じスケールファクタバンド幾つかを集めたもの（グルー
プ） sfb0 ,sfb1, sfb2,… スケールファクタバンドDESCRIPTION OF SYMBOLS 1 Psychoacoustic analyzer 2 MDCT unit 3 Quantizer 4 Variable length encoder 5 Bit number judging unit 6 Bit stream generator 7 Group judging unit block0, block1, block2, ... Short block group0, group1, group2, ... Scale factor is A collection of almost the same scale factor bands (group) sfb0, sfb1, sfb2, ... scale factor bands

Claims

【特許請求の範囲】[Claims]

【請求項１】オーディオ信号が供給されて符号化が行な
なわれるオーディオ符号化装置において、前記オーディオ信号が供給されて、人間の聴覚特性に基
づいた聴覚心理パラメータを算出する聴覚心理分析手段
と、前記オーディオ信号を周波数スペクトルに変換するＭＤ
ＣＴ変換手段と、前記ＭＤＣＴ変換手段にて変換された前記周波数スペク
トルを、前記聴覚心理分析手段から伝送される前記聴覚
心理パラメータによりロングブロック、ショートブロッ
クのいずれかに区分し、前記ショートブロックの場合は
グルーピングのための隣接ブロックの判定を行ない、前
記ロングブロックの場合には隣接ブロックの判定は行な
わないグループ判定手段と、前記グループ判定手段の出力信号が供給されて、前記周
波数スペクトルを前記聴覚心理分析手段にて算出された
前記聴覚心理パラメータの許容量子化雑音電力よりも前
記周波数スペクトルから計算される量子化雑音が大きく
ならないように量子化ステップ数を制御して量子化を行
なう量子化手段と、前記量子化手段よりの量子化値を可変長符号化する可変
長符号化手段と、前記可変長符号化手段で符号されたビット数が使用可能
なビット数に収まっているかどうかを判定するビット数
判定手段と、前記ビット数判定手段の出力と聴覚心理分析手段の出力
とが供給され符号化データをまとめてビットストリーム
を生成するビットストリーム生成手段とから構成したこ
とを特徴とするオーディオ符号化装置。1. An audio encoding apparatus to which an audio signal is supplied and encoding is performed, wherein the audio signal is supplied and a psychoacoustic analysis means for calculating a psychoacoustic parameter based on human auditory characteristics. MD for converting the audio signal into a frequency spectrum
CT conversion means, The frequency spectrum converted by the MDCT conversion means, is divided into either a long block or a short block according to the psychoacoustic parameters transmitted from the psychoacoustic analysis means, in the case of the short block Performs a determination of an adjacent block for grouping, and does not determine an adjacent block in the case of the long block. An output signal of the group determination unit is supplied, and the frequency spectrum is converted to the psychoacoustic. Quantization means for performing quantization by controlling the number of quantization steps so that the quantization noise calculated from the frequency spectrum is not larger than the permissible quantization noise power of the psychoacoustic parameter calculated by the analysis means. A variable-length code for performing variable-length encoding on a quantization value from the quantization means. Bit number determining means for determining whether or not the number of bits coded by the variable length coding means is within the number of usable bits; output of the bit number determining means and output of psychoacoustic analysis means And a bit stream generating means for generating a bit stream by combining the coded data and the encoded data.

【請求項２】請求項１に記載されたオーディオ符号化装
置において、前記グループ判定手段は、ショートブロックが選択された場合、前記ショートブロ
ック毎に内部を複数のスケールファクタバンドに分け、前記スケールファクタバンドの最大量子化ステップ数、
及び統計量を計算し、隣接するショートブロックと前記最大量子化ステップ
数、及び統計量とを比較してグルーピングの判定を行な
うことを特徴としたオーディオ符号化装置。2. The audio encoding apparatus according to claim 1, wherein, when a short block is selected, the group determining unit divides the inside into a plurality of scale factor bands for each of the short blocks, The maximum number of quantization steps for the band,
An audio encoding apparatus comprising: calculating an adjacent short block; a maximum number of quantization steps; and a statistic, and determining a grouping.

【請求項３】オーディオ信号が供給されて符号化が行な
われるオーディオ符号化方法において、前記オーディオ信号が供給されて、人間の聴覚特性に基
づいた聴覚心理パラメータを算出する聴覚心理分析ステ
ップと、前記オーディオ信号を周波数スペクトルに変換するＭＤ
ＣＴ変換ステップと、前記ＭＤＣＴ変換ステップにて変換された前記周波数ス
ペクトルを前記聴覚心理分析ステップから伝送される前
記聴覚心理パラメータによりロングブロック、ショート
ブロックのいずれかに区分し、前記ショートブロックの
場合はグルーピングのための隣接ブロックの判定を行な
い、前記ロングブロックの場合には隣接ブロックの判定
は行なわないグループ判定ステップと、前記グループ判定ステップの出力信号が供給されて、前
記周波数スペクトルを前記聴覚心理分析ステップにて算
出された前記聴覚心理パラメータの許容量子化雑音電力
よりも前記周波数スペクトルから計算される量子化雑音
が大きくならないように量子化ステップ数を制御して量
子化を行なう量子化ステップと、前記量子化ステップよりの量子化値を可変長符号化する
可変長符号化ステップと、前記可変長符号化ステップで符号されたビット数が使用
可能なビット数に収まっているかどうかを判定するビッ
ト数判定ステップと、前記ビット数判定ステップの出力と聴覚心理分析ステッ
プの出力とが供給され符号化データをまとめてビットス
トリームを生成するビットストリーム生成ステップとか
ら構成したことを特徴とするオーディオ符号化方法。3. An audio encoding method in which an audio signal is supplied to perform encoding, wherein the audio signal is supplied, and a psychoacoustic analysis step is performed to calculate a psychoacoustic parameter based on a human auditory characteristic. MD to convert audio signal to frequency spectrum
CT transforming step, the frequency spectrum converted in the MDCT transforming step is divided into a long block or a short block according to the psychoacoustic parameter transmitted from the psychoacoustic analysis step, and in the case of the short block, A group determination step of determining an adjacent block for grouping and not determining an adjacent block in the case of the long block; and an output signal of the group determination step is supplied to analyze the frequency spectrum by the psychoacoustic analysis. A quantization step of performing quantization by controlling the number of quantization steps so that quantization noise calculated from the frequency spectrum is not greater than the permissible quantization noise power of the psychoacoustic parameter calculated in step, Quantum from the quantization step A variable length encoding step of performing variable length encoding of the value; a bit number determining step of determining whether or not the number of bits encoded in the variable length encoding step is within a usable bit number; and the bit number determination. An audio encoding method, comprising the steps of: providing a step output and an output of an psychoacoustic analysis step; and supplying a coded data to generate a bit stream by combining the coded data.

【請求項４】請求項３に記載されたオーディオ符号化方
法において、前記グループ判定ステップは、ショートブロックが選択された場合、前記ショートブロ
ック毎に内部を複数のスケールファクタバンドに分け、前記スケールファクタバンドの最大量子化ステップ数、
及び統計量を計算し、隣接するショートブロックと前記最大量子化ステップ
数、及び統計量とを比較してグルーピングの判定を行な
うことを特徴としたオーディオ符号化方法。4. The audio encoding method according to claim 3, wherein, when a short block is selected, the inside is divided into a plurality of scale factor bands for each of the short blocks. The maximum number of quantization steps for the band,
And calculating a statistic, and comparing adjacent short blocks with the maximum number of quantization steps and the statistic to determine grouping.