KR100682966B1

KR100682966B1 - Method and apparatus for quantizing/dequantizing frequency amplitude, and method and apparatus for encoding/decoding audio signal using it

Info

Publication number: KR100682966B1
Application number: KR1020060015940A
Authority: KR
Inventors: 성호상; 김상욱; 라케쉬 타오리; 이강은; 이시화
Original assignee: 삼성전자주식회사
Priority date: 2005-07-13
Filing date: 2006-02-18
Publication date: 2007-02-15
Also published as: KR20070008377A

Abstract

주파수 크기데이터 양자화/역양자화 방법 및 장치와 이를 이용한 오디오 부호화/복호화 방법 및 장치가 개시된다. 그 주파수 크기데이터 양자화 방법은, 주파수 크기 데이터를 양자화하는 방법에 있어서, 임의의 오디오 프레임을 구성하는 복수의 대역에 대해, 각 대역별로 주파수 크기들의 파워를 구하여 양자화하는 단계; 양자화된 값을 이용하여 대역별로 주파수 크기 데이터를 정규화하는 단계; 및 그 정규화된 주파수 크기데이터의 짝수 또는 홀수번째 데이터들을 양자화하는 단계를 포함함을 특징으로 하고, 그 주파수 크기데이터 양자화 방법은, 그 정규화된 주파수 크기 데이터들 중 양자화되지 않은 데이터를, 양자화된 데이터를 이용한 보간에 의해 생성하는 단계; 및 그 정규화된 주파수 크기 데이터들 중 양자화되지 않은 데이터와 상기 보간에 의해 생성된 데이터의 차에 얻어진 보간에러를 양자화하는 단계를 더 포함함이 바람직하다.Disclosed are a frequency quantization / dequantization method and apparatus, and an audio encoding / decoding method and apparatus using the same. The frequency magnitude data quantization method includes: quantizing frequency magnitude data, comprising: obtaining and quantizing power of frequency magnitudes for each band for a plurality of bands constituting an arbitrary audio frame; Normalizing the frequency magnitude data for each band using the quantized value; And quantizing the even or odd-numbered data of the normalized frequency magnitude data, wherein the frequency magnitude data quantization method includes quantized data of unquantized data among the normalized frequency magnitude data. Generating by interpolation using; And quantizing the interpolation error obtained in the difference between the unquantized data among the normalized frequency magnitude data and the data generated by the interpolation.

본 발명에 의하면, 광대역 에러 오디오 신호의 주파수 크기 정보와 위상 정보를 사용하여 다수의 계층을 두고 확장성을 지원할 수 있으며, 기본 오디오의 질을 유지할 수 있다. 상술한 주파수 크기정보를 이용하면 넓은 주파수 대역을 적은 비트로 양자화할 수 있으며 전체적으로 음질에 대한 scalability를 줄 수 있다.According to the present invention, it is possible to support scalability over a plurality of layers by using frequency magnitude information and phase information of a wideband error audio signal, and maintain basic audio quality. By using the above-described frequency size information, a wide frequency band can be quantized with fewer bits, and overall scalability of sound quality can be given.

Description

주파수 크기데이터 양자화/역양자화 방법 및 장치와 이를 이용한 오디오 부호화/복호화 방법 및 장치{Method and apparatus for quantizing/dequantizing frequency amplitude, and method and apparatus for encoding/decoding audio signal using it}Method and apparatus for quantizing / dequantizing frequency magnitude data and audio encoding / decoding method using same {method and apparatus for quantizing / dequantizing frequency amplitude, and method and apparatus for encoding / decoding audio signal using it}

도 1은 MLT 방식을 이용하는 고대역 오디오 신호 부호화 장치의 구성을 블록도로 도시한 것이다.1 is a block diagram illustrating a configuration of a high-band audio signal encoding apparatus using an MLT scheme.

도 2에 도시된 바와 같은 하모닉 코더(Harmonic coder)를 이용한 고대역 오디오 부호화 장치를 도시한 것이다.2 illustrates a high band audio encoding apparatus using a harmonic coder as shown in FIG. 2.

도 3은 MDCT 방식을 이용하는 광대역 에러 오디오 신호 부호화 장치의 구성을 블록도로 도시한 것이다.3 is a block diagram illustrating a configuration of a wideband error audio signal encoding apparatus using an MDCT scheme.

도 4는 본 발명에 의한 주파수 크기데이터 양자화 장치를 이용한 오디오 부호화 장치의 구성을 블록도로 도시한 것이다.4 is a block diagram illustrating a configuration of an audio encoding apparatus using a frequency quantization apparatus according to the present invention.

도 5는 상기 주파수 크기 부호화부(440)의 보다 세부적인 구성을 블록도로 도시한 것이다.5 is a block diagram illustrating a more detailed configuration of the frequency magnitude encoder 440.

도 6은 본 발명에 의한 본 발명에 의한 주파수 크기데이터 양자화 방법을 이용한 오디오 부호화 방법을 흐름도로 도시한 것이다.6 is a flowchart illustrating an audio encoding method using the frequency magnitude data quantization method according to the present invention.

도 7은 상기 640단계에서 주파수 크기를 부호화를 위한 주파수 크기 양자화 과정을 흐름도로 도시한 것이다. FIG. 7 is a flowchart illustrating a frequency magnitude quantization process for encoding the frequency magnitude in step 640.

도 8은 본 발명의 바람직한 실시 예에 따른 오디오 부호화 장치를 블록도로 도시한 것이다.8 is a block diagram illustrating an audio encoding apparatus according to an embodiment of the present invention.

도 9는 본 발명에 의한 오디오 신호의 주파수 크기 양자화방법에 대한 일예의 개념도이다.9 is a conceptual diagram of an example of a frequency magnitude quantization method of an audio signal according to the present invention.

도 10은 본 발명에 의한 주파수 크기 양자화 방법 및 장치에 의해 제공되는 비트스트림의 구성을 나타낸다.10 shows a configuration of a bitstream provided by a frequency magnitude quantization method and apparatus according to the present invention.

도 11은 본 발명에 의한 주파수 크기데이터 역양자화부에 대한 일 실시예를 도시한 것이다.11 illustrates an embodiment of a frequency magnitude data dequantization unit according to the present invention.

도 12는 본 발명에 의한 주파수 크기데이터 역양자화부에 대한 다른 실시예를 도시한 것이다.12 shows another embodiment of the frequency quantization dequantization unit according to the present invention.

도 13은 본 발명에 의한 주파수 크기데이터 역양자화부에 대한 또 다른 실시예를 도시한 것이다.13 shows another embodiment of the frequency quantization dequantization unit according to the present invention.

도 14는 본 발명에 의한 주파수 크기 데이터 역양자화장치를 이용한 오디오 신호 복호화 장치의 구성을 블록도로 도시한 것이다.14 is a block diagram illustrating a configuration of an audio signal decoding apparatus using the frequency quantization dequantizer according to the present invention.

도 15는 본 발명에 의한 주파수 크기 데이터 역양자화 방법의 일 실시예를 흐름도로 도시한 것이다.15 is a flowchart illustrating an embodiment of a method of dequantizing frequency magnitude data according to the present invention.

도 16은 본 발명에 의한 주파수 크기 데이터 역양자화 방법의 다른 실시예를 흐름도로 도시한 것이다.16 is a flowchart illustrating another embodiment of the method of frequency quantizing data inverse quantization according to the present invention.

도 17은 본 발명에 의한 주파수 크기 데이터 역양자화 방법의 다른 실시예를 흐름도로 도시한 것이다.17 is a flowchart illustrating another embodiment of a method of dequantizing frequency magnitude data according to the present invention.

도 18은 본 발명에 의한 오디오 주파수 크기 데이터 역양자화 방법을 이용한 오디오 복호화 방법을 흐름도로 도시한 것이다.18 is a flowchart illustrating an audio decoding method using an audio frequency magnitude data dequantization method according to the present invention.

도 19는 하나의 대역에 대한 주파수 크기의 역양자화 장치에 대한 일 실시예의 구성을 블록도로 도시한 것이다.FIG. 19 is a block diagram illustrating a configuration of an embodiment of an inverse quantization apparatus having a frequency magnitude for one band.

도 20은 하나의 대역에 대한 주파수 크기의 역양자화 장치에 대한 다른 실시예의 구성을 블록도로 도시한 것이다.20 is a block diagram showing the configuration of another embodiment of an inverse quantization apparatus of frequency magnitude for one band.

도 21은 본 발명에 의한 오디오 복호화장치의 또 다른 실시예를 나타내고 있다.21 shows another embodiment of an audio decoding apparatus according to the present invention.

본 발명은 오디오 부호화 및 복호화에 관한 것으로서, 특히 주파수 크기데이터 양자화/역양자화 방법 및 장치와 이를 이용한 오디오 부호화/복호화 방법 및 장치에 관한 것이다.BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to audio encoding and decoding, and more particularly, to a method and apparatus for quantizing / dequantizing frequency size data, and an audio encoding and decoding method and apparatus using the same.

오디오 통신의 응용 분야가 다양해지고 네트워크의 전송속도가 향상됨으로 인해 고품질의 오디오 통신에 대한 필요성이 부각되고 있다. 이에 따라 기존의 오디오 통신 대역인 0.3kHz∼3.4kHz에 비해 자연성과 명료도 등 다양한 측면에서 우수한 성능을 갖는 0.3kHz∼7kHz의 대역폭을 갖는 광대역(wideband) 오디오 신호의 전달이 요구되고 있다.As the application fields of audio communication are diversified and the network transmission speed is increased, the need for high quality audio communication is emerging. Accordingly, there is a demand for transmission of a wideband audio signal having a bandwidth of 0.3 kHz to 7 kHz, which has excellent performance in various aspects such as naturalness and clarity, compared to 0.3 kHz to 3.4 kHz, which is an existing audio communication band.

또한 네트워크 측면에서, 데이터를 패킷단위로 전송하는 패킷 스위칭 네트워크(packet switching network)는 채널의 정체 현상을 초래할 수 있고, 이로 인한 패킷 손실과 오디오의 질 저하가 발생될 수 있다. 이를 해결하기 위하여 손상된 패킷을 은닉하는 기술이 사용되고 있지만 이는 근본적인 처방이 될 수 없다. 따라서 상기 광대역 오디오 신호를 효과적으로 압축하면서 채널의 정체 현상을 해결할 수 있는 광대역 오디오 부호화 및 복호화 기술이 제안되고 있다.In addition, on the network side, a packet switching network that transmits data in packet units may cause channel congestion, resulting in packet loss and audio quality degradation. To solve this problem, a technique for concealing corrupted packets is used, but this cannot be a fundamental prescription. Therefore, a wideband audio encoding and decoding technique has been proposed to solve the channel congestion while effectively compressing the wideband audio signal.

현재 제안되고 있는 광대역 오디오 부호화 및 복호화는 0.3kHz∼7kHz 대역의 오디오 신호를 한꺼번에 압축하고 이를 복원하는 방식과 0.3kHz∼4kHz 대역과 4kHz∼7kHz 대역으로 나누어 계층적으로 압축하고, 이를 복원하는 방식, 그리고 0.3∼3.4kHz 대역을 압축한 후 이를 복원하여 다시 광대역으로 오버샘플링(over-sampling)한 후 원래의 광대역 오디오 신호와의 광대역 에러 신호를 구하여 이를 압축하는 방식으로 구분될 수 있다.Widely proposed audio encoding and decoding is a method of compressing and restoring audio signals of 0.3 kHz to 7 kHz at once, and hierarchically compressing and restoring them into 0.3 kHz to 4 kHz band and 4 kHz to 7 kHz band, After compression of the 0.3 to 3.4 kHz band, it is recovered, over-sampled back to the wide band, and then a wideband error signal with the original wideband audio signal is obtained and compressed.

상기 두 번째 방식과 세 번째 방식은 데이터 정체 현상에 따라 네트워크가 복호화기에 전달하는 계층 또는 데이터의 양을 조절하여 주어진 환경에서 최적의 통신이 가능하도록 하는 대역폭 확장(band width scalability) 기능을 이용한 광대역 오디오 부호화 및 복호화 방식이다.The second and third methods are broadband audio using a bandwidth width scalability function that enables optimal communication in a given environment by controlling the amount of layers or data transmitted to the decoder according to data congestion. Encoding and decoding method.

상기 0.3kHz∼4kHz 대역과 4kHz∼7kHz 대역으로 나누어 계층적으로 압축하는 광대역 오디오 부호화에서 4kHz∼7kHz 대역의 고대역 오디오 신호는 MLT(Modulated Lapped Transform, 이하 MLT라 함) 방식에 의해 부호화된다. 도 1은 MLT 방식을 이용하는 고대역 오디오 신호 부호화 장치의 구성을 블록도로 도시한 것이다.In the wideband audio coding that is hierarchically compressed into the 0.3 kHz to 4 kHz band and the 4 kHz to 7 kHz band, the high band audio signal of the 4 kHz to 7 kHz band is encoded by the MLT (Modulated Lapped Transform, MLT) method. 1 is a block diagram illustrating a configuration of a high-band audio signal encoding apparatus using an MLT scheme.

도 1을 참조하면, 상기 고대역 오디오 부호화 장치는, 고대역 오디오 신호가 입력되면, MLT부(100)에서 입력되는 고대역 오디오 신호를 MLT하여 MLT 계수를 추출한다. 추출된 MLT 계수의 크기(Magnitude)는 2D-DCT(2 Dimension - Discrete Cosine Transform)부(110)로 출력하고, 추출된 MLT 계수의 부호(sign)는 부호 양자화기(120)로 출력한다.Referring to FIG. 1, when a high band audio signal is input, the high band audio encoding apparatus MLTs a high band audio signal input from the MLT unit 100 to extract MLT coefficients. The magnitude (Mgnitude) of the extracted MLT coefficients is output to the 2D-Discrete Cosine Transform (2D-DCT) unit 110, and the sign of the extracted MLT coefficients is output to the sign quantizer 120.

2D-DCT부(110)는 MLT 계수의 크기로부터 2D-DCT 계수를 추출하고, 상기 추출된 2D-DCT 계수를 DCT 계수 양자화기(130)로 출력한다. DCT 계수 양자화기(130)는 2차원 구조를 갖는 2D-DCT 계수에서 통계적으로 그 크기가 큰 순서대로 나열하고, 나열된 벡터를 양자화한 후, 그 코드북 인덱스를 출력한다. 부호 양자화기(120)는 MLT 계수의 크기가 큰 계수에 해당되는 부호를 양자화하여 출력한다. 출력된 코드북 인덱스 및 양자화된 부호는 고대역 오디오 복호화 장치(미도시)로 제공된다.The 2D-DCT unit 110 extracts the 2D-DCT coefficients from the magnitude of the MLT coefficients, and outputs the extracted 2D-DCT coefficients to the DCT coefficient quantizer 130. The DCT coefficient quantizer 130 arranges the 2D-DCT coefficients having a two-dimensional structure in order of statistical magnitude, quantizes the listed vectors, and outputs the codebook index. The code quantizer 120 quantizes and outputs a code corresponding to a coefficient having a large MLT coefficient. The output codebook index and the quantized code are provided to a high band audio decoding apparatus (not shown).

그러나, 상기 MLT 방식에 의해 고대역 오디오 신호를 부호화하는 것은 낮은 비트율로 오디오 신호를 전송할 때, 고음질의 복원이 어렵고, 비트율이 낮아질수록 오디오 복원 성능의 저하가 두드러진다.However, the encoding of the high-band audio signal by the MLT method is difficult to restore high sound quality when transmitting the audio signal at a low bit rate, and the lower the bit rate, the lower the audio reconstruction performance.

따라서, 이를 개선하기 위하여 도 2에 도시된 바와 같은 하모닉 코더(Harmonic coder)를 이용한 고대역 오디오 부호화 장치가 제안된 바 있다.Therefore, in order to improve this, a high-band audio encoding apparatus using a harmonic coder as shown in FIG. 2 has been proposed.

도 2를 참조하면, 하모닉 피크(harmonic peak) 검출부(200)는 고대역 오디오 신호가 입력되면, 상기 고대역 오디오 신호의 하모닉 피크를 검출하고, 상기 검출된 하모닉 피크를 토대로 한 고대역 오디오 신호의 진폭(amplitude)과 위상(phase)을 출력한다.Referring to FIG. 2, when a high-band audio signal is input, the harmonic peak detector 200 detects a harmonic peak of the high-band audio signal, and detects a harmonic peak of the high-band audio signal based on the detected harmonic peak. Outputs amplitude and phase.

진폭 양자화기(210)는 입력된 고대역 오디오 신호의 진폭을 양자화하여 출력한다. 위상 양자화기(203)는 입력되는 고대역 오디오 신호의 위상을 양자화하여 출력한다. 출력된 양자화된 진폭 및 양자화된 위상은 고대역 오디오 복호화 장치(미도시)로 제공된다.The amplitude quantizer 210 quantizes and outputs an amplitude of an input high band audio signal. The phase quantizer 203 quantizes and outputs a phase of an input high band audio signal. The output quantized amplitude and quantized phase are provided to a high band audio decoding device (not shown).

그러나, 도 2와 같은 하모닉 코더를 이용한 고대역 오디오 신호 부호화로 적은 비트율과 낮은 복잡도로 고음질을 재생할 수 있으나, 입력되는 고대역 오디오 신호에 대한 확장성(scalability)을 지원하는데 한계가 있다.However, although high-quality audio can be reproduced with a low bit rate and low complexity by encoding a high-band audio signal using a harmonic coder as shown in FIG. 2, there is a limit in supporting scalability of an input high-band audio signal.

또한, 광대역 에러 오디오 부호화는 상기 대역폭 확장 기능을 갖는 0.3∼3.4kHz 대역을 압축한 후 이를 복원하여 다시 광대역으로 오버샘플링(over-sampling)한 후, 원래의 광대역 오디오 신호와의 광대역 에러 신호를 구하여 이를 압축하는 방식을 이용한다. 상기 광대역 에러 오디오 부호화 시, 0.05kHz∼7kHz 대역의 광대역 에러 오디오 신호는 MDCT(Modified Discrete Cosine Transform, 이하 MDCT라고 약함) 방식에 의해 부호화된다. 도 3은 MDCT 방식을 이용하는 광대역 에러 오디오 신호 부호화 장치의 구성을 블록도로 도시한 것이다.In addition, broadband error audio coding compresses the 0.3 to 3.4 kHz band having the bandwidth extension function, recovers it, over-samples it to wide bandwidth, and then obtains a wideband error signal from the original wideband audio signal. Compress it. In the wideband error audio encoding, a wideband error audio signal in a 0.05 kHz to 7 kHz band is encoded by a modified discrete cosine transform (MDCT) method. 3 is a block diagram illustrating a configuration of a wideband error audio signal encoding apparatus using an MDCT scheme.

도 3을 참조하면, 상기 광대역 에러 오디오 부호화 장치는, 광대역 오디오 신호가 입력되면, 다운 샘플링부(300)를 통해 저대역으로 다운 샘플링된 신호를 얻고, 상기 저대역으로 다운 샘플링된 신호를 저대역 오디오 부호화기(310)에서 부호화를 한다. 부호화된 오디오 신호를 업 샘플링부(320)에 의해 광대역 신호를 복원하고, 감산기(330)에 의해 원신호에서 복원된 광대역 신호를 감산하여 광대역 에러 신호를 생성한다. 생성된 광대역 에러 신호는 MDCT부(340)로 입력되고, MDCT부 (340)는 입력된 광대역 에러 신호의 MDCT 계수를 추출한다. 추출된 MDCT 계수는 대역 분할부(350)에 의해 대역별로 분할되고, 정규화(normalization)부(360)에서 대역별로 분할된 MDCT 계수는 정규화된다. 정규화된 MDCT계수는 양자화기(370)에서 양자화되어 코드북 인덱스를 출력한다. 출력된 코드북 인덱스는 고대역 오디오 복호화 장치(미도시)로 제공된다.Referring to FIG. 3, when a wideband audio signal is input, the wideband error audio encoding apparatus obtains a downsampled signal into a low band through a down sampling unit 300, and then stores the downsampled signal into a low band into a low band. The audio encoder 310 performs encoding. The wideband signal is reconstructed by the up-sampling unit 320 by the encoded audio signal, and the wideband signal reconstructed from the original signal is subtracted by the subtractor 330 to generate a wideband error signal. The generated broadband error signal is input to the MDCT unit 340, and the MDCT unit 340 extracts the MDCT coefficients of the input broadband error signal. The extracted MDCT coefficients are divided by band by the band dividing unit 350, and the MDCT coefficients divided by band by the normalization unit 360 are normalized. The normalized MDCT coefficient is quantized by the quantizer 370 to output a codebook index. The output codebook index is provided to a high band audio decoding apparatus (not shown).

그러나, 상기 MDCT 방식에 의한 광대역 에러 오디오 신호를 부호화하는 방식 역시 낮은 비트율로 오디오 신호를 전송할 때, 고음질의 복원이 어려운 단점이 있다. However, the method of encoding the wideband error audio signal by the MDCT method also has a disadvantage in that high quality sound is difficult to recover when the audio signal is transmitted at a low bit rate.

본 발명이 이루고자 하는 기술적 과제는 광대역 오디오 부호화 및 복호화 시, 광대역 오디오 신호의 선형 예측 잔차를 주파수 도메인으로 변환 한 뒤, 그 크기 정보를 양자화 함에 있어서 scalability를 지원하여 계층적으로 오디오 부호화 및 복호화 를 할 수 있는, 주파수 크기데이터 양자화/역양자화 방법 및 장치와 이를 이용한 오디오 부호화/복호화 방법 및 장치를 제공하는 것이다.The technical problem to be achieved in the present invention is to perform the audio encoding and decoding hierarchically by converting the linear prediction residual of the wideband audio signal into the frequency domain and then supporting scalability in quantizing the size information. The present invention provides a method and apparatus for quantizing / dequantizing frequency magnitude data and an audio coding / decoding method using the same.

상기 기술적 과제들을 달성하기 위한 본 발명에 의한, 주파수 크기데이터 양자화 방법은, (a) 오디오신호의 주파수 크기들의 파워를 구하여 양자화하는 단계; (b) 상기 주파수 크기데이터를 이용하여 상기 양자화된 값을 정규화하는 단계; 및 (c) 상기 정규화된 주파수 크기데이터들 중 짝수 또는 홀수번째 데이터들을 양자화하는 단계를 포함함을 특징으로 한다.In accordance with an aspect of the present invention, there is provided a frequency quantization method comprising: (a) obtaining and quantizing power of frequency magnitudes of an audio signal; (b) normalizing the quantized value using the frequency magnitude data; And (c) quantizing even or odd data among the normalized frequency magnitude data.

상기 주파수 크기데이터 양자화 방법은, 상기 (b)단계의 정규화된 주파수 크기 데이터들 중 상기 (c)단계에서 양자화되지 않은 데이터를, 상기 (c)단계에서 양자화된 데이터를 이용한 보간에 의해 생성하는 단계; 및 상기 (b)단계의 정규화된 주파수 크기 데이터들 중 상기 (c)단계에서 양자화되지 않은 데이터와 상기 보간에 의해 생성된 데이터의 차에 얻어진 보간에러를 양자화하는 단계를 더 구비함이 바람직하다.The frequency magnitude data quantization method may include generating data that is not quantized in step (c) of the normalized frequency magnitude data of step (b) by interpolation using the quantized data in step (c). ; And quantizing the interpolation error obtained from the difference between the data not quantized in step (c) and the data generated by the interpolation among the normalized frequency magnitude data of step (b).

상기 기술적 과제들을 달성하기 위한 본 발명에 의한, 주파수 크기데이터 양자화 방법은, 주파수 크기 데이터를 양자화하는 방법에 있어서, (a) 임의의 오디오 프레임을 구성하는 복수의 대역에 대해, 각 대역별로 주파수 크기들의 파워를 구하여 양자화하는 단계; (b) 상기 양자화된 값을 이용하여 대역별로 주파수 크기 데이터를 정규화하는 단계; 및 (c) 상기 정규화된 주파수 크기데이터의 짝수 또는 홀수번째 데이터들을 양자화하는 단계를 포함함을 특징으로 한다.According to the present invention for achieving the above technical problem, the method for quantizing the frequency size data, in the method for quantizing the frequency size data, (a) for a plurality of bands constituting an arbitrary audio frame, the frequency size for each band Obtaining and quantizing their power; (b) normalizing frequency magnitude data for each band using the quantized value; And (c) quantizing the even or odd-numbered data of the normalized frequency magnitude data.

상기 주파수 크기데이터 양자화 방법은, (d) 상기 (a)단계의 정규화된 주파수 크기 데이터들 중 상기 (c)단계에서 양자화되지 않은 데이터를, 상기 (b)단계에서 양자화된 데이터를 이용한 보간에 의해 생성하는 단계; 및 (e) 상기 (a)단계의 정규화된 주파수 크기 데이터들 중 상기 (b)단계에서 양자화되지 않은 데이터와 상기 보간에 의해 생성된 데이터의 차에 얻어진 보간에러를 양자화하는 단계를 더 포함함이 바람직하다.The frequency magnitude data quantization method may include: (d) interpolating data that is not quantized in step (c) of the normalized frequency magnitude data of step (a) by using quantized data in step (b). Generating; And (e) quantizing the interpolation error obtained from the difference between the data that is not quantized in step (b) and the data generated by the interpolation among the normalized frequency magnitude data of step (a). desirable.

상기 기술적 과제들을 달성하기 위한 본 발명에 의한, 오디오 부호화 방법은, 오디오 신호의 광대역 에러 오디오 신호의 주파수 포락선을 구하는 단계; 상기 검출된 주파수 포락선을 상기 광대역 에러 오디호 신호에서 제거하고 주파수 크기와 위상을 구하는 단계; 상기 검출된 주파수 크기와 위상을 부호화하는 단계를 포함함을 특징으로 하고, 상기 주파수 크기 부호화는 (a) 임의의 오디오 프레임을 구성하는 복수의 대역에 대해, 각 대역별로 주파수 크기들의 파워를 구하여 양자화하는 단계; (b) 상기 양자화된 값을 이용하여 대역별로 주파수 크기 데이터를 정규화하는 단계; 및 (c) 상기 정규화된 주파수 크기데이터의 짝수 또는 홀수번째 데이터들을 양자화하는 단계를 포함한다.According to an aspect of the present invention, an audio encoding method includes: obtaining a frequency envelope of a wideband error audio signal of an audio signal; Removing the detected frequency envelope from the wideband error audio signal and obtaining a frequency magnitude and a phase; And encoding the detected frequency magnitude and phase, wherein the frequency magnitude coding comprises (a) obtaining power of frequency magnitudes for each band and quantizing the plurality of bands constituting an arbitrary audio frame. Making; (b) normalizing frequency magnitude data for each band using the quantized value; And (c) quantizing the even or odd data of the normalized frequency magnitude data.

상기 기술적 과제들을 달성하기 위한 본 발명에 의한, 주파수 크기데이터 양자화 장치는, 주파수 크기 데이터를 양자화하는 장치에 있어서, 임의의 오디오 프레임을 구성하는 대역별로 주파수 크기들의 파워를 계산하는 파워계산부; 상기 계산된 파워를 양자화하는 파워양자화부; 상기 주파수 크기데이터를 이용하여 상기 양자화된 값을 정규화하는 크기 정규화부; 및 상기 정규화된 주파수 크기데이터의 짝수 또는 홀수번째 데이터들을 양자화하는 정규화데이터 양자화부를 포함함을 특징으로 한다.According to an aspect of the present invention, there is provided a frequency magnitude data quantization apparatus, comprising: a power calculator configured to calculate power of frequency magnitudes for each band constituting an arbitrary audio frame; A power quantizer for quantizing the calculated power; A magnitude normalizer for normalizing the quantized value using the frequency magnitude data; And a normalized data quantizer for quantizing even or odd-numbered data of the normalized frequency magnitude data.

상기 주파수 크기데이터 양자화 장치는, 상기 크기 정규화부에서 정규화된 주파수 크기 데이터들 중 상기 정규화데이터 양자화부에서 양자화되지 않은 데이터를 상기 정규화데이터 중 양자화된 데이터를 이용한 보간에 의해 생성하는 보간부; 및 상기 정규화된 주파수 크기 데이터들 중 상기 양자화되지 않은 데이터와 상기 보간에 의해 생성된 데이터의 차에 얻어진 보간에러를 양자화하는 보간에러양자화부를 더 포함함이 바람직하다.The frequency magnitude data quantization apparatus may include: an interpolation unit configured to generate data that is not quantized by the normalization data quantization unit among frequency magnitude data normalized by the magnitude normalization unit by interpolation using quantized data among the normalization data; And an interpolation error quantization unit that quantizes an interpolation error obtained by a difference between the unquantized data and the data generated by the interpolation among the normalized frequency magnitude data.

상기 기술적 과제들을 달성하기 위한 본 발명에 의한, 오디오 부호화 장치는, 오디오 신호의 광대역 에러 오디오 신호의 주파수 포락선을 구하는 포락선검출부; 상기 검출된 주파수 포락선을 상기 광대역 에러 오디호 신호에서 제거하고 주파수 크기와 위상을 구하는 주파수크기/위상 획득부; 상기 검출된 주파수 크기를 부호화 하는 주파수크기 부호화부; 및 상기 검출된 주파수 위상을 부호화하는 주파수 위상 부호화부를 포함함을 특징으로 하고, 상기 주파수 크기 부호화부는 임의의 오디오 프레임을 구성하는 대역들에 대해 상기 대역별로 주파수 크기들의 파워를 계산하는 파워계산부; 상기 계산된 파워를 양자화하는 파워양자화부; 상기 주파수 크기데이터를 이용하여 상기 양자화된 값을 정규화하는 크기 정규화부; 및 상기 정규화된 주파수 크기데이터의 짝수 또는 홀수번째 데이터들을 양자화하는 정규화데이터 양자화부를 포함한다.According to an aspect of the present invention, an audio encoding apparatus includes: an envelope detector configured to obtain a frequency envelope of a wideband error audio signal of an audio signal; A frequency magnitude / phase acquisition unit which removes the detected frequency envelope from the wideband error audio signal and obtains a frequency magnitude and a phase; A frequency size encoder which encodes the detected frequency magnitude; And a frequency phase encoder which encodes the detected frequency phase, wherein the frequency magnitude encoder comprises: a power calculator that calculates power of frequency magnitudes for each band for bands constituting an arbitrary audio frame; A power quantizer for quantizing the calculated power; A magnitude normalizer for normalizing the quantized value using the frequency magnitude data; And a normalized data quantizer for quantizing even or odd-numbered data of the normalized frequency magnitude data.

상기 기술적 과제들을 달성하기 위한 본 발명에 의한, 주파수 크기데이터 역양자화 방법은, 비트스트림에 포함된 주파수크기의 파워를 양자화한 값(RMS 인덱스)을 역양자화하여 주파수 크기들의 파워를 복원하는 단계; 및 복원하고자 하는 주파수 크기의 개수에 해당하는 임펄스들과 상기 복원된 주파수 크기 파워를 곱하여 주파수 크기들을 복원하는 단계를 포함함을 특징으로 한다.In accordance with an aspect of the present invention, there is provided a frequency quantization dequantization method comprising: dequantizing a value (RMS index) of quantizing a power of a frequency size included in a bitstream to restore power of frequency magnitudes; And restoring frequency magnitudes by multiplying impulses corresponding to the number of frequency magnitudes to be restored by the restored frequency magnitude power.

상기 기술적 과제들을 달성하기 위한 본 발명에 의한, 주파수 크기데이터 역양자화 방법은, 비트스트림에 포함된 주파수 크기들의 파워를 양자화한 값(RMS 인덱스)을 역양자화하여 상기 주파수 크기들의 파워를 복원하는 단계; 비트스트림에 포함된 양자화된 짝수번째 또는 홀수번째의 정규화된 주파수 크기데이터들을 분리 하여 역양자화함으로써 상기 짝수번째 또는 홀수번째의 정규화된 주파수 크기 데이터들을 복원하는 단계; 상기 복원된 정규화된 주파수 크기 데이터들을 보간하여, 정규화된 주파수 크기 데이터들 중 복원되지 않은 주파수 크기 데이터들을 생성하는 단계; 및 상기 정규화된 주파수 크기데이터들과 상기 보간에 의해 생성된 주파수 크기 데이터들을 상기 복원된 주파수 크기들의 파워를 이용하여 역정규화하여 주파수 크기 데이터들을 복원하는 단계를 포함함을 특징으로 한다.In accordance with an aspect of the present invention, there is provided a frequency quantization dequantization method comprising: dequantizing a value (RMS index) of quantizing power of frequency magnitudes included in a bitstream to restore power of the frequency magnitudes. ; Restoring the even or odd normalized frequency magnitude data by separating and inverse quantizing the quantized even or odd normalized frequency magnitude data included in the bitstream; Interpolating the reconstructed normalized frequency magnitude data to generate unreconstructed frequency magnitude data among the normalized frequency magnitude data; And denormalizing the normalized frequency magnitude data and the frequency magnitude data generated by the interpolation using power of the restored frequency magnitudes to restore frequency magnitude data.

상기 기술적 과제들을 달성하기 위한 본 발명에 의한, 주파수 크기데이터 역양자화 방법은, (a) 비트스트림에 포함된 주파수 크기들의 파워를 양자화한 값에 해당하는 RMS 인덱스를 역양자화하여 상기 주파수 크기들의 파워를 복원하는 단계; (b) 비트스트림에 포함된 양자화된 짝수번째 또는 홀수번째의 정규화된 주파수 크기데이터들을 역양자화하여 상기 짝수번째 또는 홀수번째의 정규화된 주파수 크기 데이터들을 복원하는 단계; (c) 상기 복원된 정규화된 주파수 크기 데이터들을 보간하여, 정규화된 주파수 크기 데이터들 중 복원되지 않은 주파수 크기 데이터들을 생성하는 단계; (d) 비트스트림에 포함된 양자화된 보간에러 데이터들을 역양자화하여, 보간에러 데이터들을 복원하는 단계; 및 (e) 상기 (b)단계에서 복원된 주파수 크기데이터들과 상기 (c)단계에서 복원된 보간에 의해 생성된 주파수 크기데이터들과 상기 (d)단계에서 복원된 보간에러데이터들을 상기 (a)단계에서 복원된 주파수 크기들의 파워를 이용하여 역정규화하여 주파수 크기 데이터들을 복원하는 단계를 포함함을 특징으로 한다.In accordance with an aspect of the present invention, there is provided a frequency magnitude data dequantization method according to the present invention, wherein (a) inverse quantizes an RMS index corresponding to a quantized value of powers of frequency magnitudes included in a bitstream. Restoring; (b) dequantizing the quantized even or odd normalized frequency magnitude data included in the bitstream to restore the even or odd normalized frequency magnitude data; (c) interpolating the reconstructed normalized frequency magnitude data to generate unreconstructed frequency magnitude data among the normalized frequency magnitude data; (d) inverse quantization of quantized interpolation error data included in the bitstream to recover interpolation error data; And (e) the frequency magnitude data recovered in step (b), the frequency magnitude data generated by interpolation recovered in step (c) and the interpolation error data recovered in step (d). And restoring the frequency magnitude data by denormalizing the power of the recovered frequency magnitudes in step).

상기 주파수 크기 데이터 역양자화방법을 구성하는 각 단계는 주파수 영역으 로 변환된 오디오 신호의 프레임을 구성하는 대역별로 수행됨이 바람직하다.Each step of configuring the frequency magnitude data dequantization method is preferably performed for each band constituting a frame of the audio signal converted into the frequency domain.

상기 기술적 과제들을 달성하기 위한 본 발명에 의한, 오디오 복호화 방법은 주파수 크기를 복호화하는 단계; 주파수 위상을 복호화하는 단계; 및 상기 복호화된 주파수 크기 및 위상 데이터를 이용하여 광대역 에러 오디오 신호의 주파수 포락선을 복원하는 단계를 포함함을 특징으로 하고, 상기 주파수 크기 복호화 단계는 비트스트림에 포함된 주파수크기의 파워를 양자화한 값(RMS 인덱스)을 역양자화하여 주파수 파워를 복원하는 단계; 복원하고자 하는 주파수 크기의 개수에 해당하는 임펄스들을 생성하는 단계; 및 상기 복원된 주파수 크기 파워를 곱하여 주파수 크기들을 복원하는 단계를 포함한다.According to an embodiment of the present invention, an audio decoding method includes: decoding a frequency magnitude; Decoding the frequency phase; And restoring a frequency envelope of a wideband error audio signal using the decoded frequency magnitude and phase data, wherein the frequency magnitude decoding step is a quantized value of power of a frequency magnitude included in a bitstream. Dequantizing (RMS index) to restore frequency power; Generating impulses corresponding to the number of frequency magnitudes to be restored; And restoring frequency magnitudes by multiplying the restored frequency magnitude power.

상기 기술적 과제들을 달성하기 위한 본 발명에 의한, 주파수 크기데이터 역양자화 장치는, 비트스트림에 포함된 주파수크기의 파워를 양자화한 값(RMS 인덱스)을 역양자화하여 주파수 파워를 복원하는 주파수파워 복원부; 복원하고자 하는 주파수 크기의 개수에 해당하는 임펄스들을 생성하는 임펄스열생성부; 및 상기 임펄스열에 상기 복원된 주파수 크기 파워를 곱하여 주파수 크기들을 복원하는 제1주파수크기 복원부를 포함함을 특징으로 한다.In accordance with an aspect of the present invention, a frequency magnitude data dequantization device includes: a frequency power recovery unit for restoring frequency power by dequantizing a value (RMS index) of quantizing a power of a frequency size included in a bitstream; ; An impulse heat generator for generating impulses corresponding to the number of frequency magnitudes to be restored; And a first frequency magnitude recovery unit multiplying the impulse string by the restored frequency magnitude power to restore frequency magnitudes.

상기 기술적 과제들을 달성하기 위한 본 발명에 의한, 주파수 크기데이터 역양자화 장치는, 비트스트림에 포함된 주파수 크기들의 파워를 양자화한 값(RMS 인덱스)을 역양자화하여 상기 주파수 크기들의 파워를 복원하는 주파수파워 복원부; 비트스트림에 포함된 양자화된 짝수번째 또는 홀수번째의 정규화된 주파수 크기데 이터들을 역양자화하여 상기 짝수번째 또는 홀수번째의 정규화된 주파수 크기 데이터들을 복원하는 정규화 데이터 복원부; 상기 복원된 정규화된 주파수 크기 데이터들을 보간하여, 정규화된 주파수 크기 데이터들 중 복원되지 않은 주파수 크기 데이터들을 생성하는 정규화데이터 보간부; 및 상기 정규화된 주파수 크기데이터들과 상기 보간에 의해 생성된 주파수 크기 데이터들을 상기 복원된 주파수 크기들의 파워를 이용하여 역정규화하여 주파수 크기 데이터들을 복원하는 제2주파수 크기 복원부를 포함함을 특징으로 한다.In accordance with an aspect of the present invention, there is provided a frequency quantization data dequantization apparatus, which inversely quantizes a value (RMS index) of quantizing power of frequency magnitudes included in a bitstream to restore power of the frequency magnitudes. A power recovery unit; A normalized data reconstruction unit for dequantizing the quantized even-numbered or odd-numbered normalized frequency magnitude data included in a bitstream to restore the even-numbered or odd-numbered normalized frequency magnitude data; A normalized data interpolation unit interpolating the restored normalized frequency magnitude data to generate unreconstructed frequency magnitude data among the normalized frequency magnitude data; And a second frequency magnitude recovery unit for denormalizing the normalized frequency magnitude data and the frequency magnitude data generated by the interpolation using power of the restored frequency magnitudes to restore frequency magnitude data. .

상기 기술적 과제들을 달성하기 위한 본 발명에 의한, 주파수 크기데이터 역양자화 장치는, 비트스트림에 포함된 주파수 크기들의 파워를 양자화한 값에 해당하는 RMS 인덱스를 역양자화하여 상기 주파수 크기들의 파워를 복원하는 주파수파워 복원부; 비트스트림에 포함된 양자화된 짝수번째 또는 홀수번째의 정규화된 주파수 크기데이터들을 역양자화하여 상기 짝수번째 또는 홀수번째의 정규화된 주파수 크기 데이터들을 복원하는 정규화데이터 복원부; 상기 복원된 정규화된 주파수 크기 데이터들을 보간하여, 정규화된 주파수 크기 데이터들 중 복원되지 않은 주파수 크기 데이터들을 생성하는 정규화데이터 보간부; 비트스트림에 포함된 양자화된 보간에러 데이터들을 역양자화하여, 보간에러 데이터들을 복원하는 보간에러 복원부; 및 상기 정규화데이터 복원부에서 복원된 주파수 크기데이터들과, 상기 정규화데이터 복원부에서 복원된 보간에 의해 생성된 주파수 크기데이터들과, 상기 보간에러 복원부에서 복원된 보간에러데이터들을 상기 주파수파워 복원부에서 복원된 주파수 크기들의 파워를 이용하여 역정규화하여, 주파수 크기 데이터들을 복원하는 제3주파수 크기 복원부를 포함함을 특징으로 한다.In accordance with an aspect of the present invention, an apparatus for dequantizing frequency magnitude data, which dequantizes an RMS index corresponding to a quantized value of power of frequency magnitudes included in a bitstream, restores power of the frequency magnitudes. Frequency power recovery unit; A normalized data recovery unit for inversely quantizing quantized even-numbered or odd-numbered normalized frequency magnitude data included in a bitstream to restore the even-numbered or odd-numbered normalized frequency magnitude data; A normalized data interpolation unit interpolating the restored normalized frequency magnitude data to generate unreconstructed frequency magnitude data among the normalized frequency magnitude data; An interpolation error recovery unit that inversely quantizes quantized interpolation error data included in the bitstream and restores interpolation error data; And frequency frequency data restored by the normalized data recovery unit, frequency size data generated by interpolation restored by the normalized data recovery unit, and interpolation error data restored by the interpolation error recovery unit. And a third frequency magnitude reconstruction unit for denormalizing the power of the frequency magnitudes restored by the reconstruction to restore frequency magnitude data.

상기 기술적 과제들을 달성하기 위한 본 발명에 의한, 오디오 복호화 장치는 주파수 크기를 복호화하는 주파수크기 복호화부; 주파수 위상을 복호화하는 주파수위상 복호화부; 및 상기 복호화된 주파수 크기 및 위상 데이터를 이용하여 광대역 에러 오디오 신호의 주파수 포락선을 복원하는 주파수 포락선 복원부를 포함함을 특징으로 하고, 상기 주파수 크기 복호화부는 비트스트림에 포함된 주파수크기의 파워를 양자화한 값(RMS 인덱스)을 역양자화하여 주파수 파워를 복원하는 주파수파워 복원부; 복원하고자 하는 주파수 크기의 개수에 해당하는 임펄스들을 생성하는 임펄스생성부; 및 상기 복원된 주파수 크기 파워를 곱하여 주파수 크기들을 복원하는 주파수 크기 복원부를 포함한다.In accordance with an aspect of the present invention, an audio decoding apparatus includes: a frequency size decoder to decode a frequency magnitude; A frequency phase decoder for decoding the frequency phase; And a frequency envelope reconstructing unit for reconstructing a frequency envelope of a wideband error audio signal using the decoded frequency magnitude and phase data, wherein the frequency magnitude decoding unit quantizes power of a frequency size included in a bitstream. A frequency power restorer for inversely quantizing a value (RMS index) to restore frequency power; An impulse generator for generating impulses corresponding to the number of frequency magnitudes to be restored; And a frequency magnitude restoring unit for restoring frequency magnitudes by multiplying the restored frequency magnitude power.

그리고 상기 기재된 발명을 컴퓨터에서 실행시키기 위한 프로그램을 기록한 컴퓨터로 읽을 수 있는 기록매체를 제공한다.A computer readable recording medium having recorded thereon a program for executing the invention described above is provided.

이하, 첨부된 도면을 참조하여 본 발명에 의한 계층적 변환계수 양자화 및 역양자화를 이용한 광대역 오디오 부호화/복호화 장치 및 방법에 대해 상세히 설명하기로 한다. 도 4는 본 발명에 의한 주파수 크기데이터 양자화 장치를 이용한 오디오 부호화 장치의 구성을 블록도로 도시한 것으로서, 포락선 검출부(400), 주파수 크기/위상 획득부(420), 주파수크기 부호화부(440) 및 주파수 위상 부호화부(460)를 포함하여 이루어진다. 상기 포락선 검출부(400)는 오디오 신호의 광대역 에러 오디오 신호의 주파수 포락선을 구한다.Hereinafter, an apparatus and method for wideband audio encoding / decoding using hierarchical transform coefficient quantization and inverse quantization according to the present invention will be described in detail with reference to the accompanying drawings. 4 is a block diagram showing the configuration of an audio encoding apparatus using a frequency quantization apparatus according to the present invention, including an envelope detector 400, a frequency magnitude / phase acquisition unit 420, a frequency magnitude encoding unit 440, and A frequency phase encoder 460 is included. The envelope detector 400 obtains a frequency envelope of a wideband error audio signal of the audio signal.

상기 주파수 크기/위상 획득부(420)는 상기 검출된 주파수 포락선을 상기 광 대역 에러 오디호 신호에서 제거하고 주파수 크기와 위상을 구한다. 상기 주파수 크기 부호화부(440)는 상기 검출된 주파수 크기를 부호화한다. 상기 주파수 위상 부호화부(460)는 상기 검출된 주파수 위상을 부호화한다.The frequency magnitude / phase acquisition unit 420 removes the detected frequency envelope from the wideband error audio signal to obtain a frequency magnitude and phase. The frequency magnitude encoder 440 encodes the detected frequency magnitude. The frequency phase encoder 460 encodes the detected frequency phase.

도 5는 상기 주파수 크기 부호화부(440)의 보다 세부적인 구성을 블록도로 도시한 것으로서, 파워계산부(505), 파워양자화부(510), 정규화부(520), 정규화데이터 양자화부(530), 보간데이터 양자화부(540) 및 차분데이터 양자화부(550)를 포함하여 이루어진다. 상기 주파수 크기 부호화부(440)는 대역분리부(500)를 더 구비할 수 있다.5 is a block diagram illustrating a more detailed configuration of the frequency magnitude encoder 440. The power calculator 505, the power quantizer 510, the normalizer 520, and the normalized data quantizer 530 are shown in FIG. And an interpolation data quantization unit 540 and a differential data quantization unit 550. The frequency magnitude encoder 440 may further include a band separator 500.

상기 주파수 크기 부호화부(440)는 본 발명에 의한 주파수 크기 양자화장치의 일예가 될 수 있다. 본 발명에 의한 주파수 크기 양자화장치는 파워양자화부(510), 정규화부(520) 및 정규화데이터 양자화부(530)를 구비하며, 대역분리부(500), 보간부(540) 및 보간에러 양자화부(550)를 더 구비함이 바람직하다.The frequency magnitude coding unit 440 may be an example of a frequency magnitude quantization apparatus according to the present invention. Frequency magnitude quantization apparatus according to the present invention includes a power quantization unit 510, a normalization unit 520 and the normalized data quantization unit 530, the band separation unit 500, interpolation unit 540 and interpolation error quantization unit It is preferable to further provide 550.

상기 대역분리부(500)는 임의의 오디오 프레임을 복수의 대역으로 분리한다.The band separator 500 separates an arbitrary audio frame into a plurality of bands.

상기 파워계산부(505)는 각 대역별로 상기 분리된 각 대역을 구성하는 주파수 크기들의 파워를 계산한다. 상기 파워양자화부(510)는 상기 계산된 파워를 양자화한다. 상기 정규화부(520)는 상기 양자화된 값을 이용하여 대역별로 주파수 크기 데이터를 정규화한다. 상기 정규화데이터 양자화부(530)는 상기 정규화된 주파수 크기데이터의 짝수 또는 홀수번째 데이터들을 양자화한다. 상기 보간부(540)는 상기 정규화부에서 정규화된 주파수 크기 데이터들 중 상기 정규화데이터 양자화부에서 양자화되지 않은 데이터를, 상기 양자화된 주파수 크기 데이터를 이용하 여 보간에 의해 생성한다. 상기 보간에러 양자화부(550)는 상기 정규화된 주파수 크기 데이터들 중 양자화되지 않은 데이터와 상기 보간에 의해 생성된 데이터의 차에 해당하는 보간에러를 구하여 양자화한다.The power calculator 505 calculates power of frequency magnitudes constituting the separated bands for each band. The power quantization unit 510 quantizes the calculated power. The normalizer 520 normalizes frequency size data for each band by using the quantized value. The normalized data quantizer 530 quantizes even or odd-numbered data of the normalized frequency magnitude data. The interpolator 540 generates data that is not quantized by the normalized data quantizer among the frequency size data normalized by the normalizer by interpolation using the quantized frequency magnitude data. The interpolation error quantizer 550 obtains and quantizes an interpolation error corresponding to a difference between unquantized data among the normalized frequency magnitude data and data generated by the interpolation.

도 6은 본 발명에 의한 본 발명에 의한 주파수 크기데이터 양자화 방법을 이용한 오디오 부호화 방법을 흐름도로 도시한 것이다. 도 6을 참조하면 먼저 오디오 신호의 광대역 에러 오디오 신호의 주파수 포락선을 구한다.(600단계) 상기 검출된 주파수 포락선을 상기 광대역 에러 오디호 신호에서 제거하고 주파수 크기와 위상을 구한다.(620단계) 상기 검출된 주파수 크기와 위상을 부호화한다.(640단계)6 is a flowchart illustrating an audio encoding method using the frequency magnitude data quantization method according to the present invention. Referring to FIG. 6, first, a frequency envelope of a wideband error audio signal of an audio signal is obtained (step 600). The detected frequency envelope is removed from the wideband error audio signal and a frequency magnitude and phase are obtained (step 620). The detected frequency magnitude and phase are encoded (step 640).

도 7은 상기 640단계에서 주파수 크기를 부호화를 위한 주파수 크기 양자화과정을 흐름도로 도시한 것이다. 먼저, 주파수 영역으로 변환된 오디오 신호의 프레임을 복수의 대역으로 분리한다.(700단계) 각 대역별로 상기 분리된 각 대역을 구성하는 주파수 크기들의 파워를 계산하여 양자화한다.(710단계) 상기 양자화된 값을 이용하여 대역별로 주파수 크기 데이터를 정규화한다.(720단계) 그리고 나서 상기 정규화된 주파수 크기데이터의 짝수 또는 홀수번째 데이터들을 양자화한다.(730단계) 상기 정규화된 주파수 크기 데이터들 중 상기 730단계에서 양자화되지 않은 데이터는 상기 730단계에서 양자화된 데이터를 이용하여 보간에 의해 생성한다.(740단계) 상기 정규화된 주파수 크기 데이터들 중 양자화되지 않은 주파수 크기 데이터들과 상기 보간에 의해 생성된 데이터들의 차에 해당하는 보간에러를 구하여 양자화한다.(750단계)FIG. 7 is a flowchart illustrating a frequency magnitude quantization process for encoding the frequency magnitude in step 640. First, a frame of an audio signal converted into a frequency domain is separated into a plurality of bands (step 700). The power of frequency magnitudes constituting the separated bands is calculated and quantized for each band (step 710). The frequency magnitude data is normalized for each band using the determined values (step 720). Then, the even or odd-numbered data of the normalized frequency magnitude data are quantized (step 730). Data that is not quantized in step 420 is generated by interpolation using the data quantized in step 730. In step 740, unquantized frequency size data and data generated by interpolation among the normalized frequency size data are generated. Find and quantize the interpolation error corresponding to the difference between them (step 750).

도 8은 본 발명의 바람직한 실시 예에 따른 오디오 부호화 장치를 블록도로 도시한 것이다. 도 8을 참조하여 본 발명에 의한 오디오 부호화 장치의 바람직한 동작 과정을 살펴 보면, 먼저 16kHz 원신호는 제1다운 샘플링부(800)로 입력된 후 8kHz 신호로 변환된 후 협대역 코어 코덱(810)으로 입력된다. 상기 다운 샘플링 된 원 신호는 협대역 코어 코덱(810)에서 합성된 후 제1오버 샘플링부(820)를 통하여 16kHz의 신호로 변환된다. 이 때 상기 합성된 16kHz의 신호는 협대역만 합성되었기 때문에 고대역 주파수는 합성되지 않음을 알 수 있다. 따라서 고대역 신호와 협대역 코어 코덱에서 합성하지 못한 신호를 합성해 주기 위하여 감산기(830)를 통하여 상기 광대역 원신호와 상기 합성된 16kHz의 신호의 에러를 추출한다. 상기 추출된 16kHz의 에러신호는 제2다운샘플링부(840)를 통하여 12.8kHz의 신호로 다운 샘플링 된다. 상기 12.8kHz로 다운 샘플링 된 에러신호는 선형예측&양자화부(850)로 입력된다. 상기 선형예측&양자화부(850)에서는 먼저 상기 12.8kHz 신호의 주파수 포락선을 분석하기 위하여 Auto-correlation method와 Levinson Durbin 알고리즘을 이용하여 선형예측 계수(linear prediction coefficient)를 구한다. 상기 추출된 선형예측 계수의 저대역 성분은 협대역 코어 코덱(810)에서 생성한 선형예측 계수로 대체되고 고대역 성분만이 벡터 양자화기(880)에 의하여 양자화된다. 이는 복호화기에서도 선형예측 계수를 알기 위함이다. 8 is a block diagram illustrating an audio encoding apparatus according to an embodiment of the present invention. Referring to FIG. 8, a preferred operation process of the audio encoding apparatus according to the present invention will be described. First, a 16 kHz original signal is input to the first down sampling unit 800 and then converted into an 8 kHz signal, followed by a narrowband core codec 810. Is entered. The down-sampled original signal is synthesized by the narrowband core codec 810 and then converted into a 16 kHz signal through the first over sampling unit 820. In this case, since the synthesized 16 kHz signal is synthesized only in the narrow band, it can be seen that the high band frequency is not synthesized. Accordingly, in order to synthesize a high band signal and a signal not synthesized by a narrow band core codec, an error of the wide band original signal and the synthesized 16 kHz signal is extracted through a subtractor 830. The extracted 16 kHz error signal is down-sampled into a 12.8 kHz signal through the second down sampling unit 840. The error signal downsampled at 12.8 kHz is input to the linear prediction & quantization unit 850. In order to analyze the frequency envelope of the 12.8 kHz signal, the linear prediction & quantization unit 850 first obtains a linear prediction coefficient using an auto-correlation method and a Levinson Durbin algorithm. The low band components of the extracted linear prediction coefficients are replaced with the linear prediction coefficients generated by the narrowband core codec 810 and only the high band components are quantized by the vector quantizer 880. This is to know the linear prediction coefficients in the decoder.

상기 과정까지 입력 샘플에 대한 처리 단위를 프레임이라고 할 때, 하나의 프레임은 두 개의 하위 프레임으로 나뉘어 지고, 이 후 과정은 각각의 하위 프레임에 대해 부호화 과정이 수행된다. 이 때 첫 번째 하위 프레임을 제1하위 프레임이 라 정의하고, 두 번째 하위 프레임을 제2하위 프레임이라고 정의하며, L번째 하위프레임은 제L하위 프레임이라고 정의한다.When the processing unit for the input sample up to the above process is called a frame, one frame is divided into two subframes, and then the encoding process is performed on each subframe. In this case, the first subframe is defined as a first subframe, the second subframe is defined as a second subframe, and the Lth subframe is defined as an Lth subframe.

구해진 선형예측 계수를 통하여 상기 12.8kHz 에러 신호의 선형예측 분석이 수행된다. 이 과정을 주파수 도메인에서 해석한다면 신호의 주파수 포락선을 제거하여 주파수 도메인 특성을 평평하게 만드는 효과를 가진다. 상기 선형예측 분석 및 양자화 블록을 통하여 선형예측 잔차 신호를 생성할 수 있으며, 상기 선형예측 잔차 신호는 시간-주파수 매핑부(860)로 입력되어 주파수 도메인으로 변환된다. 상기 주파수 변환 과정을 수행하기 위하여 일 예로 FFT(Fast Fourier Transform)를 사용하였으며, 다른 주파수 변환 장치를 사용할 수도 있다.The linear prediction analysis of the 12.8 kHz error signal is performed through the obtained linear prediction coefficients. If this process is interpreted in the frequency domain, it has the effect of flattening the frequency domain characteristics by removing the frequency envelope of the signal. The linear prediction residual signal may be generated through the linear prediction analysis and quantization block, and the linear prediction residual signal is input to the time-frequency mapping unit 860 and converted into a frequency domain. In order to perform the frequency conversion process, an FFT (Fast Fourier Transform) is used as an example, and another frequency conversion device may be used.

상기 과정에서 FFT를 사용할 경우, N 개의 시간 도메인 값이 주파수 변환된다면 복소수 형태의 2N 개의 주파수 성분이 출력되며, 2N개의 성분 중 0 번째와 N 번째를 제외하고는 모두 대칭형으로 존재한다. 또한 나이키스트(Nyquist) 주파수 성분인 N 번째 데이터는 0으로 처리하여 총 2N 개의 복소수 값 중 N개의 복소수 값을 부호화하여 신호를 표현할 수 있다. 상기 대역별 분할에 대한 설명은 도 9를 참조하면 된다.When the FFT is used in the above process, if the N time domain values are frequency-converted, 2N frequency components in complex form are output, and all of them except for the 0th and Nth components are symmetrical. In addition, the N-th data, which is a Nyquist frequency component, may be processed as 0 to encode a N complex value out of a total of 2N complex values to express a signal. The division of the band is described with reference to FIG. 9.

상기 복소수값은 변환계수 양자화부(870)에서 양자화된다. 상기 복소수 값은 주파수 크기와 위상정보로 분리하여 양자화된다. 여기서 주파수 위상정보는 VQ(vector quantization), SQ(Scalar quantization), SVQ(Split VQ), MSVQ(Multi-stage Split VQ)등 전송률, 메모리, 복잡도 등의 제한에 따라 다양한 양자화 방식이 사용될 수 있다.The complex value is quantized by the transform coefficient quantization unit 870. The complex value is quantized by separating the frequency magnitude and the phase information. In this case, various quantization schemes may be used for the frequency phase information according to limitations of transmission rate, memory, and complexity such as vector quantization (VQ), scale quantization (SQ), split VQ (SVQ), and multi-stage split VQ (MSVQ).

여기서 주파수 크기정보는 도 9와 같은 방식으로 계층적으로 양자화된다. 도 9는 본 발명에 의한 오디오 신호의 주파수 크기 양자화방법에 대한 일예의 개념도이다. 도 9를 참조하면, 먼저 짝수번째 하위 프레임에 대해서 주파수 크기는 K 개의 대역으로 분리되고, 상기 분리된 각 대역에 해당하는 주파수 크기들은 파워계산부(900)로 입력되어 주파수 파워 크기가 계산된다. 주파수 파워 p는 수학식 1에 의하여 계산된다.In this case, the frequency magnitude information is quantized hierarchically in the same manner as in FIG. 9. 9 is a conceptual diagram of an example of a frequency magnitude quantization method of an audio signal according to the present invention. Referring to FIG. 9, first, frequency magnitudes are divided into K bands for even-numbered lower frames, and frequency magnitudes corresponding to each of the separated bands are input to the power calculator 900 to calculate frequency power magnitudes. The frequency power p is calculated by equation (1).

여기서 s와 e는 해당 대역의 첫 번째 주파수 인덱스와 마지막 주파수 인덱스를 나타내며, m_n은 짝수 번째 하위 프레임에서 n 번째 주파수 크기를 나타낸다. 따라서 K개의 대역으로 나눴다면 K개의 주파수 파워정보가 생성되며, 주파수 파워정보는 파워양자화부(905)에서 양자화된다. 각각 대역의 주파수 파워정보는 상호간에 강한 상관관계를 갖고 있기 때문에 K개의 벡터로 묶인 뒤 벡터양자화된다. 양자화된 파워정보는 복호화기로 전송되며, 복호화시 스케일러블(scalable)한 구조를 지원하다 보면 각 계층들이 정확한 에너지를 복원하기 위해서 각 계층별로 추가적인 이득(gain)이 필요한데, 이 방식을 이용하면 항상 최종 크기가 정해져 있으므로 추가적인 이득(gain)이 필요 없어진다.Where s and e represent the first frequency index and the last frequency index of the corresponding band, and m _n represents the nth frequency magnitude in the even subframe. Therefore, if divided into K bands, K frequency power information is generated, and the frequency power information is quantized by the power quantization unit 905. Since the frequency power information of each band has a strong correlation with each other, they are grouped into K vectors and then quantized. The quantized power information is transmitted to the decoder, and when decoding supports a scalable structure, each layer needs an additional gain for each layer in order to recover accurate energy. The size is fixed so no additional gain is needed.

이 후, 주파수 크기는 크기 정규화부(910)에서 각각의 대역에 해당하는 양자화된 주파수 파워값으로 나눠 정규화된다. 정규화된 대역별 주파수 크기벡터는 동 일하게 다음과 같이 양자화된다. 이 중에서 하나의 대역에 대한 양자화 방식은 다음과 같다. 한 대역의 주파수 크기 벡터는 짝수위치 양자화부(915)에서 짝수 위치가 먼저 양자화된다. 본 짝수 위치의 양자화에는 VQ(vector quantization), SQ(Scalar quantization), SVQ (Split VQ), MSVQ(Multi-stage Split VQ)등 전송률, 메모리, 복잡도 등의 제한에 따라 다양한 양자화 방식이 사용될 수 있다.Thereafter, the frequency magnitude is normalized by dividing the frequency magnitude into quantized frequency power values corresponding to respective bands. The normalized frequency-specific frequency magnitude vector is quantized as follows. The quantization scheme for one of these bands is as follows. The frequency magnitude vector of one band is first quantized by the even position quantizer 915. Various quantization methods may be used for quantization of even-numbered positions according to limitations of data rate, memory, and complexity such as vector quantization (VQ), scale quantization (SQ), split VQ (SVQ), and multi-stage split VQ (MSVQ). .

이 후 홀수 번째 주파수 크기를 보상해 주기 위하여 양자화된 짝수 번째 주파수 크기를 이용하여 큐빅(cubic) 보간부(920)에 의해 보간이 수행된다. 보간식은 수학식 2와 같다.Thereafter, interpolation is performed by the cubic interpolator 920 using the quantized even frequency size to compensate for the odd frequency size. The interpolation formula is the same as Equation 2.

여기서 m는 양자화된 홀수 번째 주파수의 2차 차분값을 나타내며 수학식 3과 같다.Where m represents the second-order difference value of the quantized odd-numbered frequency.

양자화된 홀수 번째 주파수정보는 보간된 정보이므로 에러가 높다고 할 수 있다. 그러므로 이 부분의 정확도를 높여주기 위해 보간에러 양자화부(925)에서 홀수 위치의 보간 에러신호를 다시 한번 양자화한다. 여기서도 짝수 위치의 양자화와 동일하게 VQ(vector quantization), SQ(Scalar quantization), SVQ (Split VQ), MSVQ(Multi-stage Split VQ)등 전송률, 메모리, 복잡도 등의 제한에 따라 다양한 양자화 방식이 사용될 수 있다. 나머지 대역은 도 9에서 처럼 동일한 방식으로 양자화 된다.Since the quantized odd-numbered frequency information is interpolated information, the error may be high. Therefore, in order to increase the accuracy of this part, the interpolation error quantizer 925 quantizes the interpolation error signal at odd positions once again. Here, as in even-numbered quantization, various quantization schemes are used depending on the limitations of data rate, memory, and complexity such as vector quantization (VQ), scale quantization (SQ), split VQ (SVQ), and multi-stage split VQ (MSVQ). Can be. The remaining bands are quantized in the same manner as in FIG.

이와 같이 짝수 번째 하위 프레임에 대해서 양자화가 완료되면 이 정보를 이용하여 홀수 번째 하위 프레임의 값을 프레임간의 보간을 통하여 구한다.As described above, when quantization is completed for the even subframe, the value of the odd subframe is obtained by interpolation between frames using this information.

홀수번째 하위 프레임에 대해서는 상기의 양자화 과정으로 처리하지 않고 수학식 4와 같은 방법으로 보간된다.The odd-numbered lower frames are interpolated in the same manner as in Equation 4 without being processed by the above quantization process.

여기서 m_n,1 는 n 번째 프레임에서 홀수번째 하위 프레임, m_n-1,2 는 n-1 번째 프레임에서 짝수번째 하위 프레임, m_n,2 는 n 번째 프레임에서 짝수번째 하위 프레임을 나타낸다.Where m _{n, 1} represents an odd subframe in the nth frame, m _n-1,2 represents an even subframe in the n-1th frame, and m _{n, 2} represents an even subframe in the nth frame.

상기 과정을 통하여 양자화된 짝수번째 하위 프레임 또는 보간된 홀수 번째 하위 프레임의 주파수 크기는 상기 양자화된 주파수 파워를 곱하여 스케일된다.The frequency magnitude of the quantized even subframe or the interpolated odd subframe is scaled by multiplying the quantized frequency power.

도 10은 본 발명에 의한 주파수 크기 양자화 방법 및 장치에 의해 제공되는 비트스트림의 구성을 나타낸다. 비트스트림은 RMS indices, Even amplitude indices, Odd amplitude indices의 순으로 정렬된다. 도 10에서 1000은 디코더로 RMS indices만 전송된 것을 의미한다. 이 경우는 양자화 에러가 가장 많지만 기본적인 정보가 있으므로 일단 디코딩은 가능하다. 그 다음 1010는 RMS indices와 Even amplitude indices가 전송된 경우이다. 그리고 마지막 1020은 RMS indices, Even amplitude indices, Odd amplitude indices가 모두 전송된 경우이며 양자화 에러가 가장 적은 경우이다. 이러한 조합으로 음질에 대한 scalability를 지원할 수 있게 된다.10 shows a configuration of a bitstream provided by a frequency magnitude quantization method and apparatus according to the present invention. The bitstream is arranged in order of RMS indices, Even amplitude indices, and Odd amplitude indices. In FIG. 10, 1000 means that only RMS indices are transmitted to the decoder. In this case, the quantization error is the most but there is basic information, so decoding is possible once. Next, 1010 is the case where RMS indices and Even amplitude indices are transmitted. The last 1020 is the case where the RMS indices, Even amplitude indices, and Odd amplitude indices are all transmitted, with the least quantization error. This combination can support scalability for sound quality.

도 11은 본 발명에 의한 주파수 크기데이터 역양자화부에 대한 일 실시예를 도시한 것으로서, 주파수파워 복원부(1100), 임펄스열생성부(1120) 및 제1주파수크기 복원부(1140)을 포함하여 이루어진다. FIG. 11 illustrates an embodiment of a frequency quantization dequantizer according to the present invention, and includes a frequency power restorer 1100, an impulse heat generator 1120, and a first frequency magnitude restorer 1140. It is done by

상기 주파수파워 복원부(1100)는 비트스트림에 포함된 주파수크기의 파워를 양자화한 값(RMS 인덱스)을 역양자화하여 주파수 파워를 복원한다. 상기 임펄스열생성부(1120)는 복원하고자 하는 주파수 크기의 개수에 해당하는 임펄스들을 생성한다. 상기 제1주파수크기 복원부(1140)는 상기 임펄스열에 상기 복원된 주파수 크기 파워를 곱하여 주파수 크기들을 복원한다.The frequency power recovery unit 1100 restores frequency power by inverse quantizing a value (RMS index) of quantizing the power of the frequency size included in the bitstream. The impulse heat generation unit 1120 generates impulses corresponding to the number of frequency magnitudes to be restored. The first frequency magnitude recovery unit 1140 restores frequency magnitudes by multiplying the impulse string by the restored frequency magnitude power.

도 12는 본 발명에 의한 주파수 크기데이터 역양자화부에 대한 다른 실시예를 도시한 것으로서, 주파수파워 복원부(1200), 정규화 데이터 복원부(1220), 정규화데이터 보간부(1240) 및 제2주파수 크기 복원부(1260)를 포함하여 이루어진다. 12 illustrates another embodiment of the frequency magnitude data dequantization unit according to the present invention, and includes a frequency power recovery unit 1200, a normalized data recovery unit 1220, a normalized data interpolation unit 1240, and a second frequency. A size restoration unit 1260 is included.

상기 주파수파워 복원부(1200)는 비트스트림에 포함된 주파수 크기들의 파워를 양자화한 값(RMS 인덱스)을 역양자화하여 상기 주파수 크기들의 파워를 복원한다. 상기 정규화 데이터 복원부(1220)는 비트스트림에 포함된 양자화된 짝수번째 또는 홀수번째의 정규화된 주파수 크기데이터들을 역양자화하여 상기 짝수번째 또는 홀수번째의 정규화된 주파수 크기 데이터들을 복원한다. 상기 정규화데이터 보 간부(1240)는 상기 복원된 정규화된 주파수 크기 데이터들을 보간하여, 정규화된 주파수 크기 데이터들 중 복원되지 않은 주파수 크기 데이터들을 생성한다. 상기 제2주파수 크기 복원부(1260)는 상기 정규화된 주파수 크기데이터들과 상기 보간에 의해 생성된 주파수 크기 데이터들을 상기 복원된 주파수 크기들의 파워를 이용하여 역정규화하여 주파수 크기 데이터들을 복원한다.The frequency power recovery unit 1200 inversely quantizes a value (RMS index) of quantizing power of frequency magnitudes included in the bitstream to restore power of the frequency magnitudes. The normalized data recovery unit 1220 dequantizes the quantized even or odd normalized frequency magnitude data included in the bitstream to restore the even or odd normalized frequency magnitude data. The normalized data interpolator 1240 interpolates the reconstructed normalized frequency magnitude data to generate unreconstructed frequency magnitude data among the normalized frequency magnitude data. The second frequency magnitude recovery unit 1260 restores frequency magnitude data by denormalizing the normalized frequency magnitude data and the frequency magnitude data generated by the interpolation using the power of the restored frequency magnitudes.

도 13은 본 발명에 의한 주파수 크기데이터 역양자화부에 대한 또 다른 실시예를 도시한 것으로서, 주파수파워 복원부(1300), 정규화 데이터 복원부(1310), 정규화데이터 보간부(1320), 보간에러 복원부(1330) 및 제3주파수 크기 복원부(1340)를 포함하여 이루어진다. FIG. 13 illustrates another embodiment of the frequency quantization data dequantization unit according to the present invention, and includes a frequency power recovery unit 1300, a normalized data recovery unit 1310, a normalized data interpolation unit 1320, and an interpolation error. The restoration unit 1330 and the third frequency magnitude restoration unit 1340 are included.

상기 주파수파워 복원부(1300)는 비트스트림에 포함된 주파수 크기들의 파워를 양자화한 값에 해당하는 RMS 인덱스를 역양자화하여 상기 주파수 크기들의 파워를 복원한다. 상기 정규화 데이터 복원부(1310)는 비트스트림에 포함된 양자화된 짝수번째 또는 홀수번째의 정규화된 주파수 크기데이터들을 역양자화하여 상기 짝수번째 또는 홀수번째의 정규화된 주파수 크기 데이터들을 복원한다.The frequency power recovery unit 1300 restores power of the frequency magnitudes by inversely quantizing an RMS index corresponding to a quantized value of powers of frequency magnitudes included in the bitstream. The normalized data recovery unit 1310 dequantizes the quantized even-numbered or odd-numbered normalized frequency magnitude data included in the bitstream to restore the even-numbered or odd-numbered normalized frequency magnitude data.

상기 정규화데이터 보간부(1320)는 상기 복원된 정규화된 주파수 크기 데이터들을 보간하여, 정규화된 주파수 크기 데이터들 중 복원되지 않은 주파수 크기 데이터들을 생성한다. 상기 보간에러 복원부(1330)는 비트스트림에 포함된 양자화된 보간에러 데이터들을 역양자화하여, 보간에러 데이터들을 복원한다.The normalized data interpolator 1320 interpolates the restored normalized frequency magnitude data to generate unreconstructed frequency magnitude data among the normalized frequency magnitude data. The interpolation error recovery unit 1330 inverse quantizes the quantized interpolation error data included in the bitstream and restores the interpolation error data.

상기 제3주파수 크기 복원부(1340)는 상기 정규화데이터 복원부에서 복원된 주파수 크기데이터들과, 상기 정규화데이터 복원부에서 복원된 보간에 의해 생성된 주파수 크기데이터들과, 상기 보간에러 복원부에서 복원된 보간에러데이터들을 상기 주파수파워 복원부에서 복원된 주파수 크기들의 파워를 이용하여 역정규화하여, 주파수 크기 데이터들을 복원한다.The third frequency magnitude recovery unit 1340 may include frequency magnitude data restored by the normalized data recovery unit, frequency magnitude data generated by interpolation restored by the normalized data recovery unit, and the interpolation error recovery unit. The reconstructed interpolation error data are denormalized using the powers of the frequency magnitudes restored by the frequency power recovery unit to recover frequency magnitude data.

도 14는 본 발명에 의한 주파수 크기 데이터 역양자화장치를 이용한 오디오 신호 복호화 장치의 구성을 블록도로 도시한 것으로서, 주파수 크기 복호화부(1400), 주파수 위상 복원부(1420) 및 주파수 포락선 복원부(1440)를 포함하여 이루어진다. FIG. 14 is a block diagram showing the configuration of an audio signal decoding apparatus using the frequency quantization dequantizer according to the present invention. The frequency magnitude decoding unit 1400, the frequency phase recovery unit 1420, and the frequency envelope recovery unit 1440 )

상기 주파수 크기 복호화부(1400)는 주파수 크기를 복호화한다. 상기 주파수 위상 복원부(1420)는 주파수 위상을 복호화한다. 상기 주파수 포락선 복원부(1440)는 상기 복호화된 주파수 크기 및 위상 데이터를 이용하여 광대역 에러 오디오 신호의 주파수 포락선을 복원한다. 상기 주파수 크기 복호화부(1400)의 보다 구체적인 구성은 도 12 내지 도 14에 도시된 본 발명에 의한 주파수 크기데이터 역양자화 장치가 될 수 있다. 그 상세한 내용은 상술한 바와 동일 하므로 생략하기로 한다. The frequency magnitude decoder 1400 decodes the frequency magnitude. The frequency phase recovery unit 1420 decodes the frequency phase. The frequency envelope recovery unit 1440 restores the frequency envelope of the wideband error audio signal using the decoded frequency magnitude and phase data. A more detailed configuration of the frequency magnitude decoder 1400 may be a frequency magnitude data dequantizer according to the present invention shown in FIGS. 12 to 14. Details thereof are the same as described above, and thus will be omitted.

도 15는 본 발명에 의한 주파수 크기 데이터 역양자화 방법의 일 실시예를 흐름도로 도시한 것이다. 먼저, 비트스트림에 포함된 주파수크기의 파워를 양자화한 값(RMS 인덱스)을 역양자화하여 주파수 크기들의 파워를 복원한다.(1500단계) 그리고 나서 복원하고자 하는 주파수 크기의 개수에 해당하는 임펄스들과 상기 복원된 주파수 크기 파워를 곱하여 주파수 크기들을 복원한다.(1550단계)15 is a flowchart illustrating an embodiment of a method of dequantizing frequency magnitude data according to the present invention. First, the quantized value (RMS index) of the power of the frequency size included in the bitstream is inversely quantized to restore power of frequency magnitudes (step 1500). Then, the impulses corresponding to the number of frequency magnitudes to be restored and The frequency magnitudes are restored by multiplying the restored frequency magnitude power (step 1550).

도 16은 본 발명에 의한 주파수 크기 데이터 역양자화 방법의 다른 실시예를 흐름도로 도시한 것이다. 먼저, 비트스트림에 포함된 주파수 크기들의 파워를 양자화한 값(RMS 인덱스)을 역양자화하여 상기 주파수 크기들의 파워를 복원한다.(1600단계) 그리고 나서 비트스트림에 포함된 양자화된 짝수번째 또는 홀수번째의 정규화된 주파수 크기데이터들을 분리하여 역양자화함으로써 상기 짝수번째 또는 홀수번째의 정규화된 주파수 크기 데이터들을 복원한다.(1620단계) 상기 복원된 정규화된 주파수 크기 데이터들을 보간하여, 정규화된 주파수 크기 데이터들 중 복원되지 않은 주파수 크기 데이터들을 생성한다.(1640단계) 상기 정규화된 주파수 크기데이터들과 상기 보간에 의해 생성된 주파수 크기 데이터들을 상기 복원된 주파수 크기들의 파워를 이용하여 역정규화하여 주파수 크기 데이터들을 복원한다.(1660단계)16 is a flowchart illustrating another embodiment of the method of frequency quantizing data inverse quantization according to the present invention. First, the quantized value (RMS index) of the power of frequency magnitudes included in the bitstream is inversely quantized to restore power of the frequency magnitudes (step 1600). Then, the quantized even or odd number included in the bitstream is recovered. The even-numbered or odd-numbered normalized frequency magnitude data is reconstructed by separating and inverse quantizing the normalized frequency magnitude data of (step 1620). The normalized frequency magnitude data is interpolated by interpolating the restored normalized frequency magnitude data. In operation 1640, the normalized frequency magnitude data and the frequency magnitude data generated by the interpolation are denormalized by using the power of the restored frequency magnitudes. Restore (step 1660).

도 17은 본 발명에 의한 주파수 크기 데이터 역양자화 방법의 다른 실시예를 흐름도로 도시한 것이다. 먼저, 비트스트림에 포함된 주파수 크기들의 파워를 양자화한 값에 해당하는 RMS 인덱스를 역양자화하여 상기 주파수 크기들의 파워를 복원한다.(1700단계) 비트스트림에 포함된 양자화된 짝수번째 또는 홀수번째의 정규화된 주파수 크기데이터들을 역양자화하여 상기 짝수번째 또는 홀수번째의 정규화된 주파수 크기 데이터들을 복원한다.(1720단계) 상기 복원된 정규화된 주파수 크기 데이터들을 보간하여, 정규화된 주파수 크기 데이터들 중 복원되지 않은 주파수 크기 데이터들을 생성한다.(1740단계) 비트스트림에 포함된 양자화된 보간에러 데이터들을 역양자화하여, 보간에러 데이터들을 복원한다.(1760단계) 상기 복원된 주파수 크기데이터들과 상기 복원된 보간에 의해 생성된 주파수 크기데이터들과 상 기 복원된 보간에러데이터들을 상기 복원된 주파수 크기들의 파워를 이용하여 역정규화하여 주파수 크기 데이터들을 복원한다.(1780단계)17 is a flowchart illustrating another embodiment of a method of dequantizing frequency magnitude data according to the present invention. First, the inverse quantization of the RMS index corresponding to the quantized value of the power of the frequency magnitudes included in the bitstream restores the power of the frequency magnitudes (step 1700). Inverse quantization of the normalized frequency magnitude data restores the even or odd numbered normalized frequency magnitude data (step 1720). The interpolated normalized frequency magnitude data is interpolated to restore one of the normalized frequency magnitude data. And generate quantized interpolation error data (step 1740). The quantized interpolation error data included in the bitstream is inversely quantized to restore interpolation error data (step 1760). Reconstructing the frequency magnitude data generated by the interpolation error data Normalizing station using the power of the frequency content and recover the frequency content of the data (step 1780)

도 18은 본 발명에 의한 오디오 주파수 크기 데이터 역양자화 방법을 이용한 오디오 복호화 방법을 흐름도로 도시한 것이다. 주파수 크기를 복호화한다.(1800단계) 또한 주파수 위상을 복호화한다.(1820단계) 상기 복호화된 주파수 크기 및 위상 데이터를 이용하여 광대역 에러 오디오 신호의 주파수 포락선을 복원한다.(1840단계) 상기 1800단계는 도 15 내지 도 17에 보다 상세한 흐름을 도시하였다. 따라서 주파수 크게 데이터 역양자화 방법에서 상술하였으므로 생략하기로 한다. 18 is a flowchart illustrating an audio decoding method using an audio frequency magnitude data dequantization method according to the present invention. Decode the frequency magnitude (step 1800). Also, decode the frequency phase (step 1820). The frequency envelope of the wideband error audio signal is restored using the decoded frequency magnitude and phase data (step 1840). Shows a more detailed flow in FIGS. 15-17. Therefore, since the frequency is largely described in the data dequantization method, it will be omitted.

도 19는 하나의 대역에 대한 주파수 크기의 역양자화 장치에 대한 일 실시예의 구성을 블록도로 도시한 것이다. 각 대역은 scalability를 제공하기 위해서 도 15와 동일한 방식으로 역양자화가 가능하다.FIG. 19 is a block diagram illustrating a configuration of an embodiment of an inverse quantization apparatus having a frequency magnitude for one band. Each band can be dequantized in the same manner as in FIG. 15 to provide scalability.

먼저 첫번째 계층의 역양자화는 다음과 같이 수행된다. 먼저 전송받은 비트스트림에서 RMS indices가 파워 역양자화부(1900)에서 해당하는 대역의 주파수 파워를 역양자화 한다. 다음으로 임펄스 열 발생부(1920)에서 대역의 크기 개수에 해당하는 임펄스를 발생시킨다. 각각의 출력을 서로 곱해서 첫번째 계층에 대한 짝수 번째 하위 프레임의 크기 벡터를 재생한다. 이 벡터를 이용하여 세 번째 계층에 대한 홀수 번째 하위 프레임에 대해서는 프레임간 보간부(1940)에 의해 이전 프레임의 마지막 짝수 번째 하위 프레임과 현재 프레임의 짝수 번째 하위 프레임간의 프레임간 보간을 하여 그 사이에 있는 프레임의 크기정보를 구한다. 프레임간의 보간은 위에서 설명한 방식과 동일하다. First, dequantization of the first layer is performed as follows. First, the RMS indices dequantize the frequency power of the corresponding band in the power dequantizer 1900 in the received bitstream. Next, the impulse heat generator 1920 generates an impulse corresponding to the number of band sizes. Each output is multiplied with each other to reproduce the magnitude vector of the even-numbered lower frame for the first layer. The odd-numbered subframes for the third layer are interpolated between the last even-numbered subframe of the previous frame and the even-numbered subframe of the current frame by using the vector. Get size information of a frame. Interpolation between frames is the same as described above.

도 20은 하나의 대역에 대한 주파수 크기의 역양자화 장치에 대한 다른 실시예의 구성을 블록도로 도시한 것이다. 두 번째 계층의 역양자화는 다음과 같이 수행된다. 전송받은 비트스트림에서 짝수크기 인덱스(Even amplitude indices)가 짝수 위치 역양자화부(2000)에서 짝수 위치의 크기벡터를 역양자화 한다. 다음으로 홀수 위치 보간부(2010)에서 홀수 위치의 크기 벡터를 구한다. 그리하여 곱셈기(2020)를 통해 전체 크기 벡터에 파워를 곱하면 두 번째 계층에 대한 짝수 번째 하위 프레임의 크기벡터를 재생한다. 상기 벡터를 이용하여 세 번째 계층에 대한 홀수 번째 하위 프레임에 대해서는 프레임간 보간부(2030)에 의해 이전 프레임의 마지막 짝수 번째 하위 프레임과 현재 프레임의 짝수 번째 하위 프레임간의 프레임간 보간을 하여 그 사이에 있는 프레임의 크기정보를 구한다. 프레임간의 보간은 위에서 설명한 방식과 동일하다.20 is a block diagram showing the configuration of another embodiment of an inverse quantization apparatus of frequency magnitude for one band. Inverse quantization of the second layer is performed as follows. Even amplitude indices in the received bitstream dequantize the magnitude vectors of even positions in the even position inverse quantizer 2000. Next, the odd position interpolation unit 2010 obtains the magnitude vector of the odd position. Thus, multiplying the full size vector by the power through the multiplier 2020 reproduces the size vector of the even-numbered lower frame for the second layer. The odd-numbered subframes for the third layer are interpolated between the last even-numbered subframe of the previous frame and the even-numbered subframe of the current frame by using the vector. Get size information of a frame. Interpolation between frames is the same as described above.

다음으로 세 번째 계층의 역양자화는 다음과 같이 수행된다. 먼저 전송받은 비트스트림에서 홀수 크기 인덱스(Odd amplitude indices)가 보간에러 역양자화부(2040)에서 홀수 위치 보간에러 역양자화를 수행한다. 이 결과와 두번째 계층에서 구한 전체 크기 벡터를 덧셈기(2050)를 통해 더한 후, 곱셈기(2060)를 통해 파워를 곱하면 세 번째 계층에 대한 짝수 번째 하위 프레임의 크기벡터를 재생한다. 이 벡터를 이용하여 세 번째 계층에 대한 홀수 번째 하위 프레임에 대해서는 프레임간 보간부(2070)에 의해 이전 프레임의 마지막 짝수 번째 하위 프레임과 현재 프레임의 짝수 번째 하위 프레임간의 프레임간 보간을 하여 그 사이에 있는 프레임의 크기정보를 구한다. 프레임간의 보간은 위에서 설명한 방식과 동일하다.Next, dequantization of the third layer is performed as follows. In the first bitstream, odd amplitude indices perform odd-position interpolation error dequantization in the interpolation error dequantization unit 2040. After adding the result and the full size vector obtained in the second layer through the adder 2050, multiplying the power through the multiplier 2060 reproduces the size vector of the even-numbered lower frame for the third layer. Using this vector, interframe interpolation between the last even subframe of the previous frame and the even subframe of the current frame is performed by the interframe interpolator 2070 for the odd subframe of the third layer. Get size information of a frame. Interpolation between frames is the same as described above.

도 21은 본 발명에 의한 오디오 복호화장치의 또 다른 실시예를 나타내고 있다. 부호화된 광대역 에러 오디오 신호에 상응하는 비트스트림이 수신되면 수신된 비트스트림을 디패킹(depacking)하여 계층별로 복원한다. 즉, 비트스트림의 일부인 협대역 코어 코덱의 정보만 수신될 경우 협대역 코어 코덱(2100)의 복호화기에 의하여 협대역 신호만 복원된 최소의 오디오 신호만이 복원 된다. 이 때 복원된 협대역 신호는 후처리 과정을 통하여 최종의 협대역 신호를 생성하게 된다. 만약 협대역 코어 코덱과 함께 광대역 신호에 대한 정보가 수신되면 수신된 광대역 정보만큼의 신호를 복원하게 된다.21 shows another embodiment of an audio decoding apparatus according to the present invention. When a bitstream corresponding to the encoded wideband error audio signal is received, the received bitstream is depacked and restored for each layer. That is, when only the information of the narrowband core codec that is part of the bitstream is received, only the minimum audio signal in which only the narrowband signal is recovered by the decoder of the narrowband core codec 2100 is recovered. In this case, the recovered narrowband signal generates a final narrowband signal through a post-processing process. If information on the wideband signal is received together with the narrowband core codec, the signal is restored as much as the received wideband information.

광대역 신호는 먼저 전송된 크기 인덱스와 위상 인덱스를 변환계수 복호화 부(2120)에 입력하여 실제 계수로 바꾼 후 이를 복소수 형태로 변환한다. 상기 복원된 복소수형태의 주파수 정보는 주파수-시간 매핑부(2130)에서 시간 도메인 변환 과정이 수행된다. 본 발명에서 시간 도메인 변환에 사용한 방법은 IFFT 이며, 다른 시간 도메인 변환 방법이 부호화기의 주파수 변환 방법과 쌍으로 사용될 수 있다. 상기 시간 도메인 변환기에 의하여 복원된 선형 예측 잔차 신호를 얻을 수 있으며, 상기 복원된 선형 예측 잔차 신호는 LPC 계수 인덱스에 의하여 복원된 LPC 계수를 사용하여 선형 예측 합성(linear prediction synthesis)부(2140)에서 오디오 신호로 합성하게 된다. 상기 과정을 거치면 12.8kHz의 광대역 에러 신호가 복원되며, 상기 복원된 12.8kHz 광대역 에러 신호는 오버 샘플러2(2150)에 의하여 16kHz의 광대역 에러 신호로 변환된다. 이 때 주파수 도메인에서 6.4kHz 이상의 주파수를 생성해 주기 위하여 고주파 발생기(2160)에서 고주파수에 해당하는 신호를 발생시킨 다. 이 고주파 발생기(2160)는 먼저 난수발생기에 의해 발생된 난수를 선형 예측 합성을 통하여 가상의 16kHz의 신호를 만들고, 상기 생성된 가상의 16kHz 신호를 고대역 통과 필터를 사용하여 고주파 성분만을 추출한 후, 수신된 고주파 이득값을 곱하여 주파수 도메인의 6.4kHz이상의 신호를 생성한다. 하지만 고주파 이득값이 비트스트림을 통하여 수신 되지 않았을 경우, 상기 복원된 선형 예측 에러 신호와 주파수 기울기를 통하여 이득값을 추정한다. 이 후, 상기 고주파 신호와 상기 복원된 16kHz의 광대역 에러 신호를 가산기(2170)를 통해 가산하여 광대역 에러 신호를 생성한다. 협대역 코어 코덱의 복호화기는 위에서 언급한 협대역 복호화와 동일하게 협대역 오디오 신호를 합성한다. 상기 합성된 협대역 오디오 신호를 오버샘플러3(2110)를 통하여 16kHz 광대역 신호로 변환한다. 상기 변환된 16kHz 협대역 코어 오디오 신호는 가산기(2180)를 통해 상기 합성된 광대역 에러 신호와 가산되어 최종적으로 합성된 광대역 오디오 신호를 생성한다. 상기 최종적으로 합성된 오디오 신호는 후처리기(2190)에서 후처리 과정을 수행하게 되는데 이는 더욱 명료한 오디오 신호를 제공하기 위함이다. 후처리 과정은 이미 음성 코덱에서 잘 알려진 포만트(formant) 후처리 필터와 이득값 보상 과정이 수행된다. 포만트 후처리 필터는 오디오 신호의 포만트 성분을 강조하여 오디오 신호를 더욱 명료하게 해주며, 이득값 보상 과정은 포만트 후처리 필터에 의하여 손실된 에너지 값을 보상해주는 역할을 한다.The wideband signal first inputs the transmitted magnitude index and phase index to the transform coefficient decoding unit 2120, converts them into actual coefficients, and converts them into a complex form. The frequency-time mapping unit 2130 performs time domain conversion on the restored complex frequency information. In the present invention, the method used for the time domain transform is IFFT, and another time domain transform method may be used in pairs with the frequency transform method of the encoder. The reconstructed linear prediction residual signal may be obtained by the time domain transformer, and the reconstructed linear prediction residual signal may be obtained by a linear prediction synthesis unit 2140 using the LPC coefficient reconstructed by the LPC coefficient index. It is synthesized into an audio signal. Through the above process, a wideband error signal of 12.8 kHz is restored, and the restored 12.8 kHz wideband error signal is converted into a 16 kHz wideband error signal by the oversampler 2150. At this time, in order to generate a frequency of 6.4kHz or more in the frequency domain, the high frequency generator 2160 generates a signal corresponding to a high frequency. The high frequency generator 2160 first generates a virtual 16 kHz signal through linear predictive synthesis using the random number generated by the random number generator, and extracts only the high frequency component from the generated virtual 16 kHz signal using a high pass filter. Multiply the received high frequency gain to produce a signal above 6.4kHz in the frequency domain. However, when the high frequency gain value is not received through the bitstream, the gain value is estimated through the restored linear prediction error signal and the frequency slope. Thereafter, the high frequency signal and the restored 16 kHz wide band error signal are added through an adder 2170 to generate a wide band error signal. The decoder of the narrowband core codec synthesizes the narrowband audio signal in the same manner as the narrowband decoding mentioned above. The synthesized narrowband audio signal is converted into a 16kHz wideband signal through the oversampler 3110. The converted 16 kHz narrowband core audio signal is added to the synthesized wideband error signal through an adder 2180 to produce a finally synthesized wideband audio signal. The finally synthesized audio signal is post-processed by the post processor 2190 in order to provide a clearer audio signal. The post-processing is performed with a formant post-processing filter and a gain compensation process, which are well known in the speech codec. The formant post-processing filter emphasizes the formant component of the audio signal to make the audio signal more clear, and the gain compensation process compensates for the energy value lost by the formant post-processing filter.

본 발명에 따른 광대역 에러 오디오 부호화 및 복호화 방법을 수행하기 위한 프로그램은 컴퓨터로 읽을 수 있는 기록 매체에 컴퓨터가 읽을 수 있는 코드로서 구현하는 것이 가능하다. 컴퓨터가 읽을 수 있는 기록 매체는 컴퓨터 시스템에 의하여 읽혀질 수 있는 데이터가 저장되는 모든 종류의 저장 장치를 포함한다. 컴퓨터가 읽을 수 있는 기록 매체의 예로는 ROM, RAM, CD-ROM, 자기 테이프, 플로피디스크, 광 데이터 저장장치 등이 있으며, 또한 캐리어 웨이브(예를 들어 인터넷을 통한 전송)의 형태로 구현되는 것도 포함한다. The program for performing the wideband error audio encoding and decoding method according to the present invention can be embodied as computer readable codes on a computer readable recording medium. Computer-readable recording media include all kinds of storage devices that store data that can be read by a computer system. Examples of computer-readable recording media include ROM, RAM, CD-ROM, magnetic tape, floppy disk, optical data storage, and the like, and may also be implemented in the form of a carrier wave (for example, transmission over the Internet). Include.

또한 컴퓨터가 읽을 수 있는 기록 매체는 네트워크로 연결된 컴퓨터 시스템에 분산되어, 분산방식으로 컴퓨터가 읽을 수 있는 코드가 저장되고 실행될 수 있다. 그리고, 상기 사용자 추적 방법을 구현하기 위한 기능적인(function) 프로그램, 코드 및 코드 세그먼트들은 본 발명이 속하는 기술분야의 프로그래머들에 의해 용이하게 추론될 수 있다. The computer readable recording medium can also be distributed over network coupled computer systems so that the computer readable code is stored and executed in a distributed fashion. In addition, functional programs, codes, and code segments for implementing the user tracking method can be easily inferred by programmers in the art to which the present invention belongs.

이제까지 본 발명에 대하여 그 바람직한 실시 예들을 중심으로 살펴보았다. 본 발명이 속하는 기술 분야에서 통상의 지식을 가진 자는 본 발명이 본 발명의 본질적인 특성에서 벗어나지 않는 범위에서 변형된 형태로 구현될 수 있음을 이해할 수 있을 것이다. 그러므로 개시된 실시 예들은 한정적인 관점이 아니라 설명적인 관점에서 고려되어야 한다. 본 발명의 범위는 전술한 설명이 아니라 특허청구범위에 나타나 있으며, 그와 동등한 범위 내에 있는 모든 차이점은 본 발명에 포함된 것으로 해석되어야 할 것이다.So far I looked at the center of the preferred embodiment for the present invention. Those skilled in the art will appreciate that the present invention can be implemented in a modified form without departing from the essential features of the present invention. Therefore, the disclosed embodiments should be considered in descriptive sense only and not for purposes of limitation. The scope of the present invention is shown in the claims rather than the foregoing description, and all differences within the scope will be construed as being included in the present invention.

상술한 본 발명에 따르면, 광대역 에러 오디오 신호의 주파수 크기 정보와 위상 정보를 사용하여 다수의 계층을 두고 확장성을 지원할 수 있다.According to the present invention described above, scalability can be supported in multiple layers by using frequency magnitude information and phase information of a wideband error audio signal.

또한, 저대역 오디오 신호는 유지하면서 광대역 에러 오디오 신호의 주파수 정보를 이용함으로써, 기본 오디오의 질을 유지할 수 있다. 상술한 주파수 크기정보를 이용하면 넓은 주파수 대역을 적은 비트로 양자화할 수 있으며 전체적으로 음질에 대한 scalability를 줄 수 있다.In addition, by using the frequency information of the wideband error audio signal while maintaining the lowband audio signal, the quality of the basic audio can be maintained. By using the above-described frequency size information, a wide frequency band can be quantized with fewer bits, and overall scalability of sound quality can be given.

Claims

(a) 오디오신호의 주파수 크기들의 파워를 구하여 양자화하는 단계;(a) obtaining and quantizing the power of the frequency magnitudes of the audio signal;

(b) 상기 주파수 크기데이터를 이용하여 상기 양자화된 값을 정규화하는 단계; 및(b) normalizing the quantized value using the frequency magnitude data; And

(c) 상기 정규화된 주파수 크기데이터들 중 짝수 또는 홀수번째 데이터들을 양자화하는 단계를 포함함을 특징으로 하는 주파수 크기데이터 양자화 방법.and (c) quantizing the even or odd-numbered data among the normalized frequency magnitude data.

제1항에 있어서,The method of claim 1,

상기 (b)단계의 정규화된 주파수 크기 데이터들 중 상기 (c)단계에서 양자화되지 않은 데이터를, 상기 (c)단계에서 양자화된 데이터를 이용한 보간에 의해 생성하는 단계; 및 Generating data not quantized in step (c) of the normalized frequency magnitude data of step (b) by interpolation using the quantized data in step (c); And

상기 (b)단계의 정규화된 주파수 크기 데이터들 중 상기 (c)단계에서 양자화되지 않은 데이터와 상기 보간에 의해 생성된 데이터의 차에 얻어진 보간에러를 양자화하는 단계를 더 구비함을 특징으로 하는 주파수 크기데이터 양자화 방법.Quantizing the interpolation error obtained from the difference between the data that is not quantized in step (c) and the data generated by the interpolation among the normalized frequency magnitude data of step (b). Magnitude Data Quantization Method.

주파수 크기 데이터를 양자화하는 방법에 있어서,In the method of quantizing frequency magnitude data,

(a) 임의의 오디오 프레임을 구성하는 복수의 대역에 대해, 각 대역별로 주파수 크기들의 파워를 구하여 양자화하는 단계;(a) obtaining and quantizing power of frequency magnitudes for each band for a plurality of bands constituting an arbitrary audio frame;

(b) 상기 양자화된 값을 이용하여 대역별로 주파수 크기 데이터를 정규화하 는 단계; 및(b) normalizing frequency size data for each band using the quantized value; And

(c) 상기 정규화된 주파수 크기데이터의 짝수 또는 홀수번째 데이터들을 양자화하는 단계를 포함함을 특징으로 하는 주파수 크기 데이터 양자화 방법.(c) quantizing the even or odd-numbered data of the normalized frequency magnitude data.

제3항에 있어서, The method of claim 3,

(d) 상기 (a)단계의 정규화된 주파수 크기 데이터들 중 상기 (c)단계에서 양자화되지 않은 데이터를, 상기 (b)단계에서 양자화된 데이터를 이용한 보간에 의해 생성하는 단계; 및 (d) generating data not quantized in step (c) of the normalized frequency magnitude data of step (a) by interpolation using the quantized data in step (b); And

(e) 상기 (a)단계의 정규화된 주파수 크기 데이터들 중 상기 (b)단계에서 양자화되지 않은 데이터와 상기 보간에 의해 생성된 데이터의 차에 얻어진 보간에러를 양자화하는 단계를 더 포함함을 특징으로 하는 주파수 크기 데이터 양자화 방법.(e) quantizing the interpolation error obtained from the difference between the data that is not quantized in step (b) and the data generated by the interpolation among the normalized frequency magnitude data of step (a). Frequency magnitude data quantization method.

오디오 신호의 광대역 에러 오디오 신호의 주파수 포락선을 구하는 단계;Obtaining a wide frequency error of the audio signal;

상기 검출된 주파수 포락선을 상기 광대역 에러 오디호 신호에서 제거하고 주파수 크기와 위상을 구하는 단계;Removing the detected frequency envelope from the wideband error audio signal and obtaining a frequency magnitude and a phase;

상기 검출된 주파수 크기와 위상을 부호화하는 단계를 포함함을 특징으로 하고,And encoding the detected frequency magnitude and phase,

상기 주파수 크기 부호화는The frequency magnitude coding is

(a) 임의의 오디오 프레임을 구성하는 복수의 대역에 대해, 각 대역별로 주 파수 크기들의 파워를 구하여 양자화하는 단계;(a) obtaining and quantizing power of frequency sizes for each band for a plurality of bands constituting an arbitrary audio frame;

(b) 상기 양자화된 값을 이용하여 대역별로 주파수 크기 데이터를 정규화하는 단계; 및(b) normalizing frequency magnitude data for each band using the quantized value; And

(c) 상기 정규화된 주파수 크기데이터의 짝수 또는 홀수번째 데이터들을 양자화하는 단계를 포함하는, 오디오 부호화 방법.(c) quantizing the even or odd-numbered data of the normalized frequency magnitude data.

제5항에 있어서, The method of claim 5,

상기 정규화된 주파수 크기 데이터들 중 상기 양자화되지 않은 데이터를, 상기 양자화된 데이터를 이용한 보간에 의해 생성하는 단계; 및 Generating the unquantized data of the normalized frequency magnitude data by interpolation using the quantized data; And

상기 정규화된 주파수 크기 데이터들 중 상기 양자화되지 않은 데이터와 상기 보간에 의해 생성된 데이터의 차에 얻어진 보간에러를 양자화하는 단계를 더 포함함을 특징으로 하는 오디오 부호화 방법.And quantizing the interpolation error obtained in the difference between the unquantized data and the data generated by the interpolation among the normalized frequency magnitude data.

주파수 크기 데이터를 양자화하는 장치에 있어서,An apparatus for quantizing frequency magnitude data,

임의의 오디오 프레임을 구성하는 대역별로 주파수 크기들의 파워를 계산하는 파워계산부;A power calculator configured to calculate power of frequency magnitudes for each band constituting an arbitrary audio frame;

상기 계산된 파워를 양자화하는 파워양자화부;A power quantizer for quantizing the calculated power;

상기 주파수 크기데이터를 이용하여 상기 양자화된 값을 정규화하는 크기 정규화부; 및A magnitude normalizer for normalizing the quantized value using the frequency magnitude data; And

상기 정규화된 주파수 크기데이터의 짝수 또는 홀수번째 데이터들을 양자화 하는 정규화데이터 양자화부를 포함함을 특징으로 하는 주파수 크기 데이터 양자화 장치.And a normalized data quantizer for quantizing even or odd-numbered data of the normalized frequency magnitude data.

제7항에 있어서, The method of claim 7, wherein

상기 크기 정규화부에서 정규화된 주파수 크기 데이터들 중 상기 정규화데이터 양자화부에서 양자화되지 않은 데이터를 상기 정규화데이터 중 양자화된 데이터를 이용한 보간에 의해 생성하는 보간부; 및An interpolation unit generating data not quantized by the normalization data quantization unit among frequency size data normalized by the size normalization unit by interpolation using quantized data among the normalization data; And

상기 정규화된 주파수 크기 데이터들 중 상기 양자화되지 않은 데이터와 상기 보간에 의해 생성된 데이터의 차에 얻어진 보간에러를 양자화하는 보간에러양자화부를 더 포함함을 특징으로 하는 주파수 크기 데이터 양자화 장치.And an interpolation error quantization unit for quantizing an interpolation error obtained by a difference between the unquantized data and the data generated by the interpolation among the normalized frequency magnitude data.

오디오 신호의 광대역 에러 오디오 신호의 주파수 포락선을 구하는 포락선검출부;An envelope detection unit for obtaining a frequency envelope of an audio signal;

상기 검출된 주파수 포락선을 상기 광대역 에러 오디호 신호에서 제거하고 주파수 크기와 위상을 구하는 주파수크기/위상 획득부;A frequency magnitude / phase acquisition unit which removes the detected frequency envelope from the wideband error audio signal and obtains a frequency magnitude and a phase;

상기 검출된 주파수 크기를 부호화 하는 주파수크기 부호화부; 및 A frequency size encoder which encodes the detected frequency magnitude; And

상기 검출된 주파수 위상을 부호화하는 주파수 위상 부호화부를 포함함을 특징으로 하고,And a frequency phase encoder which encodes the detected frequency phase.

상기 주파수 크기 부호화부는The frequency magnitude coding unit

임의의 오디오 프레임을 구성하는 대역들에 대해 상기 대역별로 주파수 크기 들의 파워를 계산하는 파워계산부;A power calculator for calculating power of frequency magnitudes for each band for bands constituting an arbitrary audio frame;

상기 정규화된 주파수 크기데이터의 짝수 또는 홀수번째 데이터들을 양자화하는 정규화데이터 양자화부를 포함하는 오디오 부호화 장치.And a normalized data quantizer for quantizing even or odd-numbered data of the normalized frequency magnitude data.

제9항에 있어서, The method of claim 9,

상기 정규화된 주파수 크기 데이터들 중 상기 양자화되지 않은 데이터와 상기 보간에 의해 생성된 데이터의 차에 얻어진 보간에러를 양자화하는 보간에러양자화부를 더 포함함을 특징으로 하는 오디오 부호화 장치.And an interpolation error quantizer that quantizes an interpolation error obtained by a difference between the unquantized data and the data generated by the interpolation among the normalized frequency magnitude data.

비트스트림에 포함된 주파수크기의 파워를 양자화한 값(RMS 인덱스)을 역양자화하여 주파수 크기들의 파워를 복원하는 단계; 및Restoring power of frequency magnitudes by inverse quantization of a quantized value (RMS index) of power of a frequency size included in the bitstream; And

복원하고자 하는 주파수 크기의 개수에 해당하는 임펄스들과 상기 복원된 주파수 크기 파워를 곱하여 주파수 크기들을 복원하는 단계를 포함함을 특징으로 하는 주파수 크기데이터 역양자화 방법.And restoring the frequency magnitudes by multiplying the impulses corresponding to the number of frequency magnitudes to be restored by the restored frequency magnitude power.

비트스트림에 포함된 주파수 크기들의 파워를 양자화한 값(RMS 인덱스)을 역양자화하여 상기 주파수 크기들의 파워를 복원하는 단계;Restoring power of the frequency magnitudes by inverse quantization of a quantized value (RMS index) of powers of frequency magnitudes included in a bitstream;

비트스트림에 포함된 양자화된 짝수번째 또는 홀수번째의 정규화된 주파수 크기데이터들을 분리하여 역양자화함으로써 상기 짝수번째 또는 홀수번째의 정규화된 주파수 크기 데이터들을 복원하는 단계;Restoring the even or odd normalized frequency magnitude data by separating and inverse quantizing quantized even or odd normalized frequency magnitude data included in a bitstream;

상기 복원된 정규화된 주파수 크기 데이터들을 보간하여, 정규화된 주파수 크기 데이터들 중 복원되지 않은 주파수 크기 데이터들을 생성하는 단계; 및Interpolating the reconstructed normalized frequency magnitude data to generate unreconstructed frequency magnitude data among the normalized frequency magnitude data; And

상기 정규화된 주파수 크기데이터들과 상기 보간에 의해 생성된 주파수 크기 데이터들을 상기 복원된 주파수 크기들의 파워를 이용하여 역정규화하여 주파수 크기 데이터들을 복원하는 단계를 포함함을 특징으로 하는 주파수 크기데이터 역양자화 방법.Denormalizing the frequency magnitude data by denormalizing the normalized frequency magnitude data and the frequency magnitude data generated by the interpolation using the power of the restored frequency magnitudes. Way.

(a) 비트스트림에 포함된 주파수 크기들의 파워를 양자화한 값에 해당하는 RMS 인덱스를 역양자화하여 상기 주파수 크기들의 파워를 복원하는 단계;(a) restoring power of the frequency magnitudes by inverse quantizing an RMS index corresponding to a quantized value of powers of frequency magnitudes included in a bitstream;

(b) 비트스트림에 포함된 양자화된 짝수번째 또는 홀수번째의 정규화된 주파수 크기데이터들을 역양자화하여 상기 짝수번째 또는 홀수번째의 정규화된 주파수 크기 데이터들을 복원하는 단계;(b) dequantizing the quantized even or odd normalized frequency magnitude data included in the bitstream to restore the even or odd normalized frequency magnitude data;

(c) 상기 복원된 정규화된 주파수 크기 데이터들을 보간하여, 정규화된 주파수 크기 데이터들 중 복원되지 않은 주파수 크기 데이터들을 생성하는 단계;(c) interpolating the reconstructed normalized frequency magnitude data to generate unreconstructed frequency magnitude data among the normalized frequency magnitude data;

(d) 비트스트림에 포함된 양자화된 보간에러 데이터들을 역양자화하여, 보간에러 데이터들을 복원하는 단계; 및(d) inverse quantization of quantized interpolation error data included in the bitstream to recover interpolation error data; And

(e) 상기 (b)단계에서 복원된 주파수 크기데이터들과 상기 (c)단계에서 복원된 보간에 의해 생성된 주파수 크기데이터들과 상기 (d)단계에서 복원된 보간에러데이터들을 상기 (a)단계에서 복원된 주파수 크기들의 파워를 이용하여 역정규화하여 주파수 크기 데이터들을 복원하는 단계를 포함함을 특징으로 하는 주파수 크기데이터 역양자화 방법.(e) the frequency magnitude data restored in step (b), the frequency magnitude data generated by interpolation restored in step (c) and the interpolation error data recovered in step (d); And restoring the frequency magnitude data by denormalizing the power of the frequency magnitudes restored in the step.

제11항 내지 제13항 중 어느 한 항에 있어서, 상기 주파수 크기 데이터 역양자화방법을 구성하는 각 단계는14. The method according to any one of claims 11 to 13, wherein each step of constructing the frequency magnitude data dequantization method

주파수 영역으로 변환된 오디오 신호의 프레임을 구성하는 대역별로 수행됨을 특징으로 하는 주파수 크기데이터 역양자화 방법.A frequency magnitude data dequantization method, characterized in that performed in each band constituting a frame of an audio signal converted into a frequency domain.

주파수 크기를 복호화하는 단계;Decoding the frequency magnitude;

주파수 위상을 복호화하는 단계; 및Decoding the frequency phase; And

상기 복호화된 주파수 크기 및 위상 데이터를 이용하여 광대역 에러 오디오 신호의 주파수 포락선을 복원하는 단계를 포함함을 특징으로 하고, Restoring a frequency envelope of a wideband error audio signal using the decoded frequency magnitude and phase data,

상기 주파수 크기 복호화 단계는The frequency magnitude decoding step

비트스트림에 포함된 주파수크기의 파워를 양자화한 값(RMS 인덱스)을 역양자화하여 주파수 파워를 복원하는 단계;Restoring frequency power by inverse quantization of a quantized value (RMS index) of power of a frequency size included in the bitstream;

복원하고자 하는 주파수 크기의 개수에 해당하는 임펄스들을 생성하는 단계; 및Generating impulses corresponding to the number of frequency magnitudes to be restored; And

상기 복원된 주파수 크기 파워를 곱하여 주파수 크기들을 복원하는 단계를 포함하는 오디오 복호화 방법.Restoring frequency magnitudes by multiplying the restored frequency magnitude power.

제15항에 있어서, 상기 주파수 크기 복호화 단계는16. The method of claim 15, wherein the frequency magnitude decoding step is

비트스트림에 포함된 양자화된 짝수번째 또는 홀수번째의 정규화된 주파수 크기데이터들을 역양자화하여 상기 짝수번째 또는 홀수번째의 정규화된 주파수 크기 데이터들을 복원하는 단계;Dequantizing the quantized even-numbered or odd-numbered normalized frequency magnitude data included in the bitstream to restore the even-numbered or odd-numbered normalized frequency magnitude data;

상기 정규화된 주파수 크기데이터들과 상기 보간에 의해 생성된 주파수 크기 데이터들을 상기 복원된 주파수 크기들의 파워를 이용하여 역정규화하여 주파수 크기 데이터들을 복원하는 단계를 포함하는 오디오 복호화 방법.And denormalizing the normalized frequency magnitude data and the frequency magnitude data generated by the interpolation by using the restored power of the frequency magnitudes to recover frequency magnitude data.

비트스트림에 포함된 주파수 크기들의 파워를 양자화한 값에 해당하는 RMS 인덱스를 역양자화하여 상기 주파수 크기들의 파워를 복원하는 단계;Restoring power of the frequency magnitudes by inversely quantizing an RMS index corresponding to a quantized value of powers of frequency magnitudes included in a bitstream;

상기 복원된 정규화된 주파수 크기 데이터들을 보간하여, 정규화된 주파수 크기 데이터들 중 복원되지 않은 주파수 크기 데이터들을 생성하는 단계;Interpolating the reconstructed normalized frequency magnitude data to generate unreconstructed frequency magnitude data among the normalized frequency magnitude data;

비트스트림에 포함된 양자화된 보간에러 데이터들을 역양자화하여, 보간에러 데이터들을 복원하는 단계; 및Dequantizing the quantized interpolation error data included in the bitstream to recover interpolation error data; And

상기 정규화데이터 복원부에서 복원된 주파수 크기데이터들과 상기 정규화데이터 복원부에서 복원된 보간에 의해 생성된 주파수 크기데이터들과 상기 보간에러 복원부에서 복원된 보간에러데이터들을 상기 주파수파워 복원부에서 복원된 주파수 크기들의 파워를 이용하여 역정규화하여 주파수 크기 데이터들을 복원하는 단계를 포함하는 오디오 복호화 방법.The frequency power recovery unit restores the frequency magnitude data restored by the normalized data recovery unit, the frequency magnitude data generated by interpolation restored by the normalized data recovery unit, and the interpolation error data restored by the interpolation error recovery unit. Restoring the frequency magnitude data by denormalizing using the power of the received frequency magnitudes.

비트스트림에 포함된 주파수크기의 파워를 양자화한 값(RMS 인덱스)을 역양자화하여 주파수 파워를 복원하는 주파수파워 복원부;A frequency power recovery unit for restoring frequency power by inversely quantizing a value (RMS index) of quantized power of a frequency size included in the bitstream;

복원하고자 하는 주파수 크기의 개수에 해당하는 임펄스들을 생성하는 임펄스열생성부; 및An impulse heat generator for generating impulses corresponding to the number of frequency magnitudes to be restored; And

상기 임펄스열에 상기 복원된 주파수 크기 파워를 곱하여 주파수 크기들을 복원하는 제1주파수크기 복원부를 포함함을 특징으로 하는 주파수 크기데이터 역양자화 장치.And a first frequency magnitude restoring unit for restoring frequency magnitudes by multiplying the impulse sequence by the restored frequency magnitude power.

비트스트림에 포함된 주파수 크기들의 파워를 양자화한 값(RMS 인덱스)을 역양자화하여 상기 주파수 크기들의 파워를 복원하는 주파수파워 복원부;A frequency power recovery unit for inversely quantizing a value (RMS index) of quantized powers of frequency magnitudes included in a bitstream to restore power of the frequency magnitudes;

비트스트림에 포함된 양자화된 짝수번째 또는 홀수번째의 정규화된 주파수 크기데이터들을 역양자화하여 상기 짝수번째 또는 홀수번째의 정규화된 주파수 크기 데이터들을 복원하는 정규화 데이터 복원부;A normalized data reconstruction unit for dequantizing the quantized even-numbered or odd-numbered normalized frequency magnitude data included in the bitstream to restore the even-numbered or odd-numbered normalized frequency magnitude data;

상기 복원된 정규화된 주파수 크기 데이터들을 보간하여, 정규화된 주파수 크기 데이터들 중 복원되지 않은 주파수 크기 데이터들을 생성하는 정규화데이터 보간부; 및A normalized data interpolation unit interpolating the restored normalized frequency magnitude data to generate unreconstructed frequency magnitude data among the normalized frequency magnitude data; And

상기 정규화된 주파수 크기데이터들과 상기 보간에 의해 생성된 주파수 크기 데이터들을 상기 복원된 주파수 크기들의 파워를 이용하여 역정규화하여 주파수 크기 데이터들을 복원하는 제2주파수 크기 복원부를 포함함을 특징으로 하는 주파수 크기데이터 역양자화 장치.And a second frequency magnitude restoring unit configured to denormalize the normalized frequency magnitude data and the frequency magnitude data generated by the interpolation using power of the restored frequency magnitudes to restore frequency magnitude data. Size Data Dequantizer.

비트스트림에 포함된 주파수 크기들의 파워를 양자화한 값에 해당하는 RMS 인덱스를 역양자화하여 상기 주파수 크기들의 파워를 복원하는 주파수파워 복원부;A frequency power recovery unit for restoring power of the frequency magnitudes by inverse quantizing an RMS index corresponding to a quantized value of powers of frequency magnitudes included in a bitstream;

비트스트림에 포함된 양자화된 짝수번째 또는 홀수번째의 정규화된 주파수 크기데이터들을 역양자화하여 상기 짝수번째 또는 홀수번째의 정규화된 주파수 크기 데이터들을 복원하는 정규화데이터 복원부;A normalized data recovery unit for inversely quantizing quantized even-numbered or odd-numbered normalized frequency magnitude data included in a bitstream to restore the even-numbered or odd-numbered normalized frequency magnitude data;

상기 복원된 정규화된 주파수 크기 데이터들을 보간하여, 정규화된 주파수 크기 데이터들 중 복원되지 않은 주파수 크기 데이터들을 생성하는 정규화데이터 보간부;A normalized data interpolation unit interpolating the restored normalized frequency magnitude data to generate unreconstructed frequency magnitude data among the normalized frequency magnitude data;

비트스트림에 포함된 양자화된 보간에러 데이터들을 역양자화하여, 보간에러 데이터들을 복원하는 보간에러 복원부; 및An interpolation error recovery unit that inversely quantizes quantized interpolation error data included in the bitstream and restores interpolation error data; And

상기 정규화데이터 복원부에서 복원된 주파수 크기데이터들과, 상기 정규화데이터 복원부에서 복원된 보간에 의해 생성된 주파수 크기데이터들과, 상기 보간에러 복원부에서 복원된 보간에러데이터들을 상기 주파수파워 복원부에서 복원된 주파수 크기들의 파워를 이용하여 역정규화하여, 주파수 크기 데이터들을 복원하는 제3주파수 크기 복원부를 포함함을 특징으로 하는 주파수 크기데이터 역양자화 장치.The frequency power recovery unit recovers the frequency magnitude data restored by the normalized data recovery unit, the frequency magnitude data generated by interpolation restored by the normalized data recovery unit, and the interpolation error data restored by the interpolation error recovery unit. And a third frequency magnitude reconstruction unit for denormalizing the power of the frequency magnitudes reconstructed by using the reconstructed signal to restore the frequency magnitude data.

제18항 내지 제20항 중 어느 한 항에 있어서, 상기 주파수 크기 데이터 역양자화 장치를 구성하는 각 구성요소는21. The apparatus according to any one of claims 18 to 20, wherein each component constituting the frequency magnitude data dequantization device is

주파수 영역으로 변환된 오디오 신호의 프레임을 구성하는 대역별로 수행됨을 특징으로 하는 주파수 크기데이터 역양자화 장치.The apparatus of claim 1, wherein the frequency magnitude data dequantization apparatus is performed for each band constituting a frame of the audio signal converted into the frequency domain.

주파수 크기를 복호화하는 주파수크기 복호화부;A frequency size decoder which decodes the frequency magnitude;

주파수 위상을 복호화하는 주파수위상 복호화부; 및A frequency phase decoder for decoding the frequency phase; And

상기 복호화된 주파수 크기 및 위상 데이터를 이용하여 광대역 에러 오디오 신호의 주파수 포락선을 복원하는 주파수 포락선 복원부를 포함함을 특징으로 하 고, And a frequency envelope recovery unit for recovering a frequency envelope of a wideband error audio signal using the decoded frequency magnitude and phase data.

상기 주파수 크기 복호화부는The frequency magnitude decoding unit

복원하고자 하는 주파수 크기의 개수에 해당하는 임펄스들을 생성하는 임펄스생성부; 및An impulse generator for generating impulses corresponding to the number of frequency magnitudes to be restored; And

상기 복원된 주파수 크기 파워를 곱하여 주파수 크기들을 복원하는 주파수 크기 복원부를 포함하는 오디오 복호화 장치.And a frequency magnitude recovery unit to restore frequency magnitudes by multiplying the restored frequency magnitude power.

제22항에 있어서, 상기 주파수 크기 복호화부는23. The apparatus of claim 22, wherein the frequency magnitude decoding unit

상기 정규화된 주파수 크기데이터들과 상기 보간에 의해 생성된 주파수 크기 데이터들을 상기 복원된 주파수 크기들의 파워를 이용하여 역정규화하여 주파수 크 기 데이터들을 복원하는 주파수 크기데이터 역정규화부를 포함하는 오디오 복호화장치.And a frequency magnitude data denormalization unit for denormalizing the normalized frequency magnitude data and the frequency magnitude data generated by the interpolation using power of the restored frequency magnitudes to restore frequency magnitude data.

상기 정규화데이터 복원부에서 복원된 주파수 크기데이터들과 상기 정규화데이터 복원부에서 복원된 보간에 의해 생성된 주파수 크기데이터들과 상기 보간에러 복원부에서 복원된 보간에러데이터들을 상기 주파수파워 복원부에서 복원된 주파수 크기들의 파워를 이용하여 역정규화하여 주파수 크기 데이터들을 복원하는 주파수 크기데이터 복원부를 포함하는 오디오 보호화 장치.The frequency power recovery unit restores the frequency magnitude data restored by the normalized data recovery unit, the frequency magnitude data generated by interpolation restored by the normalized data recovery unit, and the interpolation error data restored by the interpolation error recovery unit. And a frequency magnitude data recovery unit for denormalizing and restoring the frequency magnitude data by using the power of the frequency magnitudes.

제1항 내지 제6항, 제11항 내지 제17항 중 어느 한 항에 기재된 발명을 컴퓨터에서 실행시키기 위한 프로그램을 기록한 컴퓨터로 읽을 수 있는 기록매체.A computer-readable recording medium having recorded thereon a program for executing the invention according to any one of claims 1 to 6 and 11 to 17 on a computer.