KR101238239B1

KR101238239B1 - An encoder

Info

Publication number: KR101238239B1
Application number: KR1020107012418A
Authority: KR
Inventors: 라쎄 라아크소넨; 미코 탐미; 안드리아나 바실라체; 안씨 라모
Original assignee: 노키아 코포레이션
Priority date: 2007-11-06
Filing date: 2007-11-06
Publication date: 2013-03-04
Also published as: RU2483368C2; US20100250261A1; CA2704812C; BRPI0722269A2; TW200931397A; US9082397B2; CN101896967A; RU2010123728A; WO2009059633A1; TWI492224B; EP2212884A1; CA2704812A1; KR20100086033A; EP2212884B1

Abstract

오디오 신호를 부호화하기 위한 인코더가 제안되고, 인코더는 단일 주파수 성분들의 집합을 정의하고; 단일 주파수 성분들의 집합 중 제1부분 집합으로부터 적어도 한 개의 단일 주파수 성분을 선택하도록 구성된다.An encoder is proposed for encoding an audio signal, the encoder defining a set of single frequency components; And select at least one single frequency component from the first subset of the set of single frequency components.

Description

인코더{An encoder}An encoder

본 발명은 코딩에 관한 것으로, 특히 배타적이지 않은 것으로서 스피치 또는 오디오 코딩에 관한 것이다. The present invention relates to coding, and in particular to speech or audio coding as not exclusive.

스피치나 음악 같은 오디오 신호들은 이를테면 그러한 오디오 신호들의 효율적 전송 또는 저장을 수행하기 위해 부호화된다.Audio signals such as speech or music are encoded, for example, to effect efficient transmission or storage of such audio signals.

오디오 인코더들과 디코더들이 음악 및 배경 잡음 같은 오디오 기반 신호들을 표현하는데 사용된다. 이러한 종류의 코더들은 보통 코딩 프로세스에 대해 스피치 모델을 이용하지 않고, 그보다 스피치를 포함한 모든 종류의 오디오 신호들을 표현하기 위한 프로세스들을 이용한다. Audio encoders and decoders are used to represent audio based signals such as music and background noise. Coders of this kind usually do not use a speech model for the coding process, but rather use processes to represent all kinds of audio signals, including speech.

스피치 인코더들 및 디코더들 (코덱들)은 보통 스피치 신호들을 위해 최적화되고, 고정 또는 가변 비트 레이트에서 동작할 수 있다.Speech encoders and decoders (codecs) are usually optimized for speech signals and can operate at a fixed or variable bit rate.

오디오 코덱이 가변 비트 레이트들에서 동작하도록 구성될 수도 있다. 낮은 비트 레이트에서, 그러한 오디오 코덱은 순수한 스피치 코덱에 해당하는 코딩 레이트로 스피치 신호들에 대해 작용할 것이다. 높은 비트 레이트에서, 오디오 코덱은 음악, 배경 잡음 및 스피치를 포함하는 임의의 신호를 보다 높은 품질과 성능으로 부호화할 수 있다. The audio codec may be configured to operate at variable bit rates. At low bit rates, such an audio codec will operate on speech signals at a coding rate corresponding to a pure speech codec. At high bit rates, the audio codec can encode any signal, including music, background noise and speech, with higher quality and performance.

어떤 오디오 코덱들에서는 입력 신호가 제한된 수의 대역들로 나눠진다. 대역 신호들 각각은 양자화될 수 있다. 음향심리학 (psychoacoustics)의 이론으로부터, 스펙트럼의 최상위 주파수들은 낮은 주파수들보다 인지적으로 덜 중요하다는 것이 알려져 있다. 일부 오디오 코덱들에서 이것은 고주파 신호들에 대해 저주파 신호들보다 적은 비트가 할당되는 비트 할당에 의해 반영되고 있다. In some audio codecs, the input signal is divided into a limited number of bands. Each of the band signals may be quantized. From the theory of psychoacoustics, it is known that the highest frequencies in the spectrum are cognitively less important than the lower frequencies. In some audio codecs this is reflected by the bit allocation in which less bits are assigned to the high frequency signals than the low frequency signals.

더구나 일부 코덱들은 저주파 및 고주파 대역들 간이나 오디오 신호 영역들 간 상관을 이용해 코덱들을 통한 코딩 효율을 향상시키도록 한다. Furthermore, some codecs use correlation between low and high frequency bands or between audio signal regions to improve coding efficiency through codecs.

통상적으로, 스펙트럼의 상위 주파수 대역들은 보통 하위 주파수 대역들과 매우 유사하므로, 일부 코덱들은 보다 낮은 주파수 대역들만을 부호화한 후, 상위의 주파수 대역들을 스케일링된 (scaled) 하위 주파수 대역 사본 (copy)으로서 복제할 수 있다. 따라서 소량의 추가 제어 정보만을 사용함으로써 코덱의 총 비트 레이트의 상당한 절감이 이뤄질 수 있다. Typically, the higher frequency bands of the spectrum are usually very similar to the lower frequency bands, so some codecs encode only the lower frequency bands, and then the upper frequency bands as scaled lower frequency band copies. You can duplicate it. Thus, by using only a small amount of additional control information, significant savings in the total bit rate of the codec can be achieved.

고주파 영역을 부호화하기 위한 그러한 한 코덱은 고주파 영역 (HFR, high frequency region) 코딩이라고 알려져 있다. 고주파 영역 코딩의 한 형태가 스펙트럼 대역 복제 (SBR, spectral-band-relication)인데, 그것은 Coding Technologies에 의해 개발되었다. SBR에서, MPEG-4 (Moving Pictures Expert Group) AAC (Advanced Audio Coding) 같은 기존의 오디오 코더나 MPEG-1 레이어 III (MP3) 코더가 저주파 영역을 부호화한다. 고주파 영역은 그렇게 부호화된 저주파 영역을 이용해 개별적으로 만들어진다. One such codec for encoding a high frequency region is known as high frequency region (HFR) coding. One form of high frequency domain coding is spectral-band-relication (SBR), which was developed by Coding Technologies. In SBR, existing audio coders such as Moving Pictures Expert Group (MPEG-4) Advanced Audio Coding (AAC) or MPEG-1 Layer III (MP3) coders encode low frequency regions. The high frequency region is made separately using the low frequency region so encoded.

SBR 코딩에서, 고주파 영역은 저주파 영역을 상위 주파수대로 전치 (transposing)함으로써 얻어진다. 전치 (transposition)는 32 개의 대역을 가진 QMF (Quadrature Mirror Filters) 필터 뱅크에 기반하는 것으로, 어느 대역 샘플들로부터 각각의 고주파 대역 샘플이 구성되는지가 미리 정해지도록 수행된다. 이것은 입력 신호의 특성과는 무관하게 이뤄진다.In SBR coding, the high frequency region is obtained by transposing the low frequency region to the higher frequency. Transposition is based on a 32-band Quadrature Mirror Filters (QMF) filter bank and is performed to predetermine from which band samples each high frequency band sample is constructed. This is done regardless of the characteristics of the input signal.

상위 주파수 대역들은 추가 정보에 기초해 필터링 된다. 필터링은 합성된 고주파 영역의 특정한 특성을 오리지널 고주파 영역과 비슷하게 만들도록 행해진다. 사인파 (sinusoids)나 잡음 같은 추가 성분들이 고주파 영역에 더해져서 원래의 고주파 영역과의 유사도를 높인다. 최종적으로, 오리지널 고주파 스펙트럼의 포락선 (envelope)을 따르도록 포락선이 조정된다. Upper frequency bands are filtered based on the additional information. Filtering is done to make certain characteristics of the synthesized high frequency region similar to the original high frequency region. Additional components, such as sinusoids and noise, are added to the high frequency range to increase the similarity to the original high frequency range. Finally, the envelope is adjusted to follow the envelope of the original high frequency spectrum.

그러나 상위 주파수 영역 코딩은 오리지널 고주파 영역과 동일한 복제본을 생성하지는 못한다. 구체적으로 말해, 기존의 상위 주파수 영역 코딩 메커니즘들은 입력 신호가 음질인 (tonal) 경우, 즉 잡음의 스펙트럼과 유사한 스펙트럼을 가지지 않은 경우 상대적으로 불완전하게 수행된다.However, higher frequency domain coding does not produce the same copy as the original high frequency region. Specifically, existing higher frequency domain coding mechanisms are performed relatively incomplete when the input signal is tonal, i.e., does not have a spectrum similar to that of noise.

본 발명은 현재 제안된 코덱들이, 효율적이고도 정밀한 신호 근사치들을 코딩할 수 있는 기능과 관련해 융통성이 부족하다는 생각에서 비롯된다.The present invention stems from the idea that currently proposed codecs lack flexibility in terms of the ability to code efficient and precise signal approximations.

본 발명의 실시예들은 상술한 문제점 극복을 목적으로 한다.Embodiments of the present invention aim to overcome the above problems.

본 발명의 제1양태에 따라 오디오 신호를 부호화하기 위한 인코더가 제공되고, 상기 인코더는 단일 주파수 성분들의 집합을 정의하고; 단일 주파수 성분들의 집합 중 제1부분 집합으로부터 적어도 한 개의 단일 주파수 성분을 선택하도록 구성된다. According to a first aspect of the invention there is provided an encoder for encoding an audio signal, the encoder defining a set of single frequency components; And select at least one single frequency component from the first subset of the set of single frequency components.

상기 인코더는, 상기 선택된 적어도 한 개의 단일 주파수 성분을 나타낼 적어도 한 제1지시자를 생성하도록 추가 구성될 수 있다. The encoder may be further configured to generate at least one first indicator to represent the selected at least one single frequency component.

상기 인코더는, 단일 주파수 성분들의 집합의 적어도 제2부분 집합으로부터 적어도 한 개의 추가적 단일 주파수 성분을 선택하도록 추가 구성될 수 있다.The encoder may be further configured to select at least one additional single frequency component from at least a second subset of the set of single frequency components.

상기 인코더는 상기 선택된 적어도 한 개의 추가적 단일 주파수 성분을 나타낼 적어도 한 제2지시자를 생성하도록 추가 구성될 수 있다. The encoder may be further configured to generate at least one second indicator to represent the selected at least one additional single frequency component.

상기 인코더는 단일 주파수 성분들의 집합을 적어도 단일 주파수 성분들의 제1 및 제2부분 집합들로 분할하도록 추가 구성될 수 있다. The encoder may be further configured to divide the set of single frequency components into at least first and second subsets of single frequency components.

상기 인코더는 단일 주파수 성분들의 집합을, 그 집합 내 단일 주파수 성분의 주파수에 따라 적어도 단일 주파수 성분들의 제1 및 제2부분 집합들로 분할하도록 추가 구성될 수 있다. The encoder may be further configured to divide the set of single frequency components into at least first and second subsets of single frequency components according to the frequency of the single frequency component in the set.

상기 인코더는 단일 주파수 성분들의 집합을 그 집합 내 단일 주파수 성분의 인지적 중요도에 따라 적어도 단일 주파수 성분의 제1 및 제2부분 집합들로 분할하도록 추가 구성될 수 있다. The encoder may be further configured to divide the set of single frequency components into at least first and second subsets of the single frequency component according to the perceived importance of the single frequency component in the set.

단일 주파수 성분들은 사인파들 (sinusoids)임이 바람직하다. Preferably, the single frequency components are sinusoids.

본 발명의 제2양태에 따라 오디오 신호를 부호화하기 방법이 제공되고, 상기 방법은 단일 주파수 성분들의 집합을 정의하는 단계; 단일 주파수 성분들의 집합 중 제1부분 집합으로부터 적어도 한 개의 단일 주파수 성분을 선택하는 단계를 포함한다. According to a second aspect of the present invention there is provided a method for encoding an audio signal, the method comprising: defining a set of single frequency components; Selecting at least one single frequency component from a first subset of the set of single frequency components.

상기 방법은, 상기 선택된 적어도 한 개의 단일 주파수 성분을 나타낼 적어도 한 제1지시자를 생성하는 단계를 더 포함할 수 있다. The method may further comprise generating at least one first indicator to represent the selected at least one single frequency component.

상기 방법은, 단일 주파수 성분들의 적어도 제2부분 집합으로부터 적어도 한 개의 추가적 단일 주파수 성분을 선택하는 단계를 더 포함할 수 있다. The method may further comprise selecting at least one additional single frequency component from at least a second subset of single frequency components.

상기 방법은, 상기 선택된 적어도 한 개의 추가적 단일 주파수 성분을 나타낼 적어도 한 제2지시자를 생성하는 단계를 더 포함할 수 있다. The method may further comprise generating at least one second indicator to represent the selected at least one additional single frequency component.

상기 방법은, 단일 주파수 성분들의 집합을 적어도 단일 주파수 성분들의 제1 및 제2부분 집합들로 분할하는 단계를 더 포함할 수 있다. The method may further comprise dividing the set of single frequency components into at least first and second subsets of single frequency components.

단일 주파수 성분들의 집합을 적어도 단일 주파수 성분들의 제1 및 제2부분 집합들로 분할하는 단계는 그 집합 내 단일 주파수 성분의 주파수에 종속될 수 있다. Dividing a set of single frequency components into at least first and second subsets of single frequency components may be dependent on the frequency of the single frequency component in the set.

단일 주파수 성분들의 집합을 적어도 단일 주파수 성분들의 제1 및 제2부분 집합들로 분할하는 단계는, 그 집합 내 단일 주파수 성분의 인지적 중요도에 종속될 수 있다. Dividing a set of single frequency components into at least first and second subsets of single frequency components may depend on the cognitive importance of a single frequency component in the set.

단일 주파수 성분들은 사인파들 (sinusoids)임이 바람직하다.Preferably, the single frequency components are sinusoids.

본 발명의 제3양태에 따라 오디오 신호를 복호화하는 디코더가 제공되고, 상기 디코더는 단일 주파수 성분들의 집합의 제1부분 집합으로부터 적어도 한 단일 주파수 성분을 나타내는 적어도 한 지시자를 수신하도록 하고; 수신된 상기 지시자에 따라 상기 단일 주파수 성분을 삽입하도록 구성된다. According to a third aspect of the invention there is provided a decoder for decoding an audio signal, the decoder adapted to receive at least one indicator representing at least one single frequency component from a first subset of a set of single frequency components; And to insert the single frequency component in accordance with the indicator received.

상기 디코더는, 단일 주파수 성분들의 집합의 적어도 한 추가 부분 집합으로부터 적어도 한 추가 단일 주파수 성분을 나타내는 적어도 한 추가 지시자를 수신하고; 수신된 추가 지시자에 따라 상기 추가 단일 주파수 성분을 삽입하도록 추가 구성될 수 있다.The decoder receives at least one additional indicator indicating at least one additional single frequency component from at least one additional subset of the set of single frequency components; It may be further configured to insert the additional single frequency component according to the received additional indicator.

상기 디코더는 단일 주파수 성분들의 집합의 제1부분 집합으로부터 상기 적어도 한 단일 주파수 성분의 부호 (sign)를 나타내는 부호 지시자를 수신하도록 추가 구성될 수 있다.The decoder may be further configured to receive a sign indicator indicating a sign of the at least one single frequency component from the first subset of the set of single frequency components.

본 발명의 제4양태에 따라 오디오 신호를 복호화하기 위한 방법이 제공되고, 상기 방법은, 단일 주파수 성분들의 집합의 제1부분 집합으로부터 적어도 한 단일 주파수 성분을 나타내는 적어도 한 지시자를 수신하는 단계; 및 수신된 상기 지시자에 따라 상기 단일 주파수 성분을 삽입하는 단계를 포함한다. According to a fourth aspect of the present invention there is provided a method for decoding an audio signal, the method comprising: receiving at least one indicator representing at least one single frequency component from a first subset of a set of single frequency components; And inserting the single frequency component in accordance with the indicator received.

상기 방법은, 단일 주파수 성분들의 집합의 적어도 한 추가 부분 집합으로부터 적어도 한 추가 단일 주파수 성분을 나타내는 적어도 한 추가 지시자를 수신하는 단계; 및 수신된 추가 지시자에 따라 상기 추가 단일 주파수 성분을 삽입하는 단계를 더 포함할 수 있다. The method includes receiving at least one additional indicator representing at least one additional single frequency component from at least one additional subset of the set of single frequency components; And inserting the additional single frequency component in accordance with the received additional indicator.

상기 방법은, 단일 주파수 성분들의 집합의 제1부분 집합으로부터 상기 적어도 한 단일 주파수 성분의 부호 (sign)를 나타내는 부호 지시자를 수신하는 단계를 더 포함할 수 있다.The method may further comprise receiving a sign indicator indicative of a sign of the at least one single frequency component from the first subset of the set of single frequency components.

본 발명의 제5양태에 따라 상술한 인코더를 구비한 장치가 제안된다.According to a fifth aspect of the invention there is proposed an apparatus with the above-described encoder.

본 발명의 제6양태에 따라 상술한 디코더를 구비한 장치가 제안된다.According to a sixth aspect of the invention, an apparatus with the above-described decoder is proposed.

본 발명의 제7양태에 따라 상술한 인코더를 구비한 전자 기기가 제안된다.According to a seventh aspect of the present invention, an electronic device having the above-described encoder is proposed.

본 발명의 제8양태에 따라 상술한 디코더를 구비한 전자 기기가 제안된다.According to an eighth aspect of the present invention, an electronic device having the above-described decoder is proposed.

본 발명의 제9양태에 따라 오디오 신호를 부호화하는 방법을 수행하도록 구성된 컴퓨터 프로그램 제품이 제안되고, 그 방법은 단일 주파수 성분들의 집합을 정의하는 단계; 단일 주파수 성분들의 집합 중 제1부분 집합으로부터 적어도 한 개의 단일 주파수 성분을 선택하는 단계를 포함한다. According to a ninth aspect of the present invention there is proposed a computer program product configured to perform a method of encoding an audio signal, the method comprising: defining a set of single frequency components; Selecting at least one single frequency component from a first subset of the set of single frequency components.

본 발명의 제10양태에 따라 오디오 신호를 복호화하는 방법을 수행하도록 구성된 컴퓨터 프로그램 제품이 제안되고, 그 방법은 단일 주파수 성분들의 집합의 제1부분 집합으로부터 적어도 한 단일 주파수 성분을 나타내는 적어도 한 지시자를 수신하는 단계; 및 수신된 상기 지시자에 따라 상기 단일 주파수 성분을 삽입하는 단계를 포함한다.A computer program product configured to perform a method of decoding an audio signal according to a tenth aspect of the present invention is proposed, the method comprising at least one indicator indicating at least one single frequency component from a first subset of a set of single frequency components Receiving; And inserting the single frequency component in accordance with the indicator received.

본 발명의 제11양태에 따라 오디오 신호를 부호화하는 인코더가 제안되고, 상기 인코더는 단일 주파수 성분들의 집합을 정의하는 수단; 및 단일 주파수 성분들의 집합 중 제1부분 집합으로부터 적어도 한 개의 단일 주파수 성분을 선택하는 선택 수단을 포함한다.According to an eleventh aspect of the present invention, an encoder for encoding an audio signal is proposed, the encoder comprising: means for defining a set of single frequency components; And selection means for selecting at least one single frequency component from the first subset of the set of single frequency components.

본 발명의 제12양태에 따라 오디오 신호를 복호화하는 디코더가 제공되고, 상기 디코더는 단일 주파수 성분들의 집합의 제1부분 집합으로부터 적어도 한 단일 주파수 성분을 나타내는 적어도 한 지시자를 수신하는 수신 수단; 및 수신된 상기 지시자에 따라 상기 단일 주파수 성분을 삽입하는 삽입 수단을 포함한다. According to a twelfth aspect of the present invention there is provided a decoder for decoding an audio signal, the decoder comprising: receiving means for receiving at least one indicator representing at least one single frequency component from a first subset of a set of single frequency components; And insertion means for inserting the single frequency component in accordance with the indicator received.

본 발명의 제13양태에 따라 오디오 신호를 부호화하는 인코더가 제안되고, 상기 인코더는 적어도 두 개의 단일 주파수 성분들을 선택하고; 상기 적어도 두 개의 단일 주파수 성분들을 나타내도록 구성되고, 상기 두 개의 단일 주파수 성분들 사이의 주파수 간격 (frequency separation)에 종속되도록 구성되는 지시자를 생성하도록 구성된다.According to a thirteenth aspect of the present invention, an encoder for encoding an audio signal is proposed, wherein the encoder selects at least two single frequency components; And generate an indicator configured to indicate the at least two single frequency components and configured to be dependent on a frequency separation between the two single frequency components.

상기 인코더는 적어도 한 추가 단일 주파수 성분을 선택하도록 추가 구성될 수 있고, 상기 지시자는 상기 적어도 한 추가 단일 주파수 성분을 나타내도록 추가 구성됨이 바람직하고, 상기 지시자는 상기 적어도 한 추가 단일 주파수 성분, 및 상기 적어도 두 개의 단일 주파수 성분들 중 하나와의 사이의 주파수 간격에 종속되도록 추가 구성됨이 바람직하다. The encoder may be further configured to select at least one additional single frequency component, and wherein the indicator is further configured to indicate the at least one additional single frequency component, wherein the indicator is the at least one additional single frequency component, and the It is preferably further configured to be dependent on the frequency spacing between one of the at least two single frequency components.

상기 지시자는 상기 적어도 두 개의 단일 주파수 성분들 중 하나의 주파수에 종속되도록 추가 구성됨이 바람직하다.The indicator is preferably further configured to be dependent on the frequency of one of the at least two single frequency components.

상기 인코더는 상기 두 개의 단일 주파수 성분들 사이의 주파수 간격을 결정하도록 추가 구성될 수 있다. The encoder may be further configured to determine a frequency interval between the two single frequency components.

상기 인코더는, 상기 결정된 두 개의 단일 주파수 성분들 사이의 주파수 간격에 대한 주파수 간격 값들의 리스트를 서치하고; 상기 결정된 두 개의 단일 주파수 성분들 사이의 주파수 간격에 보다 가깝게 매치하는 리스트 중 하나를 선택하도록 추가 구성될 수 있고, 상기 지시자는 주파수 간격 값들의 리스트 중 선택된 하나에 종속됨이 바람직하다. The encoder searches a list of frequency interval values for a frequency interval between the determined two single frequency components; It may be further configured to select one of the lists that more closely matches the frequency interval between the two determined single frequency components, wherein the indicator is preferably dependent on the selected one of the list of frequency interval values.

상기 인코더는 주파수 간격 값들의 리스트 중 선택된 하나 및 결정된 주파수 간격 값 사이의 차를 결정하도록 추가 구성될 수 있고, 상기 지시자는 상기 차에 추가 종속됨이 바람직하다.The encoder may be further configured to determine a difference between the selected one of the list of frequency interval values and the determined frequency interval value, wherein the indicator is further dependent on the difference.

상기 인코더는 주파수 간격 값들의 리스트 중 선택된 하나 및 결정된 주파수 간격 값 사이의 상기 결정된 차에 대한 차이 값들의 추가 리스트를 검색하고; 상기 결정된 차이 값에 보다 가깝게 매치하는 차이 값들의 추가 리스트 중 하나를 선택하도록 추가 구성될 수 있고, 상기 지시자는 상기 차이 값들의 추가 리스트 중 선택된 하나에 종속됨이 바람직하다.The encoder retrieves a further list of difference values for the determined difference between the selected one of the list of frequency interval values and the determined frequency interval value; It may be further configured to select one of an additional list of difference values that more closely matches the determined difference value, wherein the indicator is preferably dependent on the selected one of the additional list of difference values.

본 발명의 제14양태에 따라 오디오 신호를 부호화하기 위한 방법이 제안되고, 상기 방법은 적어도 두 개의 단일 주파수 성분들을 선택하는 단계; 및 상기 적어도 두 개의 단일 주파수 성분들을 나타내도록 구성되고, 상기 두 개의 단일 주파수 성분들 사이의 주파수 간격 (frequency separation)에 종속되도록 구성되는 지시자를 생성하는 단계를 포함한다. According to a fourteenth aspect of the present invention there is proposed a method for encoding an audio signal, the method comprising: selecting at least two single frequency components; And generating an indicator configured to represent the at least two single frequency components, the indicator configured to be dependent on a frequency separation between the two single frequency components.

상기 방법은, 적어도 한 추가 단일 주파수 성분을 선택하는 단계를 더 포함할 수 있고, 상기 지시자는 상기 적어도 한 추가 단일 주파수 성분을 나타내도록 추가 구성됨이 바람직하고, 상기 지시자는 상기 적어도 한 추가 단일 주파수 성분, 및 상기 적어도 두 개의 단일 주파수 성분들 중 하나와의 사이의 주파수 간격에 종속되도록 추가 구성됨이 바람직하다. The method may further comprise selecting at least one additional single frequency component, wherein the indicator is further configured to indicate the at least one additional single frequency component, the indicator further comprising the at least one additional single frequency component. , And is further configured to depend on the frequency spacing between one of the at least two single frequency components.

상기 지시자는 상기 적어도 두 개의 단일 주파수 성분들 중 하나의 주파수에 추가 종속될 수 있다. The indicator may further depend on the frequency of one of the at least two single frequency components.

상기 방법은, 상기 두 개의 단일 주파수 성분들 사이의 주파수 간격을 결정하는 단계를 더 포함할 수 있다. The method may further comprise determining a frequency interval between the two single frequency components.

상기 방법은, 상기 결정된 두 개의 단일 주파수 성분들 사이의 주파수 간격에 대한 주파수 간격 값들의 리스트를 서치하는 단계; 및 상기 결정된 두 개의 단일 주파수 성분들 사이의 주파수 간격에 보다 가깝게 매치하는 리스트 중 하나를 선택하는 단계를 더 포함할 수 있고, 상기 지시자는 주파수 간격 값들의 리스트 중 선택된 하나에 종속됨이 바람직하다. The method includes: searching a list of frequency interval values for a frequency interval between the determined two single frequency components; And selecting one of a list that more closely matches the frequency interval between the two determined single frequency components, wherein the indicator is preferably dependent on the selected one of the list of frequency interval values.

상기 방법은, 주파수 간격 값들의 리스트 중 선택된 하나 및 결정된 주파수 간격 값 사이의 차를 결정하는 단계를 더 포함할 수 있고, 상기 지시자는 상기 차에 추가 종속됨이 바람직하다.The method may further comprise determining a difference between the selected one of the list of frequency interval values and the determined frequency interval value, wherein the indicator is further dependent on the difference.

상기 방법은, 주파수 간격 값들의 리스트 중 선택된 하나 및 결정된 주파수 간격 값 사이의 상기 결정된 차에 대한 차이 값들의 추가 리스트를 검색하는 단계; 및 상기 결정된 차이 값에 보다 가깝게 매치하는 차이 값들의 추가 리스트 중 하나를 선택하는 단계를 더 포함할 수 있고, 상기 지시자는 상기 차이 값들의 추가 리스트 중 선택된 하나에 종속됨이 바람직하다.The method includes retrieving an additional list of difference values for the determined difference between the selected one of the list of frequency interval values and the determined frequency interval value; And selecting one of an additional list of difference values that more closely matches the determined difference value, wherein the indicator is preferably dependent on the selected one of the additional list of difference values.

본 발명의 제15양태에 따라 오디오 신호를 복호화하기 위한 디코더가 제안되고, 상기 디코더는, 적어도 두 개의 단일 주파수 성분들을 나타내고 상기 두 개의 단일 주파수 성분들 사이의 주파수 간격을 나타내는 적어도 한 지시자를 수신하고; 상기 수신된 지시자에 종속된 상기 적어도 두 개의 단일 주파수 성분들을 삽입하도록 구성된다. According to a fifteenth aspect of the invention there is proposed a decoder for decoding an audio signal, the decoder receiving at least one indicator indicating at least two single frequency components and indicating a frequency interval between the two single frequency components and ; And insert the at least two single frequency components dependent on the received indicator.

상기 적어도 한 지시자는 적어도 한 추가 단일 주파수 성분을 나타내도록 추가 구성됨이 바람직하고, 상기 지시자는 상기 적어도 한 추가 단일 주파수 성분 및 상기 적어도 두 개의 단일 주파수 성분들 중 하나와의 사이의 주파수 간격에 종속되도록 추가 구성됨이 바람직하고, 상기 디코더는 상기 지시자에 종속되는 상기 적어도 한 추가 단일 주파수 성분을 삽입하도록 추가 구성됨이 바람직하다.Preferably, the at least one indicator is further configured to indicate at least one additional single frequency component, wherein the indicator is dependent on a frequency interval between the at least one additional single frequency component and one of the at least two single frequency components. It is preferably further configured and the decoder is further configured to insert the at least one additional single frequency component dependent on the indicator.

본 발명의 제16양태에 따라 오디오 신호를 복호화하기 위한 방법이 제안되고, 상기 방법은, 적어도 두 개의 단일 주파수 성분들을 나타내고 상기 두 개의 단일 주파수 성분들 사이의 주파수 간격을 나타내는 적어도 한 지시자를 수신하는 단계; 및 상기 수신된 지시자에 의존해 상기 적어도 두 개의 단일 주파수 성분들을 삽입하는 단계를 포함한다.According to a sixteenth aspect of the present invention there is proposed a method for decoding an audio signal, the method comprising receiving at least one indicator representing at least two single frequency components and indicating a frequency interval between the two single frequency components step; And inserting the at least two single frequency components in dependence on the received indicator.

상기 적어도 한 지시자는 적어도 한 추가 단일 주파수 성분을 나타내도록 추가 구성됨이 바람직하고, 상기 지시자는 상기 적어도 한 추가 단일 주파수 성분 및 상기 적어도 두 개의 단일 주파수 성분들 중 하나와의 사이의 주파수 간격에 종속되도록 추가 구성됨이 바람직하고, 상기 방법은 상기 지시자에 의존해 상기 적어도 한 추가 단일 주파수 성분을 삽입하는 단계를 더 포함할 수 있다. Preferably, the at least one indicator is further configured to indicate at least one additional single frequency component, wherein the indicator is dependent on a frequency interval between the at least one additional single frequency component and one of the at least two single frequency components. Further configuration is preferred, and the method may further comprise inserting the at least one additional single frequency component depending on the indicator.

본 발명의 제17양태에 따라 상술한 것과 같은 인코더를 포함하는 장치가 제안된다.According to a seventeenth aspect of the present invention there is proposed an apparatus comprising an encoder as described above.

본 발명의 제18양태에 따라 상술한 것과 같은 디코더를 포함하는 장치가 제안된다.According to an eighteenth aspect of the present invention there is proposed an apparatus comprising a decoder as described above.

본 발명의 제19양태에 따라 상술한 것과 같은 인코더를 포함하는 전자 기기가 제안된다.According to a nineteenth aspect of the present invention there is proposed an electronic device comprising an encoder as described above.

본 발명의 제20양태에 따라 상술한 것과 같은 디코더를 포함하는 전자 기기가 제안된다.According to a twentieth aspect of the invention there is proposed an electronic device comprising a decoder as described above.

본 발명의 제21양태에 따라, 오디오 신호를 부호화하는 방법을 수행하도록 구성된 컴퓨터 프로그램 제품이 제안되고, 그 방법은, 적어도 두 개의 단일 주파수 성분들을 선택하는 단계; 및 상기 적어도 두 개의 단일 주파수 성분들을 나타내도록 구성되고, 상기 두 개의 단일 주파수 성분들 사이의 주파수 간격 (frequency separation)에 종속되도록 구성되는 지시자를 생성하는 단계를 포함한다.According to a twenty-first aspect of the present invention, a computer program product configured to perform a method of encoding an audio signal is proposed, the method comprising: selecting at least two single frequency components; And generating an indicator configured to represent the at least two single frequency components, the indicator configured to be dependent on a frequency separation between the two single frequency components.

본 발명의 제22양태에 따라, 오디오 신호를 복호화하는 방법을 수행하도록 구성된 컴퓨터 프로그램 제품이 제안되고, 그 방법은 적어도 두 개의 단일 주파수 성분들을 나타내고 상기 두 개의 단일 주파수 성분들 사이의 주파수 간격을 나타내는 적어도 한 지시자를 수신하는 단계; 및 상기 수신된 지시자에 의존해 상기 적어도 두 개의 단일 주파수 성분들을 삽입하는 단계를 포함한다. According to a twenty-second aspect of the present invention, a computer program product configured to perform a method of decoding an audio signal is proposed, wherein the method represents at least two single frequency components and represents a frequency interval between the two single frequency components Receiving at least one indicator; And inserting the at least two single frequency components in dependence on the received indicator.

본 발명의 제23양태에 따라 오디오 신호를 부호화하기 위한 인코더가 제안되고, 상기 인코더는, 적어도 두 개의 단일 주파수 성분들을 선택하기 위한 선택 수단; 및 상기 적어도 두 개의 단일 주파수 성분들을 나타내도록 구성되고, 상기 두 개의 단일 주파수 성분들 사이의 주파수 간격 (frequency separation)에 종속되도록 구성되는 지시자를 생성하도록 하는 지시자 생성 수단을 포함한다.According to a twenty third aspect of the present invention, an encoder for encoding an audio signal is proposed, the encoder comprising: selection means for selecting at least two single frequency components; And indicator generating means configured to generate the indicator, the indicator being configured to indicate the at least two single frequency components, the indicator being configured to be dependent on a frequency separation between the two single frequency components.

본 발명의 제24양태에 따라 오디오 신호를 복호화하기 위한 디코더가 제안되고, 상기 디코더는, 적어도 두 개의 단일 주파수 성분들을 나타내고 상기 두 개의 단일 주파수 성분들 사이의 주파수 간격을 나타내는 적어도 한 지시자를 수신하기 위한 수신 수단; 및 상기 수신된 지시자에 의존해 상기 적어도 두 개의 단일 주파수 성분들을 삽입하도록 하는 삽입 수단을 포함한다.According to a twenty-fourth aspect of the present invention, a decoder for decoding an audio signal is proposed, wherein the decoder receives at least one indicator indicating at least two single frequency components and indicating a frequency interval between the two single frequency components Receiving means for; And insertion means for inserting the at least two single frequency components in dependence on the received indicator.

본 발명을 더 잘 이해하기 위해 지금부터 예로서 첨부된 도면을 참조할 것이다.
도 1은 본 발명의 실시예들을 이용하는 전자 기기를 개략적으로 도시한 것이다.
도 2는 본 발명의 실시예들을 이용하는 오디오 코덱 시스템을 개략적으로 도시한 것이다.
도 3은 도 2에 도시된 오디오 코덱 시스템의 인코더 부분을 개략적으로 도시한 것이다.
도 4는 도 3에 도시된 인코더의 고주파 영역 인코더 부분의 개략도를 도시한 것이다.
도 5는 오디오 코덱 시스템의 디코더 부분을 개략적으로 도시한 것이다.
도 6은 본 발명의 실시예들에 따라 도 3 및 4에 도시된 오디오 인코더 실시예의 동작을 예시한 흐름도를 보인다.
도 7은 본 발명에 따라 도 5에 도시된 것과 같은 오디오 디코더 실시예의 동작을 예시한 흐름도를 보인다.
도 8은 본 발명의 실시예들에 따른 오디오 신호, 삽입된 사인파 포지션들, 및 사인파 포지션들의 부호화에 대한 스펙트럼 표현의 예들을 보인다.
도 9는 본 발명의 실시예들에 따른 오디오 신호 및 삽입된 사인파 포지션들의 스펙트럼 표현들의 추가 예들을 보인다.Reference will now be made to the accompanying drawings as an example in order to better understand the present invention.
1 schematically illustrates an electronic device using embodiments of the present invention.
2 schematically illustrates an audio codec system using embodiments of the present invention.
FIG. 3 schematically shows an encoder portion of the audio codec system shown in FIG. 2.
FIG. 4 shows a schematic diagram of a high frequency region encoder portion of the encoder shown in FIG. 3.
5 schematically illustrates a decoder portion of an audio codec system.
6 shows a flowchart illustrating the operation of the audio encoder embodiment shown in FIGS. 3 and 4 in accordance with embodiments of the present invention.
7 shows a flowchart illustrating the operation of an audio decoder embodiment as shown in FIG. 5 in accordance with the present invention.
8 shows examples of spectral representations for the encoding of an audio signal, embedded sinusoidal positions, and sinusoidal positions in accordance with embodiments of the present invention.
9 shows further examples of spectral representations of an audio signal and embedded sinusoidal positions in accordance with embodiments of the present invention.

이하에서는 계층화되거나 스케일러블한 (scalable) 가변 레이트 오디오 코덱들의 제공을 위해 가능한 코덱 메커니즘들을 상세히 설명할 것이다. 이와 관련해 본 발명의 일 실시예에 따른 코덱을 포함할 수 있는 전형적 전자 기기(10)의 개략적 블록도인 도 1을 우선 참조할 것이다.The following will describe in detail the possible codec mechanisms for the provision of layered or scalable variable rate audio codecs. In this regard, reference is first made to FIG. 1, which is a schematic block diagram of an exemplary electronic device 10 that may include a codec according to one embodiment of the invention.

전자 기기(10)는 예를 들어, 무선 통신 시스템의 모바일 단말이나 사용자 기기일 수 있다. The electronic device 10 may be, for example, a mobile terminal or a user device of a wireless communication system.

전자 기기(10)는 마이크로폰(11)을 구비하는데, 이것은 아날로그-디지털 컨버터(14)를 통해 프로세서(21)와 링크된다. 프로세서(21)는 디지털-아날로그 컨버터 (DAC)(32)를 통해 확성기들(33)과도 링크된다. 프로세서(21)는 트랜시버 (TX/RX)(13)에도, 사용자 인터페이스 (UI)(15)에도, 그리고 메모리(22)에도 링크된다. The electronic device 10 has a microphone 11, which is linked with the processor 21 via an analog-to-digital converter 14. The processor 21 is also linked with the loudspeakers 33 via a digital-to-analog converter (DAC) 32. The processor 21 is linked to the transceiver (TX / RX) 13, to the user interface (UI) 15, and to the memory 22.

프로세서(21)는 다양한 프로그램 코드들을 실행하도록 구성될 수 있다. 구현되는 프로그램 코드들은 오디오 신호의 저주파 대역 및 오디오 신호의 고주파 대역을 부호화하기 위한 오디오 부호화 코드를 포함한다. 구현되는 프로그램 코드들(23)은 오디오 복화화 코드 또한 포함한다. 구현되는 프로그램 코드들(23)은 필요할 때마다 프로세서(21)에 의해 검색되도록 메모리(22) 등에 저장될 수 있다. 메모리(22)는 본 발명에 따라 부호화되었던 데이터와 같은 데이터를 저장하는 섹션(24)을 더 제공할 수 있을 것이다.The processor 21 may be configured to execute various program codes. Program codes to be implemented include audio encoding code for encoding a low frequency band of the audio signal and a high frequency band of the audio signal. Program codes 23 implemented also include audio decoding code. Program codes 23 to be implemented may be stored in the memory 22 or the like to be retrieved by the processor 21 whenever necessary. The memory 22 may further provide a section 24 for storing data, such as data that has been encoded in accordance with the present invention.

부호화 및 복호화 코드는 본 발명의 실시예들에 있어 하드웨어나 펌웨어를 통해 구현될 것이다.The encoding and decoding code may be implemented through hardware or firmware in embodiments of the present invention.

사용자 인터페이스(15)는 사용자가 가령 키패드를 통해 전자 기기(10)로 명령들을 입력할 수 있고/있거나 디스플레이 등을 통해 전자 기기(10)로부터 정보를 얻을 수 있게 한다. 트랜시버(13)는 무선 통신 네트워크 등을 통해 다른 전자 기기들과 통신할 수 있게 한다. The user interface 15 allows a user to enter commands into the electronic device 10, for example via a keypad, and / or obtain information from the electronic device 10 via a display or the like. The transceiver 13 enables communication with other electronic devices via a wireless communication network or the like.

다시 한번, 전자 기기(10)의 구조는 여러 방법으로 보완 및 변경될 수 있다는 것을 알아야 한다.Once again, it should be appreciated that the structure of the electronic device 10 can be supplemented and changed in many ways.

전자 기기(10)의 사용자는 스피치를 입력하기 위해 마이크로폰(11)을 사용할 수 있고, 스피치는 어떤 다른 전자 기기로 전송되거나 메모리(22)의 데이터 섹션(24)에 저장될 것이다. 해당 애플리케이션이 이러한 목적으로 사용자 인터페이스(15)를 통해 사용자에 의해 작동되었을 수 있다. 이 애플리케이션은 프로세서(21)에 의해 구동될 수 있는 것으로, 프로세서(21)로 하여금 메모리(22)에 저장된 부호화 코드를 실행시키게 한다.The user of the electronic device 10 can use the microphone 11 to input speech, which will be sent to any other electronic device or stored in the data section 24 of the memory 22. The application may have been operated by the user via the user interface 15 for this purpose. This application can be driven by the processor 21, causing the processor 21 to execute the encoded code stored in the memory 22.

아날로그-디지털 변환기(14)는 입력된 아날로그 오디오 신호를 디지털 오디오 신호로 변환하고, 그 디지털 오디오 신호를 프로세서(21)로 제공한다. The analog-to-digital converter 14 converts the input analog audio signal into a digital audio signal and provides the digital audio signal to the processor 21.

그러면 프로세서(21)가 도 2 및 3을 참조해 설명하는 것과 같은 방법으로 그 디지털 오디오 신호를 처리할 수 있다. Processor 21 can then process the digital audio signal in the same manner as described with reference to FIGS. 2 and 3.

그 결과에 따른 비트 스트림이 트랜시버(13)로 주어져서 다른 전자 기기로 전송되게 된다. 이와 달리, 코딩된 데이터는 예를 들어 같은 전자 기기(10)에 의해 나중에 전송되거나 나중에 제공되기 위해 이를테면 메모리(22)의 데이터 섹션(24)에 저장될 수도 있다. The resulting bit stream is given to the transceiver 13 for transmission to other electronic devices. Alternatively, the coded data may be stored, for example, in the data section 24 of the memory 22, for example for later transmission by or the same electronic device 10.

전자 기기(10)는 자신의 트랜시버(13)를 통해 다른 전자 기기로부터 상응하게 부호화된 데이터를 포함하는 비트 스트림을 또한 수신할 수도 있을 것이다. 이 경우, 프로세서(21)는 메모리(22)에 저장된 복호화 프로그램 코드를 실행할 수 있다. 프로세서(21)는 수신된 데이터를 복호화하고, 그 복호화된 데이터를 디지털-아날로그 컨버터(32)로 제공한다. 디지털-아날로그 컨버터(32)는 디지털 복호화된 데이터를 아날로그 오디오 데이터로 변환하여 그들을 확성기들(33)을 거쳐 출력시킨다. 복호화 프로그램 코드의 실행 역시, 사용자 인터페이스(15)를 통해 사용자가 호출했던 애플리케이션에 의해 시작될 수 있다. The electronic device 10 may also receive a bit stream containing correspondingly coded data from another electronic device via its transceiver 13. In this case, the processor 21 may execute the decrypted program code stored in the memory 22. The processor 21 decodes the received data and provides the decoded data to the digital-analog converter 32. The digital-analog converter 32 converts the digitally decoded data into analog audio data and outputs them through the loudspeakers 33. Execution of the decryption program code may also be initiated by an application called by the user via the user interface 15.

수신된 부호화된 데이터도 확성기들(33)을 통한 즉각적 제공 대신에, 가령 나중에 제공하거나 또 다른 전자 기기로 포워딩할 수 있도록 메모리(22)의 데이터 섹션(24)에 저장될 수 있다. Received encoded data may also be stored in the data section 24 of the memory 22 so that it can be provided later or forwarded to another electronic device instead of an immediate provision through the loudspeakers 33.

도 2 내지 4에 도시된 개략적 구조들 및 도 7과 8의 방법의 단계들은, 도 1에 도시된 전자 기기에서 구현된다고 예로써 도시된 완전한 오디오 코덱의 동작 중 일부만을 나타낸 것임을 예상할 수 있을 것이다.It can be expected that the schematic structures shown in FIGS. 2-4 and the steps of the method of FIGS. 7 and 8 represent only some of the operations of the complete audio codec shown as an example implemented in the electronic device shown in FIG. 1. .

본 발명의 실시예들에 의해 사용되는 오디오 코덱들의 일반적인 동작이 도 2에 도시된다. 일반적인 오디오 부호화/복호화 시스템들은 도 2에 개략적으로 도시된 것 같은 인코더 및 디코더로 이뤄진다. 인코더(104), 저장부 또는 미디어 채널(106) 및 디코더(108)를 갖춘 시스템(102)이 도시되어 있다.The general operation of audio codecs used by embodiments of the invention is shown in FIG. Typical audio encoding / decoding systems consist of an encoder and a decoder as shown schematically in FIG. Shown is a system 102 having an encoder 104, a storage or media channel 106, and a decoder 108.

인코더(104)는 입력 오디오 신호(110)를 압축하여 미디어 채널(106)을 통해 전송되거나 저장되는 비트 스트림(112)을 생성한다. 비트 스트림(112)은 디코더(108) 안에서 수신될 수 있다. 디코더(108)는 비트 스트림(112)을 압축해제하여 출력 오디오 신호(114)를 생성한다. 비트 스트림(112)의 비트 레이트 및 입력 신호(110) 관련 출력 오디오 신호(114)의 품질이 코딩 시스템(102)의 성능을 규정하는 주요 특징들이다. The encoder 104 compresses the input audio signal 110 to produce a bit stream 112 that is transmitted or stored over the media channel 106. Bit stream 112 may be received within decoder 108. Decoder 108 decompresses bit stream 112 to produce output audio signal 114. The bit rate of the bit stream 112 and the quality of the output audio signal 114 associated with the input signal 110 are the main features that define the performance of the coding system 102.

도 3은 본 발명의 일 실시예에 따른 인코더(104)를 개략적으로 도시한 것이다. 인코더(104)는 오디오 신호를 수신하도록 구성된 입력부(203)를 구비한다. 입력부(203)는 로 패스 필터(230), 하이 패스/밴드 패스 필터(235)에 연결된다. 로 패스 필터(230)는 신호를 저주파 영역 (LFR)코더 (코어 코덱(core codec)이라고도 알려져 있음)(231)로 출력한다. 저주파 영역 코더(231)는 고주파 영역 (HFR) 코더(232)로 신호를 출력하도록 구성된다. 하이 패스/밴드 패스 필터(235)가 HFR 코더(232)에 연결된다. LFR 코더(231), HFR 코더(232)는 비트스트림 포매터 (formatter)(234) (본 발명의 어떤 실시예들에서 비트스트림 멀티플렉서로도 알려짐)로 신호들을 출력하도록 구성된다. 비트스트림 포매터(234)는 출력(205)을 통해 출력 비트스트림(112)을 출력하도록 구성된다.3 schematically illustrates an encoder 104 according to an embodiment of the present invention. Encoder 104 has an input 203 configured to receive an audio signal. The input unit 203 is connected to the low pass filter 230 and the high pass / band pass filter 235. The low pass filter 230 outputs the signal to a low frequency region (LFR) coder (also known as a core codec) 231. The low frequency region coder 231 is configured to output a signal to a high frequency region (HFR) coder 232. High pass / band pass filter 235 is coupled to HFR coder 232. LFR coder 231 and HFR coder 232 are configured to output signals to bitstream formatter 234 (also known as a bitstream multiplexer in some embodiments of the present invention). Bitstream formatter 234 is configured to output output bitstream 112 via output 205.

본 발명의 어떤 실시예들에서, 하이 패스/밴드 패스 필터(235)는 옵션사항이어서, 오디오 신호가 HFR 코더(232)로 바로 건너갈 수도 있다. In some embodiments of the present invention, high pass / band pass filter 235 is optional such that an audio signal may be passed directly to HFR coder 232.

이러한 구성요소들의 동작은 코더(104)의 동작을 보이는 도 6의 흐름도를 참조해 더 상세히 기술될 것이다. The operation of these components will be described in more detail with reference to the flowchart of FIG. 6 showing the operation of the coder 104.

오디오 신호가 코더(104)에 의해 수신된다. 본 발명의 제1실시예에서 오디오 신호는 디지털 방식으로 샘플링된 신호이다. 본 발명의 다른 실시예들에서 오디오 입력은 마이크로폰(6) 등으로부터의 아날로그 오디오 신호일 수 있고, 이것이 아날로그-디지털 (A/D) 변환된다. 본 발명의 또 다른 실시예들에서 오디오 입력은 펄스 코드 변조 디지털 신호에서 진폭 변조 디지털 신호로 변환된다. 오디오 신호의 수신은 도 7의 601 단계에서 보이고 있다.The audio signal is received by the coder 104. In the first embodiment of the present invention, the audio signal is a digitally sampled signal. In other embodiments of the invention the audio input may be an analog audio signal from microphone 6 or the like, which is analog-to-digital (A / D) converted. In still other embodiments of the invention the audio input is converted from a pulse code modulated digital signal to an amplitude modulated digital signal. The reception of the audio signal is shown in step 601 of FIG.

로 패스 필터(230) 및 하이 패스/대역 패스 필터(235)는 오디오 신호를 수신하고 입력 신호(110)가 필터링되는 컷오프 (cut-off) 주파수를 규정한다. 컷오프 주파수 아래에 있는 수신된 오디오 신호 주파수들이 로 패스 필터(230)에 의해 저주파 영역 (LFR) 코더(231)로 보내진다. 컷오프 주파수 위에 있는 수신된 오디오 신호 주파수들은 하이 패스 필터(235)에 의해 고주파 영역(HFR) 코더(232)로 보내진다. 본 발명의 어떤 실시예들에서 신호는 저주파 코더(231)의 부호화 효율을 추가로 개선하기 위한 옵션으로서 다운 샘플링된다. The low pass filter 230 and the high pass / band pass filter 235 define a cut-off frequency at which the audio signal is received and the input signal 110 is filtered. Received audio signal frequencies below the cutoff frequency are sent by a low pass filter 230 to a low frequency region (LFR) coder 231. Received audio signal frequencies above the cutoff frequency are sent by a high pass filter 235 to a high frequency region (HFR) coder 232. In some embodiments of the present invention the signal is down sampled as an option to further improve the coding efficiency of low frequency coder 231.

LFR 코더(231)는 저주파 (그리고 옵션으로서 다운 샘플링된) 오디오 신호를 수신하고 그 신호에 대해 알맞은 저주파수 코딩을 적용한다. 본 발명의 제1실시예에서, 저주파 코더(231)는 양자화 및 32 개의 저주파 서브 대역들을 가진 허프만 (Huffman) 코딩을 적용한다. 입력 신호(110)는 분석 (analysis) 필터 뱅크 구조를 이용해 서브 대역들로 나눠진다. 각각의 서브 대역은 음향심리학적 모델에 의해 주어진 정보를 활용해 양자화 및 부호화될 수 있다. 양자화 설정 및 코딩 계획은 적용된 음향심리학적 모델에 의해 지시될 수 있다. 양자화되고 부호화된 정보는 비트 스트림(12) 생성을 위해 비트 스트림 포매터(234)로 보내진다.LFR coder 231 receives a low frequency (and optionally down sampled) audio signal and applies appropriate low frequency coding to that signal. In the first embodiment of the present invention, the low frequency coder 231 applies quantization and Huffman coding with 32 low frequency subbands. The input signal 110 is divided into subbands using an analysis filter bank structure. Each subband can be quantized and coded using the information given by the psychoacoustic model. The quantization setup and coding scheme can be dictated by the applied psychoacoustic model. The quantized and encoded information is sent to the bit stream formatter 234 for bit stream 12 generation.

또, LFR 코더(231)는 변형 이산 코사인 변환 (MDCT, modified discrete cosine transform)을 이용해 저주파 콘텐츠를 변환하여 합성 LFR 신호의 주파수 도메인 실현물들 (realizations)을 도출한다. 이러한 주파수 도메인 실현물들은 HFR 프로세서(232)로 전달된다. The LFR coder 231 also transforms low frequency content using a modified discrete cosine transform (MDCT) to derive frequency domain realizations of the synthesized LFR signal. These frequency domain implementations are passed to the HFR processor 232.

이러한 저주파수 코딩이 도 6의 606 단계에 도시되고 있다.This low frequency coding is shown in step 606 of FIG.

본 발명의 다른 실시예들에서는 비트스트림 포매터(234)로 출력되는 코어 (core) 코딩 출력을 생성하기 위해 다른 저주파 코덱들이 사용될 수 있다. 이러한 저주파 코덱들의 다른 실시예들의 예들에는 비한정적인 예들로서 AAC (advanced audio coding), MP3 (MPEG layer 3), ITU-T EV-VBR (Embedded variable rate) 스피치 코딩 베이스라인 코덱, 및 ITU-T G.729.1가 포함된다.In other embodiments of the present invention, other low frequency codecs may be used to generate a core coded output that is output to the bitstream formatter 234. Examples of other embodiments of such low frequency codecs include, but are not limited to, advanced audio coding (AAC), MPEG layer 3 (MP3), embedded variable rate (EV-VBR) speech coding baseline codec, and ITU-T. G.729.1 is included.

저주파 영역 코더(231)가 코딩 프로세스의 일부로서 한 주파수 도메인 합성 출력을 효과적으로 출력하지 못할 경우, 저주파 영역 (LFR) 코더(231)는 저주파 디코더 및 주파수 도메인 컨버터 (도 3에 도시되지 않음)를 더 포함해, 저주파 신호의 합성 (synthetic) 복제품을 생성하도록 할 수 있다. 그러면 본 발명의 실시예들이 주파수 도메인 표현으로 전환될 수 있고, 필요시 HFR 코더(232)로 전송되는 일련의 저주파 서브 대역들로 분할된다.If the low frequency region coder 231 fails to effectively output one frequency domain synthesized output as part of the coding process, the low frequency region (LFR) coder 231 may add a low frequency decoder and a frequency domain converter (not shown in FIG. 3). It can be included to create a synthetic copy of a low frequency signal. Embodiments of the present invention may then be converted to a frequency domain representation, which is divided into a series of low frequency subbands that are transmitted to the HFR coder 232 as needed.

이것은 본 발명의 실시예들에서 가능한 광범위한 코더/디코더들로부터 저주파 영역 코더(231)의 선택을 가능하게 하고, 그로써 본 발명은 출력의 일부로서 주파수 도메인 정보를 생성하는 특정 저주파 또는 코어 코더 알고리즘들에 국한되지 않게 된다. This allows the selection of low frequency region coder 231 from the widest range of coders / decoders possible in embodiments of the present invention, whereby the present invention is directed to specific low frequency or core coder algorithms that generate frequency domain information as part of the output. It will not be limited.

고주파 영역 (HFR) 코더(232)가 도 4에 보다 상세히 개략적으로 도시된다.A high frequency region (HFR) coder 232 is schematically shown in more detail in FIG. 4.

고주파 영역 코더(232)는 하이 패스/밴드 패스 필터(235)로부터 신호를 수신하며, 그 신호는 MDCT/SDFT (modified discrete cosine transfor/shifted discrete Fourier transform) 프로세서(301)로 입력된다. The high frequency domain coder 232 receives a signal from the high pass / band pass filter 235, which is input to a modified discrete cosine transfor / shifted discrete Fourier transform (MDCT / SDFT) processor 301.

MDCT/SDFT 변환기(301)로부터의 주파수 도메인 출력은 톤 선택 제어기(303), 고주파 영역 (HFR) 대역 리플리컨트 (replicant) 스케일링 프로세서(307), 및 사인파 주입 선택 (sinusoid injection selection)/인코딩 프로세서(309)로 전달된다.The frequency domain output from the MDCT / SDFT converter 301 may include a tone selection controller 303, a high frequency domain (HFR) band replicating scaling processor 307, and a sinusoid injection selection / encoding processor 309. Is delivered.

톤 선택 제어기(303)는 HFR 대역 리플리컨트 선택 프로세서(305), HFR 대역 리플리컨트 스케일링 프로세서(307), 사인파 주입 선택/인코딩 프로세서(309), 및 멀티플렉서(311)를 제어하거나 설정하도록 구성된다. HFR 대역 리플리컨트 선택 프로세서(305)는 또한 LFR 코더(231)로부터 주파수 도메인 형식의 합성 저주파 영역 신호를 수신하다. HFR 대역 리플리컨트 선택 프로세서(305)는 이제부터 설명되는 바와 같이 LFR 코더로부터, 선택된 HFR 대역들을 출력하고 그 선택사항을 HFR 대역 리플리컨트 스케일링 프로세서(307)로 전달할 것이다.The tone selection controller 303 is configured to control or set the HFR band replicant selection processor 305, the HFR band replicant scaling processor 307, the sinusoidal injection selection / encoding processor 309, and the multiplexer 311. The HFR band replicant selection processor 305 also receives a composite low frequency domain signal in frequency domain format from the LFR coder 231. The HFR Band Replicant Selection Processor 305 will output the selected HFR bands from the LFR coder and forward the selection to the HFR Band Replicant Scaling Processor 307, as will now be described.

HFR 대역 리플리컨트 스케일링 프로세서(305)는 그 선택사항 및 스케일링 요소들의 부호화된 형태를 멀티플렉서(311)로 전달해 데이터 스트림(112) 안에 삽입되게 한다. 또, HFR 대역 리플리컨트 스케일링 프로세서(307)는 선택되고 스케일링된 HFR 영역의 표현을 사인파 주입 선택/인코딩 프로세서(309)로 전달한다. 사인파 주입 선택/인코딩 프로세서(309)는 출력 데이터 스트림(112)에 산입될 신호를 멀티플렉서로 전달한다. HFR band replicant scaling processor 305 passes the selection and the encoded form of the scaling elements to multiplexer 311 for insertion into data stream 112. The HFR Band Replicant Scaling Processor 307 also delivers a representation of the selected and scaled HFR region to the sinusoidal injection selection / encoding processor 309. Sinusoidal injection selection / encoding processor 309 delivers the signal to be multiplexed into output data stream 112.

이제 도 6과 도 4를 참조해 HFR 인코더가 어떻게 작동하는지 상세히 설명할 것이다. Referring now to Figures 6 and 4, how the HFR encoder works will be described in detail.

MDCT/SDFT 프로세서(301)는 HP/BP 필터(235)로부터 수신된 고주파 영역 오디오 신호를 신호의 주파수 도메인 표현으로 전환한다. The MDCT / SDFT processor 301 converts the high frequency domain audio signal received from the HP / BP filter 235 into a frequency domain representation of the signal.

본 발명의 어떤 실시예들에서, MDCT/SDFT 프로세서는 게다가 고주파 오디오 신호를 짧은 주파수 서브 대역들로 분할한다. 그 주파수 서브 대역들은 대략 500-800 Hz 넓이에 걸칠 수 있다. 본 발명의 어떤 실시예들에서 주파수 서브 대역들은 서로 다른 대역폭들을 가진다. 한 실시예에서, 주파수 서브 대역들은 750 Hz의 대역폭을 가진다. 본 발명의 다른 실시예들에서 주파수 서브 대역들의 대역폭은 상이하든 동일하든, 고주파 영역에 대한 대역 할당에 좌우될 수 있다. In some embodiments of the invention, the MDCT / SDFT processor further divides the high frequency audio signal into short frequency subbands. The frequency subbands can span approximately 500-800 Hz. In some embodiments of the invention the frequency subbands have different bandwidths. In one embodiment, the frequency subbands have a bandwidth of 750 Hz. In other embodiments of the present invention, the bandwidth of the frequency subbands may be different or the same, depending on the band allocation for the high frequency region.

본 발명의 제1실시예에서, 주파수 서브 대역 대역폭은 일정하다, 즉 프레임 별로 달라지지 않는다. 본 발명의 다른 실시예들에서, 주파수 서브 대역 대역폭은 일정하지 않으며 한 주파수 서브 대역은 시간에 따라 달라지는 대역폭을 가질 수 있다. In the first embodiment of the invention, the frequency subband bandwidth is constant, i.e. does not vary from frame to frame. In other embodiments of the present invention, the frequency subband bandwidth is not constant and one frequency subband may have a bandwidth that varies with time.

본 발명의 일부 실시예들에서 이러한 가변적인 서브 대역 대역폭 할당치는 오디오 신호의 음향심리학적 모델링에 기초해 결정될 수 있다. 이러한 주파수 서브 대역들은 다시 본 발명의 다양한 실시예들에서 연속적이거나 (즉, 하나 다음에 하나가 차례로 오게 되어 연속 스펙트럼 실현물을 생성함) 부분적으로 오버랩될 수 있다.In some embodiments of the present invention such variable subband bandwidth allocation may be determined based on psychoacoustic modeling of the audio signal. These frequency subbands may again be contiguous in the various embodiments of the present invention (ie, one after the other in order to create a continuous spectrum realization) and partially overlap.

시간 도메인의 주파수 도메인 변환 및 서브 대역 구성 단계가 도 6의 607 단계에 보인다. The frequency domain transform and subband configuration steps of the time domain are shown in step 607 of FIG.

톤 선택 제어기(303)는 차례로 HFR 대역 리플리컨트 선택, 스케일링, 사인파 주입 선택 및 인코딩, 그리고 멀티플렉서를 제어해 고주파 영역에 대해 보다 효율적인 인코딩이 수행될 수 있도록 구성될 수 있다.The tone selection controller 303 may in turn be configured to control HFR band replicant selection, scaling, sinusoidal injection selection and encoding, and multiplexer so that more efficient encoding may be performed for the high frequency region.

MDCT/SDFT 프로세서(301)로부터 쉬프트 된 이산 푸리에 변환 출력 (SDFT)이 톤 선택 제어기(303)에서 수신된다.Discrete Fourier Transform Output (SDFT) shifted from MDCT / SDFT processor 301 is received at tone selection controller 303.

2N 개의 샘플들 (본 발명의 바람직한 실시예들에 있어 한 프레임이라고 간주될 수 있음)에 대해 정의된 SDFT (shifted discrete Fourier transform)의 예가 식 1에 보인다:An example of a shifted discrete Fourier transform (SDFT) defined for 2N samples (which can be considered one frame in preferred embodiments of the present invention) is shown in Equation 1:

여기서 h(n)은 스케일링 윈도 (scaling window)이고, x(n)은 오리지널 입력 신호이고, u 및 v는 각각 시간 및 주파수 도메인 쉬프트들을 나타낸다.Where h (n) is the scaling window, x (n) is the original input signal, and u and v represent the time and frequency domain shifts, respectively.

본 발명의 일 실시예에서 u 및 v는 u=(N+1)/2이고 v=1/2가 되게 선택될 수 있는데 이는 선택된 SDFT 변환의 실수부도 MDCT 변환으로서 사용될 수 있기 때문이다. 따라서 이것은 MDCT 변환기 및 SDFT 변환기가 하나의 시간-주파수 (time to frequency) 도메인 연산 내에서 구현되는 것을 가능하게 하고, 그에 따라 기기의 복잡도를 줄일 수 있다. In one embodiment of the present invention u and v may be selected such that u = (N + 1) / 2 and v = 1/2 since the real part of the selected SDFT transform may also be used as an MDCT transform. This thus enables the MDCT converter and the SDFT converter to be implemented within one time-frequency domain operation, thereby reducing the complexity of the device.

톤 선택 제어기(303)는 입력된 고주파 영역 신호가 노멀 (normal)인지 톤 (tonal)인지 여부를 검출하도록 구성될 수 있다. 톤 선택 제어기(303)는 현재와 과거 프레임에 대한 SDFT 출력을 비교함으로써 신호의 특성을 판단할 수 있다.The tone selection controller 303 may be configured to detect whether the input high frequency region signal is normal or tonal. The tone selection controller 303 may determine the characteristics of the signal by comparing the SDFT outputs for the current and past frames.

현재와 과거의 SDFT 프레임들이 각각 Y_b(k) 및 Y_b1(k)라고 정의될 때, 이 프레임들 간 유사도가 인덱스 S로써 계측될 수 있다. S는 식 2에 규정된다. When current and past SDFT frames are defined as Y _b (k) and Y _b1 (k), respectively, the similarity between these frames can be measured as the index S. S is defined in equation (2).

N_L+1은 고주파수 코딩의 한계 주파수에 해당한다. 파라미터 S가 작을수록, 고주파수 스펙트럼들은 더 유사하다.N _L +1 corresponds to the limit frequency of high frequency coding. The smaller the parameter S, the more similar the high frequency spectra.

톤 선택 제어기는 S 값에 의존해 신호 특성 또는 모드를 할당하는 결정 로직을 포함할 수 있다. 더욱이 신호 특성이나 모드는 이하에서 상세히 설명되는 바와 같이 HFR 코더의 나머지를 제어하는데에도 사용된다.The tone selection controller may include decision logic to assign a signal characteristic or mode depending on the S value. Moreover, signal characteristics or modes are also used to control the rest of the HFR coder as described in detail below.

이하에서는 오디오 신호의 두 가지 특성들인 모드들이 정의된 본 발명의 일 실시예를 보인다. 이 특성들이나 모드들은 노멀 또는 톤이다. Hereinafter, an embodiment of the present invention in which two characteristics of an audio signal, modes are defined. These characteristics or modes are normal or tone.

S의 값이 소정 문턱치 S_lim 이상이면 톤 선택 제어기(303) 내 결정 로직은 노멀의 특성을 할당하도록 (이것이 HFR 코더의 나머지 부분에, 있을 수 있는 어떤 사인파 삽입과 더불어 노멀 코딩이 사용될 것임을 가리킬 수 있음) 구성될 수 있다.If the value of S is above the predetermined threshold S _lim , the decision logic in tone selection controller 303 may assign the characteristic of the normal (which may indicate that normal coding will be used with any sine wave insertion that may be in the rest of the HFR coder). Can be configured).

S의 값이 소정 문턱치 S_lim 미만이면 톤 선택 제어기(303) 내 결정 로직은 톤의 특성을 할당하도록 (이것이 HFR 코더의 나머지 부분에, 사인파 삽입만을 이용해 오디오 신호가 코딩될 수 있다는 것을 가리킬 수 있음) 구성될 수 있다. 노멀 코딩 모드의 파라미터드을 양자화하는데 아무 비트도 사용되고 있지 않으므로 더 많은 사인파들이 이 모드에서 추가될 수 있다.If the value of S is less than the predetermined threshold S _lim , the decision logic in tone selection controller 303 may indicate that the audio signal can be coded using sine wave insertion, only to the rest of the HFR coder, to assign the characteristics of the tone. Can be configured. Since no bits are used to quantize the parameters of the normal coding mode, more sine waves can be added in this mode.

두 가지 동작 모드들이 기술되었지만, 톤 선택 제어기는 둘을 초과하는 가능한 동작 모드들 (할당가능한 특성들)을 가질 수 있으며, 그 모드들 각각은 정해진 문턱 영역을 이용하고 HFR 코더의 나머지 부분에 오디오 신호를 어떻게 코딩할지에 대한 지시자를 제공한다.Although two modes of operation have been described, the tone selection controller may have more than two possible modes of operation (assignable characteristics), each of which uses a defined threshold area and provides an audio signal to the rest of the HFR coder. Provide an indicator of how to code.

톤 선택 제어기(303)는 멀티플렉서로 현 프레임에 할당된 특성이나 모드를 전달하여 어떤 동작 모드가 선택되었는지에 대한 표시를 제공하도록 하고, 그 표시 역시 디코더로 전달될 것이다.The tone selection controller 303 will pass the characteristic or mode assigned to the current frame to the multiplexer to provide an indication of which mode of operation has been selected, which will also be passed to the decoder.

모드들의 개수가 보통 적을 것이기 때문에, 그러한 동작 모드들을 부호화하는데 필요로 되는 비트 수도 마찬가지로 적다. Since the number of modes will usually be small, the number of bits required to encode such operating modes is likewise small.

톤 검출 모드 선택이 도 6의 609 단계에서 보인다. Tone detection mode selection is shown in step 609 of FIG.

이하의 예는 톤 선택 제어기(303)가, 현재의 프레임에 톤 특성이 규정되었음을 가리키고, 대역 리플리컨트 선택의 동작 (도 6의 611 단계), 대역 리플리컨트 스케일링 (도 6의 613 단계)의 동작, 및 사인파 주입 및 코딩의 동작 (도 6의 615 단계)이 수행되는 경우를 설명한다.The following example indicates that the tone selection controller 303 indicates that tone characteristics are defined in the current frame, the operation of band replicant selection (step 611 in FIG. 6), the operation of band replicant scaling (step 613 in FIG. 6), and A case where the operation of sine wave injection and coding (step 615 of FIG. 6) is performed will be described.

톤 선택 제어기(303)가 오디오 신호는 톤임을 가리키면, 아무런 대역 리플리컨트 선택이나 대역 리플리컨트 스케일링 동작도 수행되지 않고, 다만 사인파 주입 및 코딩 동작만이 수행된다. 리플리컨트 선택 및 리플리컨트 스케일링 동작들에 예비된 비트 할당치가 추가 사인파들의 선택 및 코딩에 사용될 수 있다. If the tone selection controller 303 indicates that the audio signal is a tone, no band replicant selection or band replicant scaling operation is performed, but only sinusoidal injection and coding operations are performed. Bit allocations reserved for replicant selection and replicant scaling operations may be used for the selection and coding of additional sine waves.

톤 선택 제어기(303)가 오디오 신호는 노멀임을 가리키면, 대역 리플리컨트 선택 및 대역 리플리컨트 스케일링 동작들이 수행된다. 노멀 모드의 성능은 사인파 주입에 의해 추가로 향상될 수 있다. If the tone selection controller 303 indicates that the audio signal is normal, band replicant selection and band replicant scaling operations are performed. The performance of the normal mode can be further improved by sinusoidal injection.

HFR 대역 리플리컨트 선택기(305)는 고주파 영역의 주파수 서브 대역들 각각의 스펙트럼 성분들 및 저주파 영역 부호화된 신호의 주파수 도메인 표현을 수신하고, 저주파 영역으로부터 고주파 영역 서브 대역들 각각과 매치되는 부분들을 선택한다. The HFR band replicant selector 305 receives spectral components of each of the frequency subbands in the high frequency region and a frequency domain representation of the low frequency domain coded signal, and selects portions from the low frequency region that match each of the high frequency region subbands. .

본 발명의 어떤 실시예들에서 가장 가깝게 매치하는 저주파 영역 서브 대역을 판단하기 위해 서브 대역 에너지가 사용된다.In some embodiments of the present invention, subband energy is used to determine the closest matching low frequency region subband.

본 발명의 다른 실시예들에서, 고주파 영역 서브 대역들의 상이하거나 추가적인 특성들이 정해지고, 매치되는 저주파 영역의 부분을 서치하는데 사용된다. 다른 특성들에는 비한정적 예들로서 각 서브 대역의 최고점-대-최저점 에너지 비 (peak-to-valley energy ratio) 및 신호 대역폭이 포함된다.In other embodiments of the invention, different or additional characteristics of the high frequency region subbands are defined and used to search for the portion of the low frequency region that is matched. Other features include non-limiting examples of peak-to-valley energy ratio and signal bandwidth of each subband.

본 발명의 어떤 실시예들에서, HFR 대역 리플리컨트 선택기(305) 내에서의 오디오 신호의 분석은, 부호화된 저주파 영역의 분석 및 오리지널 고주파 영역의 분석을 포함한다. 따라서 본 발명의 또 다른 실시예들에서 에너지 산정기가 부호화된 저주파 신호를 수신하고 그 신호들을 짧은 서브 대역들로 나누어 가령 '전체' 스펙트럼 서브 대역 당 에너지 및/또는 각각의 '전체' 스펙트럼 서브 대역의 최고점-대-최저점 에너지 비를 판단함으로써 유효 스펙트럼 전체의 특성들을 판단하게 된다. In some embodiments of the present invention, analysis of the audio signal in the HFR band replicant selector 305 includes analysis of the encoded low frequency region and analysis of the original high frequency region. Thus, in still other embodiments of the present invention, the energy estimator receives an encoded low frequency signal and divides the signals into short subbands, such as energy per 'full' spectral subband and / or of each 'full' spectral subband. By determining the peak-to-bottom energy ratio, the characteristics of the entire effective spectrum are determined.

본 발명의 다른 실시예들에서 에너지 산정기는 부호화된 저주파 신호를 더 수신하고 (필요한 경우) 이들을 짧은 서브 대역들로 나누어 분석되도록 한다. 인코더로부터의 저주파 도메인 신호 출력이 이제, 가령 저주파 도메인 서브 대역 당 에너지 및/또는 각각의 저주파 도메인 서브 대역의 최고점-대-최저점 에너지 비율을 판단하기 위해, 고주파 도메인 신호와 비슷한 방식으로 분석된다.In other embodiments of the invention the energy estimator further receives the encoded low frequency signals (if necessary) and divides them into short subbands for analysis. The low frequency domain signal output from the encoder is now analyzed in a manner similar to the high frequency domain signal to determine, for example, the energy per low frequency domain subband and / or the peak-to-bottom energy ratio of each low frequency domain subband.

본 발명의 일 실시예에서 HFR 대역 리플리컨트 선택기(305)는 고주파 스펙트럼 값들의 허용가능한 복제본들을 형성하기 위해 전치될 (transposed) 수 있는 저주파 스펙트럼 값들의 선택을 수행한다. WO 2007/052088에 상세히 개시된 것 같은 방법에 사용될 대역들의 수와 폭은 HFR 대역 리플리컨트 선택기(305)에서 정해지거나 고정될 수 있다.In one embodiment of the present invention, HFR band replicant selector 305 performs selection of low frequency spectral values that may be transposed to form acceptable copies of high frequency spectral values. The number and width of bands to be used in the method as described in detail in WO 2007/052088 can be fixed or fixed in the HFR band replicant selector 305.

관련된 LFR 스펙트럼 값들의 선택이 도 6의 611 단계에 보인다.The selection of relevant LFR spectral values is shown in step 611 of FIG.

HFR 대역 리플리컨트 스케일러(307)는 또한 선택된 저주파 스펙트럼 값들을 수신하고, 각각의 고주파 영역 주파수 서브 대역과 선택된 저주파 스펙트럼 값들 사이의 차들을 줄이기 위해 그 값들의 스케일링이 이뤄질 수 있는지를 판단한다. The HFR band replicant scaler 307 also receives the selected low frequency spectral values and determines whether scaling of those values can be made to reduce the differences between each high frequency region frequency subband and the selected low frequency spectral values.

HFR 대역 리플리컨트 스케일러(307)는 본 발명의 어떤 실시예들에서 디코더로 보내지기 위해 필요한 비트 수를 줄이기 위해 스케일링 팩터들의 양자화 같은 부호화를 수행할 수 있다. 선택된 LFR 스펙트럼 값들의 스케일링 값들을 얻기 위해 사용되는 스케일링 팩터들의 표시가 멀티플렉서(311)로 전달된다. 또한 선택된 LFR 스펙트럼 값들의 스케일링된 값들의 복제본이 사인파 주입 선택/인코딩 기기(309)로 보내진다.The HFR band replicant scaler 307 may perform encoding such as quantization of scaling factors to reduce the number of bits needed to be sent to the decoder in some embodiments of the present invention. An indication of the scaling factors used to obtain scaling values of the selected LFR spectral values is passed to the multiplexer 311. A copy of the scaled values of the selected LFR spectral values is also sent to the sinusoidal injection selection / encoding device 309.

리플리컨트 스케일링이 도 6의 613 단계에 보인다.Replicant scaling is shown in step 613 of FIG.

사인파 주입 및 코더에 의해 수행되는 사인파 주입 및 코딩의 개념은, 사인파들을 추가함으로써 LFR 신호 성분들을 이용해 HFR의 부호화 충실도를 개선하는 것이다. 적어도 한 사인파의 추가가 부호화의 정밀도를 높일 것이다. The concept of sine wave injection and coding performed by sine wave injection and coder is to improve the coding fidelity of HFR using LFR signal components by adding sine waves. The addition of at least one sine wave will increase the precision of the encoding.

예를 들어,

과

가 각각 현재 부호화된 고주파 영역 스펙트럼 및 오리지널 고주파 영역 스펙트럼을 나타낼 때, 사인파 주입 및 코더(309)는 식 3에서 얻어진 스펙트럼 인덱스 k₁에서 최초의 사인파를 추가할 수 있다. E.g,

and

When respectively represent the currently encoded high frequency domain spectrum and the original high frequency domain spectrum, the sinusoidal injection and coder 309 can add the first sinusoid at the spectral index k ₁ obtained in equation (3).

즉, 그 사인파는 오리지널 고주파 영역 스펙트럼 값과 부호화된 고주파 영역 스펙트럼 값 사이의 최대 차를 가진 인덱스에서 삽입될 수 있다. That is, the sine wave can be inserted at an index with the maximum difference between the original high frequency region spectral value and the encoded high frequency region spectral value.

또, 사인파 주입 및 코더(309)는 식 4에 따라 삽입된 사인파의 진폭을 판단할 수 있다. In addition, the sine wave injection and coder 309 can determine the amplitude of the inserted sine wave according to Equation 4.

사인파 주입 및 코더(309)는 이제 식 5를 이용해 부호화된 고주파 영역 스펙트럼의 업데이트 결과를 도출한다.Sine wave injection and coder 309 now derive the update result of the encoded high frequency region spectrum using Equation 5.

사인파 주입 및 코더(309)는 이제 사인파의 선택 및 스케일링 동작들과 부호화된 고주파 영역의 업데이트 동작을 반복해 원하는 수의 사인파들이 추가될 때까지 추가 사인파들을 추가할 수 있다. 본 발명의 바람직한 실시예에서 바람직한 사인파의 수는 넷이다.The sine wave injection and coder 309 can now repeat the selection and scaling operations of the sine wave and the update operation of the encoded high frequency region to add additional sine waves until the desired number of sine waves are added. In a preferred embodiment of the present invention, the preferred number of sine waves is four.

본 발명의 어떤 실시예들에서 상기 동작들은, 사인파 주입 및 코더(309)가 오리지널 고주파 영역 신호와 부호화된 고주파 영역 신호 사이의 전반적인 오차가 코딩 에러 문턱치 밑으로 감소 될 때까지 반복된다. In some embodiments of the invention, the operations are repeated until the sinusoidal injection and coder 309 reduce the overall error between the original high frequency region signal and the encoded high frequency region signal below the coding error threshold.

사인파들을 선택하고 스케일링했을 때 사인파 주입 및 코더(309)는 이제 사인파들의 표시가 비트 효율적 방식에 따라 디코더로 전달될 수 있도록, 선택된 사인파들을 코딩하는 동작을 수행한다. When selecting and scaling sine waves, sine wave injection and coder 309 now perform the operation of coding the selected sine waves so that an indication of the sine waves can be delivered to the decoder in a bit efficient manner.

그에 따라 사인파 주입 및 코더(309)는 선택된 사인파들의 진폭 Ai를 양자화하고 그 양자화된 증폭 값들

을 멀티플렉서로 제출한다. The sine wave injection and coder 309 thus quantizes the amplitudes Ai of the selected sine waves and their quantized amplification values.

Submit to the multiplexer.

사인파 주입 및 코더(309)는 또한 선택된 사인파나 사인파들의 위치 및/또는 위치들을 부호화할 수 있다.Sine wave injection and coder 309 may also encode the position and / or positions of the selected sine wave or sine waves.

본 발명의 제1실시예에서, 선택된 사인파의 위치와 부호 (sign)가 양자화된다. 그러나, 위치와 부호의 양자화가 최적이지 못했다는 것을 알게 되었다.In the first embodiment of the present invention, the position and sign of the selected sine wave are quantized. However, it was found that quantization of position and sign was not optimal.

도 8과 관련해, 정현파 주입 및 코더(309)에서 수행된 본 발명의 실시예들에 따른 위치 및 부호 코딩 동작의 결과가 보여진다.With reference to FIG. 8, the results of position and sign coding operations in accordance with embodiments of the present invention performed at sinusoidal injection and coder 309 are shown.

도 8(a)는 MDCT 계수 값들(801)에 의해 표현된 7000Hz부터 7800Hz까지의 통상적 고주파 영역 서브 대역의 스펙트럼 예를 보인 것이다. 8 (a) shows an example of the spectrum of a typical high frequency region subband from 7000 Hz to 7800 Hz represented by MDCT coefficient values 801.

도 8(b)는 선택된 사인파가 입력될 수 있는 가능한 위치들이 인덱스 값을 기준으로 보여지는 경우의 예를 보인다. 32 개의 가능한 인덱스 위치들은 거기에 자리한 0개, 1개 혹은 그 이상의 사인파들을 포함할 수 있다. 8 (b) shows an example in which possible positions where the selected sine wave can be input are shown based on an index value. 32 possible index positions may include zero, one or more sine waves therein.

도 8(c)는 32 개의 가능한 인덱스 위치들이 적어도 두 개의 트랙들로 나눠지는 본 발명의 일실시예를 보인다. 트랙들은 서로 엇갈려 놓여서 (interlaced), 도 8(c)에 도시된 것 같은 두 트랙들을 가진 경우 각 트랙의 각 인덱스는 다른 트랙의 두 인덱스들 사이에 자리하도록 된다. 둘을 넘는 트랙들을 가진 실시예들에서 각각의 인덱스는 다른 트랙들 각각과 한 인덱스 간격으로 떨어져 있다. 예를 들어, 도 8(c)에서 32 개의 가능한 인덱스 위치들은 트랙 1(803)과 트랙 2(806)로 나눠진다.8 (c) shows an embodiment of the invention in which 32 possible index positions are divided into at least two tracks. The tracks are interlaced, so that when there are two tracks as shown in Fig. 8 (c), each index of each track is placed between two indexes of the other track. In embodiments with more than two tracks, each index is spaced one index apart from each of the other tracks. For example, in FIG. 8C, thirty-two possible index positions are divided into track 1 803 and track 2 806.

다른 실시예들은 엇갈려 놓이는 둘을 초과하는 트랙들을 가질 수 있다. 예를 들어, 세 개의 트랙들이 엇가려 놓이는 경우 그 위치는, pos₁(n-1), pos₂(n-1), pos₃(n-1), pos₁(n), pos₂(n), pos₃(n), pos₁(n+1), pos₂(n+1), pos₃(n+1)이 될 수 있고, 여기서 pos_k(n)는 k 번째 트랙의 n 번째 위치를 말한다. Other embodiments may have more than two tracks staggered. For example, if three tracks are staggered, the position is pos ₁ (n-1), pos ₂ (n-1), pos ₃ (n-1), pos ₁ (n), pos ₂ (n ), pos ₃ (n), pos ₁ (n + 1), pos ₂ (n + 1), pos ₃ (n + 1), where pos _k (n) is the nth position of the kth track Say.

또 다른 실시예들은 트랙들을 영역들 안에 정렬하여, 각각 총 N개의 위치들을 가진 2 개의 트랙들에 있어서 트랙들이 pos₁(1), pos₁(2),...,pos₁(N), pos₂(1), pos₂(2),...,pos₃(N) 위치들로 정렬될 수 있다. Still other embodiments align the tracks within the regions so that in two tracks each having a total of N positions, the tracks are pos ₁ (1), pos ₁ (2), ..., pos ₁ (N), pos ₂ (1), pos ₂ (2), ..., pos ₃ (N) can be aligned.

본 발명의 또 다른 실시예들에서 트랙들은 서브 대역뿐 아니라 전체 주파수 영역을 커버하도록 구성될 수 있다. In still other embodiments of the invention the tracks can be configured to cover the entire frequency range as well as the sub band.

사인파 주입 및 코더(309)는 이하의 예 및 도 9를 참조해 설명될 수 있는 바와 같이 이러한 트랙들 내 인덱스들의 분리를 이용해 위치 부호화를 개선시킨다. Sine wave injection and coder 309 improves position coding using the separation of indices in these tracks, as can be explained below with reference to the example and FIG.

도 9(a)는 7000Hz에서 14000Hz까지의 고주파 영역 신호의 스펙트럼을 보인다. 도 9(b)는 단일 트랙 인덱스 방식에서 선택된 사인파들을 보이는데, 여기서는 비트 인코딩 한계치에 도달 전에 8 개의 사인파들이 부호화될 수 있다. 도 9(c)는 본 발명의 실시예에 따른 두 개의 트랙 인덱스 방식에서 선택된 사인파들을 보이며, 여기서는 비트 인코딩 한계치에 도달하기 전에 10 개의 사인파들이 부호화될 수 있다. 9 (a) shows the spectrum of the high frequency region signal from 7000 Hz to 14000 Hz. 9 (b) shows sine waves selected in a single track index scheme, where eight sine waves can be encoded before reaching the bit encoding limit. 9 (c) shows sine waves selected in the two track index scheme according to an embodiment of the present invention, in which ten sine waves may be encoded before reaching the bit encoding limit.

HFR 코딩 비트 할당은 보통 본 발명의 실시예들에 있어서 4 kbits/second (또는 프레임 당 80 개의 비트들)이다 (프레임당 약 20 내지 25 개의 비트들이 MDCT 값들이나 사인파 진폭치들을 양자화하는데 사용될 수 있음).HFR coding bit allocation is usually 4 kbits / second (or 80 bits per frame) in embodiments of the present invention (about 20 to 25 bits per frame can be used to quantize MDCT values or sine wave amplitude values. ).

각 서브 대역에 대한 비트 할당이 식 6과 관련해 기술된다:Bit allocation for each subband is described in relation to equation 6:

여기서 N_sin은 선택된 사인파들의 개수이고 B_ind 및 B_sign은 각각 위치 (인덱싱) 및 부호 정보를 위해 필요로 하는 비트 개수이다.Where N _sin is the number of sine waves selected and B _ind and B _sign are the number of bits needed for position (indexing) and sign information, respectively.

도 10(b)와 10(c)에 도시된 예에서, 네 개의 서브 대역 길이들은 각각 64, 64, 64 그리고 32이다. In the example shown in Figs. 10 (b) and 10 (c), the four subband lengths are 64, 64, 64 and 32, respectively.

사인파 주입 및 코더(309)는 도 9(b)에 도시된 실시예에 따라 서브 대역마다 사인파 당 각각 6, 6, 6 및 5의 비트 개수를 할당한다. 이 비트 개수가 각각의 인덱스를 고유하게 규정하며 그에 따라 서브 대역 내 각각의 사인파를 결정한다. 사인파 주입 및 코더(309)는 이제 사인파의 부호, 즉 사인파가 동상인지 180도 벗어난 상인지 여부를 규정할 추가 비트를 할당할 수 있다. 그에 따라 프레임의 비트 레이트가 식 7과 같이 주어진다:The sine wave injection and coder 309 allocates the number of bits of 6, 6, 6 and 5 per sine wave for each subband according to the embodiment shown in FIG. 9 (b). This number of bits uniquely defines each index and thus determines each sine wave in the subband. The sinusoidal injection and coder 309 can now assign additional bits to define the sign of the sinusoid, that is, whether the sinusoid is in phase or 180 degrees out of phase. Thus, the bit rate of the frame is given by:

여기서, N_sb _,I는 i 번째 서브 대역내 사인파들의 개수이다. 도 9(b)에서 알 수 있는 바와 같이, N_sb _,1=3, N_sb _,2=3, N_sb _,3=1, N_sb _,4=1이므로, 8 개의 사인파들에 대해 부호화에 필요로 되는 비트들은 55 bits/frame이다. Here, N _sb _{, I} is the number of sine waves in the i-th subband. As can be seen in Fig. 9 (b), since N _sb _{, 1} = 3, N _sb _{, 2} = 3, N _sb _{, 3} = 1, N _sb _{, 4} = 1, it is necessary for encoding for 8 sine waves Bits become 55 bits / frame.

서브 대역 당 2 개이 트랙들을 이용하는 개선된 부호화 방법에서 사인파 주입 및 코더(309)는 한 서브 대역 내 각각의 사인파에 대해 보다 적은 잠정 개별 위치들로 인해, 그리고 각 트랙 위의 개별 사인파들의 배열시 그 중복으로 인해, 서브 대역마다 사인파별로 사용되는 비트 개수를 줄이게 된다. In an improved coding method that uses two tracks per subband, sinusoidal injection and coder 309 cause fewer intermittent discrete positions for each sine wave in one subband, and at the time of arrangement of the individual sine waves on each track. Due to redundancy, the number of bits used for each sine wave in each subband is reduced.

사인파들은 각각의 서브 대역 및 트랙 안에서 선택되어, 디코더가 정확한 위치 인덱스를 식별할 수 있도록 알려진 순서에 따라 부호화된다.Sine waves are selected in each subband and track and coded in a known order so that the decoder can identify the correct position index.

비트 절약은, 트랙 상의 사인파들을 선택 및 전송하는 순서와 무관하다는 사실에 기초한다. 단일 트랙 상에서 사인파 위치가 P 및 R (그리고 본 발명의 실시예들에서 부호들은 반대가 되도록 지정될 것이다)인지, R 및 P 인지 (본 발명의 실시예들에서 그 부호들은 동일한 것으로 지정될 것이다) 여부는 문제가 되지 않는다.Bit saving is based on the fact that it is independent of the order of selecting and transmitting sine waves on the track. Whether the sine wave position on a single track is P and R (and in the embodiments of the invention the signs will be specified to be opposite) or R and P (in the embodiments of the invention the signs will be designated the same) Whether or not it matters.

서브 대역 당 2 개의 트랙들을 사용하는 개선된 부호화 방법에서 사인파 주입 및 코더(309)는, 한 서브 대역 내 각 사인파에 있어 보다 적은 가능한 개별 위치들로 인해, 그리고 각 트랙 상에서 개별 사인파들의 배치 시의 중복으로 인해, 서브 대역마다 사인파당 사용되는 비트들의 개수를 줄인다.In an improved coding method that uses two tracks per subband, sinusoidal injection and coder 309 may be useful because of fewer possible individual positions for each sine wave in one subband, and at the time of placement of the individual sine waves on each track. Due to redundancy, the number of bits used per sine wave per subband is reduced.

도 9(c)에서 알 수 있듯이, 제1 및 제2트랙 둘 모두에서 최초의 두 서브 대역들에 대해 2 개의 사인파들을 부호화하는 것이 가능하다. 서브 대역들 3 및 4는 첫 번째 방법과 동일한 수의 사인파들을 가진다. 서브 대역들 1 및 2에서의 각 트랙 (각각 2 개의 사인파들을 포함)의 비트 레이트는 (5+1)+(5+0)이다. 서브 대역 3에 대해 비트 요건은 (6+1)이고 서브 대역 4에 대한 비트 요건은 (5+1)이다. 따라서 10 개의 사인파들에 필요한 총 비트 레이트는 프레임당 57 비트이다. 따라서 사인파 주입 및 코더(309)는 개선된 방식을 통해 프레임당 단 두 개의 비트들만을 써서 두 개의 추가 사인파들을 추가할 수 있다. As can be seen in FIG. 9 (c), it is possible to code two sinusoids for the first two subbands in both the first and second tracks. Subbands 3 and 4 have the same number of sine waves as the first method. The bit rate of each track in subbands 1 and 2 (including two sine waves each) is (5 + 1) + (5 + 0). The bit requirement for subband 3 is (6 + 1) and the bit requirement for subband 4 is (5 + 1). Thus, the total bit rate required for the 10 sine waves is 57 bits per frame. Thus, the sinusoidal injection and coder 309 can add two additional sinusoids using only two bits per frame in an improved manner.

첫 번째 및 두 번째 방법들에서의 사인파당 비트 레이트는 이 예에서 각각 6.875 비트 및 5.7 비트이다.The bit rates per sine wave in the first and second methods are 6.875 bits and 5.7 bits in this example, respectively.

사인파 주입 및 코더(309)는 서브 대역 길이에 따라 서브 대역 내에서 사용될 트랙들의 수를 선택할 수 있다. 서브 대역 사이즈가 적응적인 경우 (즉, 프레임 별로 변경될 수 있는 경우), 선택된 길이는 성능 개선이 이뤄진 방법을 제공할 것이다. The sine wave injection and coder 309 may select the number of tracks to be used in the sub band according to the sub band length. If the subband size is adaptive (i.e., can be changed frame by frame), the selected length will provide a way to improve performance.

예를 들어 32라는 서브 대역 길이는 16짜리 두 트랙들로 쉽게 분할될 수 있다. 마찬가지로, 48이라는 길이도 16짜리 세 트랙들로 분할될 수 있다. 64의 길이는 32짜리 2 트랙들이나 16짜리 4 트랙들로 나눠질 수 있다. 그 선택은 가능한 비트 레이트에 따라 결정될 수 있다.For example, a subband length of 32 can be easily divided into two 16-track tracks. Similarly, the length 48 can be divided into three tracks of sixteen. The length of 64 can be divided into two 32 tracks or four 16 tracks. The selection can be determined according to the possible bit rate.

사인파 주입 및 코더(309)는 연속적인 사인파들의 주입을 허용하는 트랙의 구조를 선택할 수 있고 바람직하게는 한 개를 초과하는 사인파가 각 트랙 상에 배치될 수 있다. The sine wave injection and coder 309 can select the structure of the track that allows for the injection of successive sine waves and preferably more than one sine wave can be placed on each track.

따라서, 가령 각 트랙으로부터 두 개의 사인파들이 선택되는 본 발명의 실시예들에서, 트랙들의 배치는, 가능한 사인파 위치들 P 및 P+1 (이것은 인지적으로 중요함)이 둘 모두 선택될 수 있도록 상이한 트랙들 안에 있게 선택될 수 있다.Thus, in embodiments of the invention where, for example, two sine waves are selected from each track, the arrangement of the tracks is different so that the possible sine wave positions P and P + 1 (which are cognitively important) can be selected. It can be selected to be in tracks.

주파수 서브 대역 길이 (가변적인 경우)는 고주파 영역의 전체 에너지가 프레임별로 그다지 변동되지 않게 선택되어야 한다.The frequency subband length (if variable) should be chosen so that the total energy in the high frequency range is not very variable from frame to frame.

따라서 트랙 인덱스들로 환산하여 삽입된 사인파들의 위치 부호화는 위에서 알 수 있는 바와 같이 어떤 주입된 사인파들을 가리키는데 필요한 코딩 레이트를 개선시킨다. Thus, the position coding of the inserted sine waves in terms of track indices improves the coding rate required to indicate some injected sine waves, as can be seen above.

본 발명의 다른 실시예들에서 사인파 주입 및 코더(309)는 주입된 사인파들의 위치들에 대한 부호화를 더 개선시킬 것이다.In other embodiments of the present invention, sinusoidal injection and coder 309 will further improve the encoding for the positions of the injected sinusoids.

본 발명의 어떤 실시예들에서, 사인파 주입 및 코더(309)는 인지적으로 가장 중요한 사인파들의 위치들 및 진폭들을 결정한 후에 그 사인파들의 부분집합 간 위치의 상대적 차이를 분석한다. 이러한 상대적 위치들은 여기서, 사인파들의 배열이 소수의 비트들 만을 사용해 부호화될 수 있는지를 판단하는데 사용된다. 사인파들의 배열에서 검출된 패턴이 없으면, 앞서 기술한 사인파들의 위치 부호화 방식들 중 하나가, 선택된 사인파들의 위치 부호화에 사용될 수 있다.In some embodiments of the invention, the sinusoidal injection and coder 309 determine the positions and amplitudes of the cognitive most important sinusoids and then analyze the relative difference in the position between the subsets of those sinusoids. These relative positions are used here to determine if the arrangement of sine waves can be encoded using only a few bits. If there is no pattern detected in the arrangement of sine waves, one of the above-described position coding methods of sine waves may be used for position coding of selected sine waves.

앞서 기술되었다시피, 부호화된 고주파 영역은 일련의 주파수 서브 대역들로 나눠질 수 있다. 그런 다음 각각의 주파수 서브 대역이 서치되어, 각각의 주파수 서브 대역 안에서 선택된 사인파들이 삽입될 위치들을 결정하도록 할 수 있다. 선택된 사인파들이, 오리지널 고주파 영역 신호와 비교할 때 부호화된 고주파 영역의 정확도를 향상시킬 것이다.As described above, the encoded high frequency region may be divided into a series of frequency subbands. Each frequency subband may then be searched to determine the positions at which the selected sine waves are to be inserted within each frequency subband. The selected sinusoids will improve the accuracy of the encoded high frequency region when compared to the original high frequency region signal.

본 발명의 제1실시예에서, 주파수 서브 대역들의 개수는 6으로 나눠질 수 있다. 본 발명의 다른 실시예들에서 서브 대역들의 개수는 앞서 기술된 바와 같이 가변적일 수 있다. In the first embodiment of the present invention, the number of frequency subbands may be divided by six. In other embodiments of the present invention, the number of subbands may vary as described above.

서브 대역들 각각에 대해 사인파 주입 및 코더(309)는 각 서브 대역 안에서 선택된 사인파들과 그들의 위치를 비교하여 어느 것이 구조상 시작 포인트로 간주될 것인가를 판단한다. 예를 들어 본 발명의 일 실시예에서 사인파 주입 및 코더(309)는 시작 포인트 사인파로서 최저 주파수를 가진 선택된 사인파를 택한다. 본 발명의 다른 실시예들에서 선택된 시작 포인트 사인파는 서브 대역 안에서 정중앙의 (median) 사인파나 더 높은 주파수의 사인파가 된다. For each of the subbands, the sinusoidal injection and coder 309 compares the selected sinusoids and their position within each subband to determine which one is to be considered as a structural start point. For example, in one embodiment of the present invention, sine wave injection and coder 309 selects the selected sine wave with the lowest frequency as the starting point sine wave. In other embodiments of the invention, the selected starting point sine wave is a median sine wave or a higher frequency sine wave in the subband.

일단 시작 포인트 사인파가 선택되면, 그 서브 대역 내에서 시작 포인트 위치와 다른 선택된 사인파 위치들 사이의 차가 검증된다. 서브 대역 안에서 시작 포인트 위치 및 선택된 나머지 사인파들과의 어떤 관계가 이제 코딩될 것이다.Once the starting point sine wave is selected, the difference between the starting point position and other selected sine wave positions within that subband is verified. Any relationship with the starting point location and the remaining selected sine waves in the sub band will now be coded.

예를 들어 서브 대역 안에서 제1사인파가 인덱스 5에 위치하고, 두 개의 다른 사인파들이 인덱스 위치 12와 19에 위치하는 경우, 사인파 주입 및 코더(309)는 그 사인파들의 위치를 절대 인덱스 5로서 부호화하고 그런 다음 상대적인 인덱스 7과 거기서 다시 상대 인덱스 7로서 부호화한다. 본 발명의 다른 실시예들에서 사인파 주입 및 코더(309)는 구조(3) 내 절대 인덱스(5), 상대 인덱스(7) 그리고 사인파들의 총 수를 부호화한다. For example, if a first sine wave is located at index 5 and two different sine waves at index positions 12 and 19 in the subband, sine wave injection and coder 309 encode the positions of the sine waves as absolute index 5 and such It is then coded as relative index 7 and there again as relative index 7. In other embodiments of the present invention, the sinusoidal injection and coder 309 encodes the absolute index 5, the relative index 7 and the total number of sinusoids in structure 3.

또한 위에서 제시된 예는, 주파수 서브 대역 당 선택된 사인파들의 수가 늘어날수록 더 효율적일 것이다. 위에서 보인 절대, 상대, 상대 코딩 실시예에 대한 이러한 효율성은 사인파들 간 평균 거리가 더 많은 사인파들이 추가될 수록 감소되고 그에 따라 사인파들 사이의 상대적 거리를 부호화하는데 필요한 평균 비트 수가 감소되어 사인파당 표시 비트들의 필요 개수를 줄일 수 있기 때문이다. Also, the example presented above will be more efficient as the number of selected sine waves per frequency subband increases. This efficiency for the absolute, relative, and relative coding embodiments shown above is reduced as the average distance between sinusoids increases as more sinusoids are added, thereby reducing the average number of bits needed to encode the relative distance between sinusoids, resulting in an indication per sine wave. This is because the required number of bits can be reduced.

절대, 상대, 총 코딩 실시예에서도 마찬가지로, 선택된 사인파들의 개수가 늘어날수록 사인파당 평균 비트 수가 줄어드는데 이는 각각의 추가 (extra) 사인파는 총 카운트의 증가만을 요하기 때문이다. In the absolute, relative, and total coding embodiments as well, as the number of selected sine waves increases, the average number of bits per sine wave decreases because each extra sine wave only requires an increase in the total count.

총 사인파들의 수가 한정되어 있을 때 선택된 사인파들을 서치하여 상대적 차이를 판단하기 위해 사인파 주입 및 코더(309)가 필요로 될 것이지만, 이러한 복잡도의 증가는 그다지 부담스러운 것이 못된다. Sine wave injection and coder 309 will be required to search the selected sine waves to determine the relative difference when the total number of sine waves is limited, but this increase in complexity is not very burdensome.

본 발명의 다른 실시예들에서, 사인파 주입 및 코더(309)는 시작 포인트 사인파를 이용하고 서브 대역 안에서 그 시작 포인트와 관련된 사인파들을 서치하여, 소정 후보 구조와 매치하거나 가장 근사하게 매치하는 사인파 구조를 판단한다.In other embodiments of the invention, the sinusoidal injection and coder 309 uses a starting point sine wave and searches for sine waves associated with that starting point in the subband to match a sine wave structure that matches or most closely matches a given candidate structure. To judge.

본 발명의 실시예들에 따르면 사인파 구조를 결정하는데 사용되는 기준은 선택가능하거나 가변적일 수 있다. 예를 들어, 사인파 주입 및 코더(309)는 일 실시예에서 단순히 가장 많은 수의 매치하는 사인파들을 가지거나 후보 사인파 매칭의 중요도를 가진 구조 (가령, 한 구조가 '매치하는' N 개의 사인파들을 가지는 한편 다른 한 구조는 '매치하는' N-1개의 후보를 가지는 경우, N-1 후보가 선택될 수 있는데 이는 그 후보 구조가 인지적으로 중요한, 선택된 사인파들에 보다 정확히 매치하기 때문이다)를 선택할 것이다. According to embodiments of the present invention, the criteria used to determine the sinusoidal structure may be selectable or variable. For example, sine wave injection and coder 309 may in one embodiment simply have the largest number of matching sine waves or a structure having the importance of candidate sine wave matching (e.g., having N sine waves 'matched' by one structure). On the other hand, if the other structure has 'matching' N-1 candidates, then the N-1 candidate can be selected because the candidate structure matches the selected sine waves more cognitively important). will be.

또한, 사인파 주입 및 코더(309)는 사인파들 각각의 부호 정보를 포함하고 상술한 바와 같이 사인파 진폭들을 (가령, 진폭을 나타내기 위해 사용되는 비트 수를 줄이기 위한 벡터 양자화를 이용하여) 부호화할 수 있다.In addition, the sinusoidal injection and coder 309 may include sign information of each of the sinusoids and encode the sinusoidal amplitudes (e.g., using vector quantization to reduce the number of bits used to represent the amplitude) as described above. have.

본 발명의 일부 실시예들에서, 사인파 주입 및 코더(309)는 구조들이 같은 수의 '매치하는' 사인파들을 가지는 경우, 고주파 영역의 낮은 주파수대들에서 더 많이 '매치하는' 사인파들을 가지는 매치를 선택할 수 있다. In some embodiments of the present invention, the sinusoidal injection and coder 309 selects a match with more 'matching' sine waves in the lower frequencies of the high frequency region when the structures have the same number of 'matching' sine waves. Can be.

본 발명의 다른 실시예들에서, 사인파 주입 및 코더(309)는 시작 포인트 사인파의 후보들 및 관련 인덱스를 선택한 후, 소정 사인파 위치 템플릿을 이용하는데 그로부터 사인파 위치/인덱스들로부터의 편차가 검출된다. 검출된 편차들은 본 발명의 일 실시예에서, 작은 위치 편차 코드북이라고도 알려진 소정 편차들의 룩업 테이블을 서치하고 그런 다음 그 편차로부터 관련된 코드를 출력함으로써 부호화될 수 있다. In other embodiments of the present invention, the sinusoidal injection and coder 309 selects candidates and associated indexes of the starting point sinusoids, and then uses a predetermined sinusoidal position template from which deviations from sinusoidal positions / indexes are detected. Detected deviations can be encoded in one embodiment of the invention by searching a lookup table of certain deviations, also known as small position deviation codebooks, and then outputting the relevant code from the deviations.

이 실시예에서 사인파 주입 및 코더(309)가 잠정적 사인파들과 관련해 보다 큰 융통성을 가지지만, 편차들을 서치하는 것은 필요한 서칭 프로세싱을 증대시키게 된다. Although sinusoidal injection and coder 309 have greater flexibility in terms of potential sine waves in this embodiment, searching for deviations will increase the necessary search processing.

본 발명은 최적의 사인파들의 실제 위치들을 보다 정밀하게 가리킬 수 있는 결과를 도출하는 한편, 각각의 사인파와 관련된 비트 레이트 역시 증가시킨다. 따라서, 이러한 추가 실시예가 보다 낮은 비트 레이트들에 사용되는 데 있어 가장 효율적이라고는 할 수 없다. 또, 이 실시예는 훨씬 더 많은 프로세서 자원들을 사용할지도 모르는데 이는 구조 및 오차들이 서치되거나 부호화되어야 하기 때문이다.The present invention results in a more precise indication of the actual positions of the optimal sine waves, while also increasing the bit rate associated with each sine wave. Thus, this additional embodiment may not be the most efficient for use at lower bit rates. In addition, this embodiment may use much more processor resources because the structure and errors must be searched or coded.

앞서 기술된 실시예들과 관련된 다른 실시예들에서, 사인파 주입 및 코더(309)는 사인파 구조나 편차 및 부호화될 사인파 구조나 편차 사이에서 작은 정도의 오차를 감당할 수 있다. 즉, 구조 및 편차 위치들 모두의 서치 및 부호화의 속도를 높이기 위해, 그 구조로부터 한정된 구조들 및/또는 편차들의 부분 집합이 서치된다. 이 실시예는, 부호화의 속도와 사인파당 비트 레이트가 최적화되어야 하고 사인파의 구조 및/또는 편차의 오차가 허용가능하거나 용인될 수 있는 경우에 허용될 수 있다. In other embodiments related to the embodiments described above, the sinusoidal injection and coder 309 may tolerate a small degree of error between the sinusoidal structure or deviation and the sinusoidal structure or deviation to be encoded. That is, to speed up the search and encoding of both structure and deviation locations, a subset of structures and / or deviations defined from the structure are searched. This embodiment may be allowed if the speed of encoding and bit rate per sine wave should be optimized and if the error in the structure and / or deviation of the sine wave is acceptable or acceptable.

그러나, 그러한 실시예들은, 프레임에서 프레임으로 사인파 위치들의 연장된 쉬프트나 변동이 인지될 수 있는 정도의 오차를 만들 수 있다는 것을 고려해야 한다. However, it should be taken into account that such embodiments may produce an error such that an extended shift or variation of sinusoidal positions from frame to frame can be perceived.

상기 예들은 주파수 서브 대역 당 수행된다고 기술하고 있지만, 그 예들은 전체 고주파 영역 신호에 걸쳐 동시에 적용될 수도 있다. 따라서, 고정되거나 가변적인 구조에 대한 상관 코딩 (relational coding), 구조적 코딩 (structural coding), 및 작은 편차 코딩이 전체 고주파 영역 신호인 서브 대역에 대해 수행될 수 있다. Although the above examples are described as being performed per frequency subband, the examples may be applied simultaneously over the entire high frequency region signal. Thus, relational coding, structural coding, and small deviation coding for fixed or variable structures can be performed for subbands that are entire high frequency domain signals.

그런 다음 사인파 표시 정보는 멀티플렉서(311)로 전달되어 비트스트림 출력에 포함될 수 있다. Sine wave indication information may then be passed to the multiplexer 311 and included in the bitstream output.

사인파들의 선택 및 부호화 동작이 도 6의 615 단계에 도시된다.The operation of selecting and encoding sine waves is shown in step 615 of FIG.

비트스트림 포매터(234)가 저주파 코더(231) 출력, 고주파 영역 프로세서(232) 출력을 수신하고, 그 비트스트림을 포맷해 비트스트림 출력을 생성한다. 본 발명의 일부 실시예들에서 비트스트림 포매터(234)는 수신된 입력들을 인터리브 (interleave) 할 수 있고, 비트스트림 출력(112)에 삽입될 에러 검출 및 에러 정정 코드들을 생성할 수 있다.The bitstream formatter 234 receives the low frequency coder 231 output and the high frequency region processor 232 output, and formats the bitstream to produce a bitstream output. In some embodiments of the invention, the bitstream formatter 234 may interleave the received inputs and generate error detection and error correction codes to be inserted into the bitstream output 112.

HFR 코더(232) 및 LFR 코더(231) 정보를 출력 비트스트림에 멀티플렉싱하는 단계가 도 6의 617 단계에 보여진다.Multiplexing the HFR coder 232 and LFR coder 231 information into the output bitstream is shown in step 617 of FIG.

본 발명의 이해를 더 돕기 위해, 본 발명의 실시예들과 관련한 디코더(108)의 동작이 도 5에 개략적으로 도시된 디코더 및 도 7의 디코더 동작을 보인 흐름도와 관련해 보여진다. To further understand the present invention, the operation of the decoder 108 in connection with embodiments of the present invention is shown with respect to the flow diagram illustrating the decoder operation of FIG. 7 and the decoder shown schematically in FIG.

디코더는 부호화된 비트스트림(112)이 수신될 입력부(413)를 포함한다. 입력부(413)는 비트스트림 언팩커 (unpacker)(401)에 연결된다.The decoder includes an input 413 to receive the encoded bitstream 112. The input unit 413 is connected to a bitstream unpacker 401.

비트스트림 언패커는 부호화된 비트스트림(112)을 세 개의 독립된 비트스트림들로 다중화해제 (demultiplex)하거나, 분할하거나, 언팩 (unpack)한다. 저주파 부호화된 비트스트림은 저주파 디코더(403)로 보내지고, 스펙트럼 대역 복제 비트스트림은 고주파 재구성기(407) (고주파 영역 디코더라고도 알려짐)로 보내지며, 제어 데이터가 디코더 제어기(405)로 보내진다.The bitstream unpacker demultiplexes, splits or unpacks the coded bitstream 112 into three independent bitstreams. The low frequency encoded bitstream is sent to the low frequency decoder 403, the spectral band replica bitstream is sent to the high frequency reconstructor 407 (also known as the high frequency region decoder), and control data is sent to the decoder controller 405.

이러한 언팩 프로세스가 도 7의 701 단계에 보인다.This unpacking process is shown in step 701 of FIG.

저주파 영역 디코더(403)는 저주파 부호화된 데이터를 수신하고 저주파 영역 코더(231)에서 수행된 프로세스의 역 프로세스를 수행함으로써 합성 저주파 신호를 생성한다. 이 합성 저주파 신호는 고주파 영역 디코더(407) 및 재구성 디코더(409)로 전달된다.The low frequency region decoder 403 generates the synthesized low frequency signal by receiving the low frequency encoded data and performing an inverse process of the process performed by the low frequency region coder 231. This synthesized low frequency signal is passed to the high frequency region decoder 407 and the reconstruction decoder 409.

이러한 저주파 영역 디코딩 프로세스는 도 7의 707 단계에 보여진다.This low frequency region decoding process is shown in step 707 of FIG.

디코더 제어기(405)는 비트스트림 언패커(401)로부터 제어 정보를 수신한다. 본 발명과 관련해 디코더 제어기(405)는, HFR 대역 리플리컨트 선택 프로세서(305) 및 HFR 대역 리플리컨트 스케일링 프로세서(307)와 관련해 앞서 기술한 바와 같이 HFR 코딩 프로세스에서 스펙트럼 복제가 사용되었는지 여부와 관련한 정보를 수신한다. 이 방법을 이용해 HFR 영역을 재구성함에 있어 HFR 디코더를 구성하는데 필요로 되는 임의의 특정 정보는 이제 HFR 디코더로 보내지고, 그 방법은 이하에서 기술되는 것처럼 705 단계를 포함한다. The decoder controller 405 receives control information from the bitstream unpacker 401. In the context of the present invention, the decoder controller 405 receives information regarding whether spectral replication was used in the HFR coding process as described above with respect to the HFR band replicant selection processor 305 and the HFR band replicant scaling processor 307. do. Any specific information needed to configure the HFR decoder in reconstructing the HFR region using this method is now sent to the HFR decoder, and the method includes step 705 as described below.

또, 디코더 제어기(405)는 HFR 코더 및 HFR 사인파 주입 및 코더(309)에서 선택된 임의의 사인파 선택 및 주입 프로세스들과 관련해 비트스트림 언패커(401)로부터 제어 정보를 수신한다. The decoder controller 405 also receives control information from the bitstream unpacker 401 with respect to the HFR coder and any sine wave selection and injection processes selected at the coder 309.

HFR 디코더의 설정이 도 7의 703 단계에 보인다. The setting of the HFR decoder is shown in step 703 of FIG.

본 발명의 어떤 실시예들에서 디코더 제어기(405)는 고주파 디코더(407)의 일부일 수 있다. In some embodiments of the present invention, the decoder controller 405 may be part of the high frequency decoder 407.

HFR 디코더(407)는 가령 대역 선택 정보에 의해 표시된 대역들과 관련해 고주파 재구성 비트스트림에 의해 표시된 합성 저주파 신호로부터 저주파 성분들을 복제 및 스케일링함으로써 리플리컨트 HFR 재구성 동작을 수행할 수 있다. 이 동작은 디코더 제어기(405)에 의해 제공되는 정보에 의존해 수행된다. The HFR decoder 407 may perform a replicative HFR reconstruction operation by, for example, replicating and scaling the low frequency components from the synthesized low frequency signal indicated by the high frequency reconstruction bitstream with respect to the bands indicated by the band selection information. This operation is performed depending on the information provided by the decoder controller 405.

이 고주파 복제 구성 또는 고주파 재구성이 도 8의 705 단계에서 보인다.This high frequency replication configuration or high frequency reconstruction is seen at step 705 of FIG.

HFR 디코더(407)는 또 디코더 제어기(405)에 의해 제공된 정보에 의존해 HFR 재구성 동작의 정확도를 향상시키기 위한 사인파 선택 및 주입 동작을 수행할 수 있다. 따라서, 본 발명의 실시예에 따르면 디코더 제어기(405)는 HFR 디코더(407)를 제어해 아무 사인파들이나 추가하지 않고, 디코더 제어기(405)에 의해 표시된 비트스트림 포맷에 따른 사인파들을 추가하도록 할 수 있다. 그에 따라 비한정적 예들로서, 제공된 인덱스 및 트랙 정보, 사인파 정렬 구조, 사인파 배열의 상대적 간격, 및 고정되거나 가변되는 사인파들의 배열이나 구조에 따라 사인파들을 삽입하는 단계가 포함된다.The HFR decoder 407 may also perform sine wave selection and injection operations to improve the accuracy of the HFR reconstruction operation, depending on the information provided by the decoder controller 405. Accordingly, according to an embodiment of the present invention, the decoder controller 405 may control the HFR decoder 407 to add sine waves according to the bitstream format indicated by the decoder controller 405 without adding any sine waves. . Thus non-limiting examples include inserting sine waves in accordance with provided index and track information, sinusoidal alignment structure, relative spacing of sinusoidal arrangements, and arrangement or structure of fixed or varying sinusoids.

사인파 동작의 주입이 도 7의 709 단계에 보인다. Injection of sinusoidal motion is shown in step 709 of FIG.

재구성된 고주파 성분의 비트스트림은 재구성 디코더(409)로 전달된다.The bitstream of the reconstructed high frequency component is passed to a reconstruction decoder 409.

재구성 디코더(409)는 복호화된 저주파 비트스트림과 재구성된 고주파 비트스트림을 수신해, 오리지널 신호를 나타내는 비트스트림을 생성하도록 하고, 디코더 출력부(415)로 출력 오디오 신호(114)를 출력한다.The reconstruction decoder 409 receives the decoded low frequency bitstream and the reconstructed high frequency bitstream to generate a bitstream representing the original signal, and outputs an output audio signal 114 to the decoder output unit 415.

이러한 신호의 재구성이 도 8의 711 단계에서 보인다.This reconstruction of the signal is shown in step 711 of FIG.

상술한 본 발명의 실시예들은 수반되는 프로세스들의 이해를 돕기 위해, 독자적인 인코더(104) 및 디코더(108) 장치로 표현된 코덱을 개시하였다. 그러나, 그 장치, 구조 그리고 동작들이 하나로 된 인코더-디코더 장치/구조/동작으로서 구현될 수도 있다는 것을 예상할 수 있을 것이다. 또, 본 발명의 일부 실시예들에서 코더와 디코더는 공통되는 일부 혹은 전체 구성요소들을 공유할 수도 있다.The above-described embodiments of the present invention disclose a codec represented by a unique encoder 104 and decoder 108 device to aid in understanding the processes involved. However, it will be appreciated that the apparatus, structure and operations may be implemented as a single encoder-decoder device / structure / operation. In addition, in some embodiments of the present invention, the coder and the decoder may share some or all of the common components.

상기 예들은 전자 기기(10) 안에서의 코덱 내에서 동작하는 발명의 실시예들을 개시하고 있으나, 이하에 기술되는 바와 같이 본 발명이 임의의 가변 레이트/적응적 레이트 오디오 (또는 스피치) 코덱의 일부로서 구현될 수 있다는 것을 알 수 있을 것이다. 그에 따라, 이를테면 본 발명의 실시예들은 고정 혹은 유선 통신 경로들을 통한 오디오 코딩을 구현할 수 있는 오디오 코덱 안에서 구현될 수 있다.The above examples disclose embodiments of the invention operating within a codec within the electronic device 10, but the invention is described as part of any variable rate / adaptive rate audio (or speech) codec as described below. It will be appreciated that it may be implemented. Thus, for example, embodiments of the present invention may be implemented in an audio codec that may implement audio coding over fixed or wired communication paths.

따라서 사용자 기기는 상기 본 발명의 실시예들에 개시된 것 같은 오디오 코덱을 구비할 수 있다.Thus, the user device may have an audio codec as disclosed in the embodiments of the present invention.

사용자 기기라는 용어는 모바일 전화기들, 휴대형 데이터 프로세싱 기기들 또는 휴대형 웹 브라우저들 같은 임의의 알맞은 타입의 사용자 무선 기기를 포괄하도록 의도되었음을 알아야 한다. It is to be understood that the term user device is intended to cover any suitable type of user wireless device, such as mobile telephones, portable data processing devices, or portable web browsers.

또, 공공 육상 모바일 네트워크 (PLMN, public land mobile network)의 구성요소들 역시 상술한 것 같은 오디오 코덱들을 구비할 수 있다. In addition, components of a public land mobile network (PLMN) may also have audio codecs as described above.

일반적으로, 본 발명의 다양한 실시예들은 하드웨어나 특수 용도의 회로들, 소프트웨어, 로직, 혹은 이들의 임의의 결합형태 안에서 구현될 수 있다. 예를 들어 일부 양태들은 하드웨어 안에서 구현되고, 다른 양태들은 비한정적 예들로서 콘트롤러, 마이크로프로세서 또는 다른 컴퓨팅 기기에 의해 실행될 수 있는 펌웨어나 소프트웨어를 통해 구현될 수 있다. 본 발명의 다양한 양태들이 블록도들, 흐름도들로서, 혹은 어떤 다른 묘사적 표현을 이용해 도시되고 설명되었지만, 여기 개시된 이러한 블록들, 장치, 시스템들, 기술들 또는 방법들은 비한정적 예들로서 하드웨어, 소프트웨어, 펌웨어, 특수 용도의 회로들이나 로직, 범용 하드웨어나 콘트롤러 또는 컴퓨팅 기기들, 아니면 이들의 어떤 결합 형태들 안에서 구현될 수 있다는 것을 잘 알 수 있을 것이다. In general, the various embodiments of the invention may be implemented in hardware or special purpose circuits, software, logic, or any combination thereof. For example, some aspects may be implemented in hardware, and other aspects may be implemented through firmware or software that may be executed by a controller, microprocessor or other computing device as non-limiting examples. While various aspects of the present invention have been shown and described as block diagrams, flowcharts, or in any other descriptive representation, such blocks, apparatus, systems, techniques or methods disclosed herein are non-limiting examples of hardware, software, It will be appreciated that they may be implemented in firmware, special purpose circuits or logic, general purpose hardware or controllers or computing devices, or any combination thereof.

본 발명의 실시예들은 프로세서 엔티티 같은 모바일 기기의 데이터 프로세서에 의해서 실행될 수 있는 컴퓨터 소프트웨어나, 하드웨어, 혹은 소프트웨어 및 하드웨어가 결합 된 것에 의해 구현될 수 있다. 또, 이와 관련하여 도면들에서와 같은 로직 흐름의 어떤 블록들은 프로그램 단계들, 상호연결된 로직 회로들, 블록들 및 기능들, 혹은 프로그램 단계들과 로직 회로들 블록들 및 기능들이 결합된 것을 나타낼 수 있다.Embodiments of the invention may be implemented by computer software that may be executed by a data processor of a mobile device, such as a processor entity, or by hardware or a combination of software and hardware. Further, in this regard, certain blocks of the logic flow as in the figures may represent program steps, interconnected logic circuits, blocks and functions, or program steps and logic circuit blocks and functions combined. have.

메모리는 반도체 기반 메모리 장치들, 마그네틱 메모리 장치들 및 시스템들, 광학 메모리 장치들 및 시스템들, 고정형 메모리 및 탈부착형 메모리 같은 임의의 적절한 데이터 저장 기술을 이용해 구현될 수 있다. 데이터 프로세서들은 로컬 기술 환경에 적합한 어떠한 타입에나 해당할 수 있고, 비한정적 예들로서, 일반 범용 컴퓨터들, 특수 용도의 컴퓨터들, 마이크로프로세서들, DSP들 (digital signal processors) 및 멀티-코어 프로세서 아키텍처에 기반하는 프로세서들 가운데 하나 이상을 포함할 수 있다. The memory may be implemented using any suitable data storage technology such as semiconductor based memory devices, magnetic memory devices and systems, optical memory devices and systems, fixed memory and removable memory. Data processors may be of any type suitable for the local technology environment and include, but are not limited to, general purpose computers, special purpose computers, microprocessors, digital signal processors (DSPs), and multi-core processor architectures. It may include one or more of the based processors.

본 발명의 실시예들은 집적 회로 모듈들 같은 다양한 구성소자들 안에서 실시될 수 있다. 집적 회로 설계는 대체로 고도로 자동화된 프로세스이다. 종합적이고 강력한 소프트웨어 툴들이 사용되어 로직 레벨 디자인을 반도체 기판 위에서 에칭 및 형성될 준비가 된 반도체 회로 디자인으로 변환한다.Embodiments of the invention may be practiced in various components, such as integrated circuit modules. Integrated circuit design is usually a highly automated process. Comprehensive and powerful software tools are used to convert the logic level design into a semiconductor circuit design ready to be etched and formed on the semiconductor substrate.

캘리포니아주 마운틴 뷰의 Synopsys 사와 캘리포니아주 산호세의 Cadence Design에 의해 제공되는 것들 같은 프로그램들은 잘 설정된 디자인 규칙들 및 미리 저장된 디자인 모듈들의 라이브러리들을 사용해 반도체 칩 위에서 자동으로 도선들을 라우팅하고 소자들을 배치한다. 반도체 회로의 설계가 완료되었으면, 규격화된 전자 포맷 (가령, Opus, GDSII 등등)으로 된 그에 따른 디자인이 반도체 제조 시설이나 제조 "공장"으로 보내질 것이다. Programs such as those provided by Synopsys of Mountain View, CA, and Cadence Design, San Jose, CA, use well-established design rules and libraries of pre-stored design modules to automatically route wires and place devices on a semiconductor chip. Once the design of the semiconductor circuit has been completed, the corresponding design in a standardized electronic format (eg Opus, GDSII, etc.) will be sent to the semiconductor manufacturing facility or manufacturing “factory”.

상술한 설명은, 전형적이고 비한정적인 예들을 들어 본 발명의 전형적 실시예에 대한 충분하고도 정보가치가 있는 내용을 제공하였다. 그러나, 관련 분야의 기술자들이라면 상술한 설명이 첨부된 도면들과 부가된 청구항들과 함께 읽혀질 때 다양한 변형과 적응 버전들이 가능하다는 것을 명확히 알 수 있을 것이다. 본 발명의 가르침에 대한 그러한 모든 비슷한 변형 버전들 또한 부가된 청구항들에 정의된 본 발명의 범주 안에 포함될 것이다. The foregoing description, by way of example and by way of non-limiting example, has provided sufficient and informative content for an exemplary embodiment of the present invention. However, it will be apparent to those skilled in the relevant art that various modifications and adaptations are possible when the above description is read in conjunction with the accompanying drawings and the appended claims. All such similar variations of the teachings of the invention will also be included within the scope of the invention as defined in the appended claims.

Claims

삭제delete

오디오 신호의 저 주파수 영역을 부호화하고, 톤 선택 제어기가 오디오 신호가 톤임을 나타내면 단일 주파수 성분 주입 및 코딩 동작을 실행하고, 상기 톤 선택 제어기가 오디오 신호가 노멀임을 나타내면 대역 리플리컨트 선택, 대역 리플리컨트 스케일링 및 단일 주파수 성분 주입을 실행하여 오디오 신호의 고 주파수 영역을 부호화하는 인코더에 있어서, 상기 인코더는
적어도 두 개의 상기 단일 주파수 성분들을 선택하고;
상기 적어도 두 개의 상기 단일 주파수 성분들을 나타내도록 구성되고, 상기 두 개의 단일 주파수 성분들 사이의 주파수 간격 (frequency separation)에 종속되도록 구성되는 지시자를 생성하도록 구성됨을 특징으로 하는 인코더.Encode a low frequency region of the audio signal, and the tone select controller performs a single frequency component injection and coding operation if the audio signal is a tone, and if the tone select controller indicates that the audio signal is normal, band replicant selection, band replicant scaling and An encoder for performing a single frequency component injection to encode a high frequency region of an audio signal, the encoder
Select at least two said single frequency components;
And generate an indicator configured to represent the at least two single frequency components, the indicator configured to be dependent on a frequency separation between the two single frequency components.

제31항에 있어서,
적어도 한 추가 단일 주파수 성분을 선택하도록 추가 구성될 수 있고, 상기 지시자는 상기 적어도 한 추가 단일 주파수 성분을 나타내도록 추가 구성되고, 상기 지시자는 상기 적어도 한 추가 단일 주파수 성분, 및 상기 적어도 두 개의 단일 주파수 성분들 중 하나와의 사이의 주파수 간격에 종속되도록 추가 구성됨을 특징으로 하는 인코더. 32. The method of claim 31,
May be further configured to select at least one additional single frequency component, wherein the indicator is further configured to indicate the at least one additional single frequency component, the indicator being at least one additional single frequency component, and the at least two single frequency And further configured to be dependent on the frequency spacing between one of the components.

제31항 또는 제32항에 있어서, 상기 지시자는 상기 적어도 두 개의 단일 주파수 성분들 중 하나의 주파수에 종속되도록 추가 구성됨을 특징으로 하는 인코더.33. The encoder according to claim 31 or 32, wherein the indicator is further configured to be dependent on the frequency of one of the at least two single frequency components.

제31항 또는 제32항에 있어서,
상기 두 개의 단일 주파수 성분들 사이의 주파수 간격을 결정하도록 추가 구성됨을 특징으로 하는 인코더. 33. The method according to claim 31 or 32,
And further determine a frequency spacing between the two single frequency components.

제34항에 있어서,
상기 결정된 두 개의 단일 주파수 성분들 사이의 주파수 간격에 대한 주파수 간격 값들의 리스트를 서치하고;
상기 결정된 두 개의 단일 주파수 성분들 사이의 주파수 간격에 보다 가깝게 매치하는 리스트 중 하나를 선택하도록 추가 구성되고,
상기 지시자는 주파수 간격 값들의 리스트 중 선택된 하나에 종속됨을 특징으로 하는 인코더. 35. The method of claim 34,
Search for a list of frequency spacing values for a frequency spacing between the determined two single frequency components;
Is further configured to select one of a list that more closely matches the frequency interval between the determined two single frequency components,
And the indicator is dependent on the selected one of the list of frequency interval values.

제35항에 있어서,
상기 주파수 간격 값들의 리스트 중 상기 선택된 하나 및 상기 결정된 주파수 간격 값 사이의 차를 판단하도록 추가 구성될 수 있고,
상기 지시자는 상기 차에 추가 종속됨을 특징으로 하는 인코더.36. The method of claim 35,
Further determine a difference between the selected one of the list of frequency interval values and the determined frequency interval value,
The indicator further dependent on the difference.

제36항에 있어서,
상기 주파수 간격 값들의 리스트 중 상기 선택된 하나 및 상기 결정된 주파수 간격 값 사이의 상기 판단된 차에 대한 차이 값들의 추가 리스트를 검색하고;
상기 차이 값들의 추가 리스트 중 상기 결정된 차이 값에 보다 가깝게 매치하는 하나를 선택하도록 추가 구성되고,
상기 지시자는 상기 차이 값들의 추가 리스트 중 선택된 하나에 종속됨을 특징으로 하는 인코더.37. The method of claim 36,
Retrieve a further list of difference values for the determined difference between the selected one of the list of frequency interval values and the determined frequency interval value;
Is further configured to select one of the additional lists of difference values that more closely matches the determined difference value,
And the indicator is dependent on the selected one of the additional list of difference values.

오디오 신호의 저 주파수 영역을 부호화하고, 톤 선택 제어기가 오디오 신호가 톤임을 나타내면 단일 주파수 성분 주입 및 코딩 동작을 실행하고, 상기 톤 선택 제어기가 오디오 신호가 노멀임을 나타내면 대역 리플리컨트 선택, 대역 리플리컨트 스케일링 및 단일 주파수 성분 주입을 실행하여 오디오 신호의 고 주파수 영역을 부호화하는 방법에 있어서,
적어도 두 개의 상기 단일 주파수 성분들을 선택하는 단계;
상기 적어도 두 개의 단일 주파수 성분들을 나타내도록 구성되고, 상기 두 개의 단일 주파수 성분들 사이의 주파수 간격 (frequency separation)에 종속되도록 구성되는 지시자를 생성하는 단계를 포함함을 특징으로 하는 부호화 방법.Encode a low frequency region of the audio signal, and the tone select controller performs a single frequency component injection and coding operation if the audio signal is a tone, and if the tone select controller indicates that the audio signal is normal, band replicant selection, band replicant scaling and A method of encoding a high frequency region of an audio signal by performing a single frequency component injection,
Selecting at least two said single frequency components;
Generating an indicator configured to represent the at least two single frequency components, the indicator being configured to be dependent on a frequency separation between the two single frequency components.

제38항에 있어서,
적어도 한 추가 단일 주파수 성분을 선택하는 단계를 더 포함하고,
상기 지시자는 상기 적어도 한 추가 단일 주파수 성분을 나타내도록 추가 구성되고, 상기 지시자는 상기 적어도 한 추가 단일 주파수 성분, 및 상기 적어도 두 개의 단일 주파수 성분들 중 하나와의 사이의 주파수 간격에 종속되도록 추가 구성됨을 특징으로 하는 부호화 방법. The method of claim 38,
Further selecting at least one additional single frequency component,
The indicator is further configured to indicate the at least one additional single frequency component, and the indicator is further configured to be dependent on a frequency interval between the at least one additional single frequency component and one of the at least two single frequency components. Encoding method characterized in that.

제38항 또는 제39항에 있어서, 상기 지시자는 상기 적어도 두 개의 단일 주파수 성분들 중 하나의 주파수에 종속되도록 추가 구성됨을 특징으로 하는 부호화 방법.40. The method of claim 38 or 39, wherein the indicator is further configured to be dependent on one frequency of the at least two single frequency components.

제38항 또는 제39항에 있어서,
상기 두 개의 단일 주파수 성분들 사이의 주파수 간격을 결정하는 단계를 더 포함함을 특징으로 하는 부호화 방법. 40. The method of claim 38 or 39,
And determining a frequency interval between the two single frequency components.

제41항에 있어서,
상기 결정된 두 개의 단일 주파수 성분들 사이의 주파수 간격에 대한 주파수 간격 값들의 리스트를 서치하는 단계; 및
상기 결정된 두 개의 단일 주파수 성분들 사이의 주파수 간격에 보다 가깝게 매치하는 리스트 중 하나를 선택하는 단계를 더 포함하고,
상기 지시자는 주파수 간격 값들의 리스트 중 선택된 하나에 종속됨을 특징으로 하는 부호화 방법. 42. The method of claim 41,
Searching a list of frequency spacing values for a frequency spacing between the determined two single frequency components; And
Selecting one of a list that more closely matches a frequency interval between the determined two single frequency components,
And the indicator is dependent on a selected one of a list of frequency interval values.

제42항에 있어서,
상기 주파수 간격 값들의 리스트 중 상기 선택된 하나 및 상기 결정된 주파수 간격 값 사이의 차를 판단하는 단계를 더 포함하고,
상기 지시자는 상기 차에 추가 종속됨을 특징으로 하는 부호화 방법.43. The method of claim 42,
Determining a difference between the selected one of the list of frequency interval values and the determined frequency interval value,
And the indicator is further dependent on the difference.

제43항에 있어서,
상기 주파수 간격 값들의 리스트 중 상기 선택된 하나 및 상기 결정된 주파수 간격 값 사이의 상기 판단된 차에 대한 차이 값들의 추가 리스트를 검색하는 단계; 및
상기 차이 값들의 추가 리스트 중 상기 결정된 차이 값에 보다 가깝게 매치하는 하나를 선택하는 단계를 더 포함하고,
상기 지시자는 상기 차이 값들의 추가 리스트 중 선택된 하나에 종속됨을 특징으로 하는 부호화 방법. 44. The method of claim 43,
Retrieving an additional list of difference values for the determined difference between the selected one of the list of frequency interval values and the determined frequency interval value; And
Selecting one of the additional list of difference values that more closely matches the determined difference value,
And the indicator is dependent on a selected one of the additional list of difference values.

오디오 신호의 저 주파수 영역을 부호화하고, 톤 선택 제어기가 오디오 신호가 톤임을 나타내면 단일 주파수 성분 주입 및 코딩 동작을 실행하고, 상기 톤 선택 제어기가 오디오 신호가 노멀임을 나타내면 대역 리플리컨트 선택, 대역 리플리컨트 스케일링 및 단일 주파수 성분 주입을 실행하여 오디오 신호의 고 주파수 영역을 부호화한 오디오 신호를 복호화하기 위한 디코더에 있어서,
적어도 두 개의 상기 단일 주파수 성분들을 나타내고 상기 두 개의 단일 주파수 성분들 사이의 주파수 간격을 나타내는 적어도 한 지시자를 수신하고;
상기 수신된 지시자에 종속된 상기 적어도 두 개의 단일 주파수 성분들을 삽입하도록 구성됨을 특징으로 하는 디코더. Encode a low frequency region of the audio signal, and the tone select controller performs a single frequency component injection and coding operation if the audio signal is a tone, and if the tone select controller indicates that the audio signal is normal, band replicant selection, band replicant scaling and A decoder for decoding an audio signal obtained by encoding a high frequency region of an audio signal by performing a single frequency component injection.
Receive at least one indicator indicating at least two said single frequency components and indicating a frequency spacing between said two single frequency components;
And insert the at least two single frequency components dependent on the received indicator.

제45항에 있어서, 상기 적어도 한 지시자는 적어도 한 추가 단일 주파수 성분을 나타내도록 추가 구성되고, 상기 지시자는 상기 적어도 한 추가 단일 주파수 성분 및 상기 적어도 두 개의 단일 주파수 성분들 중 하나와의 사이의 주파수 간격에 종속되도록 추가 구성되고, 상기 디코더는 상기 지시자에 종속되는 상기 적어도 한 추가 단일 주파수 성분을 삽입하도록 추가 구성됨을 특징으로 하는 디코더.46. The apparatus of claim 45, wherein the at least one indicator is further configured to indicate at least one additional single frequency component, the indicator being a frequency between the at least one additional single frequency component and one of the at least two single frequency components. Further configured to be dependent on an interval, wherein the decoder is further configured to insert the at least one additional single frequency component dependent on the indicator.

오디오 신호의 저 주파수 영역을 부호화하고, 톤 선택 제어기가 오디오 신호가 톤임을 나타내면 단일 주파수 성분 주입 및 코딩 동작을 실행하고, 상기 톤 선택 제어기가 오디오 신호가 노멀임을 나타내면 대역 리플리컨트 선택, 대역 리플리컨트 스케일링 및 단일 주파수 성분 주입을 실행하여 오디오 신호의 고 주파수 영역을 부호화한 오디오 신호를 복호화하기 위한 방법에서,
적어도 두 개의 상기 단일 주파수 성분들을 나타내고 상기 두 개의 단일 주파수 성분들 사이의 주파수 간격을 나타내는 적어도 한 지시자를 수신하는 단계; 및
상기 수신된 지시자에 의존해 상기 적어도 두 개의 단일 주파수 성분들을 삽입하는 단계를 포함함을 특징으로 하는 복호화 방법. Encode a low frequency region of the audio signal, and the tone select controller performs a single frequency component injection and coding operation if the audio signal is a tone, and if the tone select controller indicates that the audio signal is normal, band replicant selection, band replicant scaling and In a method for decoding an audio signal in which a high frequency region of an audio signal is encoded by performing a single frequency component injection,
Receiving at least one indicator indicating at least two said single frequency components and indicating a frequency interval between said two single frequency components; And
Inserting the at least two single frequency components depending on the received indicator.

제47항에 있어서, 상기 적어도 한 지시자는 적어도 한 추가 단일 주파수 성분을 나타내도록 추가 구성되고, 상기 지시자는 상기 적어도 한 추가 단일 주파수 성분 및 상기 적어도 두 개의 단일 주파수 성분들 중 하나와의 사이의 주파수 간격에 종속되도록 추가 구성되고, 상기 방법은 상기 지시자에 의존해 상기 적어도 한 추가 단일 주파수 성분을 삽입하는 단계를 더 포함함을 특징으로 하는 복호화 방법. 48. The apparatus of claim 47, wherein the at least one indicator is further configured to indicate at least one additional single frequency component, wherein the indicator is a frequency between the at least one additional single frequency component and one of the at least two single frequency components. Further configured to be dependent on the interval, wherein the method further comprises inserting the at least one additional single frequency component in dependence on the indicator.

제31항 또는 제32항에 따른 인코더를 포함함을 특징으로 하는 이동 통신 장치.33. A mobile communication device comprising the encoder according to claim 31 or 32.

제45항 또는 제46항에 따른 디코더를 포함함을 특징으로 하는 이동 통신 장치.A mobile communication device comprising a decoder according to claim 45 or 46.

제31항 또는 제32항에 따른 인코더를 포함함을 특징으로 하는 전자 기기.An electronic device comprising the encoder according to claim 31.

제45항 또는 제46항에 따른 디코더를 포함함을 특징으로 하는 전자 기기.An electronic device comprising the decoder according to claim 45 or 46.

컴퓨터에 의해 실행되어 오디오 신호의 저 주파수 영역을 부호화하고, 톤 선택 제어기가 오디오 신호가 톤임을 나타내면 단일 주파수 성분 주입 및 코딩 동작을 실행하고, 상기 톤 선택 제어기가 오디오 신호가 노멀임을 나타내면 대역 리플리컨트 선택, 대역 리플리컨트 스케일링 및 단일 주파수 성분 주입을 실행하여 오디오 신호의 고 주파수 영역을 부호화하는 방법을 수행하도록 구성된 컴퓨터 프로그램 제품을 저장한 컴퓨터 판독가능 저장매체에 있어서, 상기 부호화 방법은,
적어도 두 개의 상기 단일 주파수 성분들을 선택하는 단계; 및
상기 적어도 두 개의 단일 주파수 성분들을 나타내도록 구성되고, 상기 두 개의 단일 주파수 성분들 사이의 주파수 간격 (frequency separation)에 종속되도록 구성되는 지시자를 생성하는 단계를 포함함을 특징으로 하는 컴퓨터 판독가능 저장매체.Executed by a computer to encode the low frequency region of the audio signal, and the tone select controller performs a single frequency component injection and coding operation if the audio signal is a tone, and the band selector selection if the tone select controller indicates that the audio signal is normal 12. A computer readable storage medium having stored thereon a computer program product configured to perform a method for encoding a high frequency region of an audio signal by performing band replicant scaling and single frequency component injection.
Selecting at least two said single frequency components; And
And generating an indicator configured to represent the at least two single frequency components, the indicator being configured to be dependent on frequency separation between the two single frequency components. .

컴퓨터에 의해 실행되어 오디오 신호의 저 주파수 영역을 부호화하고, 톤 선택 제어기가 오디오 신호가 톤임을 나타내면 단일 주파수 성분 주입 및 코딩 동작을 실행하고, 상기 톤 선택 제어기가 오디오 신호가 노멀임을 나타내면 대역 리플리컨트 선택, 대역 리플리컨트 스케일링 및 단일 주파수 성분 주입을 실행하여 오디오 신호의 고 주파수 영역을 부호화한 오디오 신호를 복호화하는 방법을 수행하도록 구성된 컴퓨터 프로그램 제품을 저장한 컴퓨터 판독가능 저장매체에 있어서, 상기 복호화 방법은,
적어도 두 개의 상기 단일 주파수 성분들을 나타내고 상기 두 개의 단일 주파수 성분들 사이의 주파수 간격을 나타내는 적어도 한 지시자를 수신하는 단계; 및
상기 수신된 지시자에 의존해 상기 적어도 두 개의 단일 주파수 성분들을 삽입하는 단계를 포함함을 특징으로 하는 컴퓨터 판독가능 저장매체. Executed by a computer to encode the low frequency region of the audio signal, and the tone select controller performs a single frequency component injection and coding operation if the audio signal is a tone, and the band selector selection if the tone select controller indicates that the audio signal is normal 16. A computer readable storage medium having a computer program product configured to perform a method for decoding an audio signal encoded in a high frequency region of an audio signal by performing band replicant scaling and single frequency component injection.
Receiving at least one indicator indicating at least two said single frequency components and indicating a frequency interval between said two single frequency components; And
And inserting the at least two single frequency components in dependence on the received indicator.

오디오 신호의 저 주파수 영역을 부호화하고, 톤 선택 제어기가 오디오 신호가 톤임을 나타내면 단일 주파수 성분 주입 및 코딩 동작을 실행하고, 상기 톤 선택 제어기가 오디오 신호가 노멀임을 나타내면 대역 리플리컨트 선택, 대역 리플리컨트 스케일링 및 단일 주파수 성분 주입을 실행하여 오디오 신호의 고 주파수 영역을 부호화하기 위한 인코더에 있어서,
적어도 두 개의 상기 단일 주파수 성분들을 선택하기 위한 선택 수단; 및
상기 적어도 두 개의 단일 주파수 성분들을 나타내도록 구성되고, 상기 두 개의 단일 주파수 성분들 사이의 주파수 간격 (frequency separation)에 종속되도록 구성되는 지시자를 생성하도록 하는 지시자 생성 수단을 포함함을 특징으로 하는 인코더.Encode a low frequency region of the audio signal, and the tone select controller performs a single frequency component injection and coding operation if the audio signal is a tone, and if the tone select controller indicates that the audio signal is normal, band replicant selection, band replicant scaling and An encoder for encoding a high frequency region of an audio signal by performing a single frequency component injection,
Selecting means for selecting at least two said single frequency components; And
An indicator generating means configured to generate the at least two single frequency components, the indicator generating means for generating an indicator configured to be dependent on a frequency separation between the two single frequency components.

오디오 신호의 저 주파수 영역을 부호화하고, 톤 선택 제어기가 오디오 신호가 톤임을 나타내면 단일 주파수 성분 주입 및 코딩 동작을 실행하고, 상기 톤 선택 제어기가 오디오 신호가 노멀임을 나타내면 대역 리플리컨트 선택, 대역 리플리컨트 스케일링 및 단일 주파수 성분 주입을 실행하여 오디오 신호의 고 주파수 영역을 부호화한 오디오 신호를 복호화하기 위한 디코더에 있어서,
적어도 두 개의 상기 단일 주파수 성분들을 나타내고 상기 두 개의 단일 주파수 성분들 사이의 주파수 간격을 나타내는 적어도 한 지시자를 수신하기 위한 수신 수단; 및
상기 수신된 지시자에 의존해 상기 적어도 두 개의 단일 주파수 성분들을 삽입하도록 하는 삽입 수단을 포함함을 특징으로 하는 디코더.Encode a low frequency region of the audio signal, and the tone select controller performs a single frequency component injection and coding operation if the audio signal is a tone, and if the tone select controller indicates that the audio signal is normal, band replicant selection, band replicant scaling and A decoder for decoding an audio signal obtained by encoding a high frequency region of an audio signal by performing a single frequency component injection.
Receiving means for receiving at least one indicator indicating at least two said single frequency components and indicating a frequency interval between said two single frequency components; And
Decoder means for inserting the at least two single frequency components in dependence on the received indicator.