JP5262171B2

JP5262171B2 - Encoding apparatus, encoding method, and encoding program

Info

Publication number: JP5262171B2
Application number: JP2008037991A
Authority: JP
Inventors: 美由紀白川; 政直鈴木; 義照土永
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 2008-02-19
Filing date: 2008-02-19
Publication date: 2013-08-14
Anticipated expiration: 2028-02-19
Also published as: US9076440B2; EP2093758A2; US20090210235A1; CN101515458A; JP2009198612A

Abstract

A disclosed encoding device converts an audio signal into frequency spectra, determines allowable error powers with respect to bands divided by the frequency of the audio signal by a predetermined width, detects a tonal frequency spectrum from the frequency spectra, and detects a band containing the frequency spectrum. Using the detection result and the allowable error powers, the encoding device performs correction such that allowable error powers determined by a power determining unit with respect to bands adjacent to the band detected by a detecting unit become smaller than the powers of the frequency spectra with respect to the adjacent bands, and quantizes each of frequency spectra having greater powers than the corrected allowable error powers.

Description

この発明は、符号化装置、符号化方法および符号化プログラムに関する。 The present invention relates to an encoding device, an encoding method, and an encoding program.

従来より、オーディオ信号（音声・音楽などの音源）を圧縮・伸張するオーディオ符号化技術に関して、様々な研究が行われている。例えば、オーディオ信号を周波数領域に変換して符号化する手法について、様々な研究が行われている。 Conventionally, various studies have been conducted on audio encoding technology for compressing and expanding audio signals (sound sources such as voice and music). For example, various studies have been conducted on techniques for converting audio signals into a frequency domain and encoding them.

例えば、このようなオーディオ符号化技術としては、ＡＡＣ（ＡｄｖａｎｃｅｄＡｕｄｉｏＣｏｄｉｎｇ）方式や、ＨＥ−ＡＡＣ（ＨｉｇｈＥｆｆｉｃｉｅｎｃｙ−ＡｄｖａｎｃｅｄＡｕｄｉｏＣｏｄｉｎｇ）方式などがある。ＡＡＣ方式やＨＥ−ＡＡＣ方式は、ＩＳＯ／ＩＥＣのＭＰＥＧ−２／４Ａｕｄｉｏ規格の一つであり、日本のデジタル放送（地上波、ＢＳ、ＣＳ、ワンセグ）などに広く用いられている。 For example, as such an audio encoding technique, there are an AAC (Advanced Audio Coding) system, a HE-AAC (High Efficiency-Advanced Audio Coding) system, and the like. The AAC system and the HE-AAC system are one of ISO / IEC MPEG-2 / 4 Audio standards, and are widely used in Japanese digital broadcasting (terrestrial, BS, CS, One Seg).

このようなオーディオ符号化技術では、オーディオ符号化技術を実行する従来の符号化装置が、ＭＤＣＴ（ＭｏｄｉｆｉｅｄＤｉｓｃｒｅｔｅＣｏｓｉｎｅＴｒａｎｓｆｏｒｍ）変換により周波数スペクトルにオーディオ信号を変換し、この周波数スペクトルを量子化し、その後、符号化する。 In such an audio encoding technique, a conventional encoding apparatus that performs the audio encoding technique converts an audio signal into a frequency spectrum by MDCT (Modified Discrete Cosine Transform) conversion, quantizes the frequency spectrum, and then Encode.

上記したオーディオ符号化技術では、従来の符号化装置が、聴覚のマスキング特性を利用して、周波数スペクトルを量子化する。具体的には、従来の符号化装置は、ある音にマスクされて、聴覚的に聞こえない音の成分を決定する閾値（聞こえるか、聞こえないかの閾値）であるマスキング閾値を用いて、人の聴覚に聞こえる音だけを量子化する。 In the audio encoding technique described above, the conventional encoding apparatus quantizes the frequency spectrum using auditory masking characteristics. Specifically, the conventional encoding device uses a masking threshold that is a threshold (threshold for hearing or not hearing) that determines a component of a sound that is masked by a certain sound and cannot be heard audibly. Quantize only the sound that can be heard.

例えば、従来の符号化装置は、オーディオ信号（符号化する音源）について、心理聴覚分析（聴覚によって、聞こえるか、聞こえないかを分析する手法）を行い、マスキング閾値を周波数ごとに求める。その後、従来の符号化装置は、求めたマスキング閾値から、量子化する際に許容できる誤差の限界（許容誤差電力）を、帯域（所定の周波数の幅）ごとに決定する。そして、従来の符号化装置は、許容誤差電力を用いて、聴覚的に聞こえる音源（周波数のスペクトル）のみを量子化する。 For example, a conventional encoding device performs psychoacoustic analysis (a technique for analyzing whether an audio signal can be heard or not heard) for an audio signal (a sound source to be encoded), and obtains a masking threshold for each frequency. Thereafter, the conventional encoding apparatus determines a limit of error (allowable error power) that can be allowed for quantization for each band (width of a predetermined frequency) from the obtained masking threshold. Then, the conventional coding apparatus quantizes only the sound source (frequency spectrum) that can be heard audibly, using the allowable error power.

なお、特許文献１（特開２００６−１８０２３号公報）には、マスキング閾値を調節する手法が開示されており、特許文献２（２００１−７７０４号公報）には、符号化する際の効率（符号化する際に使用するビットの使用量を少なくする）手法が開示されており、特許文献３（特開平７−２０２８２３号公報）や特許文献４（特開平７−２９５５９４号公報）には、ビットの配分量を指定する手法が開示されている。 Patent Document 1 (Japanese Patent Application Laid-Open No. 2006-18023) discloses a technique for adjusting a masking threshold, and Patent Document 2 (Japanese Patent Application Laid-Open No. 2001-7704) discloses an encoding efficiency (code). A method of reducing the amount of bits used for conversion) is disclosed, and Patent Document 3 (Japanese Patent Laid-Open No. 7-202823) and Patent Document 4 (Japanese Patent Laid-Open No. 7-295594) disclose bits. A method for specifying the amount of allocation is disclosed.

特開２００６−１８０２３号公報（第５−１１頁、第１図）JP 2006-18023 A (page 5-11, FIG. 1) 特開２００１−７７０４号公報（第５−９頁、第１図）JP 2001-7704 A (page 5-9, FIG. 1) 特開平７−２０２８２３号公報（第３−５頁、第１図）Japanese Patent Laid-Open No. 7-202823 (page 3-5, FIG. 1) 特開平７−２９５５９４号公報（第２−３頁、第１図）JP-A-7-295594 (page 2-3, FIG. 1)

ところで、上記した従来の技術は、トーン性のある（高い）オーディオ信号を符号化する際に、音質が劣化していたという課題があった。 By the way, the above-described conventional technique has a problem that the sound quality is deteriorated when an audio signal having a tone property (high) is encoded.

具体的には、トーン性のあるオーディオ信号（例えば、正弦波やスイープ波など）を符号化する場合、図２３の（１）に示すように、特定の帯域に強度（電力［ｄＢ］、パワー）が集中し、他の帯域と比較して相対的に大きなピークを示す（強度が強い周波数スペクトルとなる）。なお、ここで、図２３は、従来技術を説明するための図であり、図２３の（１）は、トーン性のあるオーディオ信号をＭＤＣＴ変換して得られる周波数スペクトルの一例を示したものである。 Specifically, when encoding an audio signal having a tone characteristic (for example, a sine wave or a sweep wave), as shown in (1) of FIG. 23, intensity (power [dB], power ) Concentrate and show a relatively large peak compared to other bands (a frequency spectrum with a strong intensity). Here, FIG. 23 is a diagram for explaining the prior art, and (1) in FIG. 23 shows an example of a frequency spectrum obtained by MDCT conversion of a tonal audio signal. is there.

また、従来の符号化装置では、図２３の（２）に示すように、ピークが含まれる帯域について算出される許容誤差電力だけでなく、ピークが含まれる帯域の近傍の帯域について算出される許容誤差電力も大きくなっていた。具体的には、ピークが含まれる帯域の周波数スペクトルは、他の周波数スペクトルと比較して電力が大きいため、従来の符号化装置では、ピークが含まれる帯域だけでなく、近傍の帯域について算出されるマスキング閾値も大きくなり、その結果、許容誤差電力も大きくなっていた。この結果、図２３の（３）に示すように、ピーク近傍の帯域に存在する周波数スペクトルが許容誤差電力以下の周波数スペクトルとなり、従来の符号化装置は、これらの周波数スペクトルを、量子化対象外として量子化しなかった。 Further, in the conventional coding apparatus, as shown in (2) of FIG. 23, not only the allowable error power calculated for the band including the peak, but also the allowable calculation calculated for the band near the band including the peak. The error power was also large. Specifically, since the frequency spectrum of the band including the peak has a higher power than the other frequency spectrum, the conventional coding apparatus calculates not only the band including the peak but also the nearby band. As a result, the permissible error power is also increased. As a result, as shown in (3) of FIG. 23, the frequency spectrum existing in the band near the peak becomes a frequency spectrum that is equal to or lower than the allowable error power. Not quantized as.

ここで、ＭＤＣＴ変換して得られる周波数スペクトルであるＭＤＣＴ係数各々は、オーディオ信号の振幅と位相の情報を含んでいる。例えば、ピーク近傍の周波数スペクトルを量子化対象外として量子化しない場合、量子化対象外とした周波数スペクトルに含まれる情報が消失し、ピークに関する音源の位相や振幅に影響し、ふるえ感などの音質劣化が生じる。特にトーン性のあるオーディオ信号では、特定の周波数（ピークのある帯域）付近の音源が主な音源であるため、ピーク近傍の周波数スペクトルに含まれる情報が消失したことによる影響が、トーン性の低いオーディオ信号と比較して、符号化した音源の音質に強く生じていた。 Here, each MDCT coefficient, which is a frequency spectrum obtained by MDCT conversion, includes information on the amplitude and phase of an audio signal. For example, if the frequency spectrum near the peak is not quantized and not quantized, the information contained in the frequency spectrum that is not quantized disappears, affecting the phase and amplitude of the sound source related to the peak, and sound quality such as trembling Deterioration occurs. In particular, in a tone signal, since a sound source near a specific frequency (a band with a peak) is a main sound source, the effect of losing information included in the frequency spectrum near the peak is low in tone characteristics. Compared to the audio signal, the sound quality of the encoded sound source was stronger.

このように、従来の符号化装置では、トーン性のあるオーディオ信号を符号化する際に、ピーク近傍の周波数スペクトルを確実に量子化することができず、十分な音質を維持したまま符号化することができなかった。 As described above, when encoding a tone-like audio signal, the conventional encoding apparatus cannot reliably quantize the frequency spectrum in the vicinity of the peak, and encodes it while maintaining sufficient sound quality. I couldn't.

なお、上記した特許文献１〜４には、ピーク近傍の周波数スペクトルを確実に量子化する手法は開示されておらず、トーン性のあるオーディオ信号を符号化する際の音質を十分に改善することができなかった。 In addition, the above-described Patent Documents 1 to 4 do not disclose a method for reliably quantizing the frequency spectrum in the vicinity of the peak, and sufficiently improve the sound quality when encoding a tonal audio signal. I could not.

そこで、この発明は、上述した従来技術の課題を解決するためになされたものであり、トーン性のあるオーディオ信号の符号化音質を改善することが可能である符号化装置、符号化方法および符号化プログラムを提供することを目的とする。 Accordingly, the present invention has been made to solve the above-described problems of the prior art, and is an encoding device, encoding method, and code capable of improving the encoding sound quality of an audio signal having tone characteristics. The purpose is to provide a computerized program.

上述した課題を解決し、目的を達成するため、この符号化装置は、オーディオ信号を周波数スペクトルに変換し、当該周波数スペクトルを量子化して符号化する符号化装置において、前記周波数スペクトルからトーン性のある周波数スペクトルを検出された場合、前記オーディオ信号に応じて算出された許容誤差電力を補正する電力補正部と、前記電力補正部によって補正された許容誤差電力より大きな電力の前記周波数スペクトル各々を量子化する量子化部と、を有することを要件とする。 In order to solve the above-described problems and achieve the object, the encoding device converts an audio signal into a frequency spectrum and quantizes the frequency spectrum to encode the frequency spectrum. When a certain frequency spectrum is detected, a power correction unit that corrects the allowable error power calculated according to the audio signal, and each of the frequency spectrums having a power larger than the allowable error power corrected by the power correction unit is quantized. It is a requirement to have a quantization unit to be converted.

また、この符号化装置は、オーディオ信号を符号化する符号化装置において、前記オーディオ信号を周波数スペクトルに変換する周波数変換部と、前記オーディオ信号に応じて、許容誤差電力を算出する電力算出部と、前記周波数変換部によって変換された前記周波数スペクトルから、トーン性のある周波数スペクトルを検出する検出部と、前記検出部での検出結果と前記電力算出部によって算出された前記許容誤差電力とを用いて、前記許容誤差電力を補正する電力補正部と、前記電力補正部によって補正された前記許容誤差電力より大きな電力の前記周波数スペクトル各々を量子化する量子化部と、を有することを要件とする。 The encoding device further includes a frequency conversion unit that converts the audio signal into a frequency spectrum in an encoding device that encodes an audio signal, and a power calculation unit that calculates an allowable error power according to the audio signal. A detection unit for detecting a frequency spectrum having tone characteristics from the frequency spectrum converted by the frequency conversion unit, a detection result of the detection unit, and the allowable error power calculated by the power calculation unit. And a power correction unit that corrects the allowable error power, and a quantization unit that quantizes each frequency spectrum having a power larger than the allowable error power corrected by the power correction unit. .

また、この符号化装置は、さらに、前記電力算出部は、前記オーディオ信号の周波数を所定の幅で分割した帯域それぞれについて、前記許容誤差電力を算出し、前記検出部は、前記トーン性のある周波数スペクトルを検出するとともに当該周波数スペクトルが含まれる前記帯域を検出し、前記電力補正部は、前記検出部によって検出された前記帯域に近傍する帯域について前記電力算出部によって算出された前記許容誤差電力が、当該近傍する帯域についての前記周波数スペクトルの電力より小さくなるように補正することを要件とする。 Further, in this encoding device, the power calculation unit calculates the allowable error power for each band obtained by dividing the frequency of the audio signal by a predetermined width, and the detection unit has the tone characteristics. The frequency error is detected and the band including the frequency spectrum is detected, and the power correction unit calculates the allowable error power calculated by the power calculation unit for a band close to the band detected by the detection unit. However, it is necessary to correct so that it becomes smaller than the power of the frequency spectrum for the neighboring band.

また、この符号化装置は、さらに、前記量子化部は、前記周波数スペクトルのダイナミックレンジを、スケール値によって一意に特定されるダイナミックレンジに縮小し、縮小したダイナミックレンジにおける当該周波数スペクトル各々を量子化するものであって、前記帯域に含まれる前記周波数スペクトルを量子化する際に発生する誤差である量子化誤差から算出される量子化誤差電力が、当該帯域について電力算出部によって算出された前記許容誤差電力未満になるようなスケール値を、前記帯域ごとに算出する第１のスケール値算出部と、前記検出部によって検出された前記トーン性のある前記周波数スペクトルが含まれる前記帯域に近傍する帯域について、当該近傍する帯域に含まれる前記周波数スペクトルを量子化する際に発生する誤差である量子化誤差から算出される量子化誤差電力が、当該近傍する帯域について前記電力補正部によって当該近傍する帯域について算出された前記許容誤差電力未満となるようなスケール値を、当該近傍する帯域についての当該スケール値として算出する第２のスケール値算出部と、をさらに備え、前記量子化部は、前記第２のスケール値算出部によって前記スケール値が算出された前記帯域に含まれる前記周波数スペクトル各々を、当該第２のスケール値算出部によって算出された前記スケール値を用いて量子化し、当該第２のスケール値算出部によって当該スケール値が算出されていない当該帯域に含まれる前記周波数スペクトル各々を、前記第１のスケール値算出部によって算出されたスケール値を用いて量子化することを要件とする。 Further, in this encoding device, the quantization unit further reduces the dynamic range of the frequency spectrum to a dynamic range uniquely specified by a scale value, and quantizes each frequency spectrum in the reduced dynamic range. The quantization error power calculated from the quantization error, which is an error that occurs when the frequency spectrum included in the band is quantized, is calculated by the power calculation unit for the band. A first scale value calculation unit that calculates a scale value that is less than the error power for each band, and a band that is close to the band that includes the frequency spectrum with the tone characteristic detected by the detection unit Occurs when the frequency spectrum included in the neighboring band is quantized. A scale value such that the quantization error power calculated from the quantization error that is an error is less than the allowable error power calculated for the neighboring band by the power correction unit for the neighboring band A second scale value calculation unit that calculates the scale value for the band, and the quantization unit is included in the band in which the scale value is calculated by the second scale value calculation unit. Each frequency spectrum is quantized using the scale value calculated by the second scale value calculation unit, and the frequency included in the band in which the scale value is not calculated by the second scale value calculation unit Requirements that each spectrum is quantized using the scale value calculated by the first scale value calculation unit To.

また、この符号化装置は、さらに、前記周波数スペクトルを前記量子化部が量子化して得られる値には、当該値として得ることが可能である最大の値が設定されているものであって、前記帯域に含まれる前記周波数スペクトルを量子化する際に発生する誤差である量子化誤差から算出される量子化誤差電力が、当該帯域について電力算出部によって算出された前記許容誤差電力未満になるようなスケール値を、前記帯域ごとに算出する第１のスケール値算出部と、前記検出部によって検出された前記トーン性のある前記周波数スペクトルが含まれる前記帯域を構成する当該周波数スペクトル各々の内、最も大きな当該周波数スペクトルから得られる値が前記最大の値となるような前記スケール値を、当該帯域についての当該スケール値として算出する第３のスケール値算出部と、をさらに備え、前記量子化部は、前記第３のスケール値算出部によって前記スケール値が算出された前記帯域に含まれる前記周波数スペクトル各々を、当該第３のスケール値算出部によって算出された前記スケール値を用いて量子化し、当該第３のスケール値算出部によって当該スケール値が算出されていない当該帯域に含まれる前記周波数スペクトル各々を、前記第１のスケール値算出部によって算出されたスケール値を用いて量子化することを要件とする。 Further, in the encoding device, a value obtained by quantizing the frequency spectrum by the quantization unit is set to a maximum value that can be obtained as the value, A quantization error power calculated from a quantization error that is an error that occurs when the frequency spectrum included in the band is quantized is less than the allowable error power calculated by the power calculation unit for the band. A first scale value calculation unit that calculates a specific scale value for each band, and each of the frequency spectra constituting the band including the frequency spectrum having the tone characteristics detected by the detection unit, The scale value such that the value obtained from the largest frequency spectrum is the maximum value is the scale value for the band. A third scale value calculation unit that outputs the frequency spectrum, and the quantization unit outputs each of the frequency spectra included in the band in which the scale value is calculated by the third scale value calculation unit. Each of the frequency spectra included in the band in which the scale value is not calculated by the third scale value calculation unit is quantized using the scale value calculated by the third scale value calculation unit. It is necessary to perform quantization using the scale value calculated by the scale value calculation unit.

また、この符号化装置は、さらに、前記帯域に含まれる前記周波数スペクトルを量子化する際に発生する誤差である量子化誤差から算出される量子化誤差電力が、当該帯域について電力算出部によって算出された前記許容誤差電力未満になるようなスケール値を、前記帯域ごとに算出する第１のスケール値算出部と、前記帯域に含まれる前記周波数スペクトルを量子化する際に発生する前記量子化誤差電力を、当該帯域について前記第１のスケール値算出部によって算出された前記スケール値を用いて算出し、かつ、当該第１のスケール値算出部によって算出された前記スケール値を所定の値に変更したスケール値である変更スケール値を用いて算出する誤差算出部と、をさらに備え、前記量子化部は、前記誤差算出部によって算出された前記量子化誤差電力の内、最も小さい当該量子化誤差電力が算出された際に用いられた前記スケール値または変更スケール値を用いて、前記帯域に含まれる当該周波数スペクトルを量子化することを要件とする。 The encoding device further calculates a quantization error power calculated from a quantization error, which is an error generated when the frequency spectrum included in the band is quantized, by the power calculation unit for the band. A first scale value calculation unit that calculates, for each band, a scale value that is smaller than the permissible error power, and the quantization error that occurs when the frequency spectrum included in the band is quantized. Electric power is calculated for the band using the scale value calculated by the first scale value calculation unit, and the scale value calculated by the first scale value calculation unit is changed to a predetermined value. An error calculation unit that calculates using a changed scale value that is a scale value that has been obtained, and the quantization unit is configured to calculate the error before calculating the error calculation unit. It is a requirement that the frequency spectrum included in the band is quantized using the scale value or the change scale value used when the smallest quantization error power is calculated among the quantization error powers. To do.

また、この符号化装置は、さらに、前記量子化部は、前記第３のスケール値算出部によって算出されたスケール値を用いて、すべての前記帯域に含まれる前記周波数スペクトル各々を量子化することを要件とする。 Further, in this encoding device, the quantization unit quantizes each of the frequency spectra included in all the bands using the scale value calculated by the third scale value calculation unit. Is a requirement.

また、この符号化装置は、さらに、所定の帯域数を記憶する帯域数記憶部をさらに備え、前記電力補正部は、前記検出部によって検出された前記トーン性のある前記周波数スペクトルが含まれる前記帯域を中心として、前記帯域数記憶部によって記憶された前記所定の帯域数の範囲に位置する帯域各々を、前記近傍の帯域として前記許容誤差電力を補正することを要件とする。 In addition, the encoding device further includes a band number storage unit that stores a predetermined number of bands, and the power correction unit includes the frequency spectrum having the tone characteristic detected by the detection unit. It is a requirement that the allowable error power is corrected with each band located in the range of the predetermined number of bands stored in the band number storage unit centered on the band as the neighboring band.

開示の符号化装置によれば、トーン性のあるオーディオ信号の符号化音質を改善することが可能である。 According to the disclosed encoding device, it is possible to improve the encoded sound quality of an audio signal having tone characteristics.

また、開示の符号化装置によれば、量子化誤差を最低限に抑えることが可能である。 Further, according to the disclosed encoding apparatus, it is possible to minimize the quantization error.

また、開示の符号化装置によれば、使用するビット数を、抑えることが可能である。 Also, according to the disclosed encoding device, the number of bits to be used can be suppressed.

以下に添付図面を参照して、この発明に係る符号化装置、符号化方法および符号化プログラムの実施例を詳細に説明する。なお、以下では、実施例１に係る符号化装置を説明する上での前提技術、実施例１に係る符号化装置の概要および特徴、実施例１に係る符号化装置の構成および処理の流れを順に説明し、その後、その他の実施例について説明する。 Exemplary embodiments of an encoding device, an encoding method, and an encoding program according to the present invention will be described below in detail with reference to the accompanying drawings. In the following, the prerequisite technology for describing the coding apparatus according to the first embodiment, the outline and features of the coding apparatus according to the first embodiment, the configuration of the coding apparatus according to the first embodiment, and the flow of processing are described. These will be described in order, and then other embodiments will be described.

［前提技術］
まず最初に、図１〜図４を用いて、実施例１に係る符号化装置を説明する上での前提技術を説明する、なお、図１〜図４は、実施例１に係る符号化装置の前提技術を説明するための図である。 [Prerequisite technology]
First, the base technology for describing the encoding apparatus according to the first embodiment will be described with reference to FIGS. 1 to 4. FIGS. 1 to 4 illustrate the encoding apparatus according to the first embodiment. It is a figure for demonstrating the base technology of.

「周波数スペクトル」とは、オーディオ信号（音源）を周波数領域に変換（例えば、ＭＤＣＴ変換）した場合に得られる周波数ごとの係数（例えば、ＭＤＣＴ係数）である。「周波数スペクトル電力」とは、周波数スペクトルを二乗した値である。また、「トーン性のある周波数スペクトル」とは、周波数スペクトルの内、ある周波数に周波数スペクトル電力のピークが集中した場合の当該周波数の係数のことである。例えば、全ての周波数スペクトル電力の平均より大きい電力を持つ周波数スペクトルが該当する。なお、「トーン性のある周波数スペクトル」の変換元のオーディオ信号を、「トーン性のある音源」という。 The “frequency spectrum” is a coefficient (for example, MDCT coefficient) for each frequency obtained when the audio signal (sound source) is converted into the frequency domain (for example, MDCT conversion). The “frequency spectrum power” is a value obtained by squaring the frequency spectrum. In addition, the “frequency spectrum having tone characteristics” is a coefficient of the frequency when the peak of the frequency spectrum power is concentrated at a certain frequency in the frequency spectrum. For example, a frequency spectrum having a power larger than the average of all frequency spectrum powers is applicable. Note that the audio signal from which the “tone frequency spectrum” is converted is referred to as a “tone sound source”.

また、「量子化」とは、有効数字のうち、小数点以下の数値を切り捨てる処理である（例えば、「１．８」や「２．１」を「１」や「２」などの整数にする）。「量子化値」とは、周波数スペクトルを量子化することによって得られる値を示す。 “Quantization” is a process of truncating numerical values after the decimal point among significant figures (for example, “1.8” or “2.1” is changed to an integer such as “1” or “2”). ). “Quantized value” refers to a value obtained by quantizing a frequency spectrum.

「量子化誤差」とは、周波数スペクトルを量子化することによって周波数スペクトル各々について発生する誤差であって、具体的には、図１に示すように、量子化前の周波数スペクトルと、逆量子化後のスペクトル（これを、「逆量子化スペクトル」とする）との差分がこれに該当する。 The “quantization error” is an error generated for each frequency spectrum by quantizing the frequency spectrum. Specifically, as shown in FIG. 1, the frequency spectrum before quantization and the inverse quantization This is the difference from the later spectrum (referred to as “inverse quantization spectrum”).

ここで、「逆量子化スペクトル」とは、量子化値から得られる周波数スペクトルである。周波数スペクトルと量子化値と逆量子化スペクトルの関係を説明する。符号化装置は、以下に示す一連の処理によって、周波数スペクトルを量子化して、量子化値を得て、そして、量子化値から逆量子化スペクトルを得る。まず、符号化装置は、図１の（１）に示すように、周波数スペクトルのダイナミックレンジが大きいため、所定の「スケール値」を用いてスケーリングし、レンジを小さくする。その後、符号化装置は、図１の（２）に示すように、量子化を行い、量子化値を得る。そして、符号化装置は、図１の（３）に示すように、得られた量子化値を所定のスケール値を用いてリスケーリング（図１の（１）で行ったスケーリングの逆の処理）し、逆量子化スペクトルを得る。 Here, the “inverse quantization spectrum” is a frequency spectrum obtained from the quantized value. The relationship between the frequency spectrum, the quantization value, and the inverse quantization spectrum will be described. The encoding apparatus quantizes the frequency spectrum by a series of processes shown below to obtain a quantized value, and obtains an inverse quantized spectrum from the quantized value. First, since the dynamic range of the frequency spectrum is large as shown in (1) of FIG. 1, the encoding device performs scaling using a predetermined “scale value” to reduce the range. Thereafter, as shown in (2) of FIG. 1, the encoding apparatus performs quantization and obtains a quantization value. Then, the encoding apparatus rescals the obtained quantized value using a predetermined scale value as shown in (3) of FIG. 1 (the inverse process of the scaling performed in (1) of FIG. 1). Then, an inverse quantization spectrum is obtained.

なお、ここで、逆量子化スペクトルは、図２の（１）に示す式によって示され、量子化値は、図２の（２）に示す式によって示される。なお、これらの式は、周波数スペクトルと量子化値とスケール値との関係を示す式である（式１）を変形したものである。なお、「２＾（スケール値）」とは、「２の（スケール値）乗を示す。」
（式１）周波数スペクトル＝量子化値×２＾（スケール値） Here, the inverse quantization spectrum is shown by the equation shown in (1) of FIG. 2, and the quantized value is shown by the equation shown in (2) of FIG. These equations are modified from (Equation 1), which is an equation showing the relationship among the frequency spectrum, quantized value, and scale value. Note that “2 ^ (scale value)” means “2 to the power of (scale value).”
(Expression 1) Frequency spectrum = Quantized value x 2 ^ (Scale value)

「スケール値」とは、帯域ごとに一つずつ与えられるものであり、例えば、図１に示す例では、図１の（４）に示す周波数スペクトルと、図１の（５）に示す周波数スペクトルとを含む帯域「ｂ」に対して、一つのスケール値が与えられる。なお、スケール値としては、符号化装置によって、量子化誤差電力が許容誤差電力未満になるような値が用いられる。なお、帯域とは、周波数を所定の幅で分割した領域それぞれを示す。 “Scale value” is given for each band. For example, in the example shown in FIG. 1, the frequency spectrum shown in (4) of FIG. 1 and the frequency spectrum shown in (5) of FIG. One scale value is given for the band “b” including As the scale value, a value is used such that the quantization error power becomes less than the allowable error power depending on the encoding device. The band indicates each region obtained by dividing the frequency by a predetermined width.

周波数スペクトルの「帯域電力」とは、帯域に含まれる周波数スペクトル電力各々の和を示す。 The “band power” of the frequency spectrum indicates the sum of each frequency spectrum power included in the band.

周波数スペクトルの「量子化誤差電力」とは、量子化誤差の二乗の値を示す。また、ある帯域についての量子化誤差電力とは、当該帯域に含まれる周波数スペクトル各々を量子化する際に発生する量子化誤差から算出される量子化誤差電力各々の和を示す。具体的には、ある帯域の量子化誤差電力と量子化誤差との関係は、（式２）によって示される。なお、「＾２」とは、「２乗」を示す。
（式２）ある帯域の量子化誤差電力＝Σ｛（当該帯域に含まれる周波数スペクトル各々の量子化誤差）＾２｝ The “quantization error power” of the frequency spectrum indicates the square value of the quantization error. Further, the quantization error power for a certain band indicates the sum of each quantization error power calculated from the quantization error generated when each frequency spectrum included in the band is quantized. Specifically, the relationship between quantization error power and quantization error in a certain band is expressed by (Equation 2). “^ 2” indicates “square”.
(Expression 2) Quantization error power of a certain band = Σ {(quantization error of each frequency spectrum included in the band) ^ 2}

また、「許容誤差電力」とは、量子化する際に許容できる最大の量子化誤差電力であり、オーディオ信号について算出される聴覚的に聞こえるか否かを示す閾値であるマスキング閾値が変換されて、オーディオ信号の周波数を所定の幅で分割した帯域それぞれについて算出される。なお、マスキング閾値から許容誤差電力を算出する手法については、例えば、ＩＳＯ／ＩＥＣ１３８１８−７に記載された手法を用いてもよく、それ以外の手法を用いてもよい。 Also, the “allowable error power” is the maximum quantization error power that can be allowed when quantizing, and a masking threshold that is a threshold indicating whether or not it is audibly heard calculated for an audio signal is converted. The frequency is calculated for each band obtained by dividing the frequency of the audio signal by a predetermined width. As a method for calculating the allowable error power from the masking threshold, for example, a method described in ISO / IEC 13818-7 may be used, or another method may be used.

具体的には、許容誤差電力は、「許容できる量子化誤差電力の限界」である。例えば、ある帯域の許容誤差電力とは、ある帯域について算出される量子化誤差電力であって、当該帯域の周波数スペクトルを量子化する際に発生する誤差として許容できる最大の値を示す。言い換えると、開示の符号化装置は、ある帯域における量子化前の周波数スペクトルの電力と、逆量子化スペクトルの電力との差が、許容誤差電力未満となるように、周波数スペクトルを量子化する。 Specifically, the allowable error power is “a limit of allowable quantization error power”. For example, the allowable error power of a certain band is the quantization error power calculated for a certain band, and indicates the maximum value that can be allowed as an error that occurs when the frequency spectrum of the band is quantized. In other words, the disclosed encoding apparatus quantizes the frequency spectrum so that the difference between the power of the frequency spectrum before quantization in a certain band and the power of the inverse quantization spectrum is less than the allowable error power.

また、許容誤差電力は、マスキング閾値から変換されて帯域ごとに算出され、帯域ごとの周波数スペクトルの電力と比較して、どの帯域の周波数スペクトルを量子化するかを決定する際にも用いられる。なお、量子化する周波数スペクトルを決定する際に、許容誤差電力と比較するのは、比較対象となる帯域電力である。 The allowable error power is converted from the masking threshold value and calculated for each band, and is used for determining which band frequency spectrum is to be quantized as compared with the frequency spectrum power for each band. When determining the frequency spectrum to be quantized, it is the band power to be compared that is compared with the allowable error power.

また、「符号化」とは、量子化値やスケール値を、例えばハフマン符号化を用いて別の値（符号）に変換する処理である。 “Encoding” is a process of converting a quantized value or a scale value into another value (code) using, for example, Huffman coding.

スケール値と、量子化誤差電力との関係について、簡単に説明する。上記したように、スケール値は、帯域に１つずつ割り当てられ、１つの帯域に含まれる周波数スペクトル各々は、割り当てられたスケール値を用いて量子化される。 The relationship between the scale value and the quantization error power will be briefly described. As described above, one scale value is assigned to each band, and each frequency spectrum included in one band is quantized using the assigned scale value.

帯域内の、ある１つの周波数スペクトルに着目すると、量子化値とスケール値との関係は図３の（１）に示すようになり、以下の（式３）と（式４）とに示す関係が成立する。
（式３）スケール値大 ⇔ 量子化値小
（式４）スケール値小 ⇔ 量子化値大 Focusing on one frequency spectrum in the band, the relationship between the quantized value and the scale value is as shown in (1) of FIG. 3, and the relationship shown in the following (Equation 3) and (Equation 4). Is established.
(Equation 3) Large scale value 小 Small quantized value (Formula 4) Small scale value 大 Large quantized value

ここで、帯域に含まれる周波数スペクトルに着目すると、図３の（２）に示すように、スケール値を大きく設定すると、帯域内の電力の小さい周波数スペクトルから、量子化値が「０」となり、量子化誤差が増えることとなる。 Here, paying attention to the frequency spectrum included in the band, as shown in (2) of FIG. 3, when the scale value is set large, the quantized value becomes “0” from the frequency spectrum with small power in the band, The quantization error will increase.

すなわち、スケール値と、量子化誤差電力とでは、図３の（３）に示すように、以下の（式５）と（式６）との関係が成立することになる。
（式５）スケール値大 ⇔ 量子化値小 → 量子化誤差増
（式６）スケール値小 ⇔ 量子化値大 → 量子化誤差減
なお、帯域に含まれるすべての周波数スペクトルを量子化値「０」で量子化する（つまり、量子化しない）場合には、量子化誤差電力が最大の値となり、以下の（式７）の関係が成立することになる。
（式７）量子化誤差電力＝帯域電力 That is, the relationship between the following (Expression 5) and (Expression 6) is established between the scale value and the quantization error power as shown in (3) of FIG.
(Expression 5) Scale value large ⇔ Quantized value small → Quantization error increased (Formula 6) Scale value small ⇔ Quantized value large → Quantized error decreased Note that all frequency spectra included in the band are quantized to “0”. ”Is quantized (that is, not quantized), the quantization error power becomes the maximum value, and the following relationship (Equation 7) is established.
(Expression 7) Quantization error power = band power

また、ここで、スケール値と、量子化誤差電力と、許容誤差電力との関係について簡単に説明する。まず、符号化装置は、帯域電力が、許容誤差電力よりも大きい場合に、量子化する対象の帯域とする。また、符号化装置は、図４に示すように、量子化誤差電力が許容誤差電力未満になるようなスケール値を用いて、周波数スペクトルを量子化する。このため、符号化装置は、図４に示すように、「許容誤差電力＞量子化誤差電力」を満たすスケール値を用いて、量子化を行う。 Here, the relationship among the scale value, the quantization error power, and the allowable error power will be briefly described. First, the encoding apparatus sets a band to be quantized when the band power is larger than the allowable error power. In addition, as illustrated in FIG. 4, the encoding apparatus quantizes the frequency spectrum using a scale value such that the quantization error power is less than the allowable error power. For this reason, as shown in FIG. 4, the encoding apparatus performs quantization using a scale value that satisfies “allowable error power> quantization error power”.

ここで、量子化電力と許容誤差電力と帯域電力との関係について、再度整理する。すなわち、
（１）量子化誤差電力の最大値は、帯域電力である。（式７）
（２）（式７）の関係は、すべての周波数スペクトルを量子化値「０」で量子化するとき（つまり、量子化しないとき）である。
（３）許容誤差電力＞量子化誤差電力（これを（式Ａ）とする）の場合を満たすスケール値を用いて量子化値を行う。
となる。いま、（式７）の関係が成り立っているとき、（量子化誤差電力＝帯域電力）（式Ａ）は、以下の（式Ｂ）によって示される。
（式Ｂ）許容誤差電力＞（量子化誤差電力＝帯域電力）
帯域電力と量子化誤差電力とが等しいときは、上記したように、周波数スペクトルの量化値が「０」（つまり、量子化しない）の場合となる。言い換えると、許容誤差電力は、帯域の周波数スペクトルすべてを量子化するかしないかを決定する閾値となる。 Here, the relationship among the quantized power, the allowable error power, and the band power is rearranged again. That is,
(1) The maximum value of quantization error power is band power. (Formula 7)
(2) The relationship of (Equation 7) is when all the frequency spectra are quantized with the quantization value “0” (that is, when not quantizing).
(3) Quantization values are performed using a scale value that satisfies the case of allowable error power> quantization error power (this is expressed as (Equation A)).
It becomes. Now, when the relationship of (Expression 7) holds, (quantization error power = band power) (Expression A) is expressed by the following (Expression B).
(Formula B) Allowable error power> (quantization error power = band power)
When the band power and the quantization error power are equal, as described above, the quantified value of the frequency spectrum is “0” (that is, no quantization). In other words, the allowable error power is a threshold value that determines whether or not to quantize the entire frequency spectrum of the band.

［符号化装置の概要および特徴］
次に、図５を用いて、実施例１に係る符号化装置の概要および特徴を説明する。図５は、実施例１に係る符号化装置の概要および特徴を説明するための図である。 [Outline and Features of Encoding Device]
Next, the outline and characteristics of the encoding apparatus according to the first embodiment will be described with reference to FIG. FIG. 5 is a diagram for explaining the outline and features of the encoding apparatus according to the first embodiment.

同図に示すように、実施例１に係る符号化装置は、オーディオ信号を符号化する符号化装置である。例えば、図５に示すように、符号化される音源（オーディオ信号）が入力されると、当該オーディオ信号を符号化するものである。この実施例１に係る符号化装置は、以下で説明するように、トーン性のあるオーディオ信号の符号化音質を改善することが可能である点に主たる特徴がある。 As shown in the figure, the encoding apparatus according to the first embodiment is an encoding apparatus that encodes an audio signal. For example, as shown in FIG. 5, when a sound source (audio signal) to be encoded is input, the audio signal is encoded. The encoding apparatus according to the first embodiment is mainly characterized in that it is possible to improve the encoded sound quality of an audio signal having tone characteristics, as will be described below.

すなわち、符号化される音源が入力されると、実施例１に係る符号化装置は、周波数変換部が、オーディオ信号を、図５の（１）に示すような周波数スペクトルに変換する。また、周波数変換部は、図５の（２）に示すように、周波数を所定の幅で分割した領域それぞれを示す帯域ごとに、周波数スペクトルの電力を算出する。例えば、周波数変換部は、帯域に含まれる周波数スペクトル各々の電力の合計（帯域電力）を算出する。なお、図５の（２）に示す例では、塗りつぶしされていない棒それぞれが、帯域各々における周波数スペクトルを示している。 That is, when a sound source to be encoded is input, in the encoding apparatus according to the first embodiment, the frequency conversion unit converts the audio signal into a frequency spectrum as illustrated in (1) of FIG. Further, as shown in (2) of FIG. 5, the frequency conversion unit calculates the power of the frequency spectrum for each band indicating each region obtained by dividing the frequency by a predetermined width. For example, the frequency conversion unit calculates the total power (band power) of each frequency spectrum included in the band. In the example shown in (2) of FIG. 5, each unfilled bar represents a frequency spectrum in each band.

また、実施例１に係る符号化装置は、図５の（３）に示すように、電力算出部が、オーディオ信号に応じて、許容誤差電力を帯域ごとに算出する（上記した［前提技術］参照）。なお、図５の（３）に示す例では、灰色で塗りつぶした棒それぞれが、帯域各々（帯域単位での）における許容誤差電力を示している。 In the encoding apparatus according to the first embodiment, as illustrated in (3) of FIG. 5, the power calculation unit calculates the allowable error power for each band in accordance with the audio signal (the above-mentioned [Premise Technology] reference). In the example shown in (3) of FIG. 5, each bar painted in gray indicates the allowable error power in each band (in band units).

そして、実施例１に係る符号化装置は、図５の（４）に示すように、検出部が、周波数変換部によって変換された周波数スペクトルから、トーン性のある周波数スペクトルを検出するとともに、当該周波数スペクトルが含まれる帯域を検出する。例えば、検出部は、図５の（４）に示す例では、図５の（４）における帯域「５」を、トーン性のある周波数スペクトルが含まれる帯域として検出する。 In the encoding apparatus according to the first embodiment, as illustrated in (4) of FIG. 5, the detection unit detects a frequency spectrum having tone characteristics from the frequency spectrum converted by the frequency conversion unit, and A band including a frequency spectrum is detected. For example, in the example shown in (4) of FIG. 5, the detection unit detects the band “5” in (4) of FIG. 5 as a band including a frequency spectrum with tone characteristics.

そして、実施例１に係る符号化装置は、図５の（５）と（６）に示すように、電力補正部が、検出部での検出結果と電力算出部によって算出された許容誤差電力とを用いて、許容誤差電力を補正する。具体的には、電力補正部は、検出部によって検出された帯域に近傍する帯域について電力算出部によって算出された許容誤差電力が、当該近傍する帯域についての周波数スペクトルの電力より小さくなるように補正する。 In the encoding apparatus according to the first embodiment, as illustrated in (5) and (6) of FIG. 5, the power correction unit includes the detection result in the detection unit and the allowable error power calculated by the power calculation unit. Is used to correct the allowable error power. Specifically, the power correction unit corrects the allowable error power calculated by the power calculation unit for a band near the band detected by the detection unit to be smaller than the power of the frequency spectrum for the adjacent band. To do.

例えば、実施例１に係る符号化装置は、図５の（５）に示す例では、検出部によって検出された帯域「５」に隣接する二つの帯域（近傍する帯域）「４」と「６」とについて、図５の（６）に示すように、当該隣接する二つの帯域「４」と「６」とにおける周波数スペクトルの電力（図５に示す例では、塗りつぶしされていない棒）よりも、当該隣接する二つの帯域「４」と「６」とにおける許容誤差電力（図５に示す例では、灰色で塗りつぶした棒）が小さくなるように、許容誤差電力を補正する。 For example, in the example illustrated in (5) of FIG. 5, the encoding apparatus according to the first embodiment includes two bands (adjacent bands) “4” and “6” adjacent to the band “5” detected by the detection unit. As shown in FIG. 5 (6), the frequency spectrum power in the two adjacent bands “4” and “6” (in the example shown in FIG. 5, the unfilled bar) The permissible error power is corrected so that the permissible error power in the two adjacent bands “4” and “6” (in the example shown in FIG. 5, gray bars) is reduced.

そして、実施例１に係る符号化装置は、図５の（７）に示すように、量子化部が、電力補正部によって補正された許容誤差電力より大きな電力の周波数スペクトル各々を量子化する。例えば、量子化部は、図５の（７）に示す例では、トーン性のある周波数スペクトルが含まれる帯域「５」に含まれる周波数スペクトル各々と、電力補正部によって許容誤差電力が補正された帯域「４」と「６」とに含まれる周波数スペクトル各々とを、量子化する。 In the encoding apparatus according to the first embodiment, as illustrated in (7) of FIG. 5, the quantization unit quantizes each frequency spectrum of power larger than the allowable error power corrected by the power correction unit. For example, in the example shown in (7) of FIG. 5, in the quantization unit, each frequency spectrum included in the band “5” including the frequency spectrum with tone characteristics and the allowable error power are corrected by the power correction unit. Each frequency spectrum included in the bands “4” and “6” is quantized.

このようなことから、実施例１に係る符号化装置は、上記した主たる特徴の如く、トーン性のあるオーディオ信号の符号化音質を改善することが可能である。 For this reason, the encoding apparatus according to the first embodiment can improve the encoding sound quality of an audio signal having a tone characteristic as described above.

具体的には、ピーク電力近傍に存在する周波数スペクトル各々が量子化されるように、許容誤差電力を補正するので、ピーク電力近傍に存在する周波数スペクトル各々を確実に量子化することができ、トーン性のあるオーディオ信号の符号化音質を改善することが可能である。 Specifically, since the allowable error power is corrected so that each frequency spectrum existing near the peak power is quantized, each frequency spectrum existing near the peak power can be reliably quantized. It is possible to improve the quality of encoded audio signals.

［符号化装置の構成］
次に、図６〜図１１を用いて、図５に示した符号化装置の構成を説明する。ここで、図６は、実施例１に係る符号化装置の構成を示すためのブロック図である。図７は、実施例１におけるトーン検出部を説明するための図である。図８は、実施例１における心理聴覚分析部を説明するための図である。図９は、実施例１における許容誤差電力補正部を説明するための図である。図１０は、実施例１における許容誤差電力補正部を説明するための図である。図１１は、実施例１におけるスケール値補正部を説明するための図である。 [Configuration of Encoding Device]
Next, the configuration of the encoding device shown in FIG. 5 will be described with reference to FIGS. Here, FIG. 6 is a block diagram for illustrating a configuration of the encoding apparatus according to the first embodiment. FIG. 7 is a diagram for explaining the tone detection unit according to the first embodiment. FIG. 8 is a diagram for explaining the psychoacoustic analysis unit according to the first embodiment. FIG. 9 is a diagram for explaining an allowable error power correction unit according to the first embodiment. FIG. 10 is a diagram for explaining an allowable error power correction unit according to the first embodiment. FIG. 11 is a diagram for explaining the scale value correction unit according to the first embodiment.

図６に示すように、開示の符号化装置は、特に本発明に密接に関連するものとして、入力部１０１と、ＭＤＣＴ部１０２と、トーン検出部１０３と、心理聴覚分析部１０４と、許容誤差電力補正部１０５と、量子化帯域検出部１０６と、スケール値算出部１０７と、スケール値補正部１０８と、量子化部１０９と、符号化部１１０と、出力部１１１とを備える。 As shown in FIG. 6, the disclosed encoding device is particularly closely related to the present invention, and includes an input unit 101, an MDCT unit 102, a tone detection unit 103, a psychoacoustic analysis unit 104, and an allowable error. The power correction unit 105, the quantization band detection unit 106, the scale value calculation unit 107, the scale value correction unit 108, the quantization unit 109, the encoding unit 110, and the output unit 111 are provided.

なお、ＭＤＣＴ部１０２は、特許請求の範囲に記載の「周波数変換部」に対応する。心理聴覚分析部１０４は、特許請求の範囲に記載の「電力算出部」に対応する。トーン検出部１０３は、特許請求の範囲に記載の「検出部」に対応する。許容誤差電力補正部１０５は、特許請求の範囲に記載の「電力補正部」に対応する。量子化部１０９は、特許請求の範囲に記載の「量子化部」に対応する。スケール値算出部１０７は、特許請求の範囲に記載の「第１のスケール値算出部」と「第２のスケール値算出部」とに対応する。スケール値補正部１０８は、特許請求の範囲に記載の「第３のスケール値算出部」に対応する。 The MDCT unit 102 corresponds to a “frequency conversion unit” recited in the claims. The psychoacoustic analysis unit 104 corresponds to a “power calculation unit” recited in the claims. The tone detection unit 103 corresponds to the “detection unit” recited in the claims. The allowable error power correcting unit 105 corresponds to a “power correcting unit” recited in the claims. The quantization unit 109 corresponds to a “quantization unit” recited in the claims. The scale value calculation unit 107 corresponds to a “first scale value calculation unit” and a “second scale value calculation unit” described in the claims. The scale value correction unit 108 corresponds to a “third scale value calculation unit” recited in the claims.

入力部１０１は、オーディオ信号（符号化される音源）の入力を受付ける。なお、入力部１０１は、受付けたオーディオ信号を、後述するＭＤＣＴ部１０２と後述する心理聴覚分析部１０４とに伝達する。 The input unit 101 receives an input of an audio signal (a sound source to be encoded). The input unit 101 transmits the received audio signal to the MDCT unit 102 described later and the psychoacoustic analysis unit 104 described later.

ＭＤＣＴ部１０２は、入力部１０１から伝達されたオーディオ信号を周波数スペクトルに変換する。具体的には、ＭＤＣＴ部１０２は、入力部１０１から伝達されたオーディオ信号に、ＭＤＣＴ変換によって時間−周波数変換を行い、周波数スペクトルに変換する。ここで、時間−周波数変換とは、例えば、時間をパラメータとして表されている（例えば、横軸が「時間」で表される）オーディオ信号を、周波数をパラメータとして表される情報である（例えば、横軸が「周波数」で表される）周波数スペクトルに変換することを示す。 The MDCT unit 102 converts the audio signal transmitted from the input unit 101 into a frequency spectrum. Specifically, the MDCT unit 102 performs time-frequency conversion on the audio signal transmitted from the input unit 101 by MDCT conversion and converts the audio signal into a frequency spectrum. Here, the time-frequency conversion is, for example, an audio signal expressed using time as a parameter (for example, the horizontal axis is expressed as “time”) and information expressed using a frequency as a parameter (for example, , The horizontal axis represents the frequency spectrum).

また、ＭＤＣＴ部１０２は、変換した周波数スペクトルを、所定の周波数幅で分割した帯域ごとに、周波数スペクトルの電力を算出する。例えば、図７の（１）に示す例では、７つの帯域に分割し（帯域「０」から「６」）、帯域電力として、当該帯域に含まれる周波数スペクトル各々の電力の和を算出する。 In addition, the MDCT unit 102 calculates the power of the frequency spectrum for each band obtained by dividing the converted frequency spectrum by a predetermined frequency width. For example, in the example shown in (1) of FIG. 7, it is divided into seven bands (bands “0” to “6”), and the sum of the power of each frequency spectrum included in the band is calculated as band power.

また、ＭＤＣＴ部１０２は、変換した周波数スペクトルと帯域電力とを、後述するトーン検出部１０３と、後述する量子化帯域検出部１０６に伝達する。 Further, the MDCT unit 102 transmits the converted frequency spectrum and band power to a tone detection unit 103 described later and a quantization band detection unit 106 described later.

トーン検出部１０３は、ＭＤＣＴ部１０２によって変換された周波数スペクトルをＭＤＣＴ部１０２から受付けると、当該周波数スペクトルについてトーン性を分析し、トーン性のある周波数スペクトルを検出し、当該周波数スペクトルが含まれる帯域を検出する。 When the tone detector 103 receives the frequency spectrum converted by the MDCT unit 102 from the MDCT unit 102, the tone detector 103 analyzes the tone characteristic of the frequency spectrum, detects a frequency spectrum having the tone characteristic, and includes a band including the frequency spectrum. Is detected.

また、例えば、トーン検出部１０３は、図７の（２）に示すように、算出した帯域それぞれの電力から、全帯域における電力の平均値（言い換えると、周波数スペクトル全体の電力の平均値）を算出する。具体的に例をあげて説明すると、帯域の数（帯域分割数）がｂａｎｄ個（例えば、図７の（２）に示す図では、帯域の数は７個である）であり、各帯域電力が「Ｅｂａｎｄ」である場合に、トーン検出部１０３は、全帯域の周波数スペクトルの平均電力「Ｅａｖｅ」を、図７の（３）に示す式によって算出する。 Further, for example, as shown in (2) of FIG. 7, the tone detection unit 103 calculates an average value of power in all bands (in other words, average value of power of the entire frequency spectrum) from the calculated power of each band. calculate. Specifically, the number of bands (number of band divisions) is band (for example, the number of bands is 7 in the diagram shown in (2) of FIG. 7), and each band power Is “Eband”, the tone detector 103 calculates the average power “Eave” of the frequency spectrum of the entire band by the equation shown in (3) of FIG.

また、例えば、トーン検出部１０３は、図７の（４）に示すように、算出した全帯域における周波数スペクトルの平均電力を閾値として、閾値より大きい周波数スペクトルの平均電力を持つ帯域がある場合に、当該帯域をトーン性のある帯域であるとして検出する。具体的に例をあげて説明すると、トーン検出部１０３は、図７の（４）に示す例では、算出した周波数スペクトルの平均電力より大きな周波数スペクトルの平均電力を有する帯域である帯域「３」を、トーン性のある周波数スペクトルが含まれる帯域として検出する。 Further, for example, as shown in (4) of FIG. 7, the tone detection unit 103 uses a frequency spectrum average power in all the calculated bands as a threshold, and there is a band having a frequency spectrum average power larger than the threshold. The band is detected as a band having tone characteristics. Specifically, in the example shown in (4) of FIG. 7, the tone detection unit 103 uses a band “3” that is a band having a higher average power of the frequency spectrum than the calculated average power of the frequency spectrum. Is detected as a band including a frequency spectrum with tone characteristics.

また、トーン検出部１０３は、検出したトーン性のある周波数スペクトルが含まれる帯域を、後述する許容誤差電力補正部１０５と、後述するスケール値補正部１０８とに伝達する。例えば、トーン検出部１０３は、トーン性を検出したことを示すフラグ（ｔｏｎｅ＿ｆｌａｇ）と、検出した帯域を識別する情報（ｔｏｎｅ＿ｂａｎｄ、例えば、図７の（２）に示す例では、帯域「３」）を、後述する許容誤差電力補正部１０５と、後述するスケール値補正部１０８とに伝達する。なお、トーン検出部１０３は、トーン性のある周波数スペクトルが含まれる帯域を検出しなかった場合には、トーン性を検出したことを示すフラグ（ｔｏｎｅ＿ｆｌａｇ）と、検出した帯域を識別する情報とを伝達しない。 The tone detection unit 103 transmits a band including the detected frequency spectrum with tone characteristics to an allowable error power correction unit 105 described later and a scale value correction unit 108 described later. For example, the tone detection unit 103 includes a flag (tone_flag) indicating that tone characteristics have been detected, and information for identifying the detected band (tone_band, for example, the band “3” in the example illustrated in (2) of FIG. 7). Is transmitted to an allowable error power correcting unit 105 described later and a scale value correcting unit 108 described later. When the tone detection unit 103 does not detect a band including a frequency spectrum with tone characteristics, the tone detection unit 103 includes a flag (tone_flag) indicating that the tone characteristics are detected, and information for identifying the detected band. Do not communicate.

また、トーン検出部１０３は、ＭＤＣＴ部１０２から受付けた周波数スペクトルと帯域電力とを、後述する許容誤差電力補正部１０５に伝達する。 In addition, tone detecting section 103 transmits the frequency spectrum and band power received from MDCT section 102 to allowable error power correcting section 105 described later.

心理聴覚分析部１０４は、入力部１０１からオーディオ信号が伝達されると、当該オーディオ信号に応じて、許容誤差電力を算出する（前提技術参照）。具体的には、入力部１０１からオーディオ信号が伝達されると、心理聴覚分析部１０４は、オーディオ信号に含まれる周波数を所定の幅の帯域に分割し、分割した帯域それぞれについて、許容誤差電力を算出する。 When the audio signal is transmitted from the input unit 101, the psychoacoustic analysis unit 104 calculates an allowable error power according to the audio signal (see the prerequisite technology). Specifically, when the audio signal is transmitted from the input unit 101, the psychoacoustic analysis unit 104 divides the frequency included in the audio signal into a band of a predetermined width, and sets an allowable error power for each of the divided bands. calculate.

心理聴覚分析部１０４は、図８の（１）に示すように、入力部１０１から伝達されたオーディオ信号について、マスキング閾値を算出する。また、心理聴覚分析部１０４は、図８の（２）に示すように、算出したマスキング閾値を変換して、許容誤差電力を算出する。 The psychoacoustic analysis unit 104 calculates a masking threshold for the audio signal transmitted from the input unit 101 as shown in (1) of FIG. In addition, the psychoacoustic analysis unit 104 converts the calculated masking threshold and calculates the allowable error power, as shown in (2) of FIG.

なお、ここで、心理聴覚分析部１０４は、許容誤差電力を、帯域ごとに算出する。ここでいう「帯域」とは、ＭＤＣＴ部１０２によって用いられた帯域に対応するものである。言い換えると、心理聴覚分析部１０４は、ＭＤＣＴ部１０２によって帯域電力算出された帯域各々について、許容誤差電力を算出する。また、図８の（１）と（２）では、説明の便宜上、マスキング閾値と許容誤差電力とを、ＭＤＣＴ部１０２によって変換される周波数スペクトルに併せて（横軸が「周波数」である図を用いて）示している。 Here, the psychoacoustic analysis unit 104 calculates the allowable error power for each band. Here, the “band” corresponds to the band used by the MDCT unit 102. In other words, the psychoacoustic analysis unit 104 calculates an allowable error power for each band for which the band power is calculated by the MDCT unit 102. 8 (1) and (2), for convenience of explanation, the masking threshold and the allowable error power are combined with the frequency spectrum converted by the MDCT unit 102 (the horizontal axis is “frequency”). Show)

また、心理聴覚分析部１０４は、算出した許容誤差電力を、後述する許容誤差電力補正部１０５に伝達する。 The psychoacoustic analysis unit 104 transmits the calculated allowable error power to the allowable error power correction unit 105 described later.

許容誤差電力補正部１０５は、所定の帯域数を記憶する帯域数記憶部（図６には図示していない）を備え、図９に示すように、トーン検出部１０３から検出結果（「ｔｏｎｅ＿ｂａｎｄ」と「ｔｏｎｅ＿ｆｌａｇ」と）が伝達され（図９に示す例では、「検出結果」が該当する）、かつ、心理聴覚分析部１０４から許容誤差電力が伝達され、トーン検出部１０３から周波数スペクトル電力が伝達されると（図９に示す例では、「許容誤差電力」と「帯域電力」とが該当する）、当該検出結果と当該許容誤差電力とを用いて、許容誤差電力を補正する。なお、帯域数記憶部は、特許請求の範囲に記載の「帯域数記憶部」に対応する。 The allowable error power correction unit 105 includes a band number storage unit (not shown in FIG. 6) that stores a predetermined number of bands. As illustrated in FIG. 9, the detection result (“tone_band”) from the tone detection unit 103 is provided. And “tone_flag”) (in the example shown in FIG. 9, “detection result” corresponds), allowable error power is transmitted from the psychoacoustic analysis unit 104, and frequency spectrum power is transmitted from the tone detection unit 103. When transmitted (in the example shown in FIG. 9, “allowable error power” and “band power” correspond), the allowable error power is corrected using the detection result and the allowable error power. The band number storage unit corresponds to the “band number storage unit” recited in the claims.

具体的には、トーン検出部１０３によって検出された帯域に近傍する帯域について心理聴覚分析部１０４によって算出された許容誤差電力が、当該近傍する帯域についての帯域電力より小さくなるように補正する。 Specifically, the permissible error power calculated by the psychoacoustic analysis unit 104 for a band close to the band detected by the tone detection unit 103 is corrected so as to be smaller than the band power for the adjacent band.

例えば、許容誤差電力補正部１０５は、トーン検出部１０３によって検出されたトーン性のある周波数スペクトルが含まれる帯域を中心として、帯域数記憶部によって記憶された所定の帯域数の範囲に位置する帯域各々を、近傍の帯域として検出する。 For example, the allowable error power correcting unit 105 is a band located in a range of a predetermined number of bands stored in the number-of-bands storage unit with a band including a frequency spectrum having a tone characteristic detected by the tone detection unit 103 as a center. Each is detected as a nearby band.

例として、トーン検出部１０３によって「ｂ」番目の帯域が検出され、帯域数記憶部に記憶する所定の帯域幅が補正帯域幅「Ｂ」である場合を例に、具体的に説明する。例えば、許容誤差電力補正部１０５は、図１０の（１）に示すように、「ｂ」番目の帯域を中心として、「ｂ」帯域に隣接する「Ｂ」個の帯域を補正対象とする近傍の帯域として検出する。言い換えると、許容誤差電力補正部１０５は、「ｂ−Ｂ」番目から「ｂ＋Ｂ」番目までの帯域を、補正対象とする近傍の帯域として検出する。例えば、図１０の（１）に示す例では、「ｂ＝１６」「Ｂ＝４」であるので、許容誤差電力補正部１０５は、帯域「１２」から「２０」までの帯域を、補正対象とする近傍の帯域として検出する。 As an example, the case where the tone detection unit 103 detects the “b” -th band and the predetermined bandwidth stored in the number-of-bands storage unit is the correction bandwidth “B” will be specifically described as an example. For example, as shown in (1) of FIG. 10, the allowable error power correction unit 105 has a “b” band as a center and a “B” band adjacent to the “b” band as a correction target. It is detected as a band. In other words, the allowable error power correcting unit 105 detects the bands from the “b−B” th to the “b + B” th as the nearby bands to be corrected. For example, in the example shown in (1) of FIG. 10, since “b = 16” and “B = 4”, the allowable error power correction unit 105 selects the bands from “12” to “20” as the correction target. It is detected as a nearby band.

また、例えば、許容誤差電力補正部１０５は、図１０の（２）に示すように、検出した近傍の帯域について、許容誤差電力を補正する。具体的に例をあげて説明すると、図１０の（１）に示す例では、検出した近傍する帯域である帯域「１２」から「２０」までの（帯域「１６」を除く）補正前の許容誤差電力が、検出した近傍する帯域の帯域電力よりも大きい。このため、許容誤差電力補正部１０５は、図１０の（２）に示すように、検出した近傍する帯域である帯域「１２」から「２０」までの（帯域「１６」を除く）許容誤差電力が、周波数スペクトルの電力未満となるように一律に減衰させて補正する。その方法は、例えば、図１０の（３）に示すように、「ｂ−１」番目の帯域における補正後の許容誤差電力を「Ｍ´ｂ−１」、「ｂ−１」番目の帯域における補正前の許容誤差電力を「Ｍｂ−１」とした場合に、「Ｍ´ｂ−１＝ｇ×Ｍｂ−１」（減衰量「ｇ」＜１．０）を計算して求める。 Further, for example, the allowable error power correction unit 105 corrects the allowable error power for the detected nearby band as shown in (2) of FIG. Specifically, in the example shown in (1) of FIG. 10, the permissible before correction from the bands “12” to “20” (excluding the band “16”) that are detected neighboring bands. The error power is larger than the band power of the detected nearby band. Therefore, the allowable error power correction unit 105, as shown in (2) of FIG. 10, allows the allowable error power from the band “12” to “20” (excluding the band “16”) that is the detected nearby band. However, it correct | amends by attenuating uniformly so that it may become less than the electric power of a frequency spectrum. For example, as shown in FIG. 10 (3), the corrected error power in the “b−1” th band is set to “M′b−1” and “b−1” th band. When the allowable error power before correction is “Mb−1”, “M′b−1 = g × Mb−1” (attenuation amount “g” <1.0) is calculated.

また、許容誤差電力補正部１０５は、心理聴覚分析部１０４によって算出された許容誤差電力と、補正した許容誤差電力とを、後述する量子化帯域検出部１０６に伝達する。なお、許容誤差電力補正部１０５は、トーン検出部１０３から、トーン性を検出したことを示すフラグ（ｔｏｎｅ＿ｆｌａｇ）と、検出した帯域を識別する情報とが伝達されなかった場合には、許容誤差電力を補正する処理を行わず、心理聴覚分析部１０４によって算出された許容誤差電力を、後述する量子化帯域検出部１０６に伝達する。 In addition, the allowable error power correction unit 105 transmits the allowable error power calculated by the psychoacoustic analysis unit 104 and the corrected allowable error power to the quantization band detection unit 106 described later. The allowable error power correcting unit 105 receives the allowable error power when the tone detection unit 103 does not transmit the flag (tone_flag) indicating that the tone property is detected and the information for identifying the detected band. The permissible error power calculated by the psychoacoustic analysis unit 104 is transmitted to the quantization band detection unit 106, which will be described later.

量子化帯域検出部１０６は、ＭＤＣＴ部１０２から周波数スペクトルが伝達され、許容誤差電力補正部１０５から許容誤差電力（許容誤差電力補正部１０５が補正した許容誤差電力を含む）が伝達されると、周波数スペクトルの帯域の内、量子化する帯域を検出する。 When the frequency band is transmitted from the MDCT unit 102 and the allowable error power (including the allowable error power corrected by the allowable error power correction unit 105) is transmitted from the allowable error power correction unit 105 to the quantization band detection unit 106, Among the frequency spectrum bands, a band to be quantized is detected.

具体的には、量子化帯域検出部１０６は、ＭＤＣＴ部１０２から伝達された帯域電力と、許容誤差電力補正部１０５から伝達された許容誤差電力とを帯域ごとに比較し、量子化対象とする帯域を決定する。さらに詳細には、量子化帯域検出部１０６は、許容誤差電力補正部１０５によって補正された許容誤差電力がある帯域については、当該補正された許容誤差電力と当該帯域の帯域電力とを比較する。また、量子化帯域検出部１０６は、許容誤差電力補正部１０５によって補正された許容誤差電力がない帯域については、心理聴覚分析部１０４によって算出された許容誤差電力と当該帯域の帯域電力とを比較する。また、量子化帯域検出部１０６は、許容誤差電力より大きな帯域電力を示す帯域各々を、量子化する対象の帯域として検出する。また、量子化帯域検出部１０６は、検出した帯域を識別する情報を検出する。 Specifically, the quantization band detection unit 106 compares the band power transmitted from the MDCT unit 102 with the allowable error power transmitted from the allowable error power correction unit 105 for each band, and sets the quantization target. Determine the bandwidth. More specifically, for the band having the allowable error power corrected by the allowable error power correction unit 105, the quantization band detection unit 106 compares the corrected allowable error power with the band power of the band. Also, the quantization band detection unit 106 compares the allowable error power calculated by the psychoacoustic analysis unit 104 and the band power of the band for the band without the allowable error power corrected by the allowable error power correction unit 105. To do. Further, the quantization band detecting unit 106 detects each band indicating band power larger than the allowable error power as a band to be quantized. Further, the quantization band detection unit 106 detects information for identifying the detected band.

また、量子化帯域検出部１０６は、検出した量子化する帯域を識別する情報と、許容誤差電力補正部１０５から伝達された許容誤差電力と、ＭＤＣＴ部１０２から伝達された周波数スペクトルとを、後述するスケール値算出部１０７に伝達する。 The quantization band detection unit 106 also includes information for identifying the detected band to be quantized, the allowable error power transmitted from the allowable error power correction unit 105, and the frequency spectrum transmitted from the MDCT unit 102, which will be described later. To the scale value calculation unit 107.

スケール値算出部１０７は、量子化する帯域を識別する情報と許容誤差電力と周波数スペクトルとが量子化帯域検出部１０６から伝達されると、量子化誤差電力が許容誤差電力未満になるようなスケール値を、帯域ごとに算出する。 The scale value calculation unit 107 is a scale that allows the quantization error power to be less than the allowable error power when the information for identifying the band to be quantized, the allowable error power, and the frequency spectrum are transmitted from the quantization band detection unit 106. A value is calculated for each band.

なお、スケール値算出部１０７は、トーン検出部１０３によって検出されたトーン性のある周波数スペクトルが含まれる帯域に近傍する帯域について、許容誤差電力補正部１０５によって許容誤差電力が補正された場合には、当該近傍する帯域について、当該補正された許容誤差電力未満となるようなスケール値を算出する。 Note that the scale value calculation unit 107, when the permissible error power is corrected by the permissible error power correction unit 105 for the band near the band including the frequency spectrum with tone characteristics detected by the tone detection unit 103, Then, a scale value is calculated for the adjacent bands so as to be less than the corrected allowable error power.

また、スケール値算出部１０７は、量子化する帯域を識別する情報と許容誤差電力と周波数スペクトルと帯域ごとに算出したスケール値とを、後述するスケール値補正部１０８に伝達する。 The scale value calculation unit 107 transmits information for identifying the band to be quantized, allowable error power, frequency spectrum, and scale value calculated for each band to the scale value correction unit 108 described later.

スケール値補正部１０８は、図１１の（１）に示すように、トーン性のある周波数スペクトルが含まれる帯域がトーン検出部１０３から伝達され（図１１に示す例では、「トーン性検出結果」が該当する）、かつ、量子化する帯域を識別する情報と、許容誤差電力と、周波数スペクトルと、帯域ごとのスケール値とがスケール値算出部１０７から伝達されると（図１１に示す例では、「周波数スペクトル」が該当する）、トーン性のある周波数スペクトルが含まれる帯域のスケール値を補正する。例えば、スケール値補正部１０８は、トーン検出部１０３によって検出されたトーン性のある周波数スペクトルが含まれる帯域を構成する周波数スペクトル各々の内、最も大きな当該周波数スペクトルから得られる量子化値が量子化値の最大の値となるようなスケール値に、当該帯域についての当該スケール値を補正する。 As shown in (1) of FIG. 11, the scale value correction unit 108 receives a band including a frequency spectrum having tone characteristics from the tone detection unit 103 (in the example illustrated in FIG. 11, “tone detection result”). 11), and information for identifying the band to be quantized, allowable error power, frequency spectrum, and scale value for each band are transmitted from the scale value calculation unit 107 (in the example shown in FIG. 11) , “Frequency spectrum”), the scale value of the band including the frequency spectrum with tone characteristics is corrected. For example, the scale value correcting unit 108 quantizes the quantized value obtained from the largest frequency spectrum in each frequency spectrum constituting the band including the tone-specific frequency spectrum detected by the tone detecting unit 103. The scale value for the band is corrected to the scale value that is the maximum value.

ここで、トーン性のある周波数スペクトルが含まれる帯域が帯域「ｂ」であり、当該帯域「ｂ」についてスケール値算出部１０７によって算出されたスケール値を「Ｓｂ」である場合を例に、具体的に説明する。スケール値補正部１０８は、帯域「ｂ」に含まれる最大周波数スペクトルである「ｍａｘ＿ｐｏｗ＿ｓｐｅｃ」を探索して検出する（図９に示す例では、「最大周波数スペクトル探索」が該当する）。なお、ここでいう「最大周波数スペクトル」とは、トーン性のある周波数スペクトルが含まれる帯域を構成する周波数スペクトル各々の内、電力が最も大きい周波数スペクトルを示す。 Here, a case where the band including the frequency spectrum with tone characteristics is the band “b” and the scale value calculated by the scale value calculation unit 107 for the band “b” is “Sb” as an example. I will explain it. The scale value correction unit 108 searches for and detects “max_pow_spec” that is the maximum frequency spectrum included in the band “b” (in the example illustrated in FIG. 9, “maximum frequency spectrum search” corresponds). Here, the “maximum frequency spectrum” refers to a frequency spectrum having the largest power among the frequency spectra constituting the band including the frequency spectrum with tone characteristics.

また、例えば、スケール値補正部１０８は、最大周波数スペクトルを検出すると、当該最大周波数スペクトルを量子化することによって得られる量子化値が、量子化値の最大値である「ＭＡＸ＿ＱＵＡＮＴ」となるようなスケール値「Ｓ‘ｂ」を算出し（図９に示す例では、「補正スケール値算出」が該当する）、トーン検出部１０３によって検出されたトーン性のある周波数スペクトルが含まれる帯域のスケール値とする。例えば、図１１の（２）に示す式に従って、スケール値「Ｓｂ」を、算出したスケール値「Ｓ‘ｂ」に置き換える（スケール値「Ｓｂ」をスケール値「Ｓ‘ｂ」に補正する）。なお、量子化値の最大値とは、符号化技術の規格で定められている値であり、ＡＡＣ方式では、ＭＡＸ＿ＱＵＡＮＴ＝８１９１である。 For example, when the scale value correcting unit 108 detects the maximum frequency spectrum, the quantized value obtained by quantizing the maximum frequency spectrum becomes “MAX_QUANT” that is the maximum value of the quantized value. A scale value “S′b” is calculated (in the example shown in FIG. 9, “corrected scale value calculation” corresponds), and a scale value of a band including a frequency spectrum having a tone characteristic detected by the tone detection unit 103 is included. And For example, the scale value “Sb” is replaced with the calculated scale value “S′b” in accordance with the equation shown in (2) of FIG. 11 (the scale value “Sb” is corrected to the scale value “S′b”). Note that the maximum value of the quantized value is a value determined by the encoding technology standard, and MAX_QUANT = 8191 in the AAC scheme.

また、スケール値補正部１０８は、量子化する帯域を識別する情報と許容誤差電力と周波数スペクトルと帯域ごとのスケール値（スケール値補正部１０８が算出したトーン性のある周波数スペクトルが含まれる帯域のスケール値を含む）とを、後述する量子化部１０９に伝達する。 The scale value correcting unit 108 also includes information for identifying a band to be quantized, allowable error power, frequency spectrum, and a scale value for each band (for a band including a frequency spectrum having a tone characteristic calculated by the scale value correcting unit 108). (Including the scale value) is transmitted to the quantization unit 109 described later.

量子化部１０９は、量子化する帯域を識別する情報と許容誤差電力と周波数スペクトルと帯域ごとのスケール値とがスケール値補正部１０８から伝達されると、許容誤差電力より大きな電力の周波数スペクトル各々を量子化する。具体的には、量子化部１０９は、量子化帯域検出部１０６によって検出された帯域（量子化する帯域を識別する情報によって識別される帯域）各々について、周波数スペクトルのダイナミックレンジをスケール値によって一意に特定されるダイナミックレンジに縮小し、縮小したダイナミックレンジにおける帯域各々を構成する周波数スペクトルそれぞれを量子化する。 When the quantizing unit 109 receives information for identifying the band to be quantized, allowable error power, frequency spectrum, and scale value for each band from the scale value correcting unit 108, each frequency spectrum of power greater than the allowable error power Quantize Specifically, the quantization unit 109 uniquely determines the dynamic range of the frequency spectrum by the scale value for each of the bands (bands identified by information for identifying the band to be quantized) detected by the quantization band detection unit 106. And the frequency spectrum constituting each band in the reduced dynamic range is quantized.

さらに詳細には、量子化部１０９は、スケール値補正部１０８によってスケール値が算出された帯域に含まれる周波数スペクトル各々を、スケール値補正部１０８によって算出されたスケール値を用いて量子化し、スケール値補正部１０８によってスケール値が算出されていない帯域に含まれる周波数スペクトル各々を、スケール値算出部１０８によって算出されたスケール値を用いて量子化する。 More specifically, the quantization unit 109 quantizes each frequency spectrum included in the band for which the scale value is calculated by the scale value correction unit 108 using the scale value calculated by the scale value correction unit 108, Each frequency spectrum included in the band for which the scale value is not calculated by the value correction unit 108 is quantized using the scale value calculated by the scale value calculation unit 108.

なお、ここで、量子化部１０９は、スケール値算出部１０７やスケール値補正部１０８によって算出されたスケール値を用いて、帯域単位に（帯域ごとに）ダイナミックレンジを変更するが、その後、量子化を実行する際には、帯域単位に行うのではなく、当該帯域各々を構成する周波数スペクトル単位ごと（周波数スペクトルごとに）に量子化を行う。つまり、量子化部１０９は、周波数スペクトル一つ一つに対して、量子化値を一つずつ得る。 Here, the quantization unit 109 changes the dynamic range in units of bands (for each band) using the scale values calculated by the scale value calculation unit 107 and the scale value correction unit 108. When performing the quantization, the quantization is not performed for each band but for each frequency spectrum unit (for each frequency spectrum) constituting each band. That is, the quantization unit 109 obtains one quantization value for each frequency spectrum.

また、量子化部１０９は、量子化して得られた量子化値とスケール値とを、後述する符号化部１１０に伝達する。 Further, the quantization unit 109 transmits the quantized value and the scale value obtained by the quantization to the encoding unit 110 described later.

符号化部１１０は、量子化値とスケール値とが量子化部１０９から伝達されると、量子化値とスケール値とを符号化する。例えば、符号化部１１０は、ハフマン符号化を用いて量子化値とスケール値とをそれぞれ符号化する。また、符号化部１１０は、符号化した情報を、後述する出力部１１１に伝達する。 When the quantization value and the scale value are transmitted from the quantization unit 109, the encoding unit 110 encodes the quantization value and the scale value. For example, the encoding unit 110 encodes the quantized value and the scale value using Huffman coding. Also, the encoding unit 110 transmits the encoded information to the output unit 111 described later.

出力部１１１は、符号化部１１０から符号化した情報が伝達されると、入力部１０１によって入力されたオーディオ信号を符号化した情報として、符号化部１１０から伝達された情報を出力する。 When the encoded information is transmitted from the encoding unit 110, the output unit 111 outputs the information transmitted from the encoding unit 110 as information obtained by encoding the audio signal input by the input unit 101.

なお、この符号化装置は、既知のパーソナルコンピュータ、ワークステーション、携帯電話、ＰＨＳ端末、移動体通信端末またはＰＤＡなどの情報処理装置に、上記したＭＤＣＴ部１０２、トーン検出部１０３、心理聴覚分析部１０４、許容誤差電力補正部１０５、量子化帯域検出部１０６、スケール値算出部１０７、スケール値補正部１０８、量子化部１０９の各機能を搭載することによって実現することもできる。 Note that this encoding apparatus is an information processing apparatus such as a known personal computer, workstation, mobile phone, PHS terminal, mobile communication terminal, or PDA, and the above-described MDCT section 102, tone detection section 103, psychoacoustic analysis section. It can also be realized by mounting the functions of 104, allowable error power correction unit 105, quantization band detection unit 106, scale value calculation unit 107, scale value correction unit 108, and quantization unit 109.

［符号化装置による処理］
次に、図１２と図１３とを用いて、符号化装置による処理を説明する。ここでは、まず、図１２を用いて符号化装置による全体処理の流れを説明し、その後、図１３を用いて、スケール値補正部１０８による処理の流れを説明する。なお、図１２は、実施例１に係る符号化装置の処理全体の流れを示すフローチャートであり、図１３は、実施例１に係るスケール値補正部の処理の流れを示すフローチャートである。 [Processing by encoding device]
Next, processing performed by the encoding device will be described with reference to FIGS. 12 and 13. Here, first, the flow of overall processing by the encoding apparatus will be described with reference to FIG. 12, and then the flow of processing by the scale value correction unit 108 will be described with reference to FIG. FIG. 12 is a flowchart showing the overall process flow of the encoding apparatus according to the first embodiment, and FIG. 13 is a flowchart showing the process flow of the scale value correcting unit according to the first embodiment.

［符号化装置による全体処理］
図１２に示すように、開示の符号化装置では、オーディオ信号があると（ステップＳ１０１肯定）、つまり、入力部１０１によってオーディオ信号が受付けられると、ＭＤＣＴ部１０２は、ＭＤＣＴ変換を行う（ステップＳ１０２）。つまり、ＭＤＣＴ部１０２は、入力部１０１から伝達されたオーディオ信号を周波数スペクトルに変換する。そして、ＭＤＣＴ編幹部１０２は、帯域を分割し（ステップＳ１０３）、帯域電力を算出する（ステップＳ１０４）。つまり、ＭＤＣＴ部１０２は、周波数スペクトル電力を算出し、さらに所定の幅で分割した帯域ごとに、周波数スペクトル電力の和を算出する。 [Entire processing by the encoding device]
As shown in FIG. 12, in the disclosed encoding apparatus, when there is an audio signal (Yes at Step S101), that is, when the audio signal is received by the input unit 101, the MDCT unit 102 performs MDCT conversion (Step S102). ). That is, the MDCT unit 102 converts the audio signal transmitted from the input unit 101 into a frequency spectrum. Then, the MDCT trunk unit 102 divides the band (step S103) and calculates the band power (step S104). That is, the MDCT unit 102 calculates the frequency spectrum power and further calculates the sum of the frequency spectrum power for each band divided by a predetermined width.

そして、トーン検出部１０３は、トーン性のある周波数スペクトルが含まれる帯域を検出する（ステップＳ１０５）。つまり、トーン検出部１０３は、全帯域における周波数スペクトルの平均電力を閾値として、閾値より大きい周波数スペクトルの電力を持つ帯域がある場合に、当該帯域をトーン性が高い帯域であるとして検出する。 Then, the tone detection unit 103 detects a band including a frequency spectrum with tone characteristics (step S105). That is, the tone detection unit 103 detects the band as a band having high tone characteristics when there is a band having a frequency spectrum power larger than the threshold with the average power of the frequency spectrum in all bands as a threshold.

そして、心理聴覚分析部１０４は、許容誤差電力を算出する（ステップＳ１０６）。つまり、入力部１０１からオーディオ信号が伝達されると、当該オーディオ信号に応じて、許容誤差電力を算出する。 Then, the psychoacoustic analysis unit 104 calculates allowable error power (step S106). That is, when an audio signal is transmitted from the input unit 101, an allowable error power is calculated according to the audio signal.

ここで、トーンがある場合には（ステップＳ１０７肯定）、言い換えると、上記したステップＳ１０５において、トーン検出部１０３によってトーン性のある帯域が検出された場合には、許容誤差電力補正部１０５は、許容誤差電力を補正する（ステップＳ１０８）。つまり、許容誤差電力補正部１０５は、トーン検出部１０３から検出結果が伝達された場合には、近傍する帯域についての許容誤差電力を補正する。例えば、許容誤差電力補正部１０５は、近傍する帯域についての許容誤差電力を、近傍の帯域についての帯域電力よりも小さい許容誤差電力に補正する。 Here, when there is a tone (Yes in Step S107), in other words, in the above-described Step S105, when the tone detecting unit 103 detects a band having tone characteristics, the allowable error power correcting unit 105 The allowable error power is corrected (step S108). That is, when the detection result is transmitted from the tone detection unit 103, the allowable error power correction unit 105 corrects the allowable error power for the adjacent bands. For example, the allowable error power correction unit 105 corrects the allowable error power for the adjacent band to an allowable error power smaller than the band power for the adjacent band.

そして、許容誤差電力補正部１０５によって許容誤差電力が補正され（ステップＳ１０８）、または、トーンがない場合には（ステップＳ１０７否定）、量子化帯域検出部１０６は、量子化帯域を検出する（ステップＳ１０９）。つまり、量子化帯域検出部１０６は、ＭＤＣＴ部１０２から周波数スペクトルが伝達され、許容誤差電力補正部１０５から許容誤差電力が伝達されると、周波数スペクトルの帯域の内、量子化する帯域を検出する。 Then, the allowable error power is corrected by the allowable error power correction unit 105 (step S108), or when there is no tone (No in step S107), the quantization band detection unit 106 detects the quantization band (step S108). S109). That is, when the frequency spectrum is transmitted from the MDCT unit 102 and the allowable error power is transmitted from the allowable error power correcting unit 105, the quantization band detecting unit 106 detects a band to be quantized among the frequency spectrum bands. .

そして、スケール値算出部１０７は、スケール値を算出する（ステップＳ１１０）。つまり、量子化する帯域を識別する情報と許容誤差電力と周波数スペクトルとが量子化帯域検出部１０６から伝達されると、量子化誤差電力が許容誤差電力未満になるようなスケール値を、帯域ごとに算出する。 Then, the scale value calculation unit 107 calculates a scale value (step S110). That is, when the information for identifying the band to be quantized, the allowable error power, and the frequency spectrum are transmitted from the quantization band detection unit 106, a scale value that makes the quantization error power less than the allowable error power is determined for each band. To calculate.

ここで、トーンがある場合には（ステップＳ１１１肯定）、スケール値補正部１０８は、スケール値を補正する（ステップＳ１１２）。つまり、トーン性のある周波数スペクトルが含まれる帯域がトーン検出部１０３から伝達され、かつ、量子化する帯域を識別する情報と許容誤差電力と周波数スペクトルと帯域ごとのスケール値とがスケール値算出部１０７から伝達されると、スケール値補正部１０８は、トーン性のある周波数スペクトルが含まれる帯域のスケール値を補正する。 If there is a tone (Yes at Step S111), the scale value correcting unit 108 corrects the scale value (Step S112). That is, a band including a frequency spectrum having tone characteristics is transmitted from the tone detection unit 103, and information for identifying the band to be quantized, allowable error power, frequency spectrum, and scale value for each band are scale value calculation units. When transmitted from 107, the scale value correcting unit 108 corrects the scale value of the band including the frequency spectrum with tone characteristics.

そして、スケール値補正部１０８によってスケール値が補正され（ステップＳ１１２）、または、トーンがない場合には（ステップＳ１１１否定）、量子化部１０９は、周波数スペクトルを量子化する（ステップＳ１１３）。つまり、量子化部１０９は、量子化する帯域を識別する情報と許容誤差電力と周波数スペクトルと帯域ごとのスケール値とがスケール値補正部１０８から伝達されると、量子化帯域検出部１０６によって検出された帯域各々の周波数スペクトル各々を量子化する。 Then, the scale value is corrected by the scale value correcting unit 108 (step S112), or when there is no tone (No at step S111), the quantizing unit 109 quantizes the frequency spectrum (step S113). In other words, when the quantization unit 109 receives information for identifying a band to be quantized, allowable error power, frequency spectrum, and scale value for each band from the scale value correction unit 108, the quantization unit 109 detects the quantization band. Each frequency spectrum of each of the bands thus obtained is quantized.

そして、符号化部１１０は、符号化する（ステップＳ１１４）。つまり、符号化部１１０は、量子化して得られた量子化値が量子化部１０９から伝達されると、量子化値を符号化する。 Then, the encoding unit 110 performs encoding (step S114). That is, when the quantization value obtained by the quantization is transmitted from the quantization unit 109, the encoding unit 110 encodes the quantized value.

［スケール値補正部による処理］
図１３に示すように、開示の符号化装置では、スケール値が補正される場合には（ステップＳ２０１肯定）、つまり、トーン性のある周波数スペクトルが含まれる帯域がトーン検出部１０３から伝達され、かつ、量子化する帯域を識別する情報と許容誤差電力と周波数スペクトルと帯域ごとのスケール値とがスケール値算出部１０７から伝達されると、スケール値補正部１０８は、最大周波数スペクトルを検出する（ステップＳ２０２）。 [Processing by scale value correction unit]
As shown in FIG. 13, in the disclosed encoding device, when the scale value is corrected (Yes in step S201), that is, a band including a frequency spectrum with tone characteristics is transmitted from the tone detection unit 103, When the information for identifying the band to be quantized, the allowable error power, the frequency spectrum, and the scale value for each band are transmitted from the scale value calculation unit 107, the scale value correction unit 108 detects the maximum frequency spectrum ( Step S202).

そして、例えば、スケール値補正部１０８は、量子化値が最大になる場合のスケール値を算出する（ステップＳ２０３）。つまり、スケール値補正部１０８は、トーン検出部１０３によって検出されたトーン性のある周波数スペクトルが含まれる帯域を構成する当該周波数スペクトル各々の内、最も大きな当該周波数スペクトルから得られる値が最大の値となるようなスケール値を算出する。 Then, for example, the scale value correcting unit 108 calculates a scale value when the quantized value is maximized (step S203). That is, the scale value correction unit 108 has the maximum value obtained from the largest frequency spectrum among the frequency spectra constituting the band including the frequency spectrum having the tone characteristic detected by the tone detection unit 103. Calculate a scale value such that

そして、スケール値補正部１０８は、スケール値を補正する（ステップＳ２０４）。つまり、トーン性がある周波数スペクトルが検出された帯域について算出したスケール値に、スケール値算出部１０７によって算出されたスケール値を補正する。 Then, the scale value correction unit 108 corrects the scale value (step S204). That is, the scale value calculated by the scale value calculation unit 107 is corrected to the scale value calculated for the band in which the frequency spectrum having tone characteristics is detected.

［実施例１の効果］
上記したように、実施例１によれば、開示の符号化装置は、オーディオ信号を周波数スペクトルに変換し、オーディオ信号の周波数を所定の幅で分割した帯域それぞれについて、許容誤差電力を算出し、周波数スペクトルから、トーン性のある周波数スペクトルを検出するとともに当該周波数スペクトルが含まれる帯域を検出し、検出結果と許容誤差電力とを用いて、検出部によって検出された帯域に近傍する帯域について算出された許容誤差電力が、当該近傍する帯域についての周波数スペクトルの電力より小さくなるように補正し、補正された許容誤差電力より大きな電力の周波数スペクトル各々を量子化するので、トーン性のあるオーディオ信号の符号化音質を改善することが可能である。 [Effect of Example 1]
As described above, according to the first embodiment, the disclosed encoding device converts the audio signal into a frequency spectrum, calculates the allowable error power for each band obtained by dividing the frequency of the audio signal by a predetermined width, A frequency spectrum with tone characteristics is detected from the frequency spectrum, a band including the frequency spectrum is detected, and a band close to the band detected by the detection unit is calculated using the detection result and the allowable error power. The allowable error power is corrected to be smaller than the power of the frequency spectrum for the neighboring band, and each frequency spectrum of the power larger than the corrected allowable error power is quantized. It is possible to improve the encoded sound quality.

つまり、従来の手法では、トーン性のあるオーディオ信号を符号化する際に、トーン性のある周波数スペクトル近傍の周波数スペクトルを確実に量子化することができず、近傍の周波数スペクトルが失われ、その結果、図１４の（１−１）に示すような原音が、図１４の（１−２）に示すように、符号化音の位相特性が歪むことによって、振幅が変動し、音が震えることがあった。 In other words, in the conventional method, when encoding a tonal audio signal, the frequency spectrum in the vicinity of the tonal frequency spectrum cannot be reliably quantized, and the nearby frequency spectrum is lost. As a result, the original sound as shown in (1-1) of FIG. 14 is distorted in the phase characteristics of the encoded sound as shown in (1-2) of FIG. was there.

また、例えば、従来の手法では、振幅が変化して、オーバーフローすることによって（例えば、ＰＣＭの最大値である（１６ｂｉｔ）を超えることによって）、クリップが発生してしまった。これにより、図１４の（２）に示すように、異音（チリチリ）が発生していた。また、図１４の（１−２）に示すように、振幅が変化することにより、ふるえ感が発生していた。 Further, for example, in the conventional method, the clip is generated due to the amplitude changing and overflowing (for example, exceeding the maximum value (16 bits) of PCM). As a result, as shown in (2) of FIG. Further, as shown in FIG. 14 (1-2), a feeling of trembling occurred due to the change in amplitude.

このような従来の手法と比較して、開示の符号化装置によれば、図１５に示すように、トーン性のある周波数スペクトル近傍の周波数スペクトルを確実に量子化することができるので、図１６の（１）に示す原音を符号化する際に、図１６の（２）に示すように、従来の手法による量子化では発生していた異音（チリチリ）の発生やふるえ感が、図１６の（３）に示すように、改善され、トーン性のあるオーディオ信号の符号化音質を改善することが可能である。 Compared to such a conventional method, according to the disclosed encoding apparatus, as shown in FIG. 15, the frequency spectrum in the vicinity of the frequency spectrum with tone characteristics can be reliably quantized. When the original sound shown in (1) of FIG. 16 is encoded, as shown in (2) of FIG. As shown in (3), it is possible to improve the encoded sound quality of an audio signal having tone characteristics.

また、実施例１によれば、開示の符号化装置は、スケール値補正部１０８が、検出されたトーン性のある周波数スペクトルが含まれる帯域に近傍する帯域について、当該近傍する帯域に含まれる周波数スペクトルを量子化する際に発生する誤差である量子化誤差から算出される量子化誤差電力が、当該近傍する帯域について許容誤差電力補正部１０５によって当該近傍する帯域について算出された許容誤差電力未満となるようなスケール値を、当該近傍する帯域についての当該スケール値として算出し、量子化部１０９が、スケール値補正部１０８によってスケール値が算出された帯域に含まれる周波数スペクトル各々を、スケール値補正部１０８によって算出されたスケール値を用いて量子化するので、許容誤差電力を補正した場合にも、適切なスケール値を用いて量子化を行うことが可能である。 Further, according to the first embodiment, the disclosed encoding apparatus is configured such that the scale value correcting unit 108 uses the frequency included in the adjacent band for the band adjacent to the band including the detected frequency spectrum with tone characteristics. The quantization error power calculated from the quantization error that is an error that occurs when the spectrum is quantized is less than the allowable error power calculated for the adjacent band by the allowable error power correction unit 105 for the adjacent band. And the quantization unit 109 performs scale value correction on each frequency spectrum included in the band for which the scale value is calculated by the scale value correction unit 108. Since the quantization is performed using the scale value calculated by the unit 108, even when the allowable error power is corrected, It is possible to perform quantization using the switching scale value.

また、実施例１によれば、開示の符号化装置は、トーン検出部１０３によって検出されたトーン性のある周波数スペクトルが含まれる帯域を構成する当該周波数スペクトル各々の内、最も大きな当該周波数スペクトルから得られる値が最大の値となるようなスケール値を、当該帯域についての当該スケール値として算出するので、量子化誤差を最低限に抑えることが可能である。具体的には、トーン性のあるピークから得られる量子化値が、規格上設定されている最大値をとることにより、量子化誤差を最小限に最低限に抑えることができる。 In addition, according to the first embodiment, the disclosed encoding device uses the largest frequency spectrum among the frequency spectrums constituting the band including the frequency spectrum having the tone property detected by the tone detection unit 103. Since the scale value at which the obtained value is the maximum value is calculated as the scale value for the band, it is possible to minimize the quantization error. Specifically, the quantization value obtained from the tone-like peak takes the maximum value set according to the standard, whereby the quantization error can be minimized.

また、実施例１によれば、開示の符号化装置は、所定の帯域数を記憶し、検出されたトーン性のある周波数スペクトルが含まれる帯域を中心として、記憶する所定の帯域数の範囲に位置する帯域各々を、近傍の帯域として許容誤差電力を補正するので、簡単に許容誤差電力を補正する帯域を検出することが可能である。 Also, according to the first embodiment, the disclosed encoding device stores a predetermined number of bands, and the range of the predetermined number of bands to be stored is centered on a band including the detected frequency spectrum with tone characteristics. Since the permissible error power is corrected using each of the positioned bands as neighboring bands, it is possible to easily detect the band for correcting the allowable error power.

ところで、実施例１では、開示の符号化装置は、スケール値補正部１０８が、トーン検出部１０３によって検出された帯域のスケール値を、当該帯域を構成する周波数スペクトル各々の内、最も大きな周波数スペクトルを量子化して得られる値が規格上の最大値となるようなスケール値にする手法について説明したが、本発明はこれに限定されるものではない。例えば、実施例２に係る符号化装置は、発生する量子化誤差電力の少ないスケール値を探索し、探索により得られたスケール値を用いてもよい。 By the way, in the first embodiment, in the disclosed encoding device, the scale value correction unit 108 uses the scale value of the band detected by the tone detection unit 103 as the largest frequency spectrum among the frequency spectra constituting the band. Although the method of setting the scale value so that the value obtained by quantizing the value becomes the maximum value in the standard has been described, the present invention is not limited to this. For example, the encoding apparatus according to the second embodiment may search for a scale value with less generated quantization error power and use the scale value obtained by the search.

そこで、実施例２では、トーン検出部１０３によって検出された帯域のスケール値として、量子化誤差電力の少ないスケール値を探索し、探索により得られたスケール値を用いる符号化装置について説明する。すなわち、実施例２における符号化装置は、スケール値として、スケール値算出部１０７によって算出されたスケール値と、スケール値を所定の値変更した当該スケール値である変更スケール値の内、量子化する際に発生する量子化誤差（または、量子化誤差電力）を少なくするスケール値を用いるものである。なお、以下では、実施例１に係る符号化装置と同様の点については、簡単に説明し、または、説明を省略する。 Therefore, in the second embodiment, an encoding apparatus that searches for a scale value with a small quantization error power as a band scale value detected by the tone detection unit 103 and uses the scale value obtained by the search will be described. That is, the encoding apparatus according to the second embodiment quantizes the scale value calculated by the scale value calculation unit 107 as the scale value and the changed scale value that is the scale value obtained by changing the scale value by a predetermined value. A scale value that reduces the quantization error (or quantization error power) that occurs at the time is used. In the following, the same points as those of the encoding apparatus according to the first embodiment will be described briefly or the description thereof will be omitted.

実施例２に係る符号化装置では、スケール値補正部１０８（特許請求の範囲に記載の「誤差算出部」に対応する）は、帯域に含まれる当該周波数スペクトルを量子化する際に発生する量子化誤差電力を、当該帯域についてスケール値算出部１０７によって算出されたスケール値を用いて算出し、かつ、スケール値算出部１０７によって算出されたスケール値を所定の値変更した当該スケール値である変更スケール値を用いて算出する。 In the encoding apparatus according to the second embodiment, the scale value correcting unit 108 (corresponding to the “error calculating unit” recited in the claims) is configured to generate a quantum generated when the frequency spectrum included in the band is quantized. The change error power is calculated by using the scale value calculated by the scale value calculation unit 107 for the band and the scale value calculated by the scale value calculation unit 107 is changed to a predetermined value. Calculate using the scale value.

このため、実施例２におけるスケール値補正部１０８は、図１７に示すように、実施例１におけるスケール値補正部１０８とは異なり（図１１の（１）参照）、許容誤差電力（実施例２では、許容誤差電力補正部１０５によって補正された許容誤差電力と、補正される前の許容誤差電力と）を用いるものである。 Therefore, as shown in FIG. 17, the scale value correction unit 108 in the second embodiment is different from the scale value correction unit 108 in the first embodiment (see (1) in FIG. 11), and the allowable error power (second embodiment). Then, the allowable error power corrected by the allowable error power correction unit 105 and the allowable error power before correction) are used.

具体的には、実施例２に係る符号化装置では、トーン検出部１０３によって検出された帯域について、スケール値補正部１０８が、当該帯域を構成する周波数スペクトル各々を、スケール値算出部１０７によって算出されたスケール値を用いて量子化し、その際に発生する量子化誤差電力を算出する（前提技術参照）。 Specifically, in the encoding apparatus according to the second embodiment, for the band detected by the tone detection unit 103, the scale value correction unit 108 calculates each frequency spectrum constituting the band by the scale value calculation unit 107. Quantization is performed using the scale value thus obtained, and quantization error power generated at that time is calculated (see the base technology).

ここで、トーン検出部１０３によって帯域「ｂ」が検出され、帯域「ｂ」についてのスケール値「Ｓｂ」がスケール値算出部１０７によって算出され、帯域「ｂ」を構成する周波数スペクトルの本数が「Ｎｂ」である場合を例に、具体的に説明する。 Here, the tone detection unit 103 detects the band “b”, the scale value “Sb” for the band “b” is calculated by the scale value calculation unit 107, and the number of frequency spectra constituting the band “b” is “ Nb ”will be specifically described as an example.

まず、実施例２に係る符号化装置では、スケール値補正部１０８が、帯域「ｂ」を構成する周波数スペクトル各々をスケール値「Ｓｂ」を用いて量子化して量子化値を算出し、算出した量子化値とスケール値「Ｓｂ」とを用いて逆量子化を行い、逆量子化スペクトルを算出する。例えば、ＡＡＣ符号化方式においては、帯域「ｂ」に含まれるｉ番目のスペクトル「ｓｐｅｃｉ」から得られる量子化値「ｑｕａｎｔｉ」と、逆量子化スペクトル「ｉｓｐｅｃｉ」とを、スケール値補正部１０８は、図１８の（１）と（２）に示す式によって算出する。 First, in the encoding apparatus according to the second embodiment, the scale value correction unit 108 quantizes each frequency spectrum constituting the band “b” using the scale value “Sb” to calculate a quantization value, and calculates Inverse quantization is performed using the quantization value and the scale value “Sb” to calculate an inverse quantization spectrum. For example, in the AAC encoding method, the scale value correction unit 108 uses the quantized value “quanti” obtained from the i-th spectrum “speci” included in the band “b” and the inverse quantized spectrum “isspeci”. These are calculated by the equations shown in (1) and (2) of FIG.

そして、実施例２に係る符号化装置では、スケール値補正部１０８が、量子化前の周波数スペクトルと逆量子化スペクトルとから、当該帯域の量子化誤差電力を算出する。例えば、実施例２に係る符号化装置では、スケール値補正部１０８が、帯域「ｂ」の量子化誤差電力「ｅｒｒｏｒ＿ｅｂ」を、図１８の（３）に示す式によって算出する。なお、図１８の（３）に示す式中の「Ｎｂ」は、帯域「ｂ」に含まれる周波数スペクトルの数を示す。 In the encoding apparatus according to the second embodiment, the scale value correction unit 108 calculates the quantization error power of the band from the frequency spectrum before quantization and the inverse quantization spectrum. For example, in the encoding apparatus according to the second embodiment, the scale value correcting unit 108 calculates the quantization error power “error_eb” of the band “b” using the equation shown in (3) of FIG. Note that “Nb” in the equation shown in (3) of FIG. 18 indicates the number of frequency spectra included in the band “b”.

また、具体的には、実施例２に係る符号化装置では、スケール値補正部１０８が、スケール値算出部１０７によって算出されたスケール値を所定の値に変更し、トーン検出部１０３によって検出された帯域について、変更したスケール値（変更スケール値）を用いて量子化した際に発生する量子化誤差電力を算出する。 Specifically, in the encoding apparatus according to the second embodiment, the scale value correction unit 108 changes the scale value calculated by the scale value calculation unit 107 to a predetermined value and is detected by the tone detection unit 103. The quantization error power generated when quantizing the changed band using the changed scale value (changed scale value) is calculated.

例えば、スケール値補正部１０８は、スケール値「Ｓｂ」を所定の値「Ａ」に変更し、得られた変更スケール値「Ｓ‘ｂ（例えば、「Ｓ‘ｂ」＝「Ｓｂ」＋「Ａ」）」を用いて、帯域「ｂ」を量子化した際に発生する量子化誤差電力を算出する。 For example, the scale value correcting unit 108 changes the scale value “Sb” to a predetermined value “A”, and the obtained changed scale value “S′b (for example,“ S′b ”=“ Sb ”+“ A ” ")" Is used to calculate the quantization error power generated when the band "b" is quantized.

また、具体的には、実施例２に係る符号化装置では、スケール値補正部１０８が、スケール値算出部１０７によって算出されたスケール値を用いた際に発生する量子化誤差電力（量子化誤差電力「甲」とする）と、変更スケール値を用いた際に発生する量子化誤差電力（量子化誤差電力「乙」とする）とを比較し、量子化誤差電力「乙」の方が小さいかを判別する。ここで、スケール値補正部１０８は、量子化誤差電力「乙」が量子化誤差電力「甲」よりも小さい場合には、トーン検出部１０３によって検出された帯域のスケール値（例えば、「Ｓｂ」）を、変更スケール値（例えば、「Ｓ‘ｂ」）に補正する。一方、スケール値補正部１０８は、量子化誤差電力「乙」が量子化誤差電力「甲」よりも小さくない場合には、スケール値を補正しない。 Specifically, in the encoding apparatus according to the second embodiment, the quantization error power (quantization error) generated when the scale value correction unit 108 uses the scale value calculated by the scale value calculation unit 107. Compared with the power of “O”) and the quantization error power generated when the changed scale value is used (the quantization error power “O”), the quantization error power “O” is smaller Is determined. Here, when the quantization error power “O” is smaller than the quantization error power “A”, the scale value correction unit 108 determines the scale value of the band detected by the tone detection unit 103 (for example, “Sb”). ) To the changed scale value (for example, “S′b”). On the other hand, the scale value correction unit 108 does not correct the scale value when the quantization error power “O” is not smaller than the quantization error power “E”.

また、実施例２に係る符号化装置では、スケール値補正部１０８が、様々な「Ａ」を用いることによって複数のスケール値について量子化誤差電力を算出し、その中で、発生した量子化誤差電力が最も小さいスケール値に補正する。なお、例えば、スケール値補正部１０８は、変更スケール値として、一回目に「Ｓｂ１」、二回目に「Ｓｂ２」を用いた場合であって、一回目の変更スケール値「Ｓｂ１」を用いてスケール値算出部１０７によって算出されたスケール値「Ｓｂ」を補正した場合には、その後、スケール値補正部１０８は、「Ｓｂ１」を用いた際に発生する量子化誤差電力と、「Ｓｂ２」を用いた際に発生する量子化誤差電力とを比較する。 In the encoding apparatus according to the second embodiment, the scale value correcting unit 108 calculates quantization error power for a plurality of scale values by using various “A”, and among them, the generated quantization error is generated. Correct to the scale value with the lowest power. For example, the scale value correction unit 108 uses “Sb1” for the first time and “Sb2” for the second time as the changed scale value, and uses the first changed scale value “Sb1” for the scale. When the scale value “Sb” calculated by the value calculation unit 107 is corrected, the scale value correction unit 108 uses the quantization error power generated when “Sb1” is used and “Sb2”. Compared with the quantization error power generated when the error occurs.

また、例えば、スケール値補正部１０８は、所定の変更スケール値すべてについて（例えば、所定の「Ａ」から算出される変更スケール値各々（変更スケール値候補）について）、比較したかを判別する。そして、スケール値補正部１０８は、すべての変更スケール値について比較を行うまで、スケール値補正処理を継続する。 Further, for example, the scale value correcting unit 108 determines whether or not all predetermined changed scale values (for example, changed scale values calculated from predetermined “A” (changed scale value candidates)) are compared. Then, the scale value correction unit 108 continues the scale value correction process until all the changed scale values are compared.

なお、実施例２に係る符号化装置では、スケール値補正部１０８が、量子化誤差電力を、一対一で比較する手法について説明したが、本発明はこれに限定されるものではなく、複数のスケール値について、それぞれ量子化誤差電力を算出し、算出した複数（例えば、三つ以上）の量子化誤差電力について同時に比較し、発生した量子化誤差電力が最も小さい一つのスケール値を用いてもよい。 In the encoding apparatus according to the second embodiment, the scale value correction unit 108 has described the method of comparing the quantization error power on a one-to-one basis. However, the present invention is not limited to this, and a plurality of methods are used. Quantization error power is calculated for each scale value, and the calculated multiple (for example, three or more) quantization error powers are compared at the same time, and one scale value with the smallest generated quantization error power is used. Good.

また、「Ａ」の値は任意であり、「Ａ」として「０」より大きな値を用いるだけでなく、「Ａ」として「０」より小さな値を用いてもよい。また、スケール値補正部１０８は、変更スケール値として用いる値の個数（量子化誤差電力を算出して比較する回数）に関する設定を予め保持し、当該設定に基づいてスケール値補正処理を実行してもよい。 The value of “A” is arbitrary, and not only a value larger than “0” may be used as “A”, but a value smaller than “0” may be used as “A”. In addition, the scale value correction unit 108 stores in advance settings relating to the number of values used as the changed scale values (the number of times the quantization error power is calculated and compared), and executes scale value correction processing based on the settings. Also good.

なお、本発明は、様々な「Ａ」を用いる（複数の変更スケール値を用いる）手法に限定されるものではなく、例えば、スケール値算出部１０７によって算出されたスケール値を、一つの変更スケール値のみと比較してもよい。例えば、変更スケール値として予め量子化誤差が少なくなると予測される値を一つ選択して用いてもよい。これにより、スケール値補正処理を迅速に実行することが可能である。 Note that the present invention is not limited to a method using various “A” (using a plurality of changed scale values). For example, the scale value calculated by the scale value calculating unit 107 is changed to one changed scale. You may compare only with the value. For example, one value that is predicted to reduce the quantization error in advance as the change scale value may be selected and used. Thereby, the scale value correction process can be executed quickly.

なお、実施例２に係る符号化装置におけるスケール値補正部１０８による詳細な処理の流れの一例については、ここでは説明せず、後述する。 Note that an example of a detailed flow of processing performed by the scale value correction unit 108 in the encoding apparatus according to the second embodiment will not be described here but will be described later.

実施例２に係る符号化装置では、量子化部１０９は、スケール値補正部１０８によって算出された量子化誤差電力の内、最も少ない当該量子化誤差電力が算出された際に用いられたスケール値（または変更スケール値）を用いて、帯域に含まれる当該周波数スペクトル各々を量子化する。例えば、スケール値補正部１０８によって、スケール値算出部１０７によって算出されたスケール値「Ｓｂ」と、当該スケール値を「Ａ」に変更した値「Ｓ‘ｂ」とについて量子化誤差電力が算出され、「Ｓ‘ｂ」を用いた際に発生した量子化誤差電力が最も少なかった場合に、スケール値「Ｓ‘ｂ」を用いて、トーン検出部１０３によって検出された帯域を構成する周波数スペクトル各々を量子化する。 In the encoding apparatus according to the second embodiment, the quantization unit 109 uses the scale value used when the smallest quantization error power among the quantization error powers calculated by the scale value correction unit 108 is calculated. Each frequency spectrum included in the band is quantized using (or the changed scale value). For example, the quantization error power is calculated by the scale value correction unit 108 for the scale value “Sb” calculated by the scale value calculation unit 107 and the value “S′b” obtained by changing the scale value to “A”. , Each frequency spectrum constituting the band detected by the tone detector 103 using the scale value “S′b” when the quantization error power generated when using “S′b” is the smallest. Quantize

［実施例２におけるスケール値補正部による処理］
次に、図１９を用いて、実施例２におけるスケール値補正部による処理を説明する。なお、図１９は、実施例２に係る符号化装置のスケール値補正処理の流れを示すフローチャートである。 [Processing by Scale Value Correction Unit in Embodiment 2]
Next, processing performed by the scale value correction unit according to the second embodiment will be described with reference to FIG. FIG. 19 is a flowchart illustrating the flow of the scale value correction process of the encoding device according to the second embodiment.

なお、以下では、特に言及しない場合には、トーン検出部１０３によって帯域「ｂ」が検出され、帯域「ｂ」についてのスケール値「Ｓｂ」がスケール値算出部１０７によって算出され、帯域「ｂ」を構成する周波数スペクトルの本数が「Ｎｂ」である場合を例として用いて説明する。 In the following, unless otherwise specified, the band “b” is detected by the tone detection unit 103, the scale value “Sb” for the band “b” is calculated by the scale value calculation unit 107, and the band “b”. A case where the number of frequency spectrums constituting “Nb” is “Nb” will be described as an example.

図１９に示すように、開示の符号化装置では、スケール値補正部１０８は、スケール値を補正する場合には（ステップＳ３０１肯定）、量子化誤差電力を算出する（ステップＳ３０２）。つまり、スケール値補正部１０８は、スケール値算出部１０７によって算出されたスケール値「Ｓｂ」を用いて量子化し、帯域「ｂ」を量子化した際に発生する量子化誤差電力を算出する。 As shown in FIG. 19, in the disclosed encoding device, the scale value correction unit 108 calculates quantization error power (step S302) when correcting the scale value (Yes in step S301). That is, the scale value correction unit 108 quantizes using the scale value “Sb” calculated by the scale value calculation unit 107, and calculates quantization error power generated when the band “b” is quantized.

そして、スケール値補正部１０８は、スケール値を変更する（ステップＳ３０３）。つまり、例えば、スケール値補正部１０８は、スケール値「Ｓｂ」を所定の値「Ａ」に変更する。そして、スケール値補正部１０８は、変更したスケール値を用いて、量子化誤差電力を算出する（ステップＳ３０４）。つまり、例えば、スケール値補正部１０８は、得られた変更スケール値「Ｓ‘ｂ」を用いて、帯域「ｂ」を量子化した際に発生する量子化誤差電力を算出する。 Then, the scale value correction unit 108 changes the scale value (step S303). That is, for example, the scale value correction unit 108 changes the scale value “Sb” to the predetermined value “A”. Then, the scale value correction unit 108 calculates quantization error power using the changed scale value (step S304). That is, for example, the scale value correction unit 108 calculates the quantization error power generated when the band “b” is quantized using the obtained changed scale value “S′b”.

そして、スケール値補正部１０８は、量子化誤差電力を比較する（ステップＳ３０５）。つまり、例えば、スケール値補正部１０８は、帯域「ｂ」を量子化した際に発生する量子化誤差電力について、スケール値算出部１０７によって算出されたスケール値「Ｓｂ」を用いた際の量子化誤差電力（量子化誤差電力「甲」）と、変更スケール値「Ｓ‘ｂ」を用いた際の量子化誤差電力（量子化誤差電力「乙」）とを、比較する。 Then, the scale value correction unit 108 compares the quantization error power (step S305). That is, for example, the scale value correction unit 108 uses the scale value “Sb” calculated by the scale value calculation unit 107 for the quantization error power generated when the band “b” is quantized. The error power (quantization error power “E”) is compared with the quantization error power (quantization error power “O”) when the changed scale value “S′b” is used.

そして、スケール値補正部１０８は、スケール値算出部１０７によって算出されたスケール値を用いた際の量子化誤差電力と比較して、変更スケール値を用いた際の量子化誤差電力が小さいかを判別する（ステップＳ３０６）。つまり、例えば、スケール値補正部１０８は、量子化誤差電力「乙」が量子化誤差電力「甲」よりも小さいかを判別する。ここで、スケール値補正部１０８は、量子化誤差電力「乙」が量子化誤差電力「甲」よりも小さい場合には（ステップＳ３０６肯定）、スケール値を補正する（ステップＳ３０７）。つまり、例えば、スケール値補正部１０８は、スケール値「Ｓｂ」を変更スケール値「Ｓ‘ｂ」に補正する。 Then, the scale value correction unit 108 determines whether the quantization error power when the changed scale value is used is smaller than the quantization error power when the scale value calculated by the scale value calculation unit 107 is used. A determination is made (step S306). That is, for example, the scale value correction unit 108 determines whether the quantization error power “O” is smaller than the quantization error power “E”. Here, when the quantization error power “O” is smaller than the quantization error power “E” (Yes at Step S306), the scale value correction unit 108 corrects the scale value (Step S307). That is, for example, the scale value correction unit 108 corrects the scale value “Sb” to the changed scale value “S′b”.

そして、スケール値補正部１０８は、スケール値を補正すると（ステップＳ３０７）、または、量子化誤差電力「乙」が量子化誤差電力「甲」よりも小さくなかった場合には（ステップＳ３０６否定）、変更スケール値の候補について、すべて比較したかを判別する（ステップＳ３０８）。ここで、すべての変更スケール値の候補について比較した場合には（ステップＳ３０８肯定）、処理を終了する。一方、すべての変更スケール値の候補について比較していない場合には（ステップＳ３０８否定）、すべての変更スケール値の候補について比較するまで、上記したステップＳ３０３からＳ３０７までの処理を繰り返す。 Then, the scale value correcting unit 108 corrects the scale value (step S307), or when the quantization error power “O” is not smaller than the quantization error power “E” (No at step S306). It is determined whether all the candidates for the change scale value have been compared (step S308). If all the change scale value candidates are compared (Yes at step S308), the process is terminated. On the other hand, if all the change scale value candidates are not compared (No in step S308), the above-described processing from step S303 to S307 is repeated until all the change scale value candidates are compared.

［実施例２の効果］
上記したように、実施例２によれば、開示の符号化装置は、スケール値補正部１０８が、量子化誤差電力を、スケール値算出部１０７によって算出されたスケール値を用いて算出し、かつ、変更スケール値を用いて算出し、算出した量子化誤差電力の内、最も少ない量子化誤差電力が算出された際に用いられたスケール値（または変更スケール値）を用いて量子化するので、量子化誤差を最低限に抑えることが可能である。 [Effect of Example 2]
As described above, according to the second embodiment, in the disclosed encoding device, the scale value correction unit 108 calculates the quantization error power using the scale value calculated by the scale value calculation unit 107, and , Using the changed scale value, and quantizing using the scale value (or changed scale value) that was used when the smallest quantization error power was calculated, It is possible to minimize the quantization error.

さて、これまで本発明の実施例１と実施例２とについて説明したが、本発明は上述した実施例１と実施例２と以外にも、種々の異なる形態にて実施されてよいものである。そこで、以下では、その他の実施例について説明する。 Although the first and second embodiments of the present invention have been described so far, the present invention may be implemented in various different forms other than the first and second embodiments described above. . Therefore, other embodiments will be described below.

［スケール値］
例えば、上記した実施例１と実施例２とでは、スケール値補正部１０８は、トーン検出部１０３によって検出された帯域についてのみ、スケール値を補正する手法について説明したが、本発明はこれに限定されるものではなく、すべての帯域についてスケール値を補正してもよい。これにより、実施例３に係る符号化装置では、他の帯域についても、量子化誤差を少なくすることが可能である。 [Scale value]
For example, in the first embodiment and the second embodiment described above, the scale value correction unit 108 has described the method of correcting the scale value only for the band detected by the tone detection unit 103, but the present invention is not limited to this. However, the scale value may be corrected for all bands. Thereby, in the encoding apparatus according to the third embodiment, it is possible to reduce the quantization error for other bands.

また、例えば、量子化部１０９は、スケール値補正部１０８によって算出されたスケール値を用いて、すべての帯域に含まれる周波数スペクトル各々を量子化してもよい。具体的な一例をあげて説明すると、スケール値補正部１０８は、トーン検出部１０３によって検出された帯域についてのみ、スケール値を算出して補正し、さらに、トーン検出部１０３によって検出された帯域について算出したスケール値に、他の帯域についてのスケール値を補正する。そして、量子化部１０９は、トーン検出部１０３によって検出された帯域についてトーン検出部１０３によって算出したスケール値を用いて、すべての帯域の周波数スペクトル各々を量子化する。 For example, the quantization unit 109 may quantize each frequency spectrum included in all bands using the scale value calculated by the scale value correction unit 108. To explain with a specific example, the scale value correction unit 108 calculates and corrects the scale value only for the band detected by the tone detection unit 103, and further, for the band detected by the tone detection unit 103. The scale values for other bands are corrected to the calculated scale value. Then, the quantizing unit 109 quantizes each frequency spectrum of all bands using the scale value calculated by the tone detecting unit 103 for the band detected by the tone detecting unit 103.

これにより、実施例３に係る符号化装置では、スケール値を符号化する際に使用するビット数を、抑えることが可能である。具体的には、符号化する際には、スケール値は、隣接するスケール値との差で表される。ここで、すべてのスケール値を同一のスケール値にすることによって、帯域各々に異なるスケール値が設定されている手法と比較して、帯域各々に設定されるスケール値を符号化する際に必要なビット数を抑えることが可能である。 Thereby, in the encoding apparatus according to the third embodiment, the number of bits used when encoding the scale value can be suppressed. Specifically, when encoding, the scale value is represented by a difference from an adjacent scale value. Here, by setting all scale values to the same scale value, it is necessary to encode the scale values set for each band as compared with the method in which different scale values are set for each band. It is possible to reduce the number of bits.

［近傍の帯域］
上記した実施例１では、近傍の帯域として、トーン検出部１０３によって検出された帯域を中心として所定の帯域幅にある帯域を用いる手法について説明したが、本発明はこれに限定されるものではなく、ピークの電力から所定の電力幅にある帯域を用いてもよい。 [Nearby band]
In the above-described first embodiment, the technique of using a band having a predetermined bandwidth centered on the band detected by the tone detection unit 103 as the nearby band has been described, but the present invention is not limited to this. A band having a predetermined power width from the peak power may be used.

言い換えると、図２０に示すように、まず、トーン検出部１０３からの検出結果と帯域電力とから、予め設定されている電力幅を用いて、許容誤差電力を補正する帯域幅を決定し、その後、許容誤差電力を補正する。 In other words, as shown in FIG. 20, first, the bandwidth for correcting the allowable error power is determined from the detection result from the tone detection unit 103 and the bandwidth power using a preset power width, and then Correct the allowable error power.

具体的には、実施例３に係る符号化装置では、許容誤差電力補正部１０５が、電力幅記憶部に、所定の電力幅を記憶する電力幅記憶部を備える。許容誤差電力補正部１０５は、電力幅記憶部に、例えば、「Ｇ」を記憶する。 Specifically, in the encoding apparatus according to the third embodiment, the allowable error power correction unit 105 includes a power width storage unit that stores a predetermined power width in the power width storage unit. The allowable error power correction unit 105 stores, for example, “G” in the power width storage unit.

そして、実施例３に係る符号化装置では、図２１に示すように、許容誤差電力補正部１０５は、トーン検出部１０３によって検出されたトーン性のある周波数スペクトルが含まれる帯域を含む一つまたは複数の連続している当該帯域であって、トーン検出部１０３によって検出された帯域の電力値から、電力幅記憶部に記憶されている所定の電力幅に減衰した電力値以上の電力値を有する帯域各々を、近傍の帯域として、許容誤差電力を補正する。 In the encoding apparatus according to the third embodiment, as illustrated in FIG. 21, the allowable error power correction unit 105 includes one band including a frequency spectrum having a tone characteristic detected by the tone detection unit 103 or A plurality of consecutive bands having a power value equal to or higher than a power value attenuated to a predetermined power width stored in the power width storage unit from the power value of the band detected by the tone detection unit 103 The allowable error power is corrected with each band as a nearby band.

例えば、図２１に示す例を用いて、具体的に説明する。なお、ここでは、トーン性のある帯域の周波数スペクトルの電力が「Ｅｐｅａｋ」であり、電力幅記憶部が「Ｇ」を記憶し、７つの帯域があるものとして説明する。実施例３に係る符号化装置では、許容誤差電力補正部１０５が、「Ｅｐｅａｋ」から「Ｇ」を減衰させた電力である「Ｅｔｈｒ」を算出し、許容誤差電力を補正する帯域を決定するための電力閾値とする。 For example, a specific description will be given using the example shown in FIG. In the following description, it is assumed that the power of the frequency spectrum of the band with tone characteristics is “Epeak”, the power width storage unit stores “G”, and there are seven bands. In the encoding apparatus according to the third embodiment, the allowable error power correction unit 105 calculates “Ethr”, which is power obtained by attenuating “G” from “Epeak”, and determines a band for correcting the allowable error power. Power threshold.

また、例えば、実施例３に係る符号化装置では、許容誤差電力補正部１０５が、トーン帯域に隣接する帯域において、電力閾値より大きい電力を持つ帯域を調べる。例えば、図２１に示す例では、許容誤差電力補正部１０５は、帯域「２」と「４」とが電力閾値よりも大きい電力を示しているため、許容誤差電力を補正する帯域幅は、「Ｂ１（トーン帯域に隣接する帯域より低周波数側にある帯域の数）＝１」、「Ｂ２（トーン帯域に隣接する帯域より高周波数側にある帯域の数）＝１」とする。 Further, for example, in the encoding apparatus according to the third embodiment, the allowable error power correction unit 105 checks a band having power larger than the power threshold in a band adjacent to the tone band. For example, in the example illustrated in FIG. 21, the allowable error power correcting unit 105 indicates power whose bands “2” and “4” are larger than the power threshold, and thus the bandwidth for correcting the allowable error power is “ B1 (the number of bands on the lower frequency side than the band adjacent to the tone band) = 1 ”and“ B2 (the number of bands on the higher frequency side of the band adjacent to the tone band) = 1 ”.

このように、実施例３に係る符号化装置は、簡単に許容誤差電力を補正する帯域を検出することが可能である。 Thus, the encoding apparatus according to the third embodiment can easily detect a band for correcting the allowable error power.

［システム構成］
また、本実施例において説明した各処理のうち、自動的におこなわれるものとして説明した処理の全部または一部を手動的におこなうこともでき、あるいは、手動的におこなわれるものとして説明した処理の全部または一部を公知の方法で自動的におこなうこともできる。この他、上記文書中や図面中で示した処理手順、制御手順、具体的名称、各種のデータやパラメータを含む情報（例えば、図５〜図１３図１７〜図２１）については、特記する場合を除いて任意に変更することができる。 [System configuration]
In addition, among the processes described in this embodiment, all or part of the processes described as being performed automatically can be performed manually, or the processes described as being performed manually can be performed. All or a part can be automatically performed by a known method. In addition, the processing procedure, control procedure, specific name, and information including various data and parameters (for example, FIGS. 5 to 13, FIGS. It can be changed arbitrarily except for.

［実施例の組み合わせ］
また、上記した実施例１においては、例えば、許容誤差電力を補正する際に、（１）スケール値を補正する手法と、（２）量子化値が最大値になるようなスケール値を用いる手法と、（３）近傍の帯域を検出する際に、所定の帯域幅を用いる手法と、を併せて実施する場合について説明した。しかし、本発明の実施はこれに限定されるものではなく、許容誤差電力を補正する際に、（１）〜（３）を併せて実施しなくてもよく、また、（１）〜（３）の内いずれか一つまたは複数のみを実施してもよい。 [Combination of Examples]
In the first embodiment, for example, when correcting the allowable error power, (1) a method of correcting the scale value, and (2) a method of using a scale value that maximizes the quantized value. And (3) a method of using a predetermined bandwidth when detecting a nearby band has been described. However, the implementation of the present invention is not limited to this, and when correcting the allowable error power, (1) to (3) may not be performed together, and (1) to (3 ) May be implemented.

また、同様に、実施例２と、上記した実施例３において説明した手法についても、同様に、これらの手法の内、いずれか一つを実施する場合に本発明が限定されるものではなく、複数の手法を併せて実施してもよい。 Similarly, for the methods described in the second embodiment and the third embodiment, the present invention is not limited to the case where any one of these methods is performed. A plurality of methods may be implemented together.

［プログラム］
ところで、上記実施例１では、ハードウェアロジックによって各種の処理を実現する場合を説明したが、本発明はこれに限定されるものではなく、予め用意されたプログラムをコンピュータで実行することによって実現するようにしてもよい。そこで、以下では、図２２を用いて、上記の実施例１に示した符号化装置と同様の機能を有する符号化プログラムを実行するコンピュータの一例を説明する。なお、図２２は、実施例１に係る符号化装置のプログラムを説明するための図である。 [program]
In the first embodiment, the case where various processes are realized by hardware logic has been described. However, the present invention is not limited to this, and is realized by executing a program prepared in advance by a computer. You may do it. Therefore, in the following, an example of a computer that executes an encoding program having the same function as that of the encoding apparatus described in the first embodiment will be described with reference to FIG. FIG. 22 is a diagram for explaining the program of the encoding apparatus according to the first embodiment.

同図に示すように、実施例１における符号化装置３０００は、操作部３００１、マイク３００２、スピーカ３００３、ディスプレイ３００５、通信部３００６、ＣＰＵ３０１０、ＲＯＭ３０１１、ＨＤＤ３０１２、ＲＡＭ３０１３をバス３００９などで接続して構成されている。 As shown in the figure, the encoding device 3000 according to the first embodiment is configured by connecting an operation unit 3001, a microphone 3002, a speaker 3003, a display 3005, a communication unit 3006, a CPU 3010, a ROM 3011, an HDD 3012, and a RAM 3013 via a bus 3009 or the like. Has been.

ＲＯＭ３０１１には、上記の実施例１で示した入力部１０１と、ＭＤＣＴ部１０２と、トーン検出部１０３と、心理聴覚分析部１０４と、許容誤差電力補正部１０５と、量子化帯域検出部１０６と、スケール値算出部１０７と、スケール値補正部１０８と、量子化部１０９と、符号化部１１０と、出力部１１１と同様の機能を発揮する制御プログラム、つまり、同図に示すように、入力プログラム３０１１ａと、ＭＤＣＴプログラム３０１１ｂと、トーン検出プログラム３０１１ｃと、心理聴覚分析プログラム３０１１ｄと、許容誤差電力補正プログラム３０１１ｅと、量子化帯域検出プログラム３０１１ｆと、スケール値算出プログラム３０１１ｇと、スケール値補正プログラム３０１１ｈと、量子化プログラム３０１１ｉと、符号化プログラム３０１１ｊと、出力プログラム３０１１ｋとが予め記憶されている。なお、これらのプログラム３０１１ａ〜３０１１ｋについては、図６に示した符号化装置の各構成要素と同様、適宜統合または分離してもよい。 The ROM 3011 includes the input unit 101, the MDCT unit 102, the tone detection unit 103, the psychoacoustic analysis unit 104, the allowable error power correction unit 105, and the quantization band detection unit 106 described in the first embodiment. , A scale value calculation unit 107, a scale value correction unit 108, a quantization unit 109, an encoding unit 110, and a control program that exhibits the same functions as the output unit 111, that is, as shown in FIG. Program 3011a, MDCT program 3011b, tone detection program 3011c, psychoacoustic analysis program 3011d, allowable error power correction program 3011e, quantization band detection program 3011f, scale value calculation program 3011g, scale value correction program 3011h A quantization program 3011i and an encoding program A ram 3011J, and an output program 3011k stored in advance. Note that these programs 3011a to 3011k may be appropriately integrated or separated in the same manner as each component of the encoding apparatus shown in FIG.

そして、ＣＰＵ３０１０が、これらのプログラム３０１１ａ〜３０１１ｋをＲＯＭ３０１１から読み出して実行することにより、図２２に示すように、各プログラム３０１１ａ〜３０１１ｋについては、入力プロセス３０１０ａと、ＭＤＣＴプロセス３０１０ｂと、トーン検出プロセス３０１０ｃと、心理聴覚分析プロセス３０１０ｄと、許容誤差電力補正プロセス３０１０ｅと、量子化帯域検出プロセス３０１０ｆと、スケール値算出プロセス３０１０ｇと、スケール値補正プロセス３０１０ｈと、量子化プロセス３０１０ｉと、符号化プロセス３０１０ｊと、出力プロセス３０１０ｋとして機能するようになる。なお、各プロセス３０１０ａ〜３０１０ｋは、図６に示した、入力部１０１と、ＭＤＣＴ部１０２と、トーン検出部１０３と、心理聴覚分析部１０４と、許容誤差電力補正部１０５と、量子化帯域検出部１０６と、スケール値算出部１０７と、スケール値補正部１０８と、量子化部１０９と、符号化部１１０と、出力部１１１とにそれぞれ対応する。 Then, the CPU 3010 reads out these programs 3011a to 3011k from the ROM 3011 and executes them, and as shown in FIG. 22, for each program 3011a to 3011k, an input process 3010a, an MDCT process 3010b, and a tone detection process 3010c. A psychoacoustic analysis process 3010d, an allowable error power correction process 3010e, a quantization band detection process 3010f, a scale value calculation process 3010g, a scale value correction process 3010h, a quantization process 3010i, and an encoding process 3010j And function as an output process 3010k. Each of the processes 3010a to 3010k includes the input unit 101, the MDCT unit 102, the tone detection unit 103, the psychoacoustic analysis unit 104, the allowable error power correction unit 105, and the quantization band detection illustrated in FIG. Corresponds to the unit 106, the scale value calculation unit 107, the scale value correction unit 108, the quantization unit 109, the encoding unit 110, and the output unit 111, respectively.

［その他］
なお、本実施例で説明した符号化装置は、あらかじめ用意されたプログラムをパーソナルコンピュータやワークステーションなどのコンピュータで実行することによって実現することができる。このプログラムは、インターネットなどのネットワークを介して配布することができる。また、このプログラムは、ハードディスク、フレキシブルディスク（ＦＤ）、ＣＤ−ＲＯＭ、ＭＯ、ＤＶＤなどのコンピュータで読み取り可能な記録媒体に記録され、コンピュータによって記録媒体から読み出されることによって実行することもできる。 [Others]
Note that the encoding apparatus described in the present embodiment can be realized by executing a program prepared in advance on a computer such as a personal computer or a workstation. This program can be distributed via a network such as the Internet. The program can also be executed by being recorded on a computer-readable recording medium such as a hard disk, a flexible disk (FD), a CD-ROM, an MO, and a DVD and being read from the recording medium by the computer.

以上の各実施例を含む実施形態に関し、さらに以下の付記を開示する。 The following supplementary notes are further disclosed with respect to the embodiments including the above examples.

（付記１）オーディオ信号を周波数スペクトルに変換し、当該周波数スペクトルを量子化して符号化する符号化装置において、前記周波数スペクトルからトーン性のある周波数スペクトルを検出された場合、前記オーディオ信号に応じて算出された許容誤差電力を補正する電力補正部と、
前記電力補正部によって補正された許容誤差電力より大きな電力の前記周波数スペクトル各々を量子化する量子化部と、を有することを特徴とする符号化装置。 (Supplementary note 1) In an encoding device that converts an audio signal into a frequency spectrum and quantizes and encodes the frequency spectrum, when a frequency spectrum having tone characteristics is detected from the frequency spectrum, A power correction unit that corrects the calculated allowable error power;
And a quantization unit that quantizes each of the frequency spectra having a power larger than the allowable error power corrected by the power correction unit.

（付記２）オーディオ信号を符号化する符号化装置において、
前記オーディオ信号を周波数スペクトルに変換する周波数変換部と、
前記オーディオ信号に応じて、許容誤差電力を算出する電力算出部と、
前記周波数変換部によって変換された前記周波数スペクトルから、トーン性のある周波数スペクトルを検出する検出部と、
前記検出部での検出結果と前記電力算出部によって算出された前記許容誤差電力とを用いて、前記許容誤差電力を補正する電力補正部と、
前記電力補正部によって補正された前記許容誤差電力より大きな電力の前記周波数スペクトル各々を量子化する量子化部と、
を有することを特徴とする符号化装置。 (Supplementary Note 2) In an encoding apparatus for encoding an audio signal,
A frequency converter for converting the audio signal into a frequency spectrum;
A power calculating unit for calculating an allowable error power according to the audio signal;
From the frequency spectrum converted by the frequency conversion unit, a detection unit for detecting a frequency spectrum with tone characteristics;
A power correction unit that corrects the allowable error power using the detection result of the detection unit and the allowable error power calculated by the power calculation unit;
A quantization unit that quantizes each of the frequency spectrums having a power larger than the allowable error power corrected by the power correction unit;
An encoding device comprising:

（付記３）前記電力算出部は、前記オーディオ信号の周波数を所定の幅で分割した帯域それぞれについて、前記許容誤差電力を算出し、
前記検出部は、前記トーン性のある周波数スペクトルを検出するとともに当該周波数スペクトルが含まれる前記帯域を検出し、
前記電力補正部は、前記検出部によって検出された前記帯域に近傍する帯域について前記電力算出部によって算出された前記許容誤差電力が、当該近傍する帯域についての前記周波数スペクトルの電力より小さくなるように補正することを特徴とする付記２に記載の符号化装置。 (Supplementary Note 3) The power calculation unit calculates the allowable error power for each band obtained by dividing the frequency of the audio signal by a predetermined width,
The detection unit detects the frequency spectrum having the tone property and detects the band in which the frequency spectrum is included,
The power correction unit is configured such that the allowable error power calculated by the power calculation unit for a band close to the band detected by the detection unit is smaller than the power of the frequency spectrum for the adjacent band. The encoding apparatus according to appendix 2, wherein correction is performed.

（付記４）前記量子化部は、前記周波数スペクトルのダイナミックレンジを、スケール値によって一意に特定されるダイナミックレンジに縮小し、縮小したダイナミックレンジにおける当該周波数スペクトル各々を量子化するものであって、
前記帯域に含まれる前記周波数スペクトルを量子化する際に発生する誤差である量子化誤差から算出される量子化誤差電力が、当該帯域について電力算出部によって算出された前記許容誤差電力未満になるようなスケール値を、前記帯域ごとに算出する第１のスケール値算出部と、
前記検出部によって検出された前記トーン性のある前記周波数スペクトルが含まれる前記帯域に近傍する帯域について、当該近傍する帯域に含まれる前記周波数スペクトルを量子化する際に発生する誤差である量子化誤差から算出される量子化誤差電力が、当該近傍する帯域について前記電力補正部によって当該近傍する帯域について算出された前記許容誤差電力未満となるようなスケール値を、当該近傍する帯域についての当該スケール値として算出する第２のスケール値算出部と、
をさらに備え、
前記量子化部は、前記第２のスケール値算出部によって前記スケール値が算出された前記帯域に含まれる前記周波数スペクトル各々を、当該第２のスケール値算出部によって算出された前記スケール値を用いて量子化し、当該第２のスケール値算出部によって当該スケール値が算出されていない当該帯域に含まれる前記周波数スペクトル各々を、前記第１のスケール値算出部によって算出されたスケール値を用いて量子化することを特徴とする付記２または３に記載の符号化装置。 (Supplementary Note 4) The quantization unit reduces the dynamic range of the frequency spectrum to a dynamic range uniquely specified by a scale value, and quantizes each frequency spectrum in the reduced dynamic range,
A quantization error power calculated from a quantization error that is an error that occurs when the frequency spectrum included in the band is quantized is less than the allowable error power calculated by the power calculation unit for the band. A first scale value calculation unit for calculating a specific scale value for each band;
Quantization error, which is an error that occurs when the frequency spectrum included in the adjacent band is quantized with respect to the band adjacent to the band including the tone-specific frequency spectrum detected by the detection unit The quantization error power calculated from the scale value that is less than the allowable error power calculated for the neighboring band by the power correction unit for the neighboring band is the scale value for the neighboring band. A second scale value calculation unit for calculating as
Further comprising
The quantization unit uses the scale value calculated by the second scale value calculating unit for each frequency spectrum included in the band in which the scale value is calculated by the second scale value calculating unit. And quantizing each frequency spectrum included in the band for which the scale value is not calculated by the second scale value calculation unit using the scale value calculated by the first scale value calculation unit. 4. The encoding device according to appendix 2 or 3, characterized in that:

（付記５）前記周波数スペクトルを前記量子化部が量子化して得られる値には、当該値として得ることが可能である最大の値が設定されているものであって、
前記帯域に含まれる前記周波数スペクトルを量子化する際に発生する誤差である量子化誤差から算出される量子化誤差電力が、当該帯域について電力算出部によって算出された前記許容誤差電力未満になるようなスケール値を、前記帯域ごとに算出する第１のスケール値算出部と、
前記検出部によって検出された前記トーン性のある前記周波数スペクトルが含まれる前記帯域を構成する当該周波数スペクトル各々の内、最も大きな当該周波数スペクトルから得られる値が前記最大の値となるような前記スケール値を、当該帯域についての当該スケール値として算出する第３のスケール値算出部と、
をさらに備え、
前記量子化部は、前記第３のスケール値算出部によって前記スケール値が算出された前記帯域に含まれる前記周波数スペクトル各々を、当該第３のスケール値算出部によって算出された前記スケール値を用いて量子化し、当該第３のスケール値算出部によって当該スケール値が算出されていない当該帯域に含まれる前記周波数スペクトル各々を、前記第１のスケール値算出部によって算出されたスケール値を用いて量子化することを特徴とする付記２〜４のいずれか一つに記載の符号化装置。 (Supplementary Note 5) The value obtained by quantizing the frequency spectrum by the quantization unit is set to the maximum value that can be obtained as the value,
A quantization error power calculated from a quantization error that is an error that occurs when the frequency spectrum included in the band is quantized is less than the allowable error power calculated by the power calculation unit for the band. A first scale value calculation unit for calculating a specific scale value for each band;
The scale such that a value obtained from the largest frequency spectrum among the frequency spectra constituting the band in which the frequency spectrum having the tone property detected by the detection unit is included is the maximum value. A third scale value calculation unit for calculating a value as the scale value for the band;
Further comprising
The quantization unit uses the scale value calculated by the third scale value calculating unit for each frequency spectrum included in the band in which the scale value is calculated by the third scale value calculating unit. And quantizing each frequency spectrum included in the band for which the scale value is not calculated by the third scale value calculation unit using the scale value calculated by the first scale value calculation unit. The encoding device according to any one of appendices 2 to 4, characterized in that:

（付記６）前記帯域に含まれる前記周波数スペクトルを量子化する際に発生する誤差である量子化誤差から算出される量子化誤差電力が、当該帯域について電力算出部によって算出された前記許容誤差電力未満になるようなスケール値を、前記帯域ごとに算出する第１のスケール値算出部と、
前記帯域に含まれる前記周波数スペクトルを量子化する際に発生する前記量子化誤差電力を、当該帯域について前記第１のスケール値算出部によって算出された前記スケール値を用いて算出し、かつ、当該第１のスケール値算出部によって算出された前記スケール値を所定の値に変更したスケール値である変更スケール値を用いて算出する誤差算出部と、
をさらに備え、
前記量子化部は、前記誤差算出部によって算出された前記量子化誤差電力の内、最も小さい当該量子化誤差電力が算出された際に用いられた前記スケール値または変更スケール値を用いて、前記帯域に含まれる当該周波数スペクトルを量子化することを特徴とする付記２〜４のいずれか一つに記載の符号化装置。 (Supplementary Note 6) The allowable error power calculated by a power calculation unit for a quantization error power calculated from a quantization error that is an error generated when the frequency spectrum included in the band is quantized. A first scale value calculation unit that calculates a scale value that is less than each band,
The quantization error power generated when quantizing the frequency spectrum included in the band is calculated using the scale value calculated by the first scale value calculation unit for the band, and An error calculating unit that calculates using a changed scale value that is a scale value obtained by changing the scale value calculated by the first scale value calculating unit to a predetermined value;
Further comprising
The quantization unit uses the scale value or the changed scale value used when the smallest quantization error power is calculated among the quantization error powers calculated by the error calculation unit, The encoding apparatus according to any one of appendices 2 to 4, wherein the frequency spectrum included in the band is quantized.

（付記７）前記量子化部は、前記第３のスケール値算出部によって算出されたスケール値を用いて、すべての前記帯域に含まれる前記周波数スペクトル各々を量子化することを特徴とする付記４〜６のいずれか一つに記載の符号化装置。 (Supplementary note 7) The quantization unit quantizes each of the frequency spectra included in all the bands using the scale value calculated by the third scale value calculation unit. The encoding apparatus as described in any one of -6.

（付記８）所定の帯域数を記憶する帯域数記憶部をさらに備え、
前記電力補正部は、前記検出部によって検出された前記トーン性のある前記周波数スペクトルが含まれる前記帯域を中心として、前記帯域数記憶部によって記憶された前記所定の帯域数の範囲に位置する帯域各々を、前記近傍の帯域として前記許容誤差電力を補正することを特徴とする付記２〜７のいずれか一つに記載の符号化装置。 (Supplementary note 8) A band number storage unit for storing a predetermined number of bands is further provided,
The power correction unit is a band located in the range of the predetermined number of bands stored by the number-of-bands storage unit, with the band including the frequency spectrum having the tone characteristic detected by the detection unit as a center. The encoding device according to any one of appendices 2 to 7, wherein the allowable error power is corrected with each of the adjacent bands as the band.

（付記９）所定の電力幅を記憶する電力幅記憶部をさらに備え、
前記電力補正部は、前記検出部によって検出された前記トーン性のある前記周波数スペクトルが含まれる前記帯域を含む一つまたは複数の連続している当該帯域であって、前記検出部によって検出された前記帯域の電力値から、電力幅記憶部に記憶されている前記所定の電力幅減衰した電力値以上の電力値を有する当該帯域各々を、前記近傍の帯域として前記許容誤差電力を補正することを特徴とする付記２〜８のいずれか一つに記載の符号化装置。 (Supplementary Note 9) A power width storage unit that stores a predetermined power width is further provided.
The power correction unit is one or a plurality of continuous bands including the band including the frequency spectrum with the tone characteristic detected by the detection unit, and is detected by the detection unit. Correcting the allowable error power with each band having a power value equal to or greater than the power value attenuated by the predetermined power width stored in the power width storage unit from the power value of the band as the neighboring band. The encoding device according to any one of supplementary notes 2 to 8, which is characterized.

（付記１０）オーディオ信号を周波数スペクトルに変換し、符号化する符号化方法において、
前記周波数スペクトルからトーン性のある周波数スペクトルを検出し、前記オーディオ信号に応じた許容誤差電力を補正するステップと、
補正された許容誤差電力を用いて、前記周波数スペクトルを量子化するステップと、
を含むことを特徴とする符号化方法。 (Supplementary note 10) In an encoding method for converting and encoding an audio signal into a frequency spectrum,
Detecting a tonal frequency spectrum from the frequency spectrum and correcting an allowable error power according to the audio signal;
Quantizing the frequency spectrum using the corrected tolerance power;
The encoding method characterized by including.

（付記１１）オーディオ信号を周波数スペクトルに変換し、符号化する処理をコンピュータに実行させる符号化プログラムであって、
前記周波数スペクトルからトーン性のある周波数スペクトルを検出し、前記オーディオ信号に応じた許容誤差電力を補正する手順と、
補正された許容誤差電力を用いて、前記周波数スペクトルを量子化する手順と、
をコンピュータに実行させることを特徴とする符号化プログラム。 (Supplementary note 11) An encoding program for causing a computer to execute a process of converting an audio signal into a frequency spectrum and encoding the audio signal,
Detecting a frequency spectrum having a tone characteristic from the frequency spectrum and correcting an allowable error power according to the audio signal;
Using the corrected allowable error power to quantize the frequency spectrum;
An encoding program for causing a computer to execute.

実施例１に係る符号化装置の前提技術を説明するための図である。It is a figure for demonstrating the premise technique of the encoding apparatus which concerns on Example 1. FIG. 実施例１に係る符号化装置の前提技術を説明するための図である。It is a figure for demonstrating the premise technique of the encoding apparatus which concerns on Example 1. FIG. 実施例１に係る符号化装置の前提技術を説明するための図である。It is a figure for demonstrating the premise technique of the encoding apparatus which concerns on Example 1. FIG. 実施例１に係る符号化装置の前提技術を説明するための図である。It is a figure for demonstrating the premise technique of the encoding apparatus which concerns on Example 1. FIG. 実施例１に係る符号化装置の概要および特徴を示すための図である。BRIEF DESCRIPTION OF THE DRAWINGS FIG. 1 is a diagram for illustrating an overview and characteristics of an encoding apparatus according to a first embodiment. 実施例１に係る符号化装置の構成を示すためのブロック図である。1 is a block diagram for illustrating a configuration of an encoding apparatus according to Embodiment 1. FIG. 実施例１におけるトーン検出部を説明するための図である。FIG. 3 is a diagram for explaining a tone detection unit according to the first embodiment. 実施例１における心理聴覚分析部を説明するための図である。It is a figure for demonstrating the psychoacoustic analysis part in Example 1. FIG. 実施例１における許容誤差電力補正部を説明するための図である。FIG. 6 is a diagram for explaining an allowable error power correction unit according to the first embodiment. 実施例１における許容誤差電力補正部を説明するための図である。FIG. 6 is a diagram for explaining an allowable error power correction unit according to the first embodiment. 実施例１におけるスケール値補正部を説明するための図である。FIG. 6 is a diagram for explaining a scale value correction unit according to the first embodiment. 実施例１に係る符号化装置の処理全体の流れを示すフローチャートである。6 is a flowchart illustrating an overall process flow of the encoding apparatus according to the first embodiment. 実施例１に係るスケール値補正部の処理の流れを示すフローチャートである。6 is a flowchart illustrating a process flow of a scale value correction unit according to the first embodiment. 実施例１に係る符号化装置の効果を説明するための図である。It is a figure for demonstrating the effect of the encoding apparatus which concerns on Example 1. FIG. 実施例１に係る符号化装置の効果を説明するための図である。It is a figure for demonstrating the effect of the encoding apparatus which concerns on Example 1. FIG. 実施例１に係る符号化装置の効果を説明するための図である。It is a figure for demonstrating the effect of the encoding apparatus which concerns on Example 1. FIG. 実施例２に係る符号化装置を説明するための図である。FIG. 10 is a diagram for explaining an encoding apparatus according to a second embodiment. 実施例２に係る符号化装置を説明するための図である。FIG. 10 is a diagram for explaining an encoding apparatus according to a second embodiment. 実施例２に係る符号化装置のスケール値補正処理の流れを示すフローチャートである。12 is a flowchart illustrating a flow of a scale value correction process of the encoding device according to the second embodiment. 実施例３に係る符号化装置を説明するための図である。FIG. 10 is a diagram for explaining an encoding apparatus according to a third embodiment. 実施例３に係る符号化装置を説明するための図である。FIG. 10 is a diagram for explaining an encoding apparatus according to a third embodiment. 実施例１に係る符号化装置のプログラムを説明するための図である。FIG. 3 is a diagram for explaining a program of the encoding apparatus according to the first embodiment. 従来技術を説明するための図である。It is a figure for demonstrating a prior art.

符号の説明Explanation of symbols

１０１入力部
１０２ＭＤＣＴ部
１０３トーン検出部
１０４心理聴覚分析部
１０５許容誤差電力補正部
１０６量子化帯域検出部
１０７スケール値算出部
１０８スケール値補正部
１０９量子化部
１１０符号化部
１１１出力部 DESCRIPTION OF SYMBOLS 101 Input part 102 MDCT part 103 Tone detection part 104 Psychological auditory analysis part 105 Allowable error electric power correction part 106 Quantization zone | band detection part 107 Scale value calculation part 108 Scale value correction part 109 Quantization part 110 Encoding part 111 Output part

Claims

オーディオ信号を周波数スペクトルに変換し、当該周波数スペクトルを量子化して符号化する符号化装置において、
前記周波数スペクトルからトーン性のある周波数スペクトルを検出された場合、前記オーディオ信号の周波数を所定の幅で分割した帯域のうち、トーン性のある周波数スペクトルにおいてトーン性のある前記帯域に近傍する帯域について前記オーディオ信号に応じて前記帯域ごとに算出される許容誤差電力が、当該近傍する帯域についての前記周波数スペクトルの電力より小さくなるように補正する電力補正部と、
前記電力補正部によって補正された許容誤差電力より大きな電力の前記周波数スペクトル各々を量子化する量子化部と、を有することを特徴とする符号化装置。 In an encoding device that converts an audio signal into a frequency spectrum and quantizes and encodes the frequency spectrum.
When a frequency spectrum having a tone characteristic is detected from the frequency spectrum, among bands obtained by dividing the frequency of the audio signal by a predetermined width, a band close to the band having the tone characteristic in the frequency spectrum having the tone characteristic A power correction unit that corrects an allowable error power calculated for each band according to the audio signal so as to be smaller than a power of the frequency spectrum for the adjacent band;
And a quantization unit that quantizes each of the frequency spectra having a power larger than the allowable error power corrected by the power correction unit.

オーディオ信号を符号化する符号化装置において、
前記オーディオ信号を周波数スペクトルに変換する周波数変換部と、
前記オーディオ信号に応じて、前記オーディオ信号の周波数を所定の幅で分割した帯域それぞれについて許容誤差電力を算出する電力算出部と、
前記周波数変換部によって変換された前記周波数スペクトルから、トーン性のある周波数スペクトルを検出するとともに、トーン性のある周波数スペクトルにおいてトーン性のある前記帯域を検出する検出部と、
前記検出部での検出結果と前記電力算出部によって算出された前記許容誤差電力とを用いて、前記検出部によって検出された前記帯域に近傍する帯域について前記電力算出部によって算出された前記許容誤差電力が、当該近傍する帯域についての前記周波数スペクトルの電力より小さくなるように補正する電力補正部と、
前記電力補正部によって補正された前記許容誤差電力より大きな電力の前記周波数スペクトル各々を量子化する量子化部と、
を有することを特徴とする符号化装置。 In an encoding device for encoding an audio signal,
A frequency converter for converting the audio signal into a frequency spectrum;
A power calculator that calculates an allowable error power for each of the bands obtained by dividing the frequency of the audio signal by a predetermined width in accordance with the audio signal;
A detection unit for detecting a frequency spectrum having a tone characteristic from the frequency spectrum converted by the frequency conversion unit, and detecting the band having a tone characteristic in the frequency spectrum having a tone characteristic;
The allowable error calculated by the power calculation unit for a band close to the band detected by the detection unit, using the detection result of the detection unit and the allowable error power calculated by the power calculation unit. A power correction unit that corrects the power to be smaller than the power of the frequency spectrum for the adjacent band; and
A quantization unit that quantizes each of the frequency spectrums having a power larger than the allowable error power corrected by the power correction unit;
An encoding device comprising:

前記量子化部は、前記周波数スペクトルのダイナミックレンジを、スケール値によって一意に特定されるダイナミックレンジに縮小し、縮小したダイナミックレンジにおける当該周波数スペクトル各々を量子化するものであって、
前記帯域に含まれる前記周波数スペクトルを量子化する際に発生する誤差である量子化誤差から算出される量子化誤差電力が、当該帯域について前記電力算出部によって算出された前記許容誤差電力未満になるようなスケール値を、前記帯域ごとに算出する第１のスケール値算出部と、
前記検出部によって検出された前記トーン性のある前記周波数スペクトルが含まれる前記帯域に近傍する帯域について、当該近傍する帯域に含まれる前記周波数スペクトルを量子化する際に発生する誤差である量子化誤差から算出される量子化誤差電力が、当該近傍する帯域について前記電力補正部によって当該近傍する帯域について算出された前記許容誤差電力未満となるようなスケール値を、当該近傍する帯域についての当該スケール値として算出する第２のスケール値算出部と、
をさらに備え、
前記量子化部は、前記第２のスケール値算出部によって前記スケール値が算出された前記帯域に含まれる前記周波数スペクトル各々を、当該第２のスケール値算出部によって算出された前記スケール値を用いて量子化し、当該第２のスケール値算出部によって当該スケール値が算出されていない当該帯域に含まれる前記周波数スペクトル各々を、前記第１のスケール値算出部によって算出されたスケール値を用いて量子化することを特徴とする請求項２に記載の符号化装置。 The quantization unit reduces the dynamic range of the frequency spectrum to a dynamic range uniquely specified by a scale value, and quantizes each frequency spectrum in the reduced dynamic range,
The quantization error power calculated from the quantization error that is an error that occurs when the frequency spectrum included in the band is quantized is less than the allowable error power calculated by the power calculation unit for the band. A first scale value calculation unit for calculating such a scale value for each band;
Quantization error, which is an error that occurs when the frequency spectrum included in the adjacent band is quantized with respect to the band adjacent to the band including the tone-specific frequency spectrum detected by the detection unit The quantization error power calculated from the scale value that is less than the allowable error power calculated for the neighboring band by the power correction unit for the neighboring band is the scale value for the neighboring band. A second scale value calculation unit for calculating as
Further comprising
The quantization unit uses the scale value calculated by the second scale value calculating unit for each frequency spectrum included in the band in which the scale value is calculated by the second scale value calculating unit. And quantizing each frequency spectrum included in the band for which the scale value is not calculated by the second scale value calculation unit using the scale value calculated by the first scale value calculation unit. The encoding apparatus according to claim 2, wherein:

前記周波数スペクトルを前記量子化部が量子化して得られる値には、当該値として得ることが可能である最大の値が設定されているものであって、
前記帯域に含まれる前記周波数スペクトルを量子化する際に発生する誤差である量子化誤差から算出される量子化誤差電力が、当該帯域について前記電力算出部によって算出された前記許容誤差電力未満になるようなスケール値を、前記帯域ごとに算出する第１のスケール値算出部と、
前記検出部によって検出された前記トーン性のある前記周波数スペクトルが含まれる前記帯域を構成する当該周波数スペクトル各々の内、最も大きな当該周波数スペクトルから得られる値が前記最大の値となるような前記スケール値を、当該帯域についての当該スケール値として算出する第３のスケール値算出部と、
をさらに備え、
前記量子化部は、前記第３のスケール値算出部によって前記スケール値が算出された前記帯域に含まれる前記周波数スペクトル各々を、当該第３のスケール値算出部によって算出された前記スケール値を用いて量子化し、当該第３のスケール値算出部によって当該スケール値が算出されていない当該帯域に含まれる前記周波数スペクトル各々を、前記第１のスケール値算出部によって算出されたスケール値を用いて量子化することを特徴とする請求項２又は３に記載の符号化装置。 The value obtained by quantizing the frequency spectrum by the quantization unit is set to the maximum value that can be obtained as the value,
The quantization error power calculated from the quantization error that is an error that occurs when the frequency spectrum included in the band is quantized is less than the allowable error power calculated by the power calculation unit for the band. A first scale value calculation unit for calculating such a scale value for each band;
The scale such that a value obtained from the largest frequency spectrum among the frequency spectra constituting the band in which the frequency spectrum having the tone property detected by the detection unit is included is the maximum value. A third scale value calculation unit for calculating a value as the scale value for the band;
Further comprising
The quantization unit uses the scale value calculated by the third scale value calculating unit for each frequency spectrum included in the band in which the scale value is calculated by the third scale value calculating unit. And quantizing each frequency spectrum included in the band for which the scale value is not calculated by the third scale value calculation unit using the scale value calculated by the first scale value calculation unit. The encoding apparatus according to claim 2 or 3, wherein

前記帯域に含まれる前記周波数スペクトルを量子化する際に発生する誤差である量子化誤差から算出される量子化誤差電力が、当該帯域について前記電力算出部によって算出された前記許容誤差電力未満になるようなスケール値を、前記帯域ごとに算出する第１のスケール値算出部と、
前記帯域に含まれる前記周波数スペクトルを量子化する際に発生する前記量子化誤差電力を、当該帯域について前記第１のスケール値算出部によって算出された前記スケール値を用いて算出し、かつ、当該第１のスケール値算出部によって算出された前記スケール値を所定の値に変更したスケール値である変更スケール値を用いて算出する誤差算出部と、
をさらに備え、
前記量子化部は、前記誤差算出部によって算出された前記量子化誤差電力の内、最も小さい当該量子化誤差電力が算出された際に用いられた前記スケール値または変更スケール値を用いて、前記帯域に含まれる当該周波数スペクトルを量子化することを特徴とする請求項２又は３に記載の符号化装置。 The quantization error power calculated from the quantization error that is an error that occurs when the frequency spectrum included in the band is quantized is less than the allowable error power calculated by the power calculation unit for the band. A first scale value calculation unit for calculating such a scale value for each band;
The quantization error power generated when quantizing the frequency spectrum included in the band is calculated using the scale value calculated by the first scale value calculation unit for the band, and An error calculating unit that calculates using a changed scale value that is a scale value obtained by changing the scale value calculated by the first scale value calculating unit to a predetermined value;
Further comprising
The quantization unit uses the scale value or the changed scale value used when the smallest quantization error power is calculated among the quantization error powers calculated by the error calculation unit, The encoding apparatus according to claim 2 or 3, wherein the frequency spectrum included in the band is quantized.

前記量子化部は、前記第３のスケール値算出部によって算出されたスケール値を用いて、すべての前記帯域に含まれる前記周波数スペクトル各々を量子化することを特徴とする請求項４に記載の符号化装置。 The quantization unit, using the scale factor determined by the third scale factor determining unit, the frequency spectrum, each contained in all of the bands according to claim 4, characterized in that the quantizing Encoding device.

所定の帯域数を記憶する帯域数記憶部をさらに備え、
前記電力補正部は、前記検出部によって検出された前記トーン性のある前記周波数スペクトルが含まれる前記帯域を中心として、前記帯域数記憶部によって記憶された前記所定の帯域数の範囲に位置する帯域各々を、前記近傍の帯域として前記許容誤差電力を補正す
ることを特徴とする請求項２〜６のいずれか一つに記載の符号化装置。 A band number storage unit for storing a predetermined number of bands;
The power correction unit is a band located in the range of the predetermined number of bands stored by the number-of-bands storage unit, with the band including the frequency spectrum having the tone characteristic detected by the detection unit as a center. The encoding apparatus according to claim 2, wherein the allowable error power is corrected using each of the adjacent bands as a band.

オーディオ信号を周波数スペクトルに変換し、符号化する符号化方法において、
前記周波数スペクトルからトーン性のある周波数スペクトルを検出し、前記オーディオ信号の周波数を所定の幅で分割した帯域のうち、トーン性のある周波数スペクトルにおいてトーン性のある前記帯域に近傍する帯域について前記オーディオ信号に応じて前記帯域ごとに算出される許容誤差電力が、当該近傍する帯域についての前記周波数スペクトルの電力より小さくなるように補正するステップと、
補正された許容誤差電力を用いて、前記周波数スペクトルを量子化するステップと、
を含むことを特徴とする符号化方法。 In an encoding method for converting and encoding an audio signal into a frequency spectrum,
A frequency spectrum having a tone characteristic is detected from the frequency spectrum, and a band of the frequency spectrum having a tone characteristic that is close to the band having a tone characteristic in a band obtained by dividing the frequency of the audio signal by a predetermined width is used. Correcting the permissible error power calculated for each band according to the signal to be smaller than the power of the frequency spectrum for the neighboring bands;
Quantizing the frequency spectrum using the corrected tolerance power;
The encoding method characterized by including.

オーディオ信号を周波数スペクトルに変換し、符号化する処理をコンピュータに実行させる符号化プログラムであって、
前記周波数スペクトルからトーン性のある周波数スペクトルを検出し、前記オーディオ信号の周波数を所定の幅で分割した帯域のうち、トーン性のある周波数スペクトルにおいてトーン性のある前記帯域に近傍する帯域について前記オーディオ信号に応じて前記帯域ごとに算出される許容誤差電力が、当該近傍する帯域についての前記周波数スペクトルの電力より小さくなるように補正する手順と、
補正された許容誤差電力を用いて、前記周波数スペクトルを量子化する手順と、
をコンピュータに実行させることを特徴とする符号化プログラム。 An encoding program for converting an audio signal into a frequency spectrum and causing a computer to execute an encoding process,
A frequency spectrum having a tone characteristic is detected from the frequency spectrum, and a band of the frequency spectrum having a tone characteristic that is close to the band having a tone characteristic in a band obtained by dividing the frequency of the audio signal by a predetermined width is used. A procedure for correcting the allowable error power calculated for each band according to a signal to be smaller than the power of the frequency spectrum for the neighboring band;
Using the corrected allowable error power to quantize the frequency spectrum;
An encoding program for causing a computer to execute.