JP3930596B2

JP3930596B2 - Audio signal encoding method

Info

Publication number: JP3930596B2
Application number: JP02869697A
Authority: JP
Inventors: 直樹村山
Original assignee: 株式会社タイトー
Priority date: 1997-02-13
Filing date: 1997-02-13
Publication date: 2007-06-13
Anticipated expiration: 2017-02-13
Also published as: JPH10228298A

Description

【０００１】
【発明の属する技術分野】
本発明は、音声信号を高度に圧縮して符号化を行う音声信号符号化方法に関する。
【０００２】
【従来の技術】
音声信号を符号化するため、従来においては特開昭６３−１４８７１６号公報に示すようなＡＤＰＣＭ方式が広く用いられている。
この方式は、入力信号の値から次の入力信号の値を予測して、その予測値と実際の入力信号との差をとり、この差の振幅の大きさに応じたステップサイズで量子化して符号化するものであり、量子化のステップアップサイズを可変することができることを特徴としている。
【０００３】
【発明が解決しようとする課題】
しかしながら、上記した従来のＡＤＰＣＭ方式においては、音声信号を符号化する場合に、符号化した音声信号を復号化／再生する場合に音質を確保する上で重要度の高いスペクトルも重要度の低いスペクトルも一律に符号化されるので、符号化圧縮率を高くすることができず、例えば通信カラオケ等に使用した場合等は、通信データの中に占める音声信号の占有率が大きく、他の必要なデータを入れることができなかったり、或いは、この方式で録音／再生を行うためには大容量のメモリを必要とし、また、それを録音する記憶媒体の大きさが限られるので、必要な情報の量を記憶できなくなるという問題があった。
【０００４】
本発明は、上記した従来の問題点を解決すべく創案されたもので、音声信号を高度に圧縮して符号化し、しかも音声信号を復号して再生した場合の音質低下を抑えることのできる音声信号符号化方法を提供するものである。
【０００５】
【課題を解決するための手段】
上記課題を解決するため、本発明は、
フレーム化直交変換部と、レベル計算部と、スペクトル選択部と、スペクトル符号化部と、を備える計算機が音声信号を符号化する音声信号符号化方法において、
前記フレーム化直交変換部が、デジタル変換された音声信号データを所定の大きさのフレームでフレーム化した後に直交変換してスペクトル化し、
前記レベル計算部が、前記スペクトル化した全ての音声信号データのスペクトルの信号レベルの総和を第１レベルとして求め、
前記スペクトル選択部が、第１スペクトル選択処理、第２スペクトル選択処理、レベル計算処理、および、レベル判断処理を順に実行することで、前記スペクトル化した音声信号データのスペクトルを選択し、
前記スペクトル符号化部が、前記スペクトル選択部により選択されたスペクトルを可逆符号化し、
前記第１スペクトル選択処理は、スペクトルの全体について、任意のスペクトルから一定の距離内にそのスペクトルの大きさ以上の大きさを持つ別のスペクトルが存在するかどうかを調べ、そのようなスペクトルが存在しなければ、そのスペクトルを第１スペクトルの一つとして選択し、
前記第２スペクトル選択処理は、前記第１スペクトル選択処理で選択された各々の第１スペクトルから、第１スペクトル間距離とは異なる一定距離内で最大の大きさを有するスペクトルを第２スペクトルの一つとして選択し、
前記レベル計算処理は、前記第１スペクトル選択処理および前記第２スペクトル選択処理で選択された全てのスペクトルの信号レベルの総和を第２レベルとして計算し、
前記レベル判断処理は、前記第１レベルに対して前記第２レベルが所定の比率以下なら、前記一定の距離を前記第１スペクトル間距離より少し短縮して設定し、前記第１スペクトル選択処理に処理を戻す
構成としたものである。
【０００７】
【発明の実施の形態】
以下、本発明の実施の形態を図１乃至図４に基づいて説明する。
図１は、本発明の音声信号符号化方法を行うための符号化装置のブロック図、図２は、図１の符号化装置で行われる音声信号符号化方法の構成図、図３は、図２のスペクトル選択部の処理手順を示すフローチャート、図４は、図２のスペクトル選択部で行われるピークの選択を示す説明図である。
【０００８】
本発明の音声信号符号化方法は、図１に示すような符号化装置１で音声信号が符号化されるものである。
この符号化装置１は、外部のマイクロホン２等で音声をアナログ信号に変換した入力信号を受け、アナログ信号をデジタル信号に変換するＡ／Ｄコンバータ３と、このデジタル変換した信号を圧縮符号化する符号処理部４、とにより構成されている。
また、符号処理部４は、公知のマイクロコンピュータにより構成されており、内部はＩ／Ｏ装置４ａ、ＣＰＵ４ｂ、ＲＯＭ４ｃ、ＲＡＭ４ｄ等からなり、予めＲＯＭ４ｃ内に格納されているプログラムに従って効率よく符号化されるものである。
【０００９】
この符号処理部４の機能構成を図２に示す。図２において、符号処理部４は、フレーム化直交変換部５、レベル計算部６、スペクトル選択部７、レベル補償部８、およびスペクトル符号化部９の各機能ブロックから構成されている。
この、符号処理部４の動作機能を図２に基づいて説明する。
まず、図２において、前記Ａ／Ｄコンバータ３（図１参照）によりＰＣＭ変換によりデジタル信号に変換されて符号処理部４に入力された音声信号データは、フレーム化直交変換部５で、例えば、１フレームのデータ数が１０２４個というように、適切な個数にまとめてフレーム化され、このフレームの全てに対して直交変換を行って音声信号のデータがスペクトル化される。
【００１０】
次いで、レベル計算部６では、スペクトル化した音声信号のデータの全てについて、大きさ（絶対値）の総和を計算し、その値をスペクトルの全体レベルとする。また、これらのスペクトルを、例えば２５６個ずつといった周波数のサブバンドに分割し、この分割したサブバンドの各々ついて、スペクトルの大きさの絶対値の総和を計算してスペクトルの部分レベルとする。
次いで、スペクトル選択部７において、スペクトル全体の中から再生時の音質を保つ上で重要度の高いスペクトルの選択を行う。但し、この選択の際には、選択後のスペクトルの総和である選択後の全体レベルが、前記レベル計算部６にて計算しておいた選択前の全体レベルに対して例えば５０％以上となるようにスペクトルの選択を行う。
【００１１】
このスペクトル選択部７におけるスペクトルの選択手順を、図３乃至図４に基づいて説明する。
まず、図３におけるステップＳ１で、第１ピーク選択処理を行う。この第１ピーク選択処理はスペクトルの全体について、図４（ａ）に示すように、任意のスペクトルから一定の距離内にそのスペクトルの大きさ（絶対値）以上の大きさを持つ別のスペクトルが存在するかどうかを調べ、そのようなスペクトルが存在しなければ、そのスペクトルを第１ピークの一つとして選択する。
この第１ピーク選択処理の結果として、図４（ｂ）に示すような、ピーク間の一定距離（以下第１ピーク間距離という）以上の間隔をおいた飛び飛びのスペクトルだけが選択される。
【００１２】
次いで、ステップＳ２において第ｎ（ｎは整数）ピーク選択処理を行う。この第ｎピーク選択処理は、図４（ｂ）に示すように、ステップＳ１で選択された各々の第１ピークから、第１ピーク間距離とは異なる一定距離内で最大の大きさ（絶対値）を選択すると、図４（ｃ）に示すような２番目の大きさのスペクトル、即ち第２ピークとして選択される。
さらに、第２ピークの選択が終了すれば、同様にして、３番目に大きい第３ピークの選択を行い、以下順次第４ピーク、・・・第ｎピークの選択を行う。
次いで、ステップＳ３で選択後のレベル計算を行う。この選択後のレベル計算は、ステップＳ１、ステップＳ２で選択された全てのスペクトル、即ち第１ピーク、第２ピーク、第３ピーク、・・・第ｎピークの大きさの総和を計算し、選択後の全体レベルとする。
【００１３】
次いで、ステップＳ４にて、ステップＳ３で計算された選択後の全体レベルが、予め計算されていた選択前の全体レベルに対して５０％以上であるか、どうかを判断する。このステップＳ４の判断結果、５０％以上であれば処理完了となり、次のステップであるレベル補償部８（図２参照）へ進む。
もし５０％以下であれば、ステップＳ５に進む。このステップＳ５ではステップＳ１に戻って再度第１ピーク選択処理からやり直す場合に、前記第１ピーク間距離より少し短縮して選択処理を行うようにするため、ピーク間距離の縮小を行うものである。その結果、選択されるスペクトルの数は増え、ある段階で５０％以上の条件を満足するようになるので、その段階で処理が終了し、次のステップであるレベル補償部８（図２参照）へ進み、スペクトル選択を完了する。
【００１４】
スペクトル選択が完了すると、レベル補償部８（図２参照）では、選択後のスペクトルから各サブバンドの部分レベルを計算し、サブバンド内の前スペクトルに対して、次式で計算された補償値αをかけてやり、選択後の部分レベルが選択前の部分レベルと等しくなるように調整を行うものである。
但し、αは、
α＝（選択前の部分レベル）／（選択後の部分レベル）
の式で与えられるものである。
レベル補償部８で大きさを調整されたスペクトルは、スペクトル符号化部９において、カラオケ等の通信用データに組み込んだり、記録用データに組み込みをするため可逆的に符号化され、復号化器に向けて出力される。
【００１５】
上述した実施の形態によれば、スペクトル選択部７において、直交変換で得られたスペクトルの中で、再生時の音質を保ために重要度の高いスペクトルのみ選択して符号化するため、少ない情報量で音声信号の符号化が可能となる。
また、本発明の実施の形態においては、フレーム化直交変換部５において１フレームの大きさをデータ数１０２４個にしたが、この数に限定されないことは勿論である。
さらに、スペクトル選択部７における選択前と選択後の全体レベルの割合を５０％以上としたが、本発明はこれに限定されるものではなく、圧縮度を上げる場合にはこの割合を下げ、再生した時の忠実度を上げる場合にはこの割合を上げることで調整ができるので、適切な選択度を選ぶことができる。
【００１６】
【発明の効果】
以上に述べたごとく本発明は、
フレーム化直交変換部と、レベル計算部と、スペクトル選択部と、スペクトル符号化部と、を備える計算機が音声信号を符号化する音声信号符号化方法において、
前記フレーム化直交変換部が、デジタル変換された音声信号データを所定の大きさのフレームでフレーム化した後に直交変換してスペクトル化し、
前記レベル計算部が、前記スペクトル化した全ての音声信号データのスペクトルの信号レベルの総和を第１レベルとして求め、
前記スペクトル選択部が、第１スペクトル選択処理、第２スペクトル選択処理、レベル計算処理、および、レベル判断処理を順に実行することで、前記スペクトル化した音声信号データのスペクトルを選択し、
前記スペクトル符号化部が、前記スペクトル選択部により選択されたスペクトルを可逆符号化し、
前記第１スペクトル選択処理は、スペクトルの全体について、任意のスペクトルから一定の距離内にそのスペクトルの大きさ以上の大きさを持つ別のスペクトルが存在するかどうかを調べ、そのようなスペクトルが存在しなければ、そのスペクトルを第１スペクトルの一つとして選択し、
前記第２スペクトル選択処理は、前記第１スペクトル選択処理で選択された各々の第１スペクトルから、第１スペクトル間距離とは異なる一定距離内で最大の大きさを有するスペクトルを第２スペクトルの一つとして選択し、
前記レベル計算処理は、前記第１スペクトル選択処理および前記第２スペクトル選択処理で選択された全てのスペクトルの信号レベルの総和を第２レベルとして計算し、
前記レベル判断処理は、前記第１レベルに対して前記第２レベルが所定の比率以下なら、前記一定の距離を前記第１スペクトル間距離より少し短縮して設定し、前記第１スペクトル選択処理に処理を戻す
ことにより、音声信号を高度に圧縮して符号化することができるので、少ない情報量で音声信号を符号化することが可能になり、音声の情報を蓄積する場合には蓄積スペースが少なくてすみ、また、音声の情報を通信する場合には通信時間を短縮できるという効果を奏する。また、音声信号を復号して再生した場合の音質低下を抑えることのできるという効果を奏するものである。
【図面の簡単な説明】
【図１】本発明の音声信号符号化方法を行うための符号化装置のブロック図である。
【図２】図１の符号化装置で行われる音声信号符号化方法の構成図である。
【図３】図２のスペクトル選択部の処理手順を示すフローチャートである。
【図４】図２のスペクトル選択部で行われるピーク選択を示す説明図である。
【符号の説明】
１符号化装置
２マイクロホン
３Ａ／Ｄコンバータ
４符号処理部
４ａＩ／Ｏ装置
４ｂＣＰＵ
４ｃＲＯＭ
４ｄＲＡＭ
５フレーム化直交変換部
６レベル計算部
７スペクトル選択部
８レベル補償部
９スペクトル符号化部[0001]
BACKGROUND OF THE INVENTION
The present invention relates to an audio signal encoding method that performs encoding by highly compressing an audio signal.
[0002]
[Prior art]
In order to encode an audio signal, an ADPCM system as disclosed in Japanese Patent Laid-Open No. 63-148716 has been widely used.
This method predicts the value of the next input signal from the value of the input signal, takes the difference between the predicted value and the actual input signal, and quantizes it with a step size corresponding to the amplitude of this difference. The encoding is performed, and the step-up size of the quantization can be varied.
[0003]
[Problems to be solved by the invention]
However, in the above-described conventional ADPCM system, when a speech signal is encoded, a spectrum having a high importance in securing a sound quality when decoding / reproducing the encoded speech signal is also a spectrum having a low importance. Are uniformly encoded, so the encoding compression rate cannot be increased. For example, when used for communication karaoke, etc., the occupancy rate of the audio signal in the communication data is large. In order to record data or to record / playback with this method, a large amount of memory is required and the size of the storage medium for recording it is limited. There was a problem that the amount could not be memorized.
[0004]
The present invention was devised to solve the above-mentioned conventional problems, and is capable of suppressing deterioration in sound quality when an audio signal is highly compressed and encoded, and the audio signal is decoded and reproduced. A signal encoding method is provided.
[0005]
[Means for Solving the Problems]
In order to solve the above problems, the present invention provides:
In a speech signal encoding method in which a computer including a framing orthogonal transform unit, a level calculation unit, a spectrum selection unit, and a spectrum encoding unit encodes a speech signal,
The framed orthogonal transform unit framing the digitally converted audio signal data into frames of a predetermined size, and then orthogonally transforming into spectrum,
The level calculation section calculates the total sum of the signal levels of the spectra of all of the audio signal data said spectral as the first level,
The spectrum selection unit sequentially executes a first spectrum selection process, a second spectrum selection process, a level calculation process, and a level determination process, thereby selecting a spectrum of the spectralized audio signal data ,
The spectrum coding section, and lossless encoding the spectrum selected by the spectrum selection portion,
The first spectrum selection process checks whether there is another spectrum having a magnitude greater than or equal to the spectrum within a certain distance from an arbitrary spectrum. Otherwise, select that spectrum as one of the first spectra,
In the second spectrum selection process, a spectrum having a maximum size within a certain distance different from the first inter-spectrum distance is selected from each first spectrum selected in the first spectrum selection process. Select as one
The level calculation process calculates a sum of signal levels of all the spectra selected in the first spectrum selection process and the second spectrum selection process as a second level,
In the level determination process, if the second level is less than or equal to a predetermined ratio with respect to the first level, the fixed distance is set slightly shorter than the distance between the first spectra, and the first spectrum selection process is performed. Return processing
It is a configuration.
[0007]
DETAILED DESCRIPTION OF THE INVENTION
Hereinafter, embodiments of the present invention will be described with reference to FIGS.
FIG. 1 is a block diagram of an encoding apparatus for performing the speech signal encoding method of the present invention, FIG. 2 is a configuration diagram of the speech signal encoding method performed by the encoding apparatus of FIG. 1, and FIG. FIG. 4 is an explanatory diagram illustrating peak selection performed by the spectrum selection unit in FIG. 2.
[0008]
In the audio signal encoding method of the present invention, an audio signal is encoded by an encoding apparatus 1 as shown in FIG.
The encoding device 1 receives an input signal obtained by converting sound into an analog signal by an external microphone 2 or the like, and compresses and encodes the digitally converted signal and an A / D converter 3 that converts the analog signal into a digital signal. And a code processing unit 4.
The code processing unit 4 is composed of a known microcomputer, and includes an I / O device 4a, CPU 4b, ROM 4c, RAM 4d, etc., and is efficiently encoded according to a program stored in the ROM 4c in advance. Is.
[0009]
A functional configuration of the code processing unit 4 is shown in FIG. In FIG. 2, the code processing unit 4 includes functional blocks of a framed orthogonal transform unit 5, a level calculation unit 6, a spectrum selection unit 7, a level compensation unit 8, and a spectrum coding unit 9.
The operation function of the code processing unit 4 will be described with reference to FIG.
First, in FIG. 2, the audio signal data converted into a digital signal by PCM conversion by the A / D converter 3 (see FIG. 1) and inputted to the code processing unit 4 is converted into, for example, a framed orthogonal conversion unit 5. The number of data of one frame is made into a suitable number such as 1024 frames, and orthogonal transformation is performed on all of the frames, and the audio signal data is spectrumized.
[0010]
Next, the level calculation unit 6 calculates the sum of the magnitudes (absolute values) of all of the spectralized audio signal data, and sets that value as the overall level of the spectrum. Further, these spectra are divided into subbands of frequencies such as 256, and the sum of absolute values of spectrum sizes is calculated for each of the divided subbands to obtain a partial level of the spectrum.
Next, the spectrum selection unit 7 selects a spectrum having high importance in maintaining sound quality during reproduction from the entire spectrum. However, in this selection, the total level after selection, which is the total sum of the spectrum after selection, is, for example, 50% or more with respect to the total level before selection calculated by the level calculation unit 6. The spectrum is selected as follows.
[0011]
The spectrum selection procedure in the spectrum selector 7 will be described with reference to FIGS.
First, in step S1 in FIG. 3, a first peak selection process is performed. In the first peak selection process, as shown in FIG. 4A, another spectrum having a magnitude equal to or larger than the magnitude (absolute value) of the spectrum within a certain distance from an arbitrary spectrum is obtained. It is checked whether it exists, and if such a spectrum does not exist, that spectrum is selected as one of the first peaks.
As a result of the first peak selection process, only a spectrum with a distance greater than or equal to a certain distance between peaks (hereinafter referred to as a first peak-to-peak distance) as shown in FIG. 4B is selected.
[0012]
Next, in step S2, an nth (n is an integer) peak selection process is performed. As shown in FIG. 4B, the n-th peak selection process has a maximum magnitude (absolute value) within a certain distance different from the distance between the first peaks from each first peak selected in step S1. ) Is selected as the second magnitude spectrum as shown in FIG. 4C, that is, the second peak.
When the selection of the second peak is completed, the third largest third peak is selected in the same manner, and the fourth peak,.
Subsequently, the level calculation after selection is performed at step S3. The level calculation after this selection is performed by calculating the sum of the sizes of all the spectra selected in step S1 and step S2, that is, the first peak, the second peak, the third peak,... The whole level will be later.
[0013]
Next, in step S4, it is determined whether or not the overall level after selection calculated in step S3 is 50% or more with respect to the overall level before selection calculated in advance. If the determination result in step S4 is 50% or more, the processing is completed, and the process proceeds to the next step, the level compensation unit 8 (see FIG. 2).
If it is 50% or less, the process proceeds to step S5. In step S5, when returning to step S1 and starting again from the first peak selection process, the distance between the peaks is reduced in order to perform the selection process with a slightly shorter distance than the first peak distance. . As a result, the number of spectra to be selected increases, and the condition of 50% or more is satisfied at a certain stage. Therefore, the processing is terminated at that stage, and the level compensation unit 8 (see FIG. 2) as the next step. To complete the spectrum selection.
[0014]
When spectrum selection is completed, the level compensator 8 (see FIG. 2) calculates the partial level of each subband from the selected spectrum, and the compensation value calculated by the following equation for the previous spectrum in the subband. The adjustment is performed by multiplying α so that the partial level after selection becomes equal to the partial level before selection.
Where α is
α = (partial level before selection) / (partial level after selection)
It is given by the following formula.
The spectrum whose size has been adjusted by the level compensator 8 is reversibly encoded by the spectrum encoder 9 so as to be incorporated into communication data such as karaoke or to be incorporated into recording data. Is output.
[0015]
According to the above-described embodiment, the spectrum selection unit 7 selects and encodes only a high importance spectrum from the spectrum obtained by the orthogonal transform to maintain the sound quality at the time of reproduction. The audio signal can be encoded by the amount.
In the embodiment of the present invention, the framed orthogonal transform unit 5 sets the size of one frame to 1024 data. However, it is needless to say that the number is not limited to this number.
Further, the ratio of the total level before and after selection in the spectrum selection unit 7 is set to 50% or more. However, the present invention is not limited to this. When the fidelity is increased, adjustment can be made by increasing this ratio, so that an appropriate selectivity can be selected.
[0016]
【The invention's effect】
As described above, the present invention
In a speech signal encoding method in which a computer including a framing orthogonal transform unit, a level calculation unit, a spectrum selection unit, and a spectrum encoding unit encodes a speech signal,
The framed orthogonal transform unit framing the digitally converted audio signal data into frames of a predetermined size, and then orthogonally transforming into spectrum,
The level calculation section calculates the total sum of the signal levels of the spectra of all of the audio signal data said spectral as the first level,
The spectrum selection unit sequentially executes a first spectrum selection process, a second spectrum selection process, a level calculation process, and a level determination process, thereby selecting a spectrum of the spectralized audio signal data ,
The spectrum coding section, and lossless encoding the spectrum selected by the spectrum selection portion,
The first spectrum selection process checks whether there is another spectrum having a magnitude greater than or equal to the spectrum within a certain distance from an arbitrary spectrum. Otherwise, select that spectrum as one of the first spectra,
In the second spectrum selection process, a spectrum having a maximum size within a certain distance different from the first inter-spectrum distance is selected from each first spectrum selected in the first spectrum selection process. Select as one
The level calculation process calculates a sum of signal levels of all the spectra selected in the first spectrum selection process and the second spectrum selection process as a second level,
In the level determination process, if the second level is less than or equal to a predetermined ratio with respect to the first level, the fixed distance is set slightly shorter than the distance between the first spectra, and the first spectrum selection process is performed. Return processing
As a result, since the audio signal can be highly compressed and encoded, it is possible to encode the audio signal with a small amount of information, and the storage space can be reduced when storing audio information. In addition, when voice information is communicated, the communication time can be shortened. In addition, there is an effect that it is possible to suppress deterioration in sound quality when an audio signal is decoded and reproduced.
[Brief description of the drawings]
FIG. 1 is a block diagram of an encoding apparatus for performing a speech signal encoding method of the present invention.
FIG. 2 is a configuration diagram of a speech signal encoding method performed by the encoding device of FIG. 1;
FIG. 3 is a flowchart showing a processing procedure of a spectrum selection unit in FIG. 2;
4 is an explanatory diagram showing peak selection performed by a spectrum selection unit in FIG. 2; FIG.
[Explanation of symbols]
DESCRIPTION OF SYMBOLS 1 Encoding apparatus 2 Microphone 3 A / D converter 4 Code processing part 4a I / O apparatus 4b CPU
4c ROM
4d RAM
5 Frame Orthogonal Transformer 6 Level Calculator 7 Spectrum Selector 8 Level Compensator 9 Spectrum Encoder

Claims

フレーム化直交変換部と、レベル計算部と、スペクトル選択部と、スペクトル符号化部と、を備える計算機が音声信号を符号化する音声信号符号化方法において、
前記フレーム化直交変換部が、デジタル変換された音声信号データを所定の大きさのフレームでフレーム化した後に直交変換してスペクトル化し、
前記レベル計算部が、前記スペクトル化した全ての音声信号データのスペクトルの信号レベルの総和を第１レベルとして求め、
前記スペクトル選択部が、第１スペクトル選択処理、第２スペクトル選択処理、レベル計算処理、および、レベル判断処理を順に実行することで、前記スペクトル化した音声信号データのスペクトルを選択し、
前記スペクトル符号化部が、前記スペクトル選択部により選択されたスペクトルを可逆符号化し、
前記第１スペクトル選択処理は、スペクトルの全体について、任意のスペクトルから一定の距離内にそのスペクトルの大きさ以上の大きさを持つ別のスペクトルが存在するかどうかを調べ、そのようなスペクトルが存在しなければ、そのスペクトルを第１スペクトルの一つとして選択し、
前記第２スペクトル選択処理は、前記第１スペクトル選択処理で選択された各々の第１スペクトルから、第１スペクトル間距離とは異なる一定距離内で最大の大きさを有するスペクトルを第２スペクトルの一つとして選択し、
前記レベル計算処理は、前記第１スペクトル選択処理および前記第２スペクトル選択処理で選択された全てのスペクトルの信号レベルの総和を第２レベルとして計算し、
前記レベル判断処理は、前記第１レベルに対して前記第２レベルが所定の比率以下なら、前記一定の距離を前記第１スペクトル間距離より少し短縮して設定し、前記第１スペクトル選択処理に処理を戻すことを特徴とする
音声信号符号化方法。In a speech signal encoding method in which a computer including a framing orthogonal transform unit, a level calculation unit, a spectrum selection unit, and a spectrum encoding unit encodes a speech signal,
The framed orthogonal transform unit framing the digitally converted audio signal data into frames of a predetermined size, and then orthogonally transforming into spectrum,
The level calculation section calculates the total sum of the signal levels of the spectra of all of the audio signal data said spectral as the first level,
The spectrum selection unit sequentially executes a first spectrum selection process, a second spectrum selection process, a level calculation process, and a level determination process, thereby selecting a spectrum of the spectralized audio signal data ,
The spectrum coding section, and lossless encoding the spectrum selected by the spectrum selection portion,
The first spectrum selection process checks whether there is another spectrum having a magnitude greater than or equal to the spectrum within a certain distance from an arbitrary spectrum. Otherwise, select that spectrum as one of the first spectra,
In the second spectrum selection process, a spectrum having a maximum size within a certain distance different from the first inter-spectrum distance is selected from each first spectrum selected in the first spectrum selection process. Select as one
The level calculation process calculates a sum of signal levels of all the spectra selected in the first spectrum selection process and the second spectrum selection process as a second level,
In the level determination process, if the second level is less than or equal to a predetermined ratio with respect to the first level, the fixed distance is set slightly shorter than the distance between the first spectra, and the first spectrum selection process is performed. Characterized by returning processing
Audio signal encoding method.