JP2002006841A

JP2002006841A - Acoustic signal encoding system

Info

Publication number: JP2002006841A
Application number: JP2000190391A
Authority: JP
Inventors: Toshio Motegi; 敏雄茂出木
Original assignee: Dai Nippon Printing Co Ltd
Current assignee: Dai Nippon Printing Co Ltd
Priority date: 2000-06-26
Filing date: 2000-06-26
Publication date: 2002-01-11
Anticipated expiration: 2020-06-26
Also published as: JP4220108B2

Abstract

PROBLEM TO BE SOLVED: To provide an acoustic signal encoding system which is capable of obtaining the encoding data corresponding to a musical instrument part by extracting the waveforms approximate to the distortion waveforms previously prepared by a MIDI sound source from acoustic signals and encoding the same. SOLUTION: A section defining section 2 defines analysis sections to the digital acoustic signals of PCM, etc. Next, a signal analyzing section 3 performs the sinusoidal wave analysis of the acoustic signal in the respective analysis sections. The analyzed sinusoidal waves (the left side in Figure) are grouped according to prescribed rules and the sinusoidal waves in the groups are synthesized to form the distortion waves (the right side in Figure). On the other hand, the patterns corresponding to the inputted acoustic signals are extracted from the distortion waveform patterns accumulated in a distortion waveform pattern accumulating section 4 and are inputted as the basic distortion waveform patterns. In succession, the basic distortion waveform patterns are deformed to be analogous with the synthesized distortion waves to form the distortion waveform variations. A code forming section 6 encodes the frequencies and amplitude of the distortion waveform variations as code sets together with the start time and end time of the analysis sections.

Description

【発明の詳細な説明】DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【産業上の利用分野】本発明は、放送メディア（ラジ
オ、テレビ）、通信メディア（ＣＳ映像・音声配信、イ
ンターネット音楽配信、通信カラオケ）、パッケージメ
ディア（ＣＤ、ＭＤ、カセット、ビデオ、ＬＤ、ＣＤ−
ＲＯＭ、ゲームカセット、携帯音楽プレーヤ向け固体メ
モリ媒体）などで提供する各種オーディオコンテンツの
制作、並びに、専用携帯音楽プレーヤ、携帯電話・ＰＨ
Ｓ・ポケベルなどに向けたボーカルを含む音楽コンテン
ツ、歌舞伎・能・読経・詩歌など文芸作品の音声素材ま
たは語学教育音声教材のＭＩＤＩ伝送に利用するのに好
適な音響信号の符号化技術に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to broadcast media (radio, television), communication media (CS video / audio distribution, Internet music distribution, communication karaoke), package media (CD, MD, cassette, video, LD, CD). −
Production of various audio contents provided by ROMs, game cassettes, solid-state memory media for portable music players, etc., and dedicated portable music players, mobile phones and PHs
The present invention relates to an audio signal encoding technique suitable for use in MIDI transmission of music contents including vocals for S. pagers, audio materials of literary works such as kabuki, noh, chanting and poetry, or language teaching audio teaching materials.

【０００２】[0002]

【従来の技術】音響信号に代表される時系列信号には、
その構成要素として複数の周期信号が含まれている。こ
のため、与えられた時系列信号にどのような周期信号が
含まれているかを解析する手法は、古くから知られてい
る。例えば、フーリエ解析は、与えられた時系列信号に
含まれる周波数成分を解析するための方法として広く利
用されている。2. Description of the Related Art Time-series signals represented by acoustic signals include:
The components include a plurality of periodic signals. For this reason, a method of analyzing what periodic signal is included in a given time-series signal has been known for a long time. For example, Fourier analysis is widely used as a method for analyzing frequency components included in a given time-series signal.

【０００３】このような時系列信号の解析方法を利用す
れば、音響信号を符号化することも可能である。コンピ
ュータの普及により、原音となるアナログ音響信号を所
定のサンプリング周波数でサンプリングし、各サンプリ
ング時の信号強度を量子化してデジタルデータとして取
り込むことが容易にできるようになってきており、こう
して取り込んだデジタルデータに対してフーリエ解析な
どの手法を適用し、原音信号に含まれていた周波数成分
を抽出すれば、各周波数成分を示す符号によって原音信
号の符号化が可能になる。[0003] If such a time-series signal analysis method is used, it is possible to encode an audio signal. With the spread of computers, it has become easier to sample analog audio signals as original sounds at a predetermined sampling frequency, quantize the signal strength at each sampling, and take in as digital data. If a method such as Fourier analysis is applied to the data and frequency components included in the original sound signal are extracted, the original sound signal can be encoded by a code indicating each frequency component.

【０００４】一方、電子楽器による楽器音を符号化しよ
うという発想から生まれたＭＩＤＩ（Musical Instrume
nt Digital Interface）規格も、パーソナルコンピュー
タの普及とともに盛んに利用されるようになってきてい
る。このＭＩＤＩ規格による符号データ（以下、ＭＩＤ
Ｉデータという）は、基本的には、楽器のどの鍵盤キー
を、どの程度の強さで弾いたか、という楽器演奏の操作
を記述したデータであり、このＭＩＤＩデータ自身に
は、実際の音の波形は含まれていない。そのため、実際
の音を再生する場合には、楽器音の波形（歪み波形パタ
ーン）を記憶したＭＩＤＩ音源が別途必要になるが、そ
の符号化効率の高さが注目を集めており、ＭＩＤＩ規格
による符号化および復号化の技術は、現在、パーソナル
コンピュータを用いて楽器演奏、楽器練習、作曲などを
行うソフトウェアに広く採り入れられている。On the other hand, MIDI (Musical Instrume) was born from the idea of encoding musical instrument sounds by electronic musical instruments.
The Digital Interface (nt Digital Interface) standard has also been actively used with the spread of personal computers. Code data according to the MIDI standard (hereinafter, MID)
I data) is basically data describing an operation of playing a musical instrument, such as which keyboard key of the musical instrument was played and at what strength, and the MIDI data itself contains the actual sound. No waveform is included. Therefore, when an actual sound is reproduced, a MIDI sound source storing a waveform (distortion waveform pattern) of the musical instrument sound is required separately. Encoding and decoding techniques are now widely adopted in software for performing musical instruments, practicing musical instruments, composing music, and the like using a personal computer.

【０００５】そこで、音響信号に代表される時系列信号
に対して、所定の手法で解析を行うことにより、その構
成要素となる周期信号を抽出し、抽出した周期信号をＭ
ＩＤＩデータを用いて符号化しようとする提案がなされ
ている。例えば、特開平１０−２４７０９９号公報、特
開平１１−７３１９９号公報、特開平１１−７３２００
号公報、特開平１１−９５７５３号公報、特開平１２−
９９００９号公報、特開平１２−９９０９３号公報、特
願平１１−５８４３１号明細書、特願平１１−１７７８
７５号明細書、特願平１１−３２９２９７号明細書に
は、任意の時系列信号について、構成要素となる周波数
を解析し、その解析結果からＭＩＤＩデータを作成する
ことができる種々の方法が提案されている。Therefore, by analyzing a time-series signal represented by an acoustic signal by a predetermined method, a periodic signal as a component of the signal is extracted, and the extracted periodic signal is converted to an M signal.
There have been proposals to encode using IDI data. For example, JP-A-10-247099, JP-A-11-73199, JP-A-11-73200
JP-A-11-95753, JP-A-11-95753
No. 99009, Japanese Patent Application Laid-Open No. 12-99093, Japanese Patent Application No. 11-58431, Japanese Patent Application No. 11-1778.
Japanese Patent Application No. 75 and Japanese Patent Application No. 11-329297 propose various methods for analyzing frequency as a component element of an arbitrary time-series signal and creating MIDI data from the analysis result. Have been.

【０００６】[0006]

【発明が解決しようとする課題】上記各公報または明細
書において提案してきたＭＩＤＩ符号化方式により、ボ
ーカルを含む音楽の再現が可能となったが、オーディオ
符号化一般に適用するためには、複数の楽器音色を再生
できるようなマルチトラック形式符号化を実現する必要
がある。これを実現するには、「音源分離技術」、「単
音楽器のＭＩＤＩ符号化」の２つの問題があり、楽器音
から演奏された音符を認識してＭＩＤＩ楽器で再現する
という理想的なＭＩＤＩ符号化形式の実現は現状極めて
困難である。The MIDI encoding scheme proposed in each of the above publications and specifications enables reproduction of music including vocals. However, in order to apply the technique to audio encoding in general, a plurality of pieces of music must be reproduced. It is necessary to realize multi-track format encoding capable of reproducing musical instrument sounds. In order to realize this, there are two problems of "sound source separation technology" and "MIDI encoding of a single musical instrument", and an ideal MIDI code for recognizing a note played from an instrumental sound and reproducing it with a MIDI instrument. The realization of a generalized form is currently extremely difficult.

【０００７】音源分離技術に関しては、ミキシングされ
た演奏録音から楽器パート、ボーカルパート別に音源分
離し、パートごとにＭＩＤＩ符号化を行う必要がある
が、これは現状では技術的に不可能といわれ、ある程度
実現できるようになっても不完全さが残るという問題が
ある。単音楽器のＭＩＤＩ符号化に関しては、ピアノ、
ギター、バイオリンなど弦楽器系の楽器が倍音の割合が
少なく、ＭＩＤＩ符号化を行い易いが、倍音を多く含む
ブラスなど金管楽器系やノイズ音源である打楽器系に対
してはＭＩＤＩ符号化が難しいという問題がある。With regard to the sound source separation technique, it is necessary to separate the sound source for each instrument part and vocal part from the mixed performance recording and to perform MIDI coding for each part, but this is said to be technically impossible at present. There is a problem that imperfectness remains even if it can be realized to some extent. For MIDI encoding of single music instruments, piano,
String instruments such as guitars and violins have a low proportion of overtones and are easy to perform MIDI encoding. However, MIDI encoding is difficult for brass instruments such as brass containing many overtones and percussion instruments which are noise sources. There is.

【０００８】一方、音響信号を正弦波にスペクトル分解
し、スペクトル情報を符号化する分析合成符号化方式が
既にＭＰ３（MPEG-1 layer3）などで実用化されてい
る。ＰＣＭやＡＤＰＣＭといった波形符号化方式に比べ
情報量が少なく、ＭＩＤＩと同様に再生速度やピッチを
変更できるという特徴がある。しかし、この方式には次
の３つの問題がある。On the other hand, an analysis / synthesis coding method for spectrally decomposing an acoustic signal into a sine wave and coding the spectrum information has already been put to practical use in MP3 (MPEG-1 layer3) or the like. The amount of information is smaller than that of a waveform encoding method such as PCM or ADPCM, and the reproduction speed and pitch can be changed similarly to MIDI. However, this method has the following three problems.

【０００９】第１に、圧縮率をあまり上げられず、ＭＰ
３でも実用的圧縮率は１／１０程度であるという問題が
ある。その理由として楽器音などは無限に続く倍音成分
を可聴周波数の最大まで忠実に符号化する必要があるた
めで、信号を正弦波に分解する考え方のデメリットであ
る。第２に、分解した正弦波の合成処理負荷の問題があ
る。復号側では、スペクトル分解された１００以上の正
弦波をリアルタイムに合成することが必要となってい
る。この技術については、既に確立されているが、復号
処理は煩雑なものとなっている。第３に、ハイパーソニ
ック領域の音再現の問題がある。人間の最大可聴周波数
は２２ｋＨｚであり、そのためＣＤのサンプリング周波
数は４４．１ｋＨｚと決められている。しかし、最近で
は２２ｋＨｚ以上のハイパーソニック領域もある程度知
覚できるという説が濃厚になり、次世代のＤＶＤ−ａｕ
ｄｉｏなどではサンプリング周波数を９６ｋＨｚに上げ
る方向にある。（アナログのＬＰレコードでは２２ｋＨ
ｚ以上も再現される。）そうなるとスペクトル符号量は
４倍になる。First, the compression ratio cannot be increased so much that the MP
Even with 3, there is a problem that the practical compression ratio is about 1/10. The reason for this is that instrumental sounds and the like need to faithfully encode infinite harmonic components to the maximum audible frequency, which is a disadvantage of the idea of decomposing a signal into a sine wave. Secondly, there is a problem of a combined processing load of the decomposed sine waves. On the decoding side, it is necessary to synthesize 100 or more sine waves that have been spectrally decomposed in real time. This technique has already been established, but the decoding process is complicated. Third, there is a problem of sound reproduction in the hypersonic region. The maximum human audible frequency is 22 kHz, and therefore the sampling frequency of a CD is determined to be 44.1 kHz. However, recently, the theory that a hypersonic region of 22 kHz or more can be perceived to some extent has been enriched, and the next generation DVD-au
In the case of Dio, etc., the sampling frequency tends to be increased to 96 kHz. (22 kHz for analog LP records)
z and more are also reproduced. Then, the amount of spectrum code is quadrupled.

【００１０】一方、ＭＩＤＩ音源には、ＧＭ（General
MIDI）標準で１２８種、拡張音源では数千種類以上の歪
み波形が蓄積され、１６〜３２種の歪み波形をリアルタ
イムに合成できる。そこで、本発明は上記のような点に
鑑み、音響信号から、ＭＩＤＩ音源があらかじめ用意し
ている歪み波形に近いものを抽出して符号化することに
より、楽器パートに対応した符号化データを得ることが
可能な音響信号符号化システムを提供することを課題と
する。On the other hand, MIDI sound sources include GM (General
The MIDI) standard accumulates 128 types of distortion waveforms and the extended sound source accumulates thousands or more types of distortion waveforms, and can synthesize 16 to 32 types of distortion waveforms in real time. In view of the above, the present invention obtains encoded data corresponding to a musical instrument part by extracting from a sound signal a signal close to a distortion waveform prepared in advance by a MIDI sound source and encoding the extracted signal. It is an object to provide an audio signal encoding system capable of performing the above.

【００１１】[0011]

【課題を解決するための手段】上記課題を解決するた
め、本発明では、音響信号符号化システムを、複数の歪
み波形パターンを蓄積する歪み波形パターン蓄積手段
と、与えられた音響信号に対して所定の解析区間を複数
個定義するための区間定義手段と、前記歪み波形パター
ン蓄積手段から前記音響信号に対応する歪み波形パター
ンを検索して、基本歪み波形パターンとして入力する歪
み波形パターン検索手段と、前記解析区間の１つの信号
切片波形に対して、前記基本歪み波形パターンの周波数
と振幅を変化させた歪み波形バリエーションを複数作成
し、前記複数の歪み波形バリエーションの合成波形が前
記信号切片波形に類似するように、前記複数の歪み波形
バリエーションを決定する信号解析手段と、前記信号解
析手段により決定された各々の歪み波形バリエーション
の周波数と振幅、および前記解析区間の開始時刻と終了
時刻の情報を所定の符号セットで符号化することにより
音響符号化データを作成する符号形成手段を有する構成
としたことを特徴とする。In order to solve the above-mentioned problems, according to the present invention, an audio signal encoding system includes a distortion waveform pattern accumulating means for accumulating a plurality of distortion waveform patterns, Section definition means for defining a plurality of predetermined analysis sections; a distortion waveform pattern search means for searching for a distortion waveform pattern corresponding to the acoustic signal from the distortion waveform pattern storage means and inputting the distortion waveform pattern as a basic distortion waveform pattern; For one signal intercept waveform in the analysis section, a plurality of distortion waveform variations in which the frequency and amplitude of the basic distortion waveform pattern are changed are created, and the composite waveform of the plurality of distortion waveform variations is converted to the signal intercept waveform. Similarly, the signal analyzing means for determining the plurality of distortion waveform variations and the signal analyzing means The frequency and amplitude of each distortion waveform variation, and information of the start time and end time of the analysis section is encoded by a predetermined code set to have a code forming means for creating acoustic encoded data. Features.

【００１２】本発明によれば、上記のような構成とし、
音響信号に対応した基本歪み波形パターンを検索して入
力し、この基本歪み波形パターンの周波数と振幅を変化
させた複数の歪み波形バリエーションを作成し、この歪
み波形バリエーションの中から、音響信号から得られる
信号切片波形に最も近いものを選択して符号化するよう
にしたので、あらかじめ音源として用意している楽器パ
ートに近い成分ごとに符号化を行うことが可能となる。According to the present invention, the above-mentioned configuration is provided,
A basic distortion waveform pattern corresponding to an acoustic signal is searched for and input, and a plurality of distortion waveform variations in which the frequency and the amplitude of the basic distortion waveform pattern are changed are created. Since the closest signal intercept waveform is selected and encoded, it is possible to perform encoding for each component close to the musical instrument part prepared in advance as a sound source.

【００１３】[0013]

【発明の実施の形態】以下、本発明の実施形態について
図面を参照して詳細に説明する。まず、最初に本発明に
よる音響信号符号化システムにおける基本的な考え方に
ついて説明する。ＰＣＭなどの手法によりデジタル化さ
れた音響信号は、周知の手法である短時間フーリエ変
換、一般化調和解析などにより以下の（数式１）に示す
ように正弦波分解できる。Embodiments of the present invention will be described below in detail with reference to the drawings. First, the basic concept of the audio signal encoding system according to the present invention will be described. An acoustic signal digitized by a method such as PCM can be decomposed into a sine wave by a well-known method such as short-time Fourier transform or generalized harmonic analysis as shown in the following (Formula 1).

【００１４】（数式１）ｇ（ｔ） ≒ Σ_i｛Ａ_iｓｉｎ（２πｆ_iｔ）＋Ｂ_iｃｏｓ
（２πｆ_iｔ）｝[0014] (Equation 1) g (t) ≒ Σ i {A i sin (2πf i t) + B i cos
(2πf _i t)}

【００１５】この（数式１）における［ｆ_i，Ａ_i，
Ｂ_i］によりｇ（ｔ）を再現するのが、従来より行なわ
れている分析合成符号化方式の原理である。上記（数式
１）を更に整理すると、以下の（数式２）のように変形
される。In this (Equation 1), [f _i , A _i ,
Reproducing g (t) by B _i ] is the principle of the conventional analysis-synthesis coding method. When the above (Equation 1) is further arranged, it is transformed into the following (Equation 2).

【００１６】（数式２）ｇ（ｔ）≒Σ_nα_n［Σ_j｛Ｐ_njｓｉｎ(２πｊｆ_nｔ)＋Ｑ
_njｃｏｓ(２πｊｆ_nｔ)｝］(Equation 2) g (t) ≒ Σ _n α _n [Σ _j ｛P _nj sin (2πjf _n t) + Q
_nj cos (2πjf _n t)｝]

【００１７】この（数式２）において、ｊは周波数を整
数倍することを示す整数であり、ある周波数ｆ_nの整数
倍の周波数ｊｆ_nを有する正弦波がグループ化された状
態を示している。上記（数式２）を更に整理すると、以
下の（数式３）のように変形される。In this (Formula 2), j is an integer indicating that the frequency is multiplied by an integer, and indicates a state in which sine waves having a frequency jf _n that is an integer multiple of a certain frequency f _n are grouped. When the above (Equation 2) is further arranged, it is transformed into the following (Equation 3).

【００１８】（数式３）ｇ（ｔ） ≒ Σ_nα_nｕ_n（２πｆ_nｔ）[0018] (Equation 3) g (t) ≒ Σ n α n u n (2πf n t)

【００１９】この（数式３）において、ｕ_n（２πｆ
_nｔ）は基本周波数ｆ_nをもつ歪み波であり、この歪み波
としてＭＩＤＩ音源に用意されているものの中から最も
形状が近いものを選べば、理論上分析合成符号化方式と
同様にｇ（ｔ）を再現することができる。本発明による
利点は、（数式１）におけるｉでΣ計算を行うのに比
べ、（数式３）におけるｎでΣ計算を行う方が、計算回
数が圧倒的に少なく、符号化するパラメータ［ｆ_n，
α_n］も少なくなる。ここで問題となるのは、この歪み
波ｕ_n（２πｆ_nｔ）の決定方式であり、入力音響信号ｇ
（ｔ）から直接歪み波ｕ _n（２πｆ_nｔ）を求めるのは困
難である。そこで、本発明では、あらかじめＭＩＤ音源
等の歪み波形パターン蓄積手段に蓄積されている歪み波
形パターンから、入力音響信号に対応するものを選択す
る手法を採る。In this (Equation 3), u_n(2πf
_nt) is the fundamental frequency f_nIs a distorted wave having
Among the ones prepared for MIDI sound sources
If you choose one with a similar shape, it is theoretically possible to use
Similarly, g (t) can be reproduced. According to the invention
The advantage is that Σ calculation is performed with i in (Equation 1).
In addition, it is better to perform Σ calculation with n in (Equation 3).
The number is overwhelmingly small, and the parameter [f_n,
α_n] Also decreases. The problem here is that this distortion
Wave u_n(2πf_nt), the input sound signal g
(T) direct distortion wave u _n(2πf_nIt is difficult to find t)
It is difficult. Therefore, in the present invention, the MID sound source is
Distortion waves stored in the distortion waveform pattern storage means
Select the one corresponding to the input sound signal from the shape patterns.
Adopt a technique that

【００２０】続いて、本システムの具体的な構成につい
て説明する。図１は、本発明による音響信号符号化シス
テムの構成を示す機能ブロック図である。図１におい
て、音響信号入力部１は、アナログ信号である音響信号
をＰＣＭ等の手法によりデジタル化したデジタル音響信
号を入力する機能を有する。区間定義部２は、音響信号
の解析のために、時系列の音響信号を所定の区間で区切
る機能を有する。信号解析部３は、区間定義部２で定義
された区間単位で音響信号の解析を行う機能を有する。
歪み波形パターン蓄積部４は、複数の歪み波形パターン
を蓄積したものであり、例えば、１２８種の歪み波形を
有するＭＩＤＩ音源で実現される。歪み波形パターン検
索部５は、歪み波形パターン蓄積部４に蓄積された歪み
波形パターンの中から、入力された音響信号に対応する
ものを検索し、基本歪み波形パターンとして入力するた
めのものである。符号形成部６は、信号解析部３により
解析された音響信号をＭＩＤＩなどのデータに符号化す
る機能を有する。Next, a specific configuration of the present system will be described. FIG. 1 is a functional block diagram showing a configuration of an audio signal encoding system according to the present invention. In FIG. 1, an audio signal input unit 1 has a function of inputting a digital audio signal obtained by digitizing an analog audio signal by a technique such as PCM. The section definition unit 2 has a function of dividing a time-series sound signal into predetermined sections in order to analyze the sound signal. The signal analysis unit 3 has a function of analyzing an audio signal in units of sections defined by the section definition unit 2.
The distortion waveform pattern storage unit 4 accumulates a plurality of distortion waveform patterns, and is realized by, for example, a MIDI sound source having 128 types of distortion waveforms. The distortion waveform pattern search unit 5 searches for a distortion waveform pattern stored in the distortion waveform pattern storage unit 4 corresponding to the input acoustic signal, and inputs it as a basic distortion waveform pattern. . The code forming section 6 has a function of coding the acoustic signal analyzed by the signal analyzing section 3 into data such as MIDI.

【００２１】次に、図１に示した音響信号符号化システ
ムの処理動作について説明する。まず、音響信号入力部
１よりＰＣＭによりデジタル化された音響信号を入力す
る。ここでは、例えば、図２に示すような波形で表され
る音響信号が入力されたものとする。図２の例では、横
軸に時間ｔ、縦軸に振幅（強度）をとって、この音響信
号を示している。Next, the processing operation of the audio signal encoding system shown in FIG. 1 will be described. First, an audio signal digitized by PCM is input from the audio signal input unit 1. Here, for example, it is assumed that an acoustic signal represented by a waveform as shown in FIG. 2 has been input. In the example of FIG. 2, the horizontal axis indicates time t, and the vertical axis indicates amplitude (intensity), and indicates this acoustic signal.

【００２２】続いて、この解析対象となる音響信号の時
間軸上に対して、区間定義部２が複数の解析区間を設定
する。図２に示す例では、時間軸ｔ上に等間隔に６つの
時刻ｔ１〜ｔ６が定義され、これら各時刻を始点および
終点とする５つの解析区間ｄ１〜ｄ５が設定されてい
る。図２の例では、全て同一の区間長をもった解析区間
が設定されているが、個々の解析区間ごとに区間長を変
えるようにしてもかまわない。あるいは、隣接する解析
区間が時間軸上で部分的に重なり合うような区間設定を
行ってもかまわない。こうして解析区間が設定された
ら、各解析区間ごとの音響信号（以下、区間信号と呼ぶ
ことにする）について、信号解析部３により解析が行わ
れる。Subsequently, the section definition unit 2 sets a plurality of analysis sections on the time axis of the audio signal to be analyzed. In the example shown in FIG. 2, six times t1 to t6 are defined at regular intervals on the time axis t, and five analysis sections d1 to d5 having these times as a start point and an end point are set. In the example of FIG. 2, the analysis sections having the same section length are all set, but the section length may be changed for each analysis section. Alternatively, a section may be set such that adjacent analysis sections partially overlap on the time axis. When the analysis section is set in this way, the signal analysis unit 3 analyzes the sound signal (hereinafter, referred to as a section signal) for each analysis section.

【００２３】信号解析部３においては、まず各解析区間
ごとの一般化調和解析が行われ、区間信号を複数の正弦
波に分解する。ここでは、分解により１２８０種の異な
る正弦波が得られる。各正弦波の振幅は、区間信号と各
周波数を有する正弦関数、余弦関数との相関に基づいて
決定される。一般化調和解析は、周知の周波数解析手法
であり、前掲の公報および明細書にも記載があるので、
詳細な説明は省略する。The signal analyzer 3 first performs a generalized harmonic analysis for each analysis section, and decomposes the section signal into a plurality of sine waves. Here, 1280 different sine waves are obtained by the decomposition. The amplitude of each sine wave is determined based on the correlation between the section signal, a sine function having each frequency, and a cosine function. Generalized harmonic analysis is a well-known frequency analysis method, and is described in the above-mentioned publication and specification,
Detailed description is omitted.

【００２４】続いて、分解された正弦波の中から振幅値
が大きいものを選択する。選択する正弦波の数は、あら
かじめその数自体を設定しておくこともできるし、振幅
値の閾値を設定しておき、閾値を超えた数とすることも
できる。ここで、選択される正弦波の数は最終的に和音
を構成する単音の数となる。正弦波が選択されたら、各
正弦波の周波数を基本周波数とし、この基本周波数の整
数倍の周波数を有する正弦波をグループ化する。次に、
同一グループに属する正弦波を合成し、歪み波を作成す
る。図３に分解された正弦波と合成された歪み波の一例
を示す。図３中、左側には周波数の異なる６つの正弦波
が示されている。このうち、上から３番目と４番目の正
弦波は、それぞれ１番上の正弦波の２倍、３倍という整
数倍の周波数となっている。このため、これら３つの正
弦波は１つのグループとして合成され、図３中、右上の
ような歪み波となる。同様に、上から５番目と６番目の
正弦波は、それぞれ上から２番目の正弦波の３倍、５倍
という整数倍の周波数となっているので、１つのグルー
プとして合成され、図３中、右下のような歪み波とな
る。これらの処理を各グループについて行うことによ
り、グループ数分の歪み波が作成されることになる。グ
ループ化の手法としては、このようなある特定の周波数
を有する正弦波の整数倍の周波数を有する正弦波でグル
ープ化する手法に限定せず、他の手法で行っても良い
が、このように、整数倍の周波数の正弦波をグループ化
することにより、上記（数式２）、（数式３）で示した
ように、演算処理の負荷を軽減することが可能となる。
また、上記手法によりグループ化する際、基本周波数と
なるべき正弦波がグループ内に含まれない場合がある。
これは基本周波数となるべき正弦波の振幅が小さいため
に最初の選択から外れてしまうためである。このような
場合は、他のグループに属する適当な正弦波の周波数を
基本周波数として、その整数倍の周波数を有する正弦波
をグループ化する。Subsequently, a signal having a large amplitude value is selected from the decomposed sine waves. The number of sine waves to be selected may be set in advance, or a threshold value of the amplitude value may be set, and the number may exceed the threshold value. Here, the number of selected sine waves is the number of single tones that eventually make up a chord. When a sine wave is selected, the frequency of each sine wave is set as a fundamental frequency, and sine waves having a frequency that is an integral multiple of the fundamental frequency are grouped. next,
A sine wave belonging to the same group is synthesized to create a distorted wave. FIG. 3 shows an example of the distorted wave combined with the decomposed sine wave. In FIG. 3, six sine waves having different frequencies are shown on the left side. Of these, the third and fourth sine waves from the top have integer multiples of twice and three times the top sine wave, respectively. For this reason, these three sine waves are combined as one group, and become a distorted wave as shown in the upper right in FIG. Similarly, since the fifth and sixth sine waves from the top have integer multiples of three times and five times the second sine wave from the top, respectively, they are combined as one group and are combined in FIG. And a distorted wave as shown in the lower right. By performing these processes for each group, distortion waves for the number of groups are created. The method of grouping is not limited to the method of grouping with a sine wave having an integer multiple of the frequency of a sine wave having a certain specific frequency, but may be performed by another method. By grouping the sine waves having a frequency of an integral multiple, it is possible to reduce the load of the arithmetic processing as shown in (Equation 2) and (Equation 3).
Further, when grouping is performed by the above method, a sine wave to be a fundamental frequency may not be included in the group.
This is because the amplitude of the sine wave to be the fundamental frequency is small and thus deviates from the first selection. In such a case, a sine wave having a frequency that is an integral multiple of an appropriate sine wave frequency belonging to another group is set as a basic frequency.

【００２５】一方、ＭＩＤＩ音源で実現される歪み波形
パターン蓄積部４からは、音響信号入力部１より入力さ
れた音響信号に対応する歪み波形パターンが、歪み波形
パターン検索部５により検索され、基本歪み波形パター
ンとして入力される。この歪み波形パターンの検索は、
解析される音響信号を別途音声として出力し、これを検
索者が聴いて、どの歪み波形パターンにするかを決定
し、歪み波形パターンを識別する情報を検索キーとして
入力することにより行なわれる。例えば、検索者がこの
音響信号を音声で聴いたときに、ピアノの音で表現する
のが最適だと感じたら、ピアノの音色を表現した歪み波
形パターンを検索する。On the other hand, a distortion waveform pattern corresponding to the acoustic signal input from the acoustic signal input unit 1 is retrieved from the distortion waveform pattern storage unit 4 realized by the MIDI sound source by the distortion waveform pattern retrieval unit 5, and It is input as a distortion waveform pattern. Searching for this distortion waveform pattern
The analysis is performed by separately outputting a sound signal to be analyzed as a voice, listening to the sound signal, deciding which distortion waveform pattern to use, and inputting information for identifying the distortion waveform pattern as a search key. For example, when the searcher listens to the sound signal by voice and finds it best to represent the sound of a piano, the searcher searches for a distortion waveform pattern expressing the tone of the piano.

【００２６】信号解析部３では、歪み波形パターン検索
部５により検索され、入力された歪み波形パターンを基
本歪み波形パターンとし、この基本歪み波形パターンの
周波数、振幅を変化させた歪み波形バリエーションを作
成する。作成する歪み波形バリエーションの数は、設定
により変更することができるが、周波数、振幅ともに１
２８通り設定したとすると、１２８×１２８通りの歪み
波形バリエーションが作成されることになる。In the signal analysis unit 3, the distortion waveform pattern retrieved by the distortion waveform pattern retrieval unit 5 is used as a basic distortion waveform pattern, and a distortion waveform variation in which the frequency and amplitude of the basic distortion waveform pattern are changed is created. I do. The number of distortion waveform variations to be created can be changed by setting.
If 28 types are set, 128 × 128 types of distortion waveform variations are created.

【００２７】この１２８×１２８通りの歪み波形バリエ
ーションの中から、上記のようにグループ化された正弦
波を合成することにより得られた各歪み波に近い波形形
状を有するものを１つずつ選択する。この結果、所定数
の歪み波形バリエーションが選択されることになる。From the 128 × 128 kinds of distortion waveform variations, one having a waveform shape close to each distortion wave obtained by synthesizing the grouped sine waves as described above is selected one by one. . As a result, a predetermined number of distortion waveform variations are selected.

【００２８】次に、符号形成部６が、選択された各歪み
波形バリエーションの周波数と振幅、および解析区間の
開始時刻と終了時刻の情報を所定の符号セットで出力す
る。ここで、符号化される周波数ｆ_nおよび振幅α_nは、
上記（数式３）に対応する。また、解析区間について
は、例えば図２に示す解析区間ｄ１の場合、開始時刻は
ｔ１、終了時刻はｔ２となる。Next, the code forming section 6 outputs information on the frequency and amplitude of each selected distortion waveform variation and the start time and end time of the analysis section in a predetermined code set. Here, the frequency f _n and the amplitude α _n to be coded are
This corresponds to the above (Equation 3). For the analysis section, for example, in the case of the analysis section d1 shown in FIG. 2, the start time is t1 and the end time is t2.

【００２９】以上の処理を音響信号の全解析区間に対し
て行い、各解析区間ごとに符号セットを出力する。この
際、前の解析区間の終了時刻が、後の解析区間の開始時
刻に近接しており、前の解析区間における符号セットの
周波数および振幅が、後の解析区間における符号セット
の周波数および振幅に類似している場合、２つの解析区
間における符号セットの統合を行う。具体的には、前の
解析区間における符号セットの終了時刻を後の解析区間
における符号セットの終了時刻に変更し、後の終了時刻
の符号セットを削除する。これは、近接する解析区間に
おける音が類似している場合、同一の音であるとみな
し、符号化の際に１つの音にまとめるために行なわれ
る。The above processing is performed for all analysis sections of the audio signal, and a code set is output for each analysis section. At this time, the end time of the previous analysis section is close to the start time of the subsequent analysis section, and the frequency and amplitude of the code set in the previous analysis section are changed to the frequency and amplitude of the code set in the subsequent analysis section. If they are similar, the code sets in the two analysis sections are integrated. Specifically, the code set end time in the previous analysis section is changed to the code set end time in the subsequent analysis section, and the code set at the later end time is deleted. This is performed so that when sounds in adjacent analysis sections are similar, they are regarded as the same sound and are combined into one sound at the time of encoding.

【００３０】符号化データとしては、特に限定する必要
はないが、ＭＩＤＩ規格を利用することが好ましい。Ｍ
ＩＤＩ規格を利用した場合、上記符号セットの周波数情
報は、ＭＩＤＩ規格で定義されているノートナンバーに
対応する１２８種の周波数から選択され、振幅情報は、
ＭＩＤＩ規格で定義されている１２８段階のベロシティ
で記述され、開始時刻および終了時刻がＳＭＦ規格のデ
ルタタイムで記述され、符号セットはノートオンイベン
トおよびノートオフイベントの１対で記述される。Although there is no particular limitation on the encoded data, it is preferable to use the MIDI standard. M
When the IDI standard is used, the frequency information of the code set is selected from 128 frequencies corresponding to the note numbers defined in the MIDI standard, and the amplitude information is
It is described by 128 levels of velocity defined by the MIDI standard, the start time and the end time are described by the delta time of the SMF standard, and the code set is described by a pair of a note-on event and a note-off event.

【００３１】このようにして得られた音響符号化データ
は、蓄積手段または伝送手段を介して音響データ復号化
システムにて復号化され、音響信号として発せられる。
図４に音響データ復号化システムの機能ブロック図を示
す。図４において、歪み波形パターン蓄積部４は、図１
に示したものと同一の機能を有するものであるため、図
１と同一符号で示している。符号化データ入力部７は音
響符号化データを入力するためのものである。歪み波形
バリエーション再現部８は、入力された音響符号化デー
タに基づいて、歪み波形パターン検索部９から得られる
基本歪み波形パターンに変更を加えて、歪み波形バリエ
ーションを再現する機能を有する。歪み波形パターン検
索部９は、歪み波形バリエーション再現部８の指示に従
い、歪み波形パターン蓄積部４より歪み波形パターンを
検索する機能を有する。信号合成部１０は符号セットご
とに再現された歪み波形バリエーションを合成して音響
信号を再現する機能を有する。The audio coded data thus obtained is decoded by the audio data decoding system via the storage means or the transmission means, and is emitted as an audio signal.
FIG. 4 shows a functional block diagram of the audio data decoding system. In FIG. 4, the distortion waveform pattern accumulating unit 4 includes
1 have the same functions as those shown in FIG. The encoded data input unit 7 is for inputting audio encoded data. The distortion waveform variation reproduction unit 8 has a function of modifying the basic distortion waveform pattern obtained from the distortion waveform pattern search unit 9 based on the input acoustic encoded data to reproduce the distortion waveform variation. The distortion waveform pattern search section 9 has a function of searching the distortion waveform pattern storage section 4 for a distortion waveform pattern according to an instruction from the distortion waveform variation reproduction section 8. The signal synthesizing unit 10 has a function of synthesizing a distortion waveform variation reproduced for each code set to reproduce an acoustic signal.

【００３２】次に、図４に示した音響データ復号化シス
テムの処理動作について説明する。音響信号符号化シス
テムにより得られた音響符号化データが符号化データ入
力部７より入力されると、歪み波形バリエーション再現
部８は、音響符号化データから符号セット単位で符号デ
ータを抽出し、歪み波形バリエーションの再現処理を行
う。歪み波形バリエーションの再現処理としては、ま
ず、符号セットから基本歪み波形パターンを識別する情
報を抽出する。続いて、この識別情報を用いて歪み波形
パターン検索部９が歪み波形パターン蓄積部４から対応
する歪み波形パターンを抽出する。歪み波形バリエーシ
ョン再現部８は、抽出された歪み波形パターンを基本歪
み波形パターンとして取得し、この基本歪み波形パター
ンに対して、符号セットが有する周波数、振幅になるよ
うに変形を行い、歪み波形バリエーションを作成する。
この歪み波形バリエーションは、１つの符号セットにつ
いて、和音を構成する単音の数だけ作成されることにな
る。Next, the processing operation of the audio data decoding system shown in FIG. 4 will be described. When audio encoded data obtained by the audio signal encoding system is input from the encoded data input unit 7, the distortion waveform variation reproducing unit 8 extracts code data from the audio encoded data in code set units, Performs waveform variation reproduction processing. In the reproduction process of the distortion waveform variation, first, information for identifying the basic distortion waveform pattern is extracted from the code set. Subsequently, the distortion waveform pattern search unit 9 extracts the corresponding distortion waveform pattern from the distortion waveform pattern storage unit 4 using the identification information. The distortion waveform variation reproducing unit 8 obtains the extracted distortion waveform pattern as a basic distortion waveform pattern, and deforms the basic distortion waveform pattern so as to have the frequency and amplitude of the code set. Create
This distortion waveform variation is created for one code set by the number of single tones constituting a chord.

【００３３】歪み波形バリエーション再現部８による処
理を全ての符号セットに対して行い、歪み波形バリエー
ションが全て再現されたら、信号合成部１０が再現され
た歪み波形バリエーションの合成を行う。この合成処理
は、各符号セットの開始時刻と終了時刻の情報を基に時
間軸上で行なわれる。このような符号セットの開始時刻
と終了時刻から、符号セットに対応する歪み波を合成す
る処理は、ＭＩＤＩ符号データをＭＩＤＩ音源を用いて
再生する際に従来より行われている手法であるので、詳
細な説明は省略する。信号合成部１０により合成される
ことにより再現された音響信号は、例えば、ＰＣＭの形
式で出力することができる。The processing by the distortion waveform variation reproducing unit 8 is performed for all code sets, and when all the distortion waveform variations are reproduced, the signal synthesizing unit 10 synthesizes the reproduced distortion waveform variations. This combining process is performed on the time axis based on information on the start time and end time of each code set. Since the process of synthesizing the distortion wave corresponding to the code set from the start time and the end time of such a code set is a method conventionally performed when playing back MIDI code data using a MIDI sound source, Detailed description is omitted. The acoustic signal reproduced by being synthesized by the signal synthesizing unit 10 can be output in, for example, a PCM format.

【００３４】以上、本発明の好適な実施形態について説
明したが、本発明は上記実施形態に限定されず、種々の
変形が可能である。例えば、上記実施形態では、音響符
号化データが１つのトラックにより構成される場合につ
いて説明したが、音響符号化データが複数のトラックに
より構成されるようにしても良い。１つのトラックは、
１つの楽器パートに対応させることができるため、複数
のトラックを用意することにより、複数の楽器で演奏さ
れた楽曲を再現することができる。複数のトラックを有
する音響符号化データを作成する場合は、図１に示した
歪み波形パターン蓄積部４より複数の歪み波形パターン
を、歪み波形パターン入力部５により検索して入力する
ようにすれば良い。これにより、信号解析部３が複数の
基本歪み波形パターンを認識し、それぞれについてトラ
ックを作成する。そして各トラックに割り当てられるべ
き符号セットが、各基本歪み波形パターンについて作成
されることになる。ＭＩＤＩ規格の場合、トラックはチ
ャンネルに対応し、基本歪みパターンを識別するための
情報は、楽器音色を示すプログラム番号、バンク番号に
対応する。Although the preferred embodiment of the present invention has been described above, the present invention is not limited to the above embodiment, and various modifications are possible. For example, in the above-described embodiment, the case where the audio encoded data is configured by one track has been described. However, the audio encoded data may be configured by a plurality of tracks. One track is
Since it is possible to correspond to one instrument part, it is possible to reproduce music played by a plurality of instruments by preparing a plurality of tracks. When creating encoded audio data having a plurality of tracks, a plurality of distortion waveform patterns may be retrieved from the distortion waveform pattern storage unit 4 shown in FIG. good. As a result, the signal analyzer 3 recognizes a plurality of basic distortion waveform patterns and creates a track for each of them. Then, a code set to be assigned to each track is created for each basic distortion waveform pattern. In the case of the MIDI standard, a track corresponds to a channel, and information for identifying a basic distortion pattern corresponds to a program number and a bank number indicating a musical instrument timbre.

【００３５】また、上記実施形態では、歪み波形パター
ン蓄積部４にあらかじめ用意されている歪み波形パター
ンを検索し、基本歪み波形パターンとして入力するよう
にしていたが、ユーザが入力された音響に最も適合する
と思われる歪み波形パターンを歪み波形パターン蓄積部
４に随時追加する方法をとることも可能である。その場
合、ＭＩＤＩ規格で定義されるバンク番号の基本歪み波
形パターン未定義領域をユーザが追加した基本歪み波形
パターンとして定義すれば、ＭＩＤＩ規格を逸脱せずに
符号化および復号化を行うことが可能であり、サンプラ
ー機能をもつＭＩＤＩ規格音源を用いてこのような波形
追加を実現できる。In the above embodiment, the distortion waveform pattern prepared in advance in the distortion waveform pattern storage unit 4 is searched and input as the basic distortion waveform pattern. It is also possible to take a method of adding a distortion waveform pattern that seems to be suitable to the distortion waveform pattern storage unit 4 as needed. In this case, if the undefined region of the basic distortion waveform pattern of the bank number defined by the MIDI standard is defined as the basic distortion waveform pattern added by the user, encoding and decoding can be performed without departing from the MIDI standard Thus, such waveform addition can be realized by using a MIDI standard sound source having a sampler function.

【００３６】[0036]

【発明の効果】以上、説明したように本発明によれば、
複数の歪み波形パターンを蓄積する歪み波形パターン蓄
積手段と、与えられた音響信号に対して所定の解析区間
を複数個定義するための区間定義手段と、前記歪み波形
パターン蓄積手段から前記音響信号に対応する歪み波形
パターンを検索して、基本歪み波形パターンとして入力
する歪み波形パターン検索手段と、前記解析区間の１つ
の信号切片波形に対して、前記基本歪み波形パターンの
周波数と振幅を変化させた歪み波形バリエーションを複
数作成し、前記複数の歪み波形バリエーションの合成波
形が前記信号切片波形に類似するように、前記複数の歪
み波形バリエーションを決定する信号解析手段と、前記
信号解析手段により決定された各々の歪み波形バリエー
ションの周波数と振幅、および前記解析区間の開始時刻
と終了時刻の情報を所定の符号セットで符号化すること
により音響符号化データを作成する符号形成手段を有す
るするようにしたので、あらかじめ音源として用意して
いる楽器パートに近い成分ごとに符号化を行うことが可
能となるという効果を奏する。As described above, according to the present invention,
Distortion waveform pattern storage means for storing a plurality of distortion waveform patterns; section definition means for defining a plurality of predetermined analysis sections for a given sound signal; A distortion waveform pattern search means for searching for a corresponding distortion waveform pattern and inputting it as a basic distortion waveform pattern, and changing the frequency and amplitude of the basic distortion waveform pattern with respect to one signal intercept waveform in the analysis section. A plurality of distortion waveform variations are created, and the signal analysis unit and the signal analysis unit determine the plurality of distortion waveform variations so that a composite waveform of the plurality of distortion waveform variations is similar to the signal intercept waveform. Information on the frequency and amplitude of each distortion waveform variation, and the start time and end time of the analysis section Since it has a code forming means for creating acoustic coded data by coding with a predetermined code set, it is possible to perform coding for each component close to a musical instrument part prepared in advance as a sound source It has the effect of becoming.

【図面の簡単な説明】[Brief description of the drawings]

【図１】本発明による音響信号符号化システムの構成を
示す機能ブロック図である。FIG. 1 is a functional block diagram showing a configuration of an audio signal encoding system according to the present invention.

【図２】区間定義部２による解析区間の定義を説明する
ための図である。FIG. 2 is a diagram for explaining the definition of an analysis section by a section definition unit 2;

【図３】分解された正弦波と、正弦波を合成することに
得られる歪み波を示す図である。FIG. 3 is a diagram showing a decomposed sine wave and a distortion wave obtained by synthesizing the sine wave.

【図４】音響データ復号化システムの構成を示す機能ブ
ロック図である。FIG. 4 is a functional block diagram illustrating a configuration of an audio data decoding system.

【符号の説明】[Explanation of symbols]

１・・・音響信号入力部２・・・区間定義部３・・・信号解析部４・・・歪み波形パターン蓄積部５・・・歪み波形パターン検索部６・・・符号形成部７・・・符号化データ入力部８・・・歪み波形バリエーション再現部９・・・歪み波形パターン検索部１０・・・信号合成部 DESCRIPTION OF SYMBOLS 1 ... Sound signal input part 2 ... Section definition part 3 ... Signal analysis part 4 ... Distortion waveform pattern accumulation part 5 ... Distortion waveform pattern search part 6 ... Code formation part 7 .... -Encoded data input unit 8 ... distortion waveform variation reproduction unit 9 ... distortion waveform pattern search unit 10 ... signal synthesis unit

───────────────────────────────────────────────────── フロントページの続き (51)Int.Cl.⁷ 識別記号ＦＩテーマコート゛(参考）Ｈ０４Ｂ 14/04 Ｇ１０Ｌ 9/18 Ｍ ──────────────────────────────────────────────────続き Continued on the front page (51) Int.Cl. ⁷ Identification symbol FI Theme coat ゛ (Reference) H04B 14/04 G10L 9/18 M

Claims

【特許請求の範囲】[Claims]

【請求項１】複数の歪み波形パターンを蓄積する歪み波
形パターン蓄積手段と、与えられた音響信号に対して所
定の解析区間を複数個定義するための区間定義手段と、
前記歪み波形パターン蓄積手段から前記音響信号に対応
する歪み波形パターンを検索して、基本歪み波形パター
ンとして入力する歪み波形パターン検索手段と、前記解
析区間の１つの信号切片波形に対して、前記基本歪み波
形パターンの周波数と振幅を変化させた歪み波形バリエ
ーションを複数作成し、前記複数の歪み波形バリエーシ
ョンの合成波形が前記信号切片波形に類似するように、
前記複数の歪み波形バリエーションを決定する信号解析
手段と、前記信号解析手段により決定された各々の歪み
波形バリエーションの周波数と振幅、および前記解析区
間の開始時刻と終了時刻の情報を所定の符号セットで符
号化することにより音響符号化データを作成する符号形
成手段を有することを特徴とする音響信号符号化システ
ム。1. A distortion waveform pattern accumulating means for accumulating a plurality of distortion waveform patterns, a section defining means for defining a plurality of predetermined analysis sections for a given acoustic signal,
A distortion waveform pattern corresponding to the acoustic signal is retrieved from the distortion waveform pattern storage unit, and a distortion waveform pattern retrieval unit is inputted as a basic distortion waveform pattern. Create a plurality of distortion waveform variations by changing the frequency and amplitude of the distortion waveform pattern, so that the composite waveform of the plurality of distortion waveform variations is similar to the signal intercept waveform,
A signal analyzing means for determining the plurality of distortion waveform variations, and information on the frequency and amplitude of each of the distortion waveform variations determined by the signal analyzing means, and the start time and end time of the analysis section, in a predetermined code set. An acoustic signal encoding system comprising code forming means for creating encoded audio data by encoding.

【請求項２】前記歪み波形パターン検索手段からは、複
数の基本歪み波形パターンが検索され、前記音響符号化
データが複数のトラックで構成され、各符号セットが使
用された基本歪み波形パターンを基準に前記いずれかの
トラックに分離され、各トラックごとに選択された基本
歪み波形パターンの識別情報が付加されていることを特
徴とする請求項１に記載の音響信号符号化システム。2. A plurality of basic distortion waveform patterns are retrieved from the distortion waveform pattern retrieval means, and the audio encoded data is composed of a plurality of tracks, and a reference is made to a basic distortion waveform pattern in which each code set is used. 2. The acoustic signal encoding system according to claim 1, wherein identification information of a basic distortion waveform pattern separated for each of the tracks is added to each of the tracks.

【請求項３】前記符号形成手段は、前の符号セットの終
了時刻が後の符号セットの開始時刻に近接し、前の符号
セットの周波数および振幅が後の符号セットの周波数お
よび振幅に類似している場合、前の符号セットの終了時
刻を後の符号セットの終了時刻に変更し、後の符号セッ
トを削除するものであることを特徴とする請求項１に記
載の音響信号符号化システム。3. The code forming means, wherein the end time of the previous code set is close to the start time of the subsequent code set, and the frequency and amplitude of the previous code set are similar to the frequency and amplitude of the subsequent code set. 2. The audio signal encoding system according to claim 1, wherein, when the first code set is present, the end time of the previous code set is changed to the end time of the second code set, and the second code set is deleted.

【請求項４】前記信号解析手段は、前記信号切片波形に
対して周波数解析を行って正弦波の集合に分解し、前記
分解された正弦波を所定のルールに従って複数のグルー
プに分け、前記各グループに属する正弦波を合成するこ
とにより得られる歪み波形に最も類似した形状をもつ歪
み波形バリエーションを選択することにより、前記複数
の歪み波形バリエーションを決定するものであることを
特徴とする請求項１に記載の音響信号符号化システム。4. The signal analyzing means performs frequency analysis on the signal intercept waveform to decompose the signal intercept waveform into a set of sine waves, divides the decomposed sine waves into a plurality of groups according to a predetermined rule, and 2. The plurality of distortion waveform variations are determined by selecting a distortion waveform variation having a shape most similar to a distortion waveform obtained by synthesizing sine waves belonging to a group. 3. The audio signal encoding system according to claim 1.

【請求項５】前記信号解析手段におけるグループ分け
は、前記各グループに属する正弦波の周波数がグループ
内の１つまたは複数の特定の正弦波の周波数の整数倍に
なるように行われるものであることを特徴とする請求項
４に記載の音響信号符号化システム。5. The method according to claim 1, wherein the grouping is performed such that the frequency of a sine wave belonging to each group is an integral multiple of the frequency of one or more specific sine waves in the group. The audio signal encoding system according to claim 4, wherein:

【請求項６】前記信号解析手段におけるグループ分け
は、前記各グループに属する正弦波の周波数が他のグル
ープに属する１つまたは複数の特定の正弦波の周波数の
整数倍になるように行われるものであることを特徴とす
る請求項５に記載の音響信号符号化システム。6. The method according to claim 1, wherein said grouping is performed such that the frequency of a sine wave belonging to each group is an integral multiple of the frequency of one or more specific sine waves belonging to another group. The acoustic signal encoding system according to claim 5, wherein

【請求項７】前記符号セットの周波数情報がＭＩＤＩ規
格で定義されているノートナンバーに対応する１２８種
の周波数から選択され、振幅情報がＭＩＤＩ規格で定義
されている１２８段階のベロシティで記述され、開始時
刻および終了時刻がＳＭＦ規格のデルタタイムで記述さ
れ、前記符号セットがＭＩＤＩ規格のノートオンイベン
トおよびノートオフイベントの１対で記述されており、
前記歪み波形パターンをＭＩＤＩ音源で実現しているこ
とを特徴とする請求項１に記載の音響信号符号化システ
ム。7. The frequency information of the code set is selected from 128 types of frequencies corresponding to note numbers defined by the MIDI standard, and the amplitude information is described by 128-stage velocities defined by the MIDI standard. The start time and the end time are described by SMF standard delta time, and the code set is described by a pair of MIDI standard note-on event and note-off event,
The audio signal encoding system according to claim 1, wherein the distortion waveform pattern is realized by a MIDI sound source.

【請求項８】前記符号セットの周波数情報がＭＩＤＩ規
格で定義されているノートナンバーに対応する１２８種
の周波数から選択され、振幅情報がＭＩＤＩ規格で定義
されている１２８段階のベロシティで記述され、開始時
刻および終了時刻がＳＭＦ規格のデルタタイムで記述さ
れ、前記符号セットがＭＩＤＩ規格のノートオンイベン
トおよびノートオフイベントの１対で記述されており、
前記歪み波形パターンをＭＩＤＩ音源で実現していると
共に、前記トラックがＭＩＤＩ規格のチャンネルであ
り、基本歪み波形パターンの識別情報がＭＩＤＩ規格で
楽器音色を示すプログラム番号、バンク番号で記述され
ていることを特徴とする請求項２に記載の音響信号符号
化システム。8. The frequency information of the code set is selected from 128 types of frequencies corresponding to note numbers defined in the MIDI standard, and the amplitude information is described in 128 levels of velocity defined in the MIDI standard. The start time and the end time are described by SMF standard delta time, and the code set is described by a pair of MIDI standard note-on event and note-off event,
The distortion waveform pattern is realized by a MIDI sound source, the track is a MIDI standard channel, and identification information of the basic distortion waveform pattern is described by a program number and a bank number indicating an instrument tone in the MIDI standard. The audio signal encoding system according to claim 2, wherein:

【請求項９】複数の歪み波形パターンを蓄積する歪み波
形パターン蓄積手段と、音響符号化データを入力する符
号化データ入力手段と、入力された音響符号化データに
対応する歪み波形パターンを、前記歪み波形パターン蓄
積手段から検索する歪み波形パターン検索手段と、前記
音響符号化データの符号セットが有する周波数と振幅の
情報に基づいて、前記検索された基本歪み波形パターン
の周波数と振幅を変更して歪み波形バリエーションを作
成する歪み波形バリエーション再現手段と、前記符号セ
ットごとに作成することにより得られる複数の歪み波形
バリエーションに対して、前記各符号セットが有する開
始時刻と終了時刻の情報に基づいて時間軸上で合成を行
うことにより音響信号を再現する信号合成手段を備えて
いることを特徴とする音響データ復号化システム。9. A distortion waveform pattern accumulating means for accumulating a plurality of distortion waveform patterns, encoded data input means for inputting audio encoded data, and a distortion waveform pattern corresponding to the input audio encoded data, A distortion waveform pattern search unit that searches from the distortion waveform pattern storage unit, and a frequency and an amplitude of the searched basic distortion waveform pattern are changed based on information on a frequency and an amplitude included in a code set of the acoustic encoded data. A distortion waveform variation reproducing unit that creates a distortion waveform variation, and a time based on information on a start time and an end time of each of the code sets for a plurality of distortion waveform variations obtained by creating each of the code sets. It is characterized by having signal synthesizing means to reproduce the acoustic signal by performing synthesis on the axis Sound data decoding system that.