JP2000099097A

JP2000099097A - Signal reproducing device and method, voice signal reproducing device, and speed conversion method for voice signal

Info

Publication number: JP2000099097A
Application number: JP10270244A
Authority: JP
Inventors: Noboru Murabayashi; 昇村林; Takao Takahashi; 孝夫高橋
Original assignee: Sony Corp
Current assignee: Sony Corp
Priority date: 1998-09-24
Filing date: 1998-09-24
Publication date: 2000-04-07

Abstract

PROBLEM TO BE SOLVED: To enable converting reproducing speed of a voice signal without the degradation of the extent of understanding contents. SOLUTION: A voice signal reproducing device 10 is provided with a waveform cutting out circuit 12 cutting out a signal waveform of a voice signal for each prescribed time unit, a level detecting circuit 13 detecting a signal level of a cut out signal waveform, a feature extracting circuit 14 extracting the feature of a cut out signal waveform, and a waveform processing circuit 15 processing a waveform of the voice signal by performing the deletion and/or addition of a signal waveform for each prescribed time unit based on a signal level of a signal waveform detected by the level detecting circuit 13 and features of a signal waveform extracted by the feature extracting circuit 14. In this voice signal reproducing device 10, reproducing speed of a voice signal is converted without changing pitch by deleting a part in which a soundless part and feature are continued.

Description

【発明の詳細な説明】DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、記録媒体に記録さ
れた信号を再生してその信号の時間軸を変換して出力す
る信号再生装置及び方法、並びに、音声信号の再生速度
を通常の再生速度より早くしたり遅くしたりする特殊再
生を行う音声信号再生装置及び音声信号の速度変換方法
に関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a signal reproducing apparatus and method for reproducing a signal recorded on a recording medium, converting the time axis of the signal, and outputting the signal, and a method for reproducing an audio signal at a normal reproducing speed. TECHNICAL FIELD The present invention relates to an audio signal reproducing apparatus for performing special reproduction faster or slower than a speed and a speed conversion method of the audio signal.

【０００２】[0002]

【従来の技術】一般に、家庭用ビデオテープレコーダ
（ＶＴＲ）には、ビデオテープを通常の再生速度よりも
高速に再生し、短い時間で記録された映像や音声を視聴
することができる早送り再生機能が設けられている。し
かしながら、この早送り再生機能を用いてビデオテープ
を再生した場合、出力される音声信号のピッチが変わっ
てしまい、その音声を聞いても内容を理解することがで
きなかった。そこで、近年のＶＴＲでは、早送り再生時
において音声信号の無音区間を削除する話速変換処理を
行い、出力される音声信号のピッチを通常再生時のピッ
チと同一にし、その音声を聞いて内容を理解することが
できるようにしている。2. Description of the Related Art In general, a home video tape recorder (VTR) has a fast-forward playback function capable of playing a video tape at a higher speed than a normal playback speed and allowing a user to view recorded video and audio in a short time. Is provided. However, when a video tape is reproduced using this fast-forward reproduction function, the pitch of an output audio signal changes, and the contents cannot be understood even when the audio is heard. Therefore, in recent VTRs, during fast-forward playback, speech speed conversion processing is performed to delete a silent section of the audio signal, the pitch of the output audio signal is made the same as the pitch at the time of normal playback, and the content is heard by listening to the audio. So that you can understand.

【０００３】[0003]

【発明が解決しようとする課題】ところが、無音区間を
削除して話速変換処理をするＶＴＲでは、より高速に早
送り再生をした場合、無音区間が比較的多く含まれる音
声に対しては有効に機能したが、無音区間があまり多く
ない音声に対しては有音区間も削除しなければならな
く、出力された音声を聞いても内容を理解することがで
きなかった。However, in a VTR in which a silent section is deleted and speech speed conversion processing is performed, when fast forward reproduction is performed at a higher speed, it is effective for a voice including a relatively large number of silent sections. Although it worked, the voiced sections had to be deleted for voices with few silent sections, and the contents could not be understood by listening to the output voices.

【０００４】本発明は、このような実情を鑑みてなされ
たものであり、信号の有効部分を削除することなく、時
間軸を変換することができる信号再生装置及び方法を提
供することを目的とする。The present invention has been made in view of such circumstances, and has as its object to provide a signal reproducing apparatus and method capable of converting a time axis without deleting an effective portion of a signal. I do.

【０００５】また、本発明は、内容理解度の低下が無く
音声信号の再生速度を変換することができる音声信号再
生装置及び音声信号の速度変換方法を提供することを目
的とする。It is another object of the present invention to provide an audio signal reproducing apparatus and an audio signal speed conversion method capable of converting an audio signal reproduction speed without lowering the degree of understanding of contents.

【０００６】[0006]

【課題を解決するための手段】本発明に係る信号再生装
置は、記録媒体から信号を再生する再生手段と、上記再
生手段により再生した再生信号の信号波形を所定の時間
単位毎に切り出す信号切出手段と、上記信号切出手段に
より切り出した信号波形の信号レベルを検出するレベル
検出手段と、上記信号切出手段により切り出した信号波
形の特徴を抽出する特徴抽出手段と、上記レベル検出手
段により検出した所定の時間単位の信号波形の信号レベ
ルと上記特徴抽出手段により抽出した所定の時間単位の
信号波形の特徴とに基づき、所定の時間単位毎の信号波
形の削除及び／又は追加を行って上記再生信号の波形加
工をし、再生信号の時間軸を変換する時間軸変換手段と
を備えることを特徴とする。A signal reproducing apparatus according to the present invention comprises a reproducing means for reproducing a signal from a recording medium, and a signal cutting means for cutting out a signal waveform of a reproduced signal reproduced by the reproducing means at predetermined time units. Output means, level detection means for detecting the signal level of the signal waveform extracted by the signal extraction means, feature extraction means for extracting the characteristics of the signal waveform extracted by the signal extraction means, and the level detection means Based on the detected signal level of the predetermined time-unit signal waveform and the characteristics of the predetermined time-unit signal waveform extracted by the feature extracting means, the signal waveform is deleted and / or added for each predetermined time unit. And a time axis converting means for processing the waveform of the reproduced signal and converting the time axis of the reproduced signal.

【０００７】この信号再生装置では、所定の時間単位毎
に切り出した再生信号の信号波形から信号レベルと特徴
とを検出し、この所定の時間単位毎に信号波形の削除及
び／又は追加を行って上記再生信号の波形加工をし、再
生信号の時間軸を変換する。In this signal reproducing apparatus, a signal level and a characteristic are detected from a signal waveform of a reproduced signal cut out every predetermined time unit, and the signal waveform is deleted and / or added every predetermined time unit. The waveform of the reproduced signal is processed to convert the time axis of the reproduced signal.

【０００８】本発明に係る信号再生方法は、記録媒体か
ら信号を再生し、再生した再生信号の信号波形を所定の
時間単位毎に切り出し、切り出した信号波形の信号レベ
ルを検出し、切り出した信号波形の特徴を抽出し、検出
した所定の時間単位の信号波形の信号レベルと抽出した
所定の時間単位の信号波形の特徴とに基づき、所定の時
間単位毎の信号波形の削除及び／又は追加を行って上記
再生信号の波形加工をし、再生信号の時間軸を変換する
ことを特徴とする。In the signal reproducing method according to the present invention, a signal is reproduced from a recording medium, a signal waveform of the reproduced signal is cut out at predetermined time units, a signal level of the cut signal waveform is detected, and the cut out signal is detected. The characteristic of the waveform is extracted, and the deletion and / or addition of the signal waveform for each predetermined time unit is performed based on the detected signal level of the signal waveform in the predetermined time unit and the extracted characteristic of the signal waveform in the predetermined time unit. Then, the waveform of the reproduced signal is processed to convert the time axis of the reproduced signal.

【０００９】この信号再生方法では、所定の時間単位毎
に切り出した再生信号の信号波形から信号レベルと特徴
とを検出し、この所定の時間単位毎に信号波形の削除及
び／又は追加を行って上記再生信号の波形加工をし、再
生信号の時間軸を変換する。In this signal reproducing method, a signal level and a characteristic are detected from a signal waveform of a reproduced signal cut out for each predetermined time unit, and the signal waveform is deleted and / or added for each predetermined time unit. The waveform of the reproduced signal is processed to convert the time axis of the reproduced signal.

【００１０】本発明に係る音声信号の再生装置では、音
声信号の信号波形を所定の時間単位毎に切り出す信号切
出手段と、上記信号切出手段により切り出した信号波形
の信号レベルを検出するレベル検出手段と、上記信号切
出手段により切り出した信号波形の特徴を抽出する特徴
抽出手段と、上記レベル検出手段により検出した所定の
時間単位の信号波形の信号レベルと上記特徴抽出手段に
より抽出した所定の時間単位の信号波形の特徴とに基づ
き、所定の時間単位毎の信号波形の削除及び／又は追加
を行って上記音声信号の波形加工をし、音声信号の再生
速度を変換する速度変換手段とを備えることを特徴とす
る。In the audio signal reproducing apparatus according to the present invention, a signal extracting means for extracting a signal waveform of the audio signal at predetermined time units, and a level for detecting a signal level of the signal waveform extracted by the signal extracting means. Detecting means, a characteristic extracting means for extracting a characteristic of the signal waveform extracted by the signal extracting means, a signal level of a signal waveform in a predetermined time unit detected by the level detecting means, and a predetermined level extracted by the characteristic extracting means. Speed conversion means for processing the waveform of the audio signal by deleting and / or adding a signal waveform for each predetermined time unit based on the characteristics of the signal waveform in the time unit of It is characterized by having.

【００１１】この音声信号の再生装置では、所定の時間
単位毎に切り出した音声信号の信号波形から信号レベル
と特徴とを検出し、この所定の時間単位毎に信号波形の
削除及び／又は追加を行って上記音声信号の波形加工を
し、音声信号の再生速度を変換する。In this audio signal reproducing apparatus, a signal level and a characteristic are detected from a signal waveform of an audio signal cut out for each predetermined time unit, and deletion and / or addition of the signal waveform are performed for each predetermined time unit. Then, waveform processing of the audio signal is performed, and the reproduction speed of the audio signal is converted.

【００１２】本発明に係る音声信号の速度変換方法で
は、音声信号の信号波形を所定の時間単位毎に切り出
し、切り出した信号波形の信号レベルを検出し、切り出
した信号波形の特徴を抽出し、抽出した所定の時間単位
の信号波形の信号レベルと抽出した所定の時間単位の信
号波形の特徴とに基づき、所定の時間単位毎の信号波形
の削除及び／又は追加を行って上記音声信号の波形加工
をし、音声信号の再生速度を変換することを特徴とす
る。In the method for converting the speed of an audio signal according to the present invention, the signal waveform of the audio signal is cut out at predetermined time units, the signal level of the cut out signal waveform is detected, and the characteristics of the cut out signal waveform are extracted. The waveform of the audio signal is deleted and / or added for each predetermined time unit based on the extracted signal level of the predetermined time unit signal waveform and the extracted characteristic of the predetermined time unit signal waveform. It is characterized by processing and converting the reproduction speed of the audio signal.

【００１３】この音声信号の速度変換方法では、所定の
時間単位毎に切り出した音声信号の信号波形から信号レ
ベルと特徴とを検出し、この所定の時間単位毎に信号波
形の削除及び／又は追加を行って上記音声信号の波形加
工をし、音声信号の再生速度を変換する。In this method of converting the speed of an audio signal, a signal level and a characteristic are detected from a signal waveform of the audio signal cut out at a predetermined time unit, and the signal waveform is deleted and / or added at the predetermined time unit. To perform waveform processing on the audio signal to convert the reproduction speed of the audio signal.

【００１４】[0014]

【発明の実施の形態】まず、本発明を適用した第１の実
施の形態の話速変換装置について説明する。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS First, a speech speed conversion apparatus according to a first embodiment of the present invention will be described.

【００１５】図１に、本発明を適用した第１の実施の形
態の話速変換装置のブロック構成図を示す。FIG. 1 shows a block diagram of a speech speed converter according to a first embodiment of the present invention.

【００１６】図１に示す話速変換装置１０は、例えば、
ビデオテープレコーダ等の音声出力段に用いられ、早送
り再生等がされたときに、音声信号の話速変換を行う装
置である。The speech speed conversion device 10 shown in FIG.
This device is used in an audio output stage of a video tape recorder or the like, and performs a speech speed conversion of an audio signal when fast forward reproduction or the like is performed.

【００１７】話速変換装置１０は、アナログ／デジタル
（Ａ／Ｄ）変換回路１１と、波形切出回路１２と、レベ
ル検出回路１３と、特徴抽出回路１４と、波形加工回路
１５と、デジタル／アナログ（Ｄ／Ａ）変換回路１６と
を有している。The speech speed conversion device 10 includes an analog / digital (A / D) conversion circuit 11, a waveform extraction circuit 12, a level detection circuit 13, a feature extraction circuit 14, a waveform processing circuit 15, And an analog (D / A) conversion circuit 16.

【００１８】この話速変換装置１０には、例えば、ビデ
オテープから再生されたアナログの音声信号が供給され
る。The speech speed converter 10 is supplied with, for example, an analog audio signal reproduced from a video tape.

【００１９】Ａ／Ｄ変換回路１１は、入力されたアナロ
グの音声信号をＡ／Ｄ変換してデジタルの音声信号に変
換する。The A / D conversion circuit 11 A / D converts the input analog audio signal and converts it into a digital audio signal.

【００２０】波形切出回路１２には、Ａ／Ｄ変換回路１
１によりデジタルデータにされた音声信号が供給され
る。波形切出回路１２は、時間的に連続している音声信
号を所定の時間単位毎に分割して、その時間単位内の音
声信号の信号波形を切り出す。信号波形を切り出す時間
単位は、例えば、１０ｍｓｅｃ〜２００ｍｓｅｃ程度で
ある。この時間単位のことを以後音声ブロックと呼ぶ。The A / D conversion circuit 1 is provided in the waveform extraction circuit 12.
1 supplies an audio signal converted into digital data. The waveform extracting circuit 12 divides a temporally continuous audio signal for each predetermined time unit, and extracts a signal waveform of the audio signal within the time unit. The time unit for cutting out the signal waveform is, for example, about 10 msec to 200 msec. This time unit is hereinafter referred to as an audio block.

【００２１】レベル検出回路１３には、波形切出回路１
２により信号波形が切り出された音声信号、音声ブロッ
ク毎に供給される。レベル検出回路１３は、音声ブロッ
ク毎に信号信号のレベルを検出して、その音声ブロック
が無音区間であるか有音区間であるかを判別する。例え
ば、レベル検出回路１３は、この音声ブロック毎の音声
信号の平均パワー（電力）Ｐ或いは平均レベルＭを求
め、この平均パワーＰ或いは平均レベルＭが所定の閾値
より高ければ有音区間と判断し、所定の閾値より低けれ
ば無音区間と判断する。音声ブロック内の音声信号の平
均パワーＰ及び平均レベルＭは、以下のように算出する
ことができる。The level detecting circuit 13 includes a waveform extracting circuit 1
2, the audio signal whose signal waveform is cut out is supplied for each audio block. The level detection circuit 13 detects the level of the signal signal for each audio block, and determines whether the audio block is a silent section or a sound section. For example, the level detection circuit 13 obtains an average power (power) P or an average level M of the audio signal for each audio block, and determines that the audio section is a sound section if the average power P or the average level M is higher than a predetermined threshold. , If it is lower than a predetermined threshold, it is determined to be a silent section. The average power P and average level M of the audio signal in the audio block can be calculated as follows.

【００２２】平均パワーＰ＝（１／Ｎ）Σｉ² 平均レベルＭ＝（１／Ｎ）Σ｜ｉ｜但し、累積加算は音声ブロック内で行い、Ｎは音声ブロ
ック内のサンプリングデータ数、ｉは音声信号の信号レ
ベル（振幅）である。Average power P = (1 / N) Σi ² Average level M = (1 / N) Σ | i | where cumulative addition is performed in the audio block, N is the number of sampling data in the audio block, and i is This is the signal level (amplitude) of the audio signal.

【００２３】特徴抽出回路１４には、波形切出回路１２
により信号波形が切り出された音声信号が、音声ブロッ
ク毎に供給される。特徴抽出回路１４は、供給された音
声信号の信号波形から、音声ブロック毎に音声信号の特
徴を抽出する。音声信号の特徴とは、例えば、音声信号
のピッチや音声信号の周波数特性等である。The feature extracting circuit 14 includes a waveform extracting circuit 12
The audio signal whose signal waveform has been cut out is supplied for each audio block. The feature extraction circuit 14 extracts features of the audio signal for each audio block from the signal waveform of the supplied audio signal. The characteristics of the audio signal include, for example, the pitch of the audio signal and the frequency characteristics of the audio signal.

【００２４】波形加工回路１５には、波形切出回路１２
により信号波形が切り出された音声信号が、音声ブロッ
ク毎に供給される。また、この波形加工回路１５には、
レベル検出回路１３より判断された音声ブロックが有音
区間であるか無音区間であるかの判断結果、及び、特徴
抽出回路１４により抽出された音声ブロック毎の音声信
号の特徴が供給される。The waveform processing circuit 15 includes a waveform extraction circuit 12
The audio signal whose signal waveform has been cut out is supplied for each audio block. The waveform processing circuit 15 includes:
The result of the determination as to whether the voice block determined by the level detection circuit 13 is a voiced section or a silent section, and the features of the voice signal for each voice block extracted by the feature extraction circuit 14 are supplied.

【００２５】波形加工回路１５は、音声ブロックが有音
区間であるか無音区間であるかの判断結果と音声信号の
特徴とに基づき波形加工処理を行い、音声信号の再生速
度変換を行う。The waveform processing circuit 15 performs waveform processing on the basis of the result of determining whether the audio block is a sound section or a silent section and the characteristics of the audio signal, and converts the reproduction speed of the audio signal.

【００２６】Ｄ／Ａ変換回路１６には、波形加工回路１
５により波形加工がされたデジタルの音声信号が供給さ
れる。Ｄ／Ａ変換回路１６は、デジタルの音声信号をア
ナログ信号に変換して出力する。The D / A conversion circuit 16 includes a waveform processing circuit 1
5, a digital audio signal whose waveform has been processed is supplied. The D / A conversion circuit 16 converts a digital audio signal into an analog signal and outputs it.

【００２７】以上のような構成の話速変換装置１０で
は、、ビデオテープの早送り再生をした場合、波形加工
回路１５が音声信号の加工処理をすることにより、音声
のピッチを変化させず音声信号の話速度を速くし、早聞
きを可能としている。In the speech speed converter 10 having the above-described configuration, when the video tape is fast-forward-reproduced, the waveform processing circuit 15 processes the audio signal so that the audio signal does not change in pitch. Speed up the talk speed and enable quick listening.

【００２８】波形加工回路１５の処理内容をさらに詳細
に説明する。The processing contents of the waveform processing circuit 15 will be described in more detail.

【００２９】波形加工回路１５は、無音区間の音声ブロ
ックを削除する。さらに、波形加工回路１５は、特徴が
類似した音声信号が連続している音声ブロックの一部を
削除する。このとき、波形加工回路１５は、音声信号の
再生速度に応じて音声ブロックの削除の割合を決定し
て、削除をしていく。すなわち、波形加工回路１５は、
早送り再生をする際の速度に応じて削除するブロック数
を決定する。そして、波形加工回路１５は、削除してい
ない残っている音声ブロック同士を接続して出力し、音
声信号の再生速度の変換処理すなわち話速度の変換処理
を行う。The waveform processing circuit 15 deletes a voice block in a silent section. Further, the waveform processing circuit 15 deletes a part of an audio block in which audio signals having similar characteristics are continuous. At this time, the waveform processing circuit 15 determines a deletion ratio of the audio block according to the reproduction speed of the audio signal and deletes the audio block. That is, the waveform processing circuit 15
The number of blocks to be deleted is determined according to the speed at the time of fast-forward playback. Then, the waveform processing circuit 15 connects and outputs the remaining audio blocks that have not been deleted, and performs a conversion process of the reproduction speed of the audio signal, that is, a conversion process of the speech speed.

【００３０】ここで、波形加工回路１５は、削除してい
ない残っている音声ブロック同士を接続する際にそのま
ま単純に接続しては聴覚上問題が生じるので、以下のよ
うに滑らかに接続をする。Here, the waveform processing circuit 15 simply connects the remaining audio blocks that have not been deleted and causes an auditory problem if they are directly connected. Therefore, the waveform processing circuit 15 performs smooth connection as follows. .

【００３１】例えば、図２に示すように、時間的に連続
した音声ブロックＡ、Ｂ、Ｃがあり、その真ん中の音声
ブロックＢを削除するものとする。音声ブロックＡと音
声ブロックＣを単純に接続したのでは、音声ブロックＡ
の信号波形の最後の部分（時刻ｔ１における信号）と音
声ブロックＣの信号波形の最先の部分（時間ｔ２におけ
る信号）とが不連続にとなり、この不連続部分が雑音に
なる場合がある。For example, as shown in FIG. 2, there are audio blocks A, B, and C which are temporally continuous, and the middle audio block B is to be deleted. If the audio block A and the audio block C are simply connected, the audio block A
And the earliest part of the signal waveform of the audio block C (the signal at time t2) become discontinuous, and this discontinuous part may become noise.

【００３２】そこで、波形加工回路１５は、例えば、音
声ブロックＡの最先の部分（時刻ｔａ）で１、音声ブロ
ックＢの最後の部分（時刻ｔ２）で０となるような波形
接続重み付け関数ｆａ（ｔ）を、音声ブロックＡの音声
信号に乗ずる。また、音声ブロックＢの最先の部分（時
刻ｔ１）で０、音声ブロックＣの最後の部分（時刻ｔ
ｃ）で１となるような波形接続重み付け関数ｆｃ（ｔ）
を、音声ブロックＣの音声信号に乗ずる。そして、この
重み付け関数を乗じた音声ブロックＡと音声部録Ｃの音
声信号を接続する。Therefore, the waveform processing circuit 15 generates a waveform connection weighting function fa such that, for example, 1 is set at the earliest portion (time ta) of the audio block A and 0 at the last portion (time t2) of the audio block B. (T) is multiplied by the audio signal of the audio block A. Also, 0 is set at the earliest part (time t1) of the audio block B, and is set at the last part (time t1) of the audio block C.
The waveform connection weighting function fc (t) that becomes 1 in c)
Is multiplied by the audio signal of the audio block C. Then, the audio block A multiplied by the weighting function and the audio signal of the audio section C are connected.

【００３３】具体的には、音声ブロックＡの音声信号に
乗ずる重み付け関するｆａ（ｔ）、及び、音声ブロック
Ｃの音声信号に乗ずる重み付け関すｆｃ（ｔ）は、以下
のようになる。ｆａ（ｔ）＝−（ｔ／（ｔ２−ｔａ））＋ｔ２／（ｔ２
−ｔａ）ｆｃ（ｔ）＝（ｔ／（ｔｃ−ｔ１））−ｔ１／（ｔｃ
−ｔ１）そして、音声ブロックＡ〜Ｂの音声信号をＡＢ（ｔ）、
音声ブロックＢ〜Ｃの音声信号をＢＣ（ｔ）とすると、
波形接続後の信号ＡＣ（ｔ）は、以下のようになる。ＡＣ（ｔ）＝ＡＢ（ｔ）・ｆａ（ｔ）＋ＢＣ（ｔ）・ｆ
ｃ（ｔ）波形加工回路１５では、このように波形接続重み付け関
数を用いて処理することにより、削除していない音声ブ
ロック同士を滑らかに接続でき、比較的聴覚上違和感の
ない音声信号接続が行える。More specifically, fa (t) relating to the weighting of the audio signal of the audio block A and fc (t) relating to the weighting of the audio signal of the audio block C are as follows. fa (t) =-(t / (t2-ta)) + t2 / (t2
−ta) fc (t) = (t / (tc−t1)) − t1 / (tc)
-T1) Then, the audio signals of the audio blocks A and B are converted into AB (t),
Assuming that the audio signals of the audio blocks B to C are BC (t),
The signal AC (t) after the waveform connection is as follows. AC (t) = AB (t) · fa (t) + BC (t) · f
In the c (t) waveform processing circuit 15, by performing processing using the waveform connection weighting function in this manner, audio blocks that have not been deleted can be smoothly connected to each other, and audio signal connection that is relatively free of auditory discomfort can be performed. .

【００３４】なお、波形接続重み付け関数ｆ（ｔ）とし
て、ｆ（ｔ）＝ａｔ＋ｂのような線形１次関数を例に挙
げたが、その他にｆ（ｔ）＝ａｔ²＋ｂのような線形２
次関数、或いは、ｆ（ｔ）＝ａ・ｅｘｐ（−ｔ／τ）＋
ｂのような指数関数であってもよい。Although the linear connection function such as f (t) = at + b has been described as an example of the waveform connection weighting function f (t), a linear function such as f (t) = at ² + b is also used.
The following function or f (t) = a · exp (−t / τ) +
An exponential function such as b may be used.

【００３５】また、波形加工回路１５では、図３に示す
ように、音声ブロックＡの接続点となるａ点を音声ブロ
ックＣの音声信号に接続する場合、このａ点における微
分係数の符号と異なる微分係数のｃ１点で接続するので
はなく、このａ点における微分係数の符号と同一の符号
となるｃ２点で接続するように、音声ブロックＢを削除
するようにしてもよい。In the waveform processing circuit 15, as shown in FIG. 3, when the point a serving as the connection point of the audio block A is connected to the audio signal of the audio block C, the sign of the differential coefficient at the point a is different. Instead of connecting at the point c1 of the differential coefficient, the audio block B may be deleted so as to connect at the point c2 having the same sign as the sign of the differential coefficient at the point a.

【００３６】つぎに、この話速変換装置１０により実際
の音声信号を話速変換した場合の信号波形について、図
４〜図７の波形図を用いて説明する。Next, signal waveforms in the case where the actual voice signal is subjected to speech speed conversion by the speech speed conversion device 10 will be described with reference to the waveform diagrams of FIGS.

【００３７】図４は、入力音声信号の波形を示す図であ
る。図５は、話速変換した後の出力音声信号の波形を示
す図である。図６は、上記図４に示した入力音声信号の
波形の一部分を拡大した図である。図７は、上記図５に
示した出力音声信号の波形の一部分を拡大した図であ
る。図４及び図５は、横軸の１目盛が５０００サンプル
を示しており、時間にして約０．１０４秒となってい
る。また、図６及び図７は、横軸の１目盛が１０００サ
ンプルを示しており、時間にして約０．０２０８秒とな
っている。また、音声信号は、サンプリング周波数ｆｓ
＝４８０００ＫＨｚ、量子化ビット数１６ビットの信号
である。FIG. 4 is a diagram showing a waveform of an input audio signal. FIG. 5 is a diagram showing a waveform of the output audio signal after the speech speed conversion. FIG. 6 is an enlarged view of a part of the waveform of the input audio signal shown in FIG. FIG. 7 is an enlarged view of a part of the waveform of the output audio signal shown in FIG. 4 and 5, one scale on the horizontal axis indicates 5000 samples, which is about 0.104 seconds in time. 6 and 7, one graduation on the horizontal axis indicates 1000 samples, which is about 0.0208 seconds in time. The audio signal has a sampling frequency fs
= 48000 KHz, signal of 16 bits of quantization bits.

【００３８】これら各図を見て分かるように、波形削除
処理を行い話速処理を行って、波形接続処理を行って
も、入力音声信号と出力音声信号のピッチが変化してお
らず、また音声波形も滑らかに接続されている。As can be seen from these figures, the pitch between the input voice signal and the output voice signal does not change even when the waveform deletion processing is performed, the speech speed processing is performed, and the waveform connection processing is performed. Audio waveforms are also connected smoothly.

【００３９】ここで、図８及び図９に示す波形のＸ部分
は音声信号の子音区間であり、波形のＹ部分は母音区間
である。入力音声信号と出力音声信号の子音区間を比べ
ると、話速変換処理を行った後の出力音声信号の方が子
音区間が短くなっていることが分かる。同様に、入力音
声信号と出力音声信号の母音区間を比べると、話速変換
処理を行った後の出力音声信号の方が母音区間が短くな
っていることが分かる。Here, the X part of the waveforms shown in FIGS. 8 and 9 is a consonant section of the voice signal, and the Y part of the waveform is a vowel section. Comparing the consonant sections of the input speech signal and the output speech signal, it can be seen that the consonant section of the output speech signal after the speech speed conversion processing is shorter. Similarly, comparing the vowel sections of the input speech signal and the output speech signal, it can be seen that the vowel section of the output speech signal after the speech speed conversion processing is shorter.

【００４０】そのため、話速処理を行って音声波形を削
除する場合に、削除する割合をあまり多くしてしまうと
会話音声の内容が分からなくなってしまう。Therefore, when the speech waveform is deleted by performing the speech speed processing, if the rate of deletion is too large, the contents of the conversation voice cannot be understood.

【００４１】実験の結果によると、例えば母音区間につ
いては入力音声信号のピッチが約１０λ程度あったとす
れば、出力音声信号は、母音区間に１／２〜１／３程度
のピッチがあれば、会話内容の劣化がほとんどなく良好
となる。また、子音区間についてもほぼ同様である。According to the results of the experiment, for example, if the pitch of the input voice signal is about 10λ in the vowel section, the output voice signal will have a pitch of about 1/2 to 1/3 in the vowel section. The conversation content is good with little deterioration. The same applies to the consonant section.

【００４２】以上のように本発明の第１の実施の形態の
話速変換装置１０では、音声ブロック毎に切り出した音
声信号の信号波形から、音声信号の信号レベルと音声信
号の特徴とを検出し、音声ブロックの削除を行って上記
音声信号の波形加工をすることにより、内容理解度の低
下が無く音声信号の再生速度を変換することができる。As described above, the speech speed converter 10 according to the first embodiment of the present invention detects the signal level of the audio signal and the characteristics of the audio signal from the signal waveform of the audio signal cut out for each audio block. By performing the waveform processing of the audio signal by deleting the audio block, the reproduction speed of the audio signal can be converted without lowering the degree of understanding of the content.

【００４３】なお、波形加工回路１５において音声ブロ
ックを削除する場合、例えば２倍速再生をするのであれ
ばブロック数を全体の１／２に間引けばよく、無音ブロ
ックが非常に長いからといって必要以上のブロック数を
削除することはない。また、図示しないビデオテープの
回転速度のコントローラ等に音声信号の特徴等をフィー
ドバックして、無音区間が多い部分では例えば３倍速や
４倍速といった非常に早い早送り再生をし、有音区間で
は音声信号の内容が理解できる程度の早送り再生をする
といったような可変速再生を行っても良い。In the case where the audio block is deleted in the waveform processing circuit 15, for example, in the case of double-speed reproduction, the number of blocks may be reduced to の of the whole, and it is simply because the silent block is very long. It does not delete more blocks than necessary. Also, the characteristics of the audio signal are fed back to a video tape rotation speed controller or the like (not shown), and a very fast fast-forward reproduction such as triple speed or quadruple speed is performed in a portion having many silent sections. Variable-speed playback such as fast-forward playback to the extent that the contents can be understood.

【００４４】また、波形加工回路１５は、早送り再生の
再生速度に応じて、音声ブロックの削除方法を変えても
良い。例えば、下記の表に示すように、低速度（１〜
１．５倍速）、中速度（１．５〜２．５倍速）、高速度
（２．５倍以上）でそれぞれ削除方法を変えても良い。The waveform processing circuit 15 may change the method of deleting the audio block according to the reproduction speed of the fast-forward reproduction. For example, as shown in the table below, low speed (1 to
The deletion method may be changed for each of 1.5 times speed, medium speed (1.5 to 2.5 times speed), and high speed (2.5 times or more).

【００４５】[0045]

【表１】 [Table 1]

【００４６】なお、以上の処理で所定の速度にならない
場合は、例えば、低速度であれば母音区間の削除を行
い、中速度であれば子音区間の削除を行い、また、高速
度であれば音声レベルの低い部分を削除するようにして
も良い。If the speed does not reach the predetermined speed in the above processing, for example, a vowel section is deleted at a low speed, a consonant section is deleted at a medium speed, and a high speed. A part with a low audio level may be deleted.

【００４７】また、高速度の高速再生の場合における母
音区間と子音区間との検出については、まず母音区間の
検出を行い、その前部の音声区間を子音区間とするなど
の処理を行う。母音区間の検出については、自己相関関
数によるピッチ検出を行い、その検出区間を母音区間と
するなどの処理を行う。また、この場合、音声信号をそ
のまま自己相関処理する他に、対数スペクトル処理の
後、自己相関処理を行い、ピッチ検出するようにしても
良い。As for the detection of a vowel section and a consonant section in the case of high-speed high-speed reproduction, first, a vowel section is detected, and a process in which a preceding voice section is set as a consonant section is performed. As for the detection of a vowel section, pitch detection is performed using an autocorrelation function, and processing such as setting the detected section as a vowel section is performed. In this case, in addition to performing the autocorrelation processing on the audio signal as it is, the pitch detection may be performed by performing the autocorrelation processing after the logarithmic spectrum processing.

【００４８】つぎに、本発明を適用した第２の実施の形
態の話速変換装置について説明する。Next, a speech speed conversion device according to a second embodiment of the present invention will be described.

【００４９】図８に、本発明を適用した第２の実施の形
態の話速変換装置のブロック構成図を示す。なお、この
第２の実施の形態の話速変換装置を説明するのにあた
り、上記第１の実施の形態の話速変換装置１０と同一の
回路については、図面中に同一の符号を付け、その詳細
な説明を省略する。また、第３の実施の形態以降も同様
とする。FIG. 8 shows a block diagram of a speech speed converter according to a second embodiment of the present invention. In describing the speech speed conversion device of the second embodiment, the same circuits as those of the speech speed conversion device 10 of the first embodiment are denoted by the same reference numerals in the drawings. Detailed description is omitted. The same applies to the third and subsequent embodiments.

【００５０】図８に示す話速変換装置２０は、例えば、
ビデオテープレコーダ等の音声出力段に用いられ、早送
り再生等がされたときに、音声信号の話速変換を行う装
置である。The speech speed conversion device 20 shown in FIG.
This device is used in an audio output stage of a video tape recorder or the like, and performs a speech speed conversion of an audio signal when fast forward reproduction or the like is performed.

【００５１】話速変換装置２０は、アナログ／デジタル
（Ａ／Ｄ）変換回路１１と、波形切出回路１２と、レベ
ル検出回路１３と、相関性検出回路２１と、波形加工回
路１５と、デジタル／アナログ（Ｄ／Ａ）変換回路１６
とを有している。波形加工回路１５は、間引き回路２２
と、波形接続回路２３とから構成されている。The speech speed conversion device 20 includes an analog / digital (A / D) conversion circuit 11, a waveform extraction circuit 12, a level detection circuit 13, a correlation detection circuit 21, a waveform processing circuit 15, / Analog (D / A) conversion circuit 16
And The waveform processing circuit 15 includes a thinning circuit 22
And a waveform connection circuit 23.

【００５２】相関性検出回路２１には、波形切出回路１
２により信号波形が切り出された音声信号が、音声ブロ
ック毎に供給される。相関性検出回路２１は、波形切出
回路１２により切り出した音声ブロック間の自己相関関
数を求める。The correlation detecting circuit 21 includes a waveform extracting circuit 1
An audio signal whose signal waveform has been cut out by 2 is supplied for each audio block. The correlation detection circuit 21 obtains an autocorrelation function between the audio blocks extracted by the waveform extraction circuit 12.

【００５３】間引き回路２２は、相関性検出回路２１で
求めた相関性のある音声ブロック、及び、レベル検出回
路１３で求めた無音区間の音声ブロックを削除する。The thinning circuit 22 deletes the correlated audio block obtained by the correlation detection circuit 21 and the audio block of the silent section obtained by the level detection circuit 13.

【００５４】波形接続回路２３は、削除せず残った音声
ブロック同士を音声波形が滑らかに接続するように接続
処理を行う。The waveform connection circuit 23 performs connection processing so that the audio blocks remaining without being deleted are connected smoothly with the audio waveform.

【００５５】本発明の第２の実施の形態の話速変換装置
２０では、このように波形の相関性を検出することによ
って、音声信号が類似した部分を検出することができ、
検出した音声信号の類似部分を削除することによって、
話速変換をすることができる。In the speech speed conversion device 20 according to the second embodiment of the present invention, by detecting the correlation between the waveforms as described above, it is possible to detect a portion having a similar audio signal.
By removing similar parts of the detected audio signal,
Speak speed conversion can be performed.

【００５６】つぎに、本発明を適用した第３の実施の形
態の話速変換装置について説明する。Next, a speech speed conversion device according to a third embodiment of the present invention will be described.

【００５７】図９に、本発明を適用した第３の実施の形
態の話速変換装置のブロック構成図を示す。FIG. 9 shows a block diagram of a speech speed converter according to a third embodiment of the present invention.

【００５８】図９に示す話速変換装置３０は、例えば、
ビデオテープレコーダ等の音声出力段に用いられ、早送
り再生等がされたときに、音声信号の話速変換を行う装
置である。The speech speed converter 30 shown in FIG.
This device is used in an audio output stage of a video tape recorder or the like, and performs a speech speed conversion of an audio signal when fast forward reproduction or the like is performed.

【００５９】話速変換装置３０は、アナログ／デジタル
（Ａ／Ｄ）変換回路１１と、波形切出回路１２と、レベ
ル検出回路１３と、レベル比較回路３１と、波形加工回
路１５と、デジタル／アナログ（Ｄ／Ａ）変換回路１６
とを有している。The speech speed conversion device 30 includes an analog / digital (A / D) conversion circuit 11, a waveform extraction circuit 12, a level detection circuit 13, a level comparison circuit 31, a waveform processing circuit 15, Analog (D / A) conversion circuit 16
And

【００６０】レベル比較回路３１には、波形切出回路１
２により信号波形が切り出された音声信号が、音声ブロ
ック毎に供給される。レベル比較回路３１は、音声ブロ
ック毎に、音声信号のレベルを検出する。The level comparing circuit 31 includes a waveform extracting circuit 1
An audio signal whose signal waveform has been cut out by 2 is supplied for each audio block. The level comparison circuit 31 detects the level of the audio signal for each audio block.

【００６１】間引き回路３２は、無音区間の音声ブロッ
クを削除するとともに、レベル比較回路３１のレベル検
出結果に基づき、音声ブロック毎にレベル検出を行い、
レベルが同じ音声ブロックが所定回数続いたら、その音
声ブロックのうちいくつかのブロックを削除する。削除
するブロックの数は、再生速度等に応じて決定しても良
く、また、連続した同レベルの音声ブロックのうち、レ
ベルが異なる音声ブロックに隣接する２つの音声ブロッ
クを残しその他の音声ブロックを削除しても良い。The decimation circuit 32 deletes the audio block in the silent section, and performs level detection for each audio block based on the level detection result of the level comparison circuit 31.
When the audio block having the same level continues for a predetermined number of times, some of the audio blocks are deleted. The number of blocks to be deleted may be determined according to the reproduction speed or the like. In addition, of the consecutive audio blocks of the same level, two audio blocks adjacent to audio blocks of different levels are left, and the other audio blocks are removed. You may delete it.

【００６２】波形接続回路２３は、削除せず残った音声
ブロック同士を音声波形が滑らかに接続するように接続
処理を行う。The waveform connection circuit 23 performs connection processing so that audio blocks remaining without being deleted are connected smoothly with each other.

【００６３】例えば、この話速変換装置３０は、図１０
に示すように、Ａ〜Ｆの音声ブロックが入力され、音声
ブロックＢ、Ｃ、Ｄのレベルが同一だった場合、音声ブ
ロックＣを削除し、ＢとＤ区間の間で波形接続処理を行
う。また、音声ブロックＢ、Ｃ、Ｄ、Ｅのレベルが同一
だった場合、音声ブロックＣとＤを削除し、ＢとＥとの
間で波形接続処理を行う。For example, this speech speed conversion device 30
As shown in (5), when the audio blocks A to F are input and the audio blocks B, C, and D have the same level, the audio block C is deleted, and the waveform connection processing is performed between the B and D sections. When the levels of the audio blocks B, C, D, and E are the same, the audio blocks C and D are deleted, and the waveform connection processing is performed between B and E.

【００６４】本発明の第３の実施の形態の話速変換装置
３０では、このように音声レベルを検出することによっ
て、音声信号が類似した部分を検出することができ、検
出した音声信号の類似部分を削除することによって、話
速変換をすることができる。In the speech speed conversion device 30 according to the third embodiment of the present invention, by detecting the voice level in this manner, a portion where the voice signal is similar can be detected, and the similarity of the detected voice signal can be detected. Speech speed conversion can be performed by deleting the part.

【００６５】つぎに、本発明を適用した第４の実施の形
態の話速変換装置について説明する。Next, a speech speed converter according to a fourth embodiment of the present invention will be described.

【００６６】図１１に、本発明を適用した第４の実施の
形態の話速変換装置のブロック構成図を示す。FIG. 11 is a block diagram showing a speech speed converter according to a fourth embodiment of the present invention.

【００６７】図１１に示す話速変換装置４０は、例え
ば、ビデオテープレコーダ等の音声出力段に用いられ、
早送り再生等がされたときに、音声信号の話速変換を行
う装置である。The speech speed converter 40 shown in FIG. 11 is used in, for example, an audio output stage of a video tape recorder or the like.
This is a device that performs speech speed conversion of an audio signal when fast-forward playback or the like is performed.

【００６８】話速変換装置４０は、アナログ／デジタル
（Ａ／Ｄ）変換回路１１と、波形切出回路１２と、レベ
ル検出回路１３と、周波数解析回路４１と、ピーク周波
数検出回路４２と、ピーク周波数継続性検出回路４３
と、波形加工回路１５と、デジタル／アナログ（Ｄ／
Ａ）変換回路１６とを有している。The speech speed conversion device 40 includes an analog / digital (A / D) conversion circuit 11, a waveform extraction circuit 12, a level detection circuit 13, a frequency analysis circuit 41, a peak frequency detection circuit 42, Frequency continuity detection circuit 43
, A waveform processing circuit 15 and a digital / analog (D /
A) The conversion circuit 16 is provided.

【００６９】周波数解析回路４１には、波形切出回路１
２により信号波形が切り出された音声信号が、音声ブロ
ック毎に供給される。周波数解析回路４１は、音声ブロ
ック毎に、音声信号の周波数解析を行う。The frequency analysis circuit 41 includes a waveform extraction circuit 1
An audio signal whose signal waveform has been cut out by 2 is supplied for each audio block. The frequency analysis circuit 41 performs a frequency analysis of the audio signal for each audio block.

【００７０】ピーク周波数検出回路４２は、周波数解析
回路４１の解析結果に基づき、音声ブロック毎に、第１
ピーク周波数と第２のピーク周波数とを検出する。The peak frequency detection circuit 42, based on the analysis result of the frequency analysis circuit 41,
A peak frequency and a second peak frequency are detected.

【００７１】ピーク周波数継続性検出回路４３は、ピー
ク周波数検出回路４２の検出結果に基づき、ピーク周波
数が同一の音声ブロックが継続しているかどうかを検出
する。The peak frequency continuity detecting circuit 43 detects whether or not a sound block having the same peak frequency continues based on the detection result of the peak frequency detecting circuit 42.

【００７２】間引き回路４４は、無音区間の音声ブロッ
クを削除するとともに、ピーク周波数継続性検出回路４
３の検出結果に基づき、ピーク周波数が同じ音声ブロッ
クが所定回数続いたら、その音声ブロックのうちいくつ
かのブロックを削除する。削除するブロックの数は、再
生速度等に応じて決定しても良く、また、連続した同じ
ピーク周波数の音声ブロックのうち、ピーク周波数が異
なる音声ブロックに隣接する２つの音声ブロックを残し
その他の音声ブロックを削除しても良い。The decimating circuit 44 deletes a voice block in a silent section and performs a peak frequency continuity detecting circuit 4.
When a predetermined number of audio blocks having the same peak frequency continue based on the detection result of No. 3, some of the audio blocks are deleted. The number of blocks to be deleted may be determined according to the reproduction speed or the like. Of the continuous audio blocks having the same peak frequency, two audio blocks adjacent to an audio block having a different peak frequency are left and other audio blocks are left. Blocks may be deleted.

【００７３】波形接続回路２３は、削除せず残った音声
ブロック同士を音声波形が滑らかに接続するように接続
処理を行う。The waveform connection circuit 23 performs connection processing so that the audio blocks remaining without being deleted are smoothly connected by the audio waveform.

【００７４】本発明の第４の実施の形態の話速変換装置
４０では、このようにピーク周波数を検出することによ
って、音声信号が類似した部分を検出することができ、
検出した音声信号の類似部分を削除することによって、
話速変換をすることができる。In the speech speed conversion device 40 according to the fourth embodiment of the present invention, by detecting the peak frequency in this manner, it is possible to detect a portion where the voice signal is similar,
By removing similar parts of the detected audio signal,
Speak speed conversion can be performed.

【００７５】つぎに、本発明を適用した第５の実施の形
態について説明をする。この第５の実施の形態は、上述
した本発明の第１から第４の実施の形態の話速変換装置
を適用したディスク再生装置である。Next, a fifth embodiment of the present invention will be described. The fifth embodiment is a disk reproducing apparatus to which the speech speed converter according to the first to fourth embodiments of the present invention is applied.

【００７６】図１２に、本発明を適用した第５の実施の
形態のディスク再生装置のブロック構成図を示す。FIG. 12 is a block diagram showing a disk reproducing apparatus according to a fifth embodiment of the present invention.

【００７７】図１２に示すディスク再生装置５０は、例
えば、映像及び音声がデジタル記録された光ディスク等
を再生し、ユーザにより早送り再生操作等がされた場合
には、映像信号を高速再生するとともに、音声信号の話
速変換を行うことができる装置である。The disk reproducing apparatus 50 shown in FIG. 12 reproduces, for example, an optical disk or the like on which video and audio are digitally recorded, and when a user performs a fast forward reproduction operation or the like, reproduces a video signal at a high speed. This is a device that can convert the speech speed of a voice signal.

【００７８】ディスク再生装置５０は、光ディスク５１
に記録された信号を読み取りディジタルデータにする再
生処理回路５２と、再生処理回路５２により読み取られ
たデジタルデータを映像信号と音声信号とに分離するデ
マルチプレクサ５３とを有している。The disk reproducing device 50 is provided with an optical disk 51.
And a demultiplexer 53 that separates the digital data read by the reproduction processing circuit 52 into a video signal and an audio signal.

【００７９】また、ディスク再生装置５０は、デマルチ
プレクサ５３により分離された映像信号が供給され、映
像信号のデコード処理やエラー訂正処理等を行う映像信
号処理回路５４と、早送り再生時等に映像信号の間引き
処理等を行う画像加工回路５５と、画像加工回路４４に
より加工された映像信号をアナログ信号に変換し出力す
る映像用Ｄ／Ａ変換回路５６とからなる映像再生系を有
している。この映像再生系では、ユーザにより早送り再
生操作がされると、画像加工回路５５がフレームの間引
き処理等を行って、所定の再生速度の映像信号を出力す
る。The disk reproducing device 50 is supplied with the video signal separated by the demultiplexer 53, and performs a video signal processing circuit 54 for performing a decoding process, an error correction process, and the like of the video signal. The video reproduction system includes an image processing circuit 55 that performs a thinning process and the like, and a video D / A conversion circuit 56 that converts a video signal processed by the image processing circuit 44 into an analog signal and outputs the analog signal. In this video reproduction system, when the user performs a fast-forward reproduction operation, the image processing circuit 55 performs a frame thinning process or the like, and outputs a video signal at a predetermined reproduction speed.

【００８０】また、ディスク再生装置５０は、デマルチ
プレクサ５３により分離された音声信号が供給され、音
声信号のデコード処理やエラー訂正処理を行い、並び
に、音声信号を所定の時間単位毎の音声ブロックで波形
の切り出し処理を行う音声信号処理回路５７と、レベル
検出回路１３と、特徴抽出回路１４と、波形加工回路１
５と、音声用Ｄ／Ａ変換回路１６とを有している。The disc reproducing apparatus 50 is supplied with the audio signal separated by the demultiplexer 53, performs decoding processing and error correction processing of the audio signal, and converts the audio signal into audio blocks in predetermined time units. An audio signal processing circuit 57 for performing a waveform cutting process, a level detection circuit 13, a feature extraction circuit 14, and a waveform processing circuit 1
5 and a D / A conversion circuit 16 for audio.

【００８１】また、ディスク再生装置５０は、各回路の
制御及びユーザからの操作入力を受け付けるシステムコ
ントローラ５８と、システムコントローラ５８の制御に
基づき光ディスク５１のサーボコントロールを行うサー
ボ制御回路５９とを有している。The disk reproducing apparatus 50 has a system controller 58 for controlling each circuit and accepting an operation input from a user, and a servo control circuit 59 for performing servo control of the optical disk 51 based on the control of the system controller 58. ing.

【００８２】このような構成のディスク再生装置５０で
は、ユーザにより早送り再生の操作処理がされると、画
像加工回路５５が画像データを加工して、早送り再生画
像を出力する。また、波形加工回路１４が、無音の音声
ブロック及び特徴が連続した音声ブロックを削除して、
早送り再生画像に対応した再生速度の音声信号を出力す
る。また、システムコントローラ５８は、レベル検出回
路１３からの音声ブロックが無音であるか有音であるか
の判断結果、及び、特徴抽出回路１４の音声信号の特徴
に基づき、サーボ制御回路５９を介して光ディスク５１
の回転速度を制御する。このことにより、例えば、無音
区間が多い部分では例えば３倍速や４倍速といった非常
に早い早送り再生をし、有音区間では音声信号の内容が
理解できる程度の早送り再生をするといったような可変
速再生を行うことができる。In the disk reproducing apparatus 50 having such a configuration, when the user performs the fast forward reproduction operation processing, the image processing circuit 55 processes the image data and outputs the fast forward reproduced image. In addition, the waveform processing circuit 14 deletes a silent audio block and an audio block having continuous features,
An audio signal having a playback speed corresponding to the fast-forward playback image is output. Further, the system controller 58 outputs a signal via the servo control circuit 59 based on the determination result of whether the sound block from the level detection circuit 13 is silent or sound, and the feature of the sound signal of the feature extraction circuit 14. Optical disk 51
To control the rotation speed of the. Thereby, for example, in a portion having many silent sections, a very fast fast-forward playback such as a triple speed or a quadruple speed is performed, and in a sound section, a fast-forward playback is performed such that the contents of an audio signal can be understood. It can be performed.

【００８３】本発明の第５の実施の形態のディスク再生
装置５０では、このように光ディスク５１に記録された
音声信号に対しても、話速変換をすることができる。In the disk reproducing apparatus 50 according to the fifth embodiment of the present invention, the voice speed can be converted even for the audio signal recorded on the optical disk 51 as described above.

【００８４】以上、本発明の第１から第５の実施の形態
について説明した。各実施の形態の話速変換装置並びに
ディスク再生装置では、無音部分の削除とともに、波形
相関性、ピーク周波数特性、レベル変化等の音声信号の
特徴を検出し、その特徴が連続した部分を削除してい
る。このことにより、音声ピッチを通常時と変えること
なく音声信号の再生速度を変更することができる。The first to fifth embodiments of the present invention have been described. In the speech speed conversion device and the disc reproducing device of each embodiment, in addition to the removal of the silent portion, the features of the audio signal such as the waveform correlation, the peak frequency characteristic, and the level change are detected, and the portion where the feature is continuous is deleted. ing. As a result, the reproduction speed of the audio signal can be changed without changing the audio pitch from the normal time.

【００８５】また、各実施の形態では、音声信号の一部
を削除することにより、高速再生を実現する例を示した
が、例えば、無音部分を検出した場合にはその無音部分
にさらに無音部分を追加したり、また、類似した特徴が
連続した部分を検出した場合にはその特徴と類似した特
徴の信号を追加したりすることにより、通常速度より遅
い低速再生を実現することができる。具体的には、波形
加工回路１４において、音声ブロックを追加し、追加し
た音声ブロックを接続処理することにより実現できる。In each embodiment, an example has been described in which high-speed reproduction is realized by deleting a part of an audio signal. For example, when a silent part is detected, the silent part is further added to the silent part. , Or when a portion where similar features are continuous is detected, a signal having a feature similar to the feature is added, thereby realizing low-speed playback lower than the normal speed. Specifically, the waveform processing circuit 14 can be realized by adding an audio block and connecting the added audio block.

【００８６】また、本実施の形態では、音声信号の速度
変換をする例について説明したが、本発明は、音声信号
に限られず、例えば画像信号等であってもよい。In this embodiment, an example in which the speed of an audio signal is converted has been described. However, the present invention is not limited to an audio signal, and may be, for example, an image signal.

【００８７】[0087]

【発明の効果】本発明では、所定の時間単位毎に切り出
した再生信号の信号波形から信号レベルと特徴とを検出
し、この所定の時間単位毎に信号波形の削除及び／又は
追加を行って上記再生信号の波形加工をし、再生信号の
時間軸を変換する。このことにより本発明では、信号の
有効部分を削除することなく、時間軸を変換することが
できる。According to the present invention, a signal level and a characteristic are detected from a signal waveform of a reproduced signal cut out for each predetermined time unit, and the signal waveform is deleted and / or added for each predetermined time unit. The waveform of the reproduced signal is processed to convert the time axis of the reproduced signal. As a result, in the present invention, the time axis can be converted without deleting the effective portion of the signal.

【００８８】また、本発明では、所定の時間単位毎に切
り出した音声信号の信号波形から信号レベルと特徴とを
検出し、この所定の時間単位毎に信号波形の削除及び／
又は追加を行って上記音声信号の波形加工をし、音声信
号の再生速度を変換する。このことにより本発明では、
簡易な構成で音声ピッチの変動を無くすことができ、内
容理解度の低下が無く音声信号の再生速度を変換するこ
とができる。Further, in the present invention, the signal level and the characteristic are detected from the signal waveform of the audio signal cut out every predetermined time unit, and the signal waveform is deleted and / or deleted every this predetermined time unit.
Alternatively, the waveform processing of the audio signal is performed by adding, and the reproduction speed of the audio signal is converted. Thus, in the present invention,
With a simple configuration, fluctuations in the voice pitch can be eliminated, and the reproduction speed of the voice signal can be converted without lowering the degree of understanding of the contents.

【図面の簡単な説明】[Brief description of the drawings]

【図１】本発明の第１の実施の形態の話速変換装置のブ
ロック構成図である。FIG. 1 is a block diagram of a speech speed conversion device according to a first embodiment of the present invention.

【図２】上記話速変換装置の波形加工回路による、重み
付け関数を用いた音声信号の接続処理を説明するための
図である。FIG. 2 is a diagram for explaining connection processing of an audio signal using a weighting function by a waveform processing circuit of the speech speed conversion device.

【図３】上記話速変換装置の波形加工回路による、微分
符号を用いた音声信号の接続処理を説明するための図で
ある。FIG. 3 is a diagram for explaining connection processing of an audio signal using a differential code by a waveform processing circuit of the speech speed conversion device.

【図４】上記話速変換装置に入力される音声信号の一例
を示す波形図である。FIG. 4 is a waveform diagram showing an example of an audio signal input to the speech speed conversion device.

【図５】図４で示した音声信号を上記話速変換装置によ
り話速変換を行った後の音声信号を示す波形図である。FIG. 5 is a waveform diagram showing an audio signal after the audio signal shown in FIG. 4 is subjected to speech speed conversion by the speech speed conversion device.

【図６】図４で示した音声信号を拡大した図である。6 is an enlarged view of the audio signal shown in FIG.

【図７】図５で示した音声信号を拡大した図である。FIG. 7 is an enlarged view of the audio signal shown in FIG.

【図８】本発明の第２の実施の形態の話速変換装置のブ
ロック構成図である。FIG. 8 is a block diagram of a speech speed conversion device according to a second embodiment of the present invention.

【図９】本発明の第３の実施の形態の話速変換装置のブ
ロック構成図である。FIG. 9 is a block diagram of a speech speed conversion device according to a third embodiment of the present invention.

【図１０】上記第３の実施の形態の話速変換装置の波形
加工回路による、音声信号の削除処理を説明するための
図である。FIG. 10 is a diagram for explaining an audio signal deletion process performed by the waveform processing circuit of the speech speed conversion device according to the third embodiment.

【図１１】本発明の第４の実施の形態の話速変換装置の
ブロック構成図である。FIG. 11 is a block diagram of a speech speed conversion device according to a fourth embodiment of the present invention.

【図１２】本発明の第５の実施の形態のディスク再生装
置のブロック構成図である。FIG. 12 is a block diagram of a disc reproducing apparatus according to a fifth embodiment of the present invention.

【符号の説明】[Explanation of symbols]

１０，２０，３０，４０話速変換装置、１２波形切
出回路、１３レベル検出回路、１４特徴抽出回路、
１５波形加工回路、２１相関性検出回路、３１レ
ベル比較回路、４１周波数解析回路、４２ピーク周
波数検出回路、４３ピーク周波数継続性検出回路、５
０ディスク再生装置10, 20, 30, 40 speech rate converter, 12 waveform extraction circuit, 13 level detection circuit, 14 feature extraction circuit,
15 Waveform processing circuit, 21 Correlation detection circuit, 31 Level comparison circuit, 41 Frequency analysis circuit, 42 Peak frequency detection circuit, 43 Peak frequency continuity detection circuit, 5
0 Disc playback device

Claims

【特許請求の範囲】[Claims]

【請求項１】記録媒体から信号を再生する再生手段
と、上記再生手段により再生した再生信号の信号波形を所定
の時間単位毎に切り出す信号切出手段と、上記信号切出手段により切り出した信号波形の信号レベ
ルを検出するレベル検出手段と、上記信号切出手段により切り出した信号波形の特徴を抽
出する特徴抽出手段と、上記レベル検出手段により検出した所定の時間単位の信
号波形の信号レベルと上記特徴抽出手段により抽出した
所定の時間単位の信号波形の特徴とに基づき、所定の時
間単位毎の信号波形の削除及び／又は追加を行って上記
再生信号の波形加工をし、再生信号の時間軸を変換する
時間軸変換手段とを備える信号再生装置。1. A reproducing unit for reproducing a signal from a recording medium, a signal extracting unit for extracting a signal waveform of a reproduced signal reproduced by the reproducing unit at predetermined time units, and a signal extracted by the signal extracting unit. Level detection means for detecting the signal level of the waveform; feature extraction means for extracting the characteristics of the signal waveform extracted by the signal extraction means; signal level of the signal waveform in a predetermined time unit detected by the level detection means; The waveform of the reproduction signal is processed by deleting and / or adding the signal waveform for each predetermined time unit based on the characteristics of the signal waveform for the predetermined time unit extracted by the characteristic extraction means. A signal reproducing apparatus comprising: a time axis converting unit for converting an axis.

【請求項２】上記レベル検出手段により検出した所定
の時間単位の信号波形の信号レベルと上記特徴抽出手段
により抽出した所定の時間単位の信号波形の特徴とに応
じて、上記記録媒体の再生制御をする制御手段を備える
ことを特徴とする請求項１に記載の信号再生装置。2. The reproduction control of the recording medium according to a signal level of a signal waveform in a predetermined time unit detected by the level detecting means and a characteristic of the signal waveform in a predetermined time unit extracted by the characteristic extracting means. 2. The signal reproducing apparatus according to claim 1, further comprising control means for performing the following.

【請求項３】上記時間軸変換手段は、削除した信号波
形の前後の信号波形を、重み付け関数を用いて波形加工
して接続することを特徴とする請求項１に記載の信号再
生装置。3. The signal reproducing apparatus according to claim 1, wherein the time axis converting means connects the signal waveforms before and after the deleted signal waveform by performing waveform processing using a weighting function.

【請求項４】上記時間軸変換手段は、削除した信号波
形の前後の信号波形を、その信号波形の微分値に基づき
接続することを特徴とする請求項１に記載の信号再生装
置。4. The signal reproducing apparatus according to claim 1, wherein said time axis conversion means connects signal waveforms before and after the deleted signal waveform based on a differential value of the signal waveform.

【請求項５】上記レベル検出手段は、所定の時間単位
毎の信号波形の平均パワー及び／又は平均レベルから信
号レベルを検出し、上記時間軸変換手段は、信号レベルが所定の閾値以下で
ある信号波形を、所定の時間単位毎に削除及び／又は追
加することを特徴とする請求項１に記載の信号再生装
置。5. The level detecting means detects a signal level from an average power and / or an average level of a signal waveform for each predetermined time unit, and the time axis converting means determines that the signal level is equal to or less than a predetermined threshold value. The signal reproducing device according to claim 1, wherein the signal waveform is deleted and / or added every predetermined time unit.

【請求項６】上記特徴抽出手段は、所定の時間単位毎
の信号波形の波形相関性からその信号波形の特徴を抽出
し、上記時間軸変換手段は、上記特徴が類似した信号波形
を、所定の時間単位毎に削除及び／又は追加することを
特徴とする請求項１に記載の信号再生装置。6. The characteristic extracting means extracts a characteristic of a signal waveform from a waveform correlation of the signal waveform for each predetermined time unit, and the time axis converting means converts a signal waveform having a similar characteristic to a predetermined characteristic. The signal reproducing apparatus according to claim 1, wherein the signal is deleted and / or added every time unit.

【請求項７】上記特徴抽出手段は、所定の時間単位毎
の信号波形を周波数解析し、ピーク周波数の持続性から
その信号波形の特徴を抽出し、上記時間軸変換手段は、上記特徴が類似した信号波形
を、所定の時間単位毎に削除及び／又は追加することを
特徴とする請求項１に記載の信号再生装置。7. The feature extracting means frequency-analyzes a signal waveform for each predetermined time unit, and extracts a characteristic of the signal waveform from the continuity of a peak frequency. 2. The signal reproducing apparatus according to claim 1, wherein the generated signal waveform is deleted and / or added every predetermined time unit.

【請求項８】上記特徴抽出手段は、所定の時間単位毎
の信号波形のレベル変化からその信号波形の特徴を抽出
し、上記時間軸変換手段は、上記特徴が類似した信号波形
を、所定の時間単位毎に削除及び／又は追加することを
特徴とする請求項１に記載の信号再生装置。8. The feature extracting means extracts a characteristic of a signal waveform from a level change of the signal waveform for each predetermined time unit, and the time axis converting means converts a signal waveform having a similar characteristic to a predetermined waveform. 2. The signal reproducing apparatus according to claim 1, wherein the signal is deleted and / or added every time unit.

【請求項９】記録媒体から信号を再生し、再生した再生信号の信号波形を所定の時間単位毎に切り
出し、切り出した信号波形の信号レベルを検出し、切り出した信号波形の特徴を抽出し、検出した所定の時間単位の信号波形の信号レベルと抽出
した所定の時間単位の信号波形の特徴とに基づき、所定
の時間単位毎の信号波形の削除及び／又は追加を行って
上記再生信号の波形加工をし、再生信号の時間軸を変換
することを特徴とする信号再生方法。9. A signal is reproduced from a recording medium, a signal waveform of the reproduced signal is clipped at predetermined time units, a signal level of the clipped signal waveform is detected, and a characteristic of the clipped signal waveform is extracted. Based on the detected signal level of the predetermined time unit signal waveform and the extracted characteristic of the predetermined time unit signal waveform, the signal waveform of the predetermined time unit is deleted and / or added to perform the waveform of the reproduction signal. A signal reproducing method comprising processing and converting a time axis of a reproduced signal.

【請求項１０】検出した所定の時間単位の信号波形の
信号レベルと抽出した所定の時間単位の信号波形の特徴
とに応じて、上記記録媒体の再生制御をすることを特徴
とする請求項９に記載の信号再生方法。10. The reproduction control of the recording medium according to the detected signal level of the signal waveform in the predetermined time unit and the extracted characteristic of the signal waveform in the predetermined time unit. 3. The signal reproducing method according to 1.

【請求項１１】削除した信号波形の前後の信号波形
を、重み付け関数を用いて波形加工して接続し、再生信
号の時間軸を変換することを特徴とする請求項９に記載
の信号再生方法。11. The signal reproducing method according to claim 9, wherein the signal waveforms before and after the deleted signal waveform are processed by using a weighting function and connected to convert the time axis of the reproduced signal. .

【請求項１２】削除した信号波形の前後の信号波形
を、その信号波形の微分値に基づき接続し、再生信号の
時間軸を変換することを特徴とする請求項９に記載の信
号再生方法。12. The signal reproducing method according to claim 9, wherein signal waveforms before and after the deleted signal waveform are connected based on a differential value of the signal waveform, and a time axis of the reproduced signal is converted.

【請求項１３】所定の時間単位毎の信号波形の平均パ
ワー及び／又は平均レベルから信号レベルを検出し、上記信号レベルが所定の閾値以下である信号波形を、所
定の時間単位毎に削除及び／又は追加することを特徴と
する請求項９に記載の信号再生方法。13. A signal level is detected from an average power and / or an average level of a signal waveform for each predetermined time unit, and a signal waveform whose signal level is equal to or less than a predetermined threshold is deleted and deleted for each predetermined time unit. The signal reproducing method according to claim 9, wherein the signal is added.

【請求項１４】所定の時間単位毎の信号波形の波形相
関性からその信号波形の特徴を抽出し、上記特徴が類似した信号波形を、所定の時間単位毎に削
除及び／又は追加することを特徴とする請求項９に記載
の信号再生方法。14. A method for extracting a characteristic of a signal waveform from a waveform correlation of a signal waveform for each predetermined time unit, and deleting and / or adding a signal waveform having a similar characteristic for each predetermined time unit. The signal reproducing method according to claim 9, wherein:

【請求項１５】所定の時間単位毎の信号波形を周波数
解析し、ピーク周波数の持続性からその信号波形の特徴
を抽出し、上記特徴が類似した信号波形を、所定の時間単位毎に削
除及び／又は追加することを特徴とする請求項９に記載
の信号再生方法。15. A frequency analysis of a signal waveform for each predetermined time unit, extracting a characteristic of the signal waveform from the continuity of the peak frequency, deleting a signal waveform having a similar characteristic for each predetermined time unit, and The signal reproducing method according to claim 9, wherein the signal is added.

【請求項１６】所定の時間単位毎の信号波形のレベル
変化からその信号波形の特徴を抽出し、上記特徴が類似した信号波形を、所定の時間単位毎に削
除及び／又は追加することを特徴とする請求項９に記載
の信号再生方法。16. A method for extracting a characteristic of a signal waveform from a level change of the signal waveform for each predetermined time unit, and deleting and / or adding a signal waveform having a similar characteristic for each predetermined time unit. The signal reproducing method according to claim 9, wherein

【請求項１７】音声信号の信号波形を所定の時間単位
毎に切り出す信号切出手段と、上記信号切出手段により切り出した信号波形の信号レベ
ルを検出するレベル検出手段と、上記信号切出手段によ
り切り出した信号波形の特徴を抽出する特徴抽出手段
と、上記レベル検出手段により検出した所定の時間単位の信
号波形の信号レベルと上記特徴抽出手段により抽出した
所定の時間単位の信号波形の特徴とに基づき、所定の時
間単位毎の信号波形の削除及び／又は追加を行って上記
音声信号の波形加工をし、音声信号の再生速度を変換す
る速度変換手段とを備える音声信号再生装置。17. A signal extracting means for extracting a signal waveform of an audio signal every predetermined time unit, a level detecting means for detecting a signal level of the signal waveform extracted by the signal extracting means, and a signal extracting means. A characteristic extracting means for extracting a characteristic of the signal waveform extracted by: a signal level of the signal waveform in a predetermined time unit detected by the level detecting means; and a characteristic of the signal waveform in a predetermined time unit extracted by the characteristic extracting means. Audio signal reproduction device comprising: a signal conversion unit that deletes and / or adds a signal waveform for each predetermined time unit to process the waveform of the audio signal, and converts a reproduction speed of the audio signal.

【請求項１８】上記速度変換手段は、削除した信号波
形の前後の信号波形を、重み付け関数を用いて波形加工
して接続することを特徴とする請求項１７に記載の音声
信号再生装置。18. The audio signal reproducing apparatus according to claim 17, wherein the speed conversion means connects the signal waveforms before and after the deleted signal waveform by processing the waveform using a weighting function.

【請求項１９】上記速度変換手段は、削除した信号波
形の前後の信号波形を、その信号波形の微分値に基づき
接続することを特徴とする請求項１７に記載の音声信号
再生装置。19. The audio signal reproducing apparatus according to claim 17, wherein said speed conversion means connects signal waveforms before and after the deleted signal waveform based on a differential value of the signal waveform.

【請求項２０】上記レベル検出手段は、所定の時間単
位毎の信号波形の平均パワー及び／又は平均レベルから
信号レベルを検出し、上記速度変換手段は、信号レベルが所定の閾値以下であ
る信号波形を、所定の時間単位毎に削除及び／又は追加
することを特徴とする請求項１７に記載の音声信号再生
装置。20. The level detecting means detects a signal level from an average power and / or an average level of a signal waveform for each predetermined time unit, and the speed converting means detects a signal whose signal level is equal to or less than a predetermined threshold value. The audio signal reproducing apparatus according to claim 17, wherein the waveform is deleted and / or added every predetermined time unit.

【請求項２１】上記特徴抽出手段は、所定の時間単位
毎の信号波形の波形相関性からその信号波形の特徴を抽
出し、上記速度変換手段は、上記特徴が類似した信号波形を、
所定の時間単位毎に削除及び／又は追加することを特徴
とする請求項１７に記載の音声信号再生装置。21. The characteristic extracting unit extracts a characteristic of a signal waveform from a waveform correlation of the signal waveform for each predetermined time unit, and the speed converting unit extracts a signal waveform having a similar characteristic.
18. The audio signal reproducing device according to claim 17, wherein the audio signal is deleted and / or added every predetermined time unit.

【請求項２２】上記特徴抽出手段は、所定の時間単位
毎の信号波形を周波数解析し、ピーク周波数の持続性か
らその信号波形の特徴を抽出し、上記速度変換手段は、上記特徴が類似した信号波形を、
所定の時間単位毎に削除及び／又は追加することを特徴
とする請求項１７に記載の音声信号再生装置。22. The characteristic extracting means frequency-analyzes a signal waveform for each predetermined time unit, and extracts a characteristic of the signal waveform from the continuity of a peak frequency. The speed converting means has a similar characteristic. The signal waveform
18. The audio signal reproducing device according to claim 17, wherein the audio signal is deleted and / or added every predetermined time unit.

【請求項２３】上記特徴抽出手段は、所定の時間単位
毎の信号波形のレベル変化からその信号波形の特徴を抽
出し、上記速度変換手段は、上記特徴が類似した信号波形を、
所定の時間単位毎に削除及び／又は追加することを特徴
とする請求項１７に記載の音声信号再生装置。23. The characteristic extracting means extracts a characteristic of a signal waveform from a level change of the signal waveform for each predetermined time unit, and the speed converting means converts a signal waveform having a similar characteristic into a signal waveform.
18. The audio signal reproducing device according to claim 17, wherein the audio signal is deleted and / or added every predetermined time unit.

【請求項２４】音声信号の信号波形を所定の時間単位
毎に切り出し、切り出した信号波形の信号レベルを検出し、切り出した信号波形の特徴を抽出し、抽出した所定の時間単位の信号波形の信号レベルと抽出
した所定の時間単位の信号波形の特徴とに基づき、所定
の時間単位毎の信号波形の削除及び／又は追加を行って
上記音声信号の波形加工をし、音声信号の再生速度を変
換することを特徴とする音声信号の速度変換方法。24. A signal waveform of an audio signal is cut out in predetermined time units, a signal level of the cut out signal waveform is detected, a characteristic of the cut out signal waveform is extracted, and a signal waveform of the extracted predetermined time unit is extracted. Based on the signal level and the extracted characteristic of the signal waveform in the predetermined time unit, the waveform of the audio signal is processed by deleting and / or adding the signal waveform in the predetermined time unit, and the reproduction speed of the audio signal is reduced. A method for converting the speed of an audio signal, characterized by performing the conversion.

【請求項２５】削除した信号波形の前後の信号波形
を、重み付け関数を用いて波形加工して接続し、音声信
号の再生速度を変換することを特徴とする請求項２４に
記載の音声信号の速度変換方法。25. The audio signal according to claim 24, wherein the signal waveforms before and after the deleted signal waveform are waveform processed using a weighting function and connected to convert the audio signal reproduction speed. Speed conversion method.

【請求項２６】削除した信号波形の前後の信号波形
を、その信号波形の微分値に基づき接続し、音声信号の
再生速度を変換することを特徴とする請求項２４に記載
の音声信号の速度変換方法。26. The audio signal speed according to claim 24, wherein the signal waveforms before and after the deleted signal waveform are connected based on the differential value of the signal waveform, and the reproduction speed of the audio signal is converted. Conversion method.

【請求項２７】所定の時間単位毎の信号波形の平均パ
ワー及び／又は平均レベルから信号レベルを検出し、上記信号レベルが所定の閾値以下である信号波形を、所
定の時間単位毎に削除及び／又は追加することを特徴と
する請求項２４に記載の音声信号の速度変換方法。27. A signal level is detected from an average power and / or an average level of a signal waveform for each predetermined time unit, and a signal waveform whose signal level is equal to or lower than a predetermined threshold is deleted and deleted for each predetermined time unit. The method for converting the speed of an audio signal according to claim 24, wherein the speed conversion is performed.

【請求項２８】所定の時間単位毎の信号波形の波形相
関性からその信号波形の特徴を抽出し、上記特徴が類似した信号波形を、所定の時間単位毎に削
除及び／又は追加することを特徴とする請求項２４に記
載の音声信号の速度変換方法。28. A method for extracting a characteristic of a signal waveform from a waveform correlation of the signal waveform for each predetermined time unit, and deleting and / or adding a signal waveform having a similar characteristic for each predetermined time unit. The method for converting the speed of an audio signal according to claim 24, wherein:

【請求項２９】所定の時間単位毎の信号波形を周波数
解析し、ピーク周波数の持続性からその信号波形の特徴
を抽出し、上記特徴が類似した信号波形を、所定の時間単位毎に削
除及び／又は追加することを特徴とする請求項２４に記
載の音声信号の速度変換方法。29. A frequency analysis of a signal waveform for each predetermined time unit, extracting characteristics of the signal waveform from the continuity of the peak frequency, and deleting a signal waveform having a similar feature in each predetermined time unit. The method for converting the speed of an audio signal according to claim 24, wherein the speed conversion is performed.

【請求項３０】所定の時間単位毎の信号波形のレベル
変化からその信号波形の特徴を抽出し、上記特徴が類似した信号波形を、所定の時間単位毎に削
除及び／又は追加することを特徴とする請求項２４に記
載の音声信号の速度変換方法。30. A method for extracting a characteristic of a signal waveform from a level change of the signal waveform for each predetermined time unit, and deleting and / or adding a signal waveform having a similar characteristic for each predetermined time unit. The speed conversion method of an audio signal according to claim 24, wherein