JP2962777B2 - Audio signal time-base expansion / compression device - Google Patents

Audio signal time-base expansion / compression device

Info

Publication number
JP2962777B2
JP2962777B2 JP2173376A JP17337690A JP2962777B2 JP 2962777 B2 JP2962777 B2 JP 2962777B2 JP 2173376 A JP2173376 A JP 2173376A JP 17337690 A JP17337690 A JP 17337690A JP 2962777 B2 JP2962777 B2 JP 2962777B2
Authority
JP
Japan
Prior art keywords
expansion
ratio
audio signal
audio
length
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
JP2173376A
Other languages
Japanese (ja)
Other versions
JPH0460700A (en
Inventor
啓之 平井
正蔵 杉下
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Sanyo Denki Co Ltd
Original Assignee
Sanyo Denki Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Sanyo Denki Co Ltd filed Critical Sanyo Denki Co Ltd
Priority to JP2173376A priority Critical patent/JP2962777B2/en
Publication of JPH0460700A publication Critical patent/JPH0460700A/en
Application granted granted Critical
Publication of JP2962777B2 publication Critical patent/JP2962777B2/en
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Landscapes

  • Signal Processing Not Specific To The Method Of Recording And Reproducing (AREA)

Description

【発明の詳細な説明】 (イ) 産業上の利用分野 本発明は異なる冗長度で長時間記録された音声信号
を、録音時とは異なる速度で再生するときに適用される
音声信号の時間軸伸長圧縮装置に関するものである。
DETAILED DESCRIPTION OF THE INVENTION (A) Industrial Field of the Invention The present invention relates to a time axis of an audio signal applied when an audio signal recorded for a long time with different redundancy is reproduced at a different speed from that at the time of recording. The present invention relates to a decompression / compression device.

(ロ) 従来の技術 従来、英会話学習等を行う際は、学習者に応じた会話
速度のヒヤリング等を行うため、単に音声信号の再生速
度を可変して音声の時間軸を変えるようにしている。こ
の場合、再生速度に応じて再生音声のピッチ周波数やホ
ルマント周波数も元の録音音声の周波数から変化し、音
声の明瞭度が劣化するという不都合がある。
(B) Conventional technology Conventionally, when conducting English conversation learning or the like, in order to conduct hearing or the like of a conversation speed according to a learner, the reproduction time of an audio signal is simply varied to change the time axis of the audio. . In this case, there is a disadvantage that the pitch frequency and the formant frequency of the reproduced sound also change from the frequency of the original recorded sound according to the reproduction speed, and the intelligibility of the sound is deteriorated.

そこで、昭和61年10月発行の「日本音響学会講演論文
集」の第149頁〜第150頁にはポインター移動量制御によ
る重複加算法(PICORA法)と呼ばれる、音声信号の時間
軸伸長圧縮方法により、前記の不都合を解消して音声の
時間軸の伸長圧縮を行うことが記載されている。
Therefore, the pp. 149-150 of the "Transactions of the Acoustical Society of Japan," published in October 1986, describes a method of time axis expansion and compression of audio signals called the overlap addition method (PICORA method) by controlling the amount of pointer movement. Describes that the above-mentioned inconvenience is eliminated and the time axis of the voice is expanded and compressed.

前記PICORA法は、伸縮比率を入力してやると一定の比
率で音声を伸長・圧縮するものである。
The PICORA method expands and compresses audio at a fixed ratio when an expansion ratio is input.

(ハ) 発明が解決しようとする課題 ところで、長時間に亘る音声の連続記録の際には発話
の速さが一定でなく、早いところと遅いところがあり、
その結果異なる冗長度で記録される場合が多いものであ
る。
(C) Problems to be Solved by the Invention By the way, during continuous recording of voice over a long period of time, the speed of speech is not constant, and there are places where the speech is fast and places where it is slow.
As a result, recording is often performed with different degrees of redundancy.

従って、このように記録された音声信号を時間軸伸長
・圧縮によって最適に再生するには、その都度伸縮比率
を変化させなければならない。
Therefore, in order to optimally reproduce the audio signal recorded in this manner by time-base expansion / compression, the expansion / contraction ratio must be changed each time.

(ニ) 発明が解決するための手段 本発明は上記の課題に鑑み、異なる冗長度で記録され
た音声信号を可変速度で再生する音声記憶部と、再生し
た音声波形の母音の長さにより伸長・圧縮比率を決定す
る伸縮比率決定部と、該伸縮比率決定部による伸長・圧
縮比率により音声信号の時間軸を伸長・圧縮する時間軸
制御部とを具備したことを特徴とする音声信号の時間軸
伸長圧縮装置を提供するものである。
(D) Means for Solving the Invention In view of the above problems, the present invention has an audio storage unit that reproduces audio signals recorded with different degrees of redundancy at a variable speed, and expands the vowel length of the reproduced audio waveform. An audio signal time, comprising: an expansion / contraction ratio determining unit that determines a compression ratio; and a time axis control unit that expands and compresses the time axis of the audio signal based on the expansion / compression ratio by the expansion / contraction ratio determining unit. An axial extension / compression device is provided.

又、前記伸縮比率決定部は、逐次複数個の母音の長さ
の平均長と、任意に設定した長さとの比をもって伸長・
圧縮比率を決定することを特徴とした音声信号の時間軸
伸長圧縮装置を提供するものである。
Further, the expansion / contraction ratio determination unit sequentially expands / contracts the data by using a ratio of an average length of a plurality of vowels to an arbitrarily set length.
An object of the present invention is to provide an audio signal time-base expansion / compression device characterized by determining a compression ratio.

(ホ) 作用 上記のように構成された本発明による音声信号の時間
軸伸長圧縮装置によれば、再生中における音声信号の発
話の速さが途中で変化する場合、それに対応して母音の
長さが変化するため、音声の発話の速さに適応した伸縮
比率を得ることができるものである。
(E) Function According to the audio signal time-axis expansion / compression device according to the present invention configured as described above, if the speed of speech of the audio signal during reproduction changes on the way, the length of the vowel is correspondingly changed. Therefore, an expansion / contraction ratio adapted to the speed of speech utterance can be obtained.

(ヘ) 実施例 以下、図面に示す本発明装置の実施例について説明す
る。
(F) Example Hereinafter, an example of the device of the present invention shown in the drawings will be described.

図は本発明装置をブロック図で説明するものであり、
(1)は異なる冗長度で記録された音声信号を可変速度
で再生する音声記憶部で、該音声記憶部(1)は再生速
度が可変されるテープレコーダやデジタルメモリーなど
である。(2)は伸縮比率決定部であり、音声信号の伸
長・圧縮比率を決定するものである。該伸縮比率決定部
(2)による伸長・圧縮比率は次のような方法によって
決定されるものである。
The figure illustrates the device of the invention in a block diagram,
(1) is an audio storage unit for reproducing an audio signal recorded with different redundancy at a variable speed, and the audio storage unit (1) is a tape recorder or a digital memory whose reproduction speed is variable. (2) is an expansion / contraction ratio determination unit that determines the expansion / compression ratio of the audio signal. The expansion / compression ratio by the expansion / contraction ratio determining unit (2) is determined by the following method.

第2図はそのフローチャートであり、再生される音声
信号を音声入力部(3)によって音声信号の母音が入力
されるまで音声を取り込み[ステップ1]、この取り込
まれた音声信号の母音の長さを母音長計算部(4)で計
算し、母音の先頭と終了を決め、母音の長さを決定する
[ステップ2]。そして母音が複数(N)個になったか
どうかを判定手段(5)によって判定する[ステップ
3]。母音の長さは同じ速さで発話しても発生する内容
によって異なるので、N個の母音の中で中心の値から閾
値以内の値についてのみ平均長計算手段(6)によって
母音の平均長を取り[ステップ4]、予め設定した比較
長設定手段(7)による比較長さ(L)と母音の平均長
との比を伸縮比率計算部(8)で演算し[ステップ
5]、この演算結果によって伸長・圧縮比率が決定され
るように構成されている。
FIG. 2 is a flowchart of the operation. The audio signal to be reproduced is fetched until the vowel of the audio signal is input by the audio input unit (3) [Step 1]. Is calculated by the vowel length calculation unit (4), the start and end of the vowel are determined, and the length of the vowel is determined [Step 2]. Then, it is determined by the determining means (5) whether or not the number of vowels is plural (N) [Step 3]. Since the length of a vowel differs depending on the content generated even when the vowel is uttered at the same speed, the average length of the vowel is calculated by the average length calculation means (6) only for values within a threshold value from the center value among the N vowels. [Step 4], the ratio of the comparison length (L) by the preset comparison length setting means (7) to the average length of the vowel is calculated by the expansion / contraction ratio calculation unit (8) [Step 5], and the calculation result is obtained. Thus, the expansion / compression ratio is determined.

第1図において、(9)は前記伸縮比率決定部(2)
によって決定された伸長・圧縮比率により音声信号の時
間軸を伸長・圧縮する時間軸制御部で、該時間軸制御部
(9)は伸縮比率と音声波形を与えると、その比率によ
って音声波形を伸ばしたり、縮めたりするものであり、
例えば、従来技術として記載した上記のPICORA法等を使
用することができる。(10)は伸長・圧縮処理されたデ
ジタルデータをアナログ波形に変換するD/A変換部であ
る。
In FIG. 1, (9) is the expansion / contraction ratio determination unit (2).
The time axis control unit (9) expands and compresses the time axis of the audio signal according to the expansion / compression ratio determined by the control unit. Or shrink,
For example, the PICORA method described above as a conventional technique can be used. Reference numeral (10) denotes a D / A conversion unit that converts digital data that has been expanded and compressed into an analog waveform.

本発明は上述のように構成されているので、音声信号
の中に含まれている母音の長さをもって信号の伸縮比率
を決定し、この比率に従って音声再生信号の時間軸を伸
長・圧縮するように動作するものである。
Since the present invention is configured as described above, the expansion and contraction ratio of the signal is determined based on the length of the vowel included in the audio signal, and the time axis of the audio reproduction signal is expanded and compressed according to this ratio. It works.

(ト) 発明の効果 従って、本発明によれば、元の音声の発話速度により
時間軸の伸長・圧縮比率を変化させることができるた
め、長時間記録されている講話などを聞く場合に、遅く
話されているところは圧縮率を高くして早く再生し、早
く話されているところは圧縮率を低くして遅く再生する
ことができ、その結果長時間の講話を短時間で分かり易
く再生することができるものである。
(G) Effect of the Invention Therefore, according to the present invention, the expansion / compression ratio of the time axis can be changed according to the speech speed of the original voice. Higher compression ratios can be played faster when spoken, and lower compression ratios can be played back slower when spoken earlier, so that long lectures can be played in a short time and easily. Is what you can do.

【図面の簡単な説明】[Brief description of the drawings]

第1図は本発明の時間軸伸長圧縮装置のブロック図、第
2図は伸縮比率決定部のフローチャートである。 (1)……音声記憶部、(2)……伸縮比率決定部、
(3)……音声入力部、(4)……母音長計算部、
(5)……判定手段、(6)……平均長計算手段、
(7)……比較長設定手段、(8)……伸縮比率計算
部、(9)……時間軸制御部、(10)……D/A変換部。
FIG. 1 is a block diagram of a time axis expansion / compression device of the present invention, and FIG. 2 is a flowchart of an expansion / contraction ratio determination unit. (1) ... voice storage unit, (2) ... expansion / contraction ratio determination unit,
(3) ... voice input unit, (4) ... vowel length calculation unit,
(5) ... determination means, (6) ... average length calculation means,
(7) ... comparison length setting means, (8) ... expansion / contraction ratio calculation unit, (9) ... time axis control unit, (10) ... D / A conversion unit.

───────────────────────────────────────────────────── フロントページの続き (58)調査した分野(Int.Cl.6,DB名) G10L 3/02 G11B 20/02 ──────────────────────────────────────────────────続 き Continued on front page (58) Field surveyed (Int.Cl. 6 , DB name) G10L 3/02 G11B 20/02

Claims (2)

(57)【特許請求の範囲】(57) [Claims] 【請求項1】異なる冗長度で記録された音声信号を可変
速度で再生する音声記憶部と、再生した音声波形の母音
の長さにより伸長・圧縮比率を決定する伸縮比率決定部
と、該伸縮比率決定部による伸長・圧縮比率により音声
信号の時間軸を伸長・圧縮する時間軸制御部とを具備し
たことを特徴とする音声信号の時間軸伸長圧縮装置。
An audio storage unit for reproducing audio signals recorded with different degrees of redundancy at a variable speed, an expansion / contraction ratio determination unit for determining an expansion / compression ratio based on the length of a vowel of a reproduced audio waveform, A time axis control unit for expanding and compressing the time axis of an audio signal according to an expansion / compression ratio by a ratio determining unit.
【請求項2】前記伸縮比率決定部は、逐次複数個の母音
の長さの平均長と、任意に設定した長さとの比をもって
伸長・圧縮比率を決定することを特徴とした請求項
(1)記載の音声信号の時間軸伸長圧縮装置。
2. The expansion / contraction ratio determining unit according to claim 1, wherein the expansion / contraction ratio determining unit determines the expansion / compression ratio based on a ratio of an average length of a plurality of vowels to an arbitrarily set length. ).
JP2173376A 1990-06-29 1990-06-29 Audio signal time-base expansion / compression device Expired - Fee Related JP2962777B2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP2173376A JP2962777B2 (en) 1990-06-29 1990-06-29 Audio signal time-base expansion / compression device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP2173376A JP2962777B2 (en) 1990-06-29 1990-06-29 Audio signal time-base expansion / compression device

Publications (2)

Publication Number Publication Date
JPH0460700A JPH0460700A (en) 1992-02-26
JP2962777B2 true JP2962777B2 (en) 1999-10-12

Family

ID=15959247

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2173376A Expired - Fee Related JP2962777B2 (en) 1990-06-29 1990-06-29 Audio signal time-base expansion / compression device

Country Status (1)

Country Link
JP (1) JP2962777B2 (en)

Also Published As

Publication number Publication date
JPH0460700A (en) 1992-02-26

Similar Documents

Publication Publication Date Title
JP2000511651A (en) Non-uniform time scaling of recorded audio signals
JP3308567B2 (en) Digital voice processing apparatus and digital voice processing method
US6085157A (en) Reproducing velocity converting apparatus with different speech velocity between voiced sound and unvoiced sound
JPS5982608A (en) System for controlling reproducing speed of sound
JP2001184100A (en) Speaking speed converting device
JP2962777B2 (en) Audio signal time-base expansion / compression device
JP3373933B2 (en) Speech speed converter
JP2000081897A (en) Method of recording speech information, speech information recording medium, and method and device of reproducing speech information
JPH09152889A (en) Speech speed transformer
JPH09138698A (en) Sound recording/reproducing device
JP3081469B2 (en) Speech speed converter
JPH0573089A (en) Speech reproducing method
JP2867744B2 (en) Audio playback device
JP3189587B2 (en) Audio time base converter
JPH08292790A (en) Video tape recorder
JP3189597B2 (en) Audio time base converter
JPH09146587A (en) Speech speed changer
WO1997009713A1 (en) A method of processing audio signal for fidelity varying-speed replaying
JPH0854895A (en) Reproducing device
JP2874607B2 (en) Audio time base converter
JP4529859B2 (en) Audio playback device
JPH08202391A (en) Speaking speed changing device
JPH08255000A (en) Voice signal reproducing device
JPH05303400A (en) Method and device for audio reproduction
JP3267193B2 (en) Voice reading device

Legal Events

Date Code Title Description
FPAY Renewal fee payment (event date is renewal date of database)

Free format text: PAYMENT UNTIL: 20070806

Year of fee payment: 8

FPAY Renewal fee payment (event date is renewal date of database)

Free format text: PAYMENT UNTIL: 20080806

Year of fee payment: 9

LAPS Cancellation because of no payment of annual fees