TW442740B

TW442740B - Method for changing articulation speed

Info

Publication number: TW442740B
Application number: TW87121166A
Authority: TW
Inventors: Jeff Song; Kuang-Shin Lin; X B Liu
Original assignee: Inventec Corp
Priority date: 1998-12-18
Filing date: 1998-12-18
Publication date: 2001-06-23

Abstract

The present invention relates to a method for changing articulation speed, especially a method for handling the change in the playing speed of digital voice signals. It allows digital voice signals to maintain the original tone of each syllable while articulating at a non-standard speed. Based on the preset playing speed (e.g. slowing down by a half of the speed or quickening up by twice of the speed), each sound signal section in the voice signal is duplicated (or deleted) in equimultiple before using the voice processing unit to play in accordance with the original sample frequency. Thus, the sound played will match the preset playing speed while maintaining the original tone.

Description

442740 五、發明說明（1) 【發明的應用範圍】本發明係有關一種改變發音速度的方法，應用於數位化之語音資料的發音處理，用以在對數位化之語音資料進行發音速度的改變後，不會使其發音之音調失真的方法。【發明背景】請參閱「第1圖」，無論是Microsoft開發的 ActiveMovie ’ MCI，還是其它公司開發的語音編緝軟件，其在電腦中對語音的採集、存儲、播放的方式，是將各種音源產生設備（如：麥克風、卡式錄音機等）1 〇，所產生的語音信號，藉由一語音處理單元（如：語音卡）2 〇對語音信號進行採樣，並透過邏輯處理單元3 〇轉換成相-對應之數位化的語音信號，請參閱「第2圖」，數位化的語音k號4 0係由複數個音元信號段41、5 1、6 1所組成’而且每個音元信號段4 1更包含有複數個信號採點4 1 1，最後再將此數位化之語音信號4 〇存入一媒體5 0的語音文件中；在播放語音時，只要將語杜中的每個音兀信號段4 1内的信號採樣點4丄出到語音處理單元3"，再由語音處理信號採樣點4 i i放大輸出到聲音輸..出單元 = 聲音輸出單元6 0發出可聽到之聲音訊號^ 由而其中與發音有密切關係的數據是信號採樣 1’信號採樣點4 1 1是按照預先設定語音信號（係指由麥克風或卡式錄音機等設者= 行採樣，再將由這些信號接掸 Λ ，叹價屋生者）進琥铋樣點411所組成的音元信號442740 V. Description of the invention (1) [Scope of application of the invention] The present invention relates to a method for changing the pronunciation speed, which is applied to the pronunciation processing of digital voice data, and is used to change the pronunciation speed of digital voice data. Method that does not distort the pitch of its pronunciation. [Background of the Invention] Please refer to "Figure 1". Whether it is ActiveMovie 'MCI developed by Microsoft, or voice editing software developed by other companies, the way of collecting, storing, and playing voice in the computer is to convert various audio sources. A generating device (such as a microphone, a cassette recorder, etc.) 1 〇, the generated voice signal is sampled by a voice processing unit (such as a voice card) 2, and converted into a logical processing unit 3 〇 Phase-corresponding digitized voice signal, please refer to "Figure 2". The digitized voice k number 40 is composed of a plurality of vowel signal segments 41, 5 1, 6 1 'and each vowel signal Segment 4 1 further includes a plurality of signal acquisition points 4 1 1, and finally the digitized voice signal 4 0 is stored in a media 50 voice file. When playing a voice, as long as each The signal sampling point 4 in the sound signal segment 41 is output to the speech processing unit 3, and then the speech processing signal sampling point 4 ii is amplified and output to the sound output .. The output unit = the sound output unit 6 0 emits an audible sound Signal ^ Therefore, the data closely related to pronunciation is signal sampling 1 'signal sampling point 4 1 1 according to the preset voice signal (referred to by the microphone or cassette recorder, etc. = line sampling, and then these signals are connected to 掸 Λ , The sighing room) phonon signal composed of bismuth sample point 411

442740 五、發明說明（2) '"""" 段4 1經過處理後存入記錄媒體5 〇内的語音文件中。然後再，與採樣頻率相同的頻率通過語音處理單元3 〇將這些信號採樣點還原播放之。在目前的語音信號的格式中 22kHz、8bit的格式為單聲道收音機音質，44kHz、丨 ^式為立體聲CD音質；其中2驗（44kHz)就是指採樣頻率，8blt (16bit)就是指存放一個信號採樣點4工丄的位兀Ϊ/、而語音處理單元3 0就是以-既定的播 ' 根據月'j述的語音格式來播放聲音，且立體聲CD音為一…聲道收音機音質的二极ί ί ί變語音發音的方法，是以每個信號採樣點4 1 it:，$行信號採樣點411複製或删減以實加快或減慢。因此如果要將原語音的播放採二二都；個中的每個信號 :ί在號段41的波形週期就被拉長-倍，來的也立m f #果保持採樣頻率不變，則播放出 :十同時聲音就會變低、變粗。請如£1所_#為1古為原始之音几信號段4 1 1的波形圖，如圖所不係為含有一幅度為156的採樣作時 =!?信號段41，今若要以慢」倍：速度播放音元據前述之傳统變迷處理方式，就須i 曰兀“唬1中的每個信號採樣點將複製後的信號採樣點41“***音心…中並442740 V. Description of the invention (2) '" " " " Paragraph 4 1 After processing, it is stored in the voice file in the recording medium 50. Then, these signal sampling points are restored and played back by the speech processing unit 30 at the same frequency as the sampling frequency. In the format of the current voice signal, the format of 22kHz and 8bit is the sound quality of the mono radio, and the format of 44kHz and ^^ is the sound quality of the stereo CD; where the 2 test (44kHz) refers to the sampling frequency and 8blt (16bit) refers to the storage of a signal The sampling point is 4 bits, and the voice processing unit 30 is to play the sound according to the speech format described in "Preset Broadcasting", and the stereo CD sound is a two-channel radio quality. ί ί The method of changing the pronunciation of a voice is to copy or delete the signal sampling point 411 for each signal sampling point 411 to speed up or slow down. Therefore, if you want to play the original voice, use two or two; each signal: ί The waveform period at number 41 is stretched-times, and the next one is mf # If the sampling frequency is maintained, the playback Out: At the same time, the sound will become lower and thicker. Please refer to the waveform diagram of the signal section 4 1 1 for the original sound, such as £ 1. As shown in the figure, it is not a sample with an amplitude of 156. = !? Signal section 41. "Slow" times: according to the traditional obfuscation processing method described above, it is necessary to say that "every signal sampling point in Bluff 1 will insert the copied signal sampling point 41" into the heart ...

442740 五、發明說明（3) 置於原6號採樣％ 4 1 1 ^ 信號段4 1 a將^「^1的後面，那麼經過處理後的音元鄰且採樣頻率相同的所㊉’包含有多組兩兩個相如果按照預定的It:;採樣點411、4113 ’所以 Φ ® ^ ^ ^ ^诛樣頻率進行聲音的還原和播放，則原來十^ 一個振動週期的音元信號段4 1，就變成 — /1疋成一個振動週期的音元信號段41a ; 廷樣一來，語音的播妓— + Λ 们播放速度固然減慢了’但由於改變了原 t聲：！振動週期和頻率，所以語音就產生了變調的現象。这疋因為假如原本是以221^2採樣頻率錄製的音元信號段4 1，經過上述的處理後就轉換成了一個以44kHz採樣頻率iflL的音元信號段41 a，但是由於仍是按原來的 22kb/s速度播放，所以還原後的聲音的頻率比錄製時慢了一借，再加上發聲的聲調與聲波旳振動頻率有直接的關係’所以就會出現變調的現象。【發明欲解決之問題】目前的語音變速的技術，在對原語音文件的採樣信號進行處理的過程中，改變了還原後語音聲波的振動頻^广所以會出現變調的現象；因此目前的語音變速技術不論是頻率變低或變高，均會在變速後使聲音變得模糊不清w $ 成使用者在聽覺上的不悦。特別是在進行語言教學過程中，學習者一般都對口語和聽力學習感覺困難。其中—部份原因是對方說話的語速過快，初學者來不及反應。如^ 能夠將聲音的速度減慢將可以大大提高訓練的效果。【發明之概述】442740 V. Description of the invention (3) Placed on the original sample No. 6 4 1 1 ^ The signal segment 4 1 a will be after ^ "^ 1, then the processed phonons are adjacent to each other and have the same sampling frequency. If two groups of two phases are used to restore and play the sound according to the predetermined It :; sampling points 411, 4113 'so Φ ® ^ ^ ^ ^ 诛 sample frequency, the original ten ^ one vibration period of the vowel signal segment 4 1 , It becomes — / 1, a vowel signal segment 41a that oscillates into a vibration cycle; in the same way, the voice of the prostitute — + Λ is slowed down, but because the original sound is changed: the vibration cycle and Frequency, so the voice has a tone change phenomenon. This is because if the original phonon signal segment 41 was recorded at the 221 ^ 2 sampling frequency, after the above processing, it will be converted into a phonon with a sampling frequency of 44kHz iflL. The signal segment 41 a, but because it is still playing at the original 22kb / s speed, the frequency of the restored sound is a bit slower than when recording, plus the tone of the sound has a direct relationship with the frequency of the sound wave 旳 vibration ' There will be a phenomenon of tone change. [Invent The problem to be solved] The current technology of voice speed change, in the process of processing the sampled signal of the original voice file, changes the vibration frequency of the restored sound wave so that the phenomenon of tone change will occur; therefore, the current technology of voice speed change Regardless of whether the frequency becomes lower or higher, the sound will become blurred after changing the speed w $ becomes a user's hearing dissatisfaction. Especially in the process of language teaching, learners are generally speaking and listening Learning is difficult. Part of the reason is that the other party speaks too fast, and beginners have no time to respond. For example, ^ can slow down the speed of the voice will greatly improve the training effect. [Overview of the invention]

C:\Program Files\Patent\P-229TW.ptd 第 6 頁 442 7 4 Ο 五、發明說明（4) 本發明的主要目的在於提出速度的快速播放或者慢速播放時法’使得在調整語音的播放速度不變、聲音不失真。本發明處理語音變速的原理的每個信號採樣點4 1 χ作為複疋以原#音信號4 〇中的音元作遇期）41作為一個基參閱「第3圖」和「第5圖」， 4 1具有較標準之播放速度慢一圖」中曰元k號段4 1進行複製段4 1 a置於原音元信號段4 1 信號段42 (如「第5圖」所示 0以原來取樣頻率的播放速度進放，這樣一來就不會改變每個音原有頻率’而且還可在改變語音語音的語調（頻率）。有關本發明之詳細内容及技下：【圖式簡單說明】第1圖，為語音變速播放處理裝第2圖’為語音信號的波形圖。第3圖、為以原始之音元信號段第4圖’為以傳統方法經慢速播一種對語音 ’不會出現後，語音清信號進變調現淅、語 ’並不是以製或刪減的「第1 基本單號.段（即一個完整製或刪的音元，係對後的音 ’來進行複若要使輸出倍的效果時，並把複製的後面，構 )，再由語行音元信號元信號段4 播放速度後成一新音處理段4 21、4 ，仍維行任意象的方調保持圖」中元，而的振動減。請信號段「第3 元信號的音元單元3 的播 1 a的持原来術，茲就配合圖式說明如置的方塊圖的波形圖° 放處理後的波形圖C: \ Program Files \ Patent \ P-229TW.ptd Page 6 442 7 4 〇 V. Description of the invention (4) The main purpose of the present invention is to propose a method of fast playback or slow playback of the speed, so as to adjust the voice The playback speed does not change and the sound is not distorted. Each signal sampling point 4 1 χ of the present invention that handles the principle of speech shifting is used as the complex period, and the period of the original # tone signal 4 〇 is used as a reference period. 41 is used as a basis. See "Figure 3" and "Figure 5" , 4 1 has a picture that is slower than the standard playback speed. ”In the Chinese paragraph k section 4 1 copy section 4 1 a placed in the original sound signal section 4 1 signal section 42 (as shown in the" Figure 5 "0 to the original The playback speed of the sampling frequency is advanced, so that the original frequency of each tone will not be changed, and the tone (frequency) of the speech can also be changed. Details and techniques related to the present invention: [Schematic illustration ] Figure 1, for voice variable-speed playback processing, Figure 2 is the waveform diagram of the voice signal. Figure 3, the original vowel signal segment When it does not appear, the voiceless signal is changed into tone, and the language 'is not based on the "basic number 1. paragraph (ie, a complete or deleted syllable, which is the complete syllable). When you want to make the output double the effect, and put the copy on the back, construct) Then the speech line signal element signal segment 4 plays a new tone processing segment 4 21, 4 after playing speed, and still maintains the square tone retention map of any image ", while the vibration is reduced. Please signal segment" element 3 The original technique of broadcasting 1a of the phonon unit 3 of the signal will be explained with reference to the waveform diagram of the block diagram as shown in the figure.

C:\ProgramFiles\Patent\P-229TW.ptd 第 7 頁 442740 五、發明說明（5) 第5圖’為第3圖之音元信號段經慢一倍之速度播放處理後的波形圖。第6圖’為第2圖之語音信號經慢一倍之速度播放處理後的波形圖。第7圖，為第2圖之語音信號經慢二分之一倍之速度播放處理後的波形圖第8圖，為第2圖之語音信號經快一倍之速度播放處理後的波形圖。第9圖，為結構鏈表的示意圖。第1 0 — 1圖，為本發明處理語音變速播放之方法的部份流程圖。第1 0 — 2圖，為本發明處理語音變速播放之方法的流程圖。，。刀第1 0 — 3圊，為本發明處理語音變速播放之方法的流程圖。 β切【發明之詳細說明】請參閱「第2圖」，本發明所採用的方法是在進> $ 音信號4 0的變速播放時，不是複製或刪除語音作號=，中的每一個信號採樣點4 i X ，而是根據要將語^ := 0以加快或變慢方式播放的要求，對其内的音元信^二 (聲波的一個完整振動週期）4 1作複製或删除&動= 所以在對語音信號4 0作變速播放的處理之前，，找出語音信號4 0中的每個音元信號段4 1 ，以下g 、決定語音信號内之音元信號段的條件：匈C: \ ProgramFiles \ Patent \ P-229TW.ptd Page 7 442740 V. Description of the invention (5) Figure 5 'is a waveform diagram of the vowel signal segment of Figure 3 after being processed twice as slowly. Fig. 6 'is a waveform diagram of the voice signal of Fig. 2 after being processed twice as slowly. Fig. 7 is a waveform diagram of the voice signal of Fig. 2 after being played at half the slower speed. Fig. 8 is a waveform diagram of the voice signal of Fig. 2 after being played at a speed twice as fast. Figure 9 is a schematic diagram of a structured linked list. Figures 10-1 are partial flowcharts of the method for processing variable-speed speech playback of the present invention. Figs. 10 to 2 are flowcharts of a method for processing variable-speed speech playback according to the present invention. ,. Knife Nos. 10 to 3 are flowcharts of the method for processing variable-speed speech playback of the present invention. β cut [Detailed description of the invention] Please refer to "Fig. 2". The method adopted in the present invention is not to copy or delete each of the voice marks during the variable-speed playback of the> $ tone signal 40. The signal sampling point 4 i X, but according to the requirement to play the language ^: = 0 in a faster or slower way, copy or delete the phonetic letter ^ 2 (a complete vibration cycle of the sound wave) 4 1 & Dynamic = So before processing the variable speed playback of the speech signal 40, find each phonetic signal segment 4 1 in the speech signal 40, and the following g determines the condition of the speech signal segment in the speech signal : Hungary

;44274ο 五、發明說明（6) i .疼個音元信號段的起始點4 4 須疋中心點或者它和它的一止點4 5的必中心線4 6彳目a 、，，下一個信號採樣，點組成的連線與 r。琢4D相父，並且起拎號與它們下一個_號採M ^ "點4 5的採樣信或同為下降趨勢樣點組成的變化趨勢同為上升趨勢該以/“Hz起為始丄和終止點4 5之間在時間上的間良應終止點間的時間相隔為2-3毫秒。 P起-始點與右一1的二元信號段和鄰近的下一個音元信號段，應 ΐ = : Ρ兩個音元信號段的中心線4 6以上的 =-大：作：線4 6以下的最小值之間的差距小於中心線到取大變化範圍的十分之—。 4不滿足以上條件的不能作為一個音元信號段，而且對於不滿足條件的數據在語音變速處理時保持不變，既不複製也不刪減。请參閱「第1〇一丄圖」至「第1〇 — 3圖」，為本發明語音變速播放處理的流程圖，其變速的處理步騍依序為：步轉A 1 ..於數位化之語音信號中，以比較每兩個信號採樣點4 1 1的方式進行掃描，並將所有中心綠4 6上的信號採樣點4 1 1 ，或與其後之信號採樣點的連線與中心線4 6相交的採樣點’以及所有拐點（即指波峰、波谷的轉折點）的信息記錄到一個結構鏈表4 7内，其中每個鍵表4 7 1的結構如表一所示；.44274ο V. Description of the invention (6) i. The starting point 4 4 of the vowel signal segment must be the center point of it or its one-stop point 4 5 and the necessary center line 4 6 彳目 a ,,,, A signal sample, a line of points and r. 4D phase fathers, and the 拎 number and their next _ number to take M ^ " point 4 5 sampling letter or the same as the downward trend, the change trend of the sample point composition is the same as the upward trend should start with / "Hz" The time between the time between the termination point 45 and the termination point 5 should be 2-3 milliseconds. The P-start point is the binary signal segment from the right to the first 1 and the next next sound signal segment, Should ΐ =: Ρ above the center line of the two tone signal segments 4 6 =-large: operation: the difference between the minimum value below the line 4 6 is less than the center line to take a tenth of a large change range.-4 not Those that meet the above conditions cannot be used as a vowel signal segment, and the data that does not meet the conditions remains unchanged during speech shift processing, and is neither copied nor deleted. Please refer to the "No. 10 figure" to "No. 1" 〇-3 "is a flowchart of the voice variable speed playback processing of the present invention. The sequence of the variable speed processing steps is: Step A 1 .. in the digital voice signal to compare every two signal sampling points 4 1 1 to scan, and all the signal sampling points 4 1 1 on the center green 4 6 or The information about the connection point of the signal sampling point and the center line 4 6 'and the information of all inflection points (that is, the turning points of the crests and troughs) are recorded in a structural linked list 4 7 of which each key table 4 7 1 The structure is shown in Table 1.

442 74 Ο442 74 Ο

表一、鏈表的結構號採樣點與中心線之間的差值 if號採樣否為中心點耜始點的 ^£~^ψ 趕.'表結構的指$：~ 步驟A 2 . 在兩個相鄰的中遠的拐點；於結構鏈表4 6中濾除多餘的拐點記錄，心點間最多只保留一個距離中心線4 5最一個上升趨步驟A 3 .從結構鏈表4 6的頭向後尋找勢或下降趨勢的中心點；步驟A4.判斷是否存在一個上升趨勢或下降趨勢的 '^點若為是跳至步A6，若為否執行下一步驟；步驟A5·尋找下一個為上升趨勢或下降趨勢的中心點，並跳至步驟A 4 ; 步騾A 6 .判斷是有中心點的記錄，若為是執行下一步驟；若為否執行步驟A 8 ; 步驟A 7 .記錄中心點的記錄，並跳至步驟a 9 ; 步驟A 8 .記錄中心點的位置於記錄媒體中.；步驟A 9 判斷記錄媒體中是否有兩個具有相同特徵的中心點，若為是跳至夕驟A i丄，若為否，執行下一驟；Table 1. The difference between the sample number of the structure number of the linked list and the center line. If the sample of if number is ^ £ ~ ^ ψ of the starting point of the center point, hurry. 'The structure of the table means $: ~ Step A 2. Two adjacent COSCO inflection points; filter out the excess inflection point records in the structure link list 46, at most, only one distance from the center line 45 to the heart point is the most ascending step A 3. From the head of the structure link list 4 6 Step backward to find the center point of the trend or down trend; Step A4. Determine if there is a '^ point of an up trend or down trend. If yes, go to step A6, if no, go to the next step; Step A5. Find the next up The center point of the trend or downtrend, and skip to step A 4; Step 骡 A 6. Judge that there is a record of the center point, if yes, go to the next step; if no, go to step A 8; Step A 7. Record Center Step A 9; Step A 8. Record the position of the center point in the recording medium; Step A 9 Determine whether there are two center points with the same characteristics in the recording medium, if it is skipped to the evening Step A i 丄, if not, go to the next step;

C:\Program Files\Patent\P-229TW. ptd 第 10 頁 442 74 Ο 五、發明說明（8) 步驟A 1 0 一步驟，若為否步驟A 1 1 步驟A 1 2 時間上的間隔；步驟A 1 3 行下一步騍，若步驟A 1 4 音元信號段，並步驟A 1 5 號段；步驟A 1 6 大點的偏移值，線與最大值的出狀態，在變至步驟1 9，步驟A 1 大點的偏移值大值的偏移量 •判斷是否全部跳至步騍A 5 ; .计算兩個中心 •再根據採樣頻 •判斷間隔是否為否則跳至步驟 .將兩個中心點記錄到一個臨時 .重覆步驟8〜搜尋完畢，若為是執行下 :比較次一個音是否遠遠小於前移量，若為是，處理時將不對此為否’執行下一比較次一個音，是否與前一個音近似，若為是，跳偏速若 7 行下一步驟步驟A 基準，跳至步驟A 了比較辨認點之間的偏移量. 率計算出兩個中心點之間 h於2至3亳秒 A 5 ；右馬是執間的信號採樣點作為— 的記錄媒體中；為個 14尋找出次—個音元信兀信號段中’中心線與最 —個音元信號段中，中心則可以認定此為語音的淡段聲音做特殊處理，並跳步驟；元信號段中，中心線與最元信號段中，中心線與最至步驟19，若為否，執C: \ Program Files \ Patent \ P-229TW. Ptd Page 10 442 74 Ο V. Description of the invention (8) Step A 1 0 One step, if not Step A 1 1 Step A 1 2 Time interval; Step Step A 1 to the next line. If step A 1 4 is the vowel signal section and step A 1 5; step A 1 6 is the offset value of the large point, and the state of the line and the maximum value is changed to step 1. 9. Step A 1 Large point offset value Large value offset • Determine whether all jump to step 骒 A 5;. Calculate two centers • Then according to the sampling frequency • Determine if the interval is otherwise skip to step. A center point is recorded to a temporary. Repeat step 8 ~ After the search is completed, if it is executed, compare whether the next note is far less than the amount of forward movement. If so, do not perform this comparison when processing. The next tone, is it similar to the previous tone, if yes, skip speed if 7 lines Next step step A benchmark, skip to step A to compare the offset between the identified points. The rate calculates the two center points H between 2 and 3 leap seconds A 5; the right horse is the record of the signal sampling point as — In the media; find out the center line and the most phonetic signal segment of the sub-single signal segment for each of the 14; the center can identify this as the light voice of the voice to do special processing and skip the steps; In the meta signal segment, the center line and the most meta signal segment, the center line and the most up to step 19, if not, execute

1 8 ·以第一個音元信號段的第二個中心點步驟A 5 ; 1 9 ·判斷結構表中的所有記錄點是否都經過，若為是執行下步驟，若為否，則跳至步驟A1 8 · Step A 5 with the second center point of the first vowel signal segment; 1 9 · Determine whether all recorded points in the structure table have passed. If yes, go to the next step. If no, skip to Step A

442 7 4 0 五、發明說明（9)5 ；步驟A 2 0 -確定語音信號中音元信號段；步驟A 2 1 ·根據發音速度的設定將所有的音元信號段於一記錄媒體中進行複製；步驟A2 2 ·以語音處理單元2 0，將複製於記錄媒體中的音元信號段轉換成可聽之聲音訊號；步驟A 2 3，判斷是否已處理完所有複製後的音元信號段，若為是，執行步驟A2 5，若為否，執行下一步驟；步驟A 2 4 ·取出下一筆複製後的音元信號段，並跳至步驟A 2 2 ;以及步驟A 2 5 ·將語音處理單元2 ◦置於等待狀態。在上述步驟A2 1中，若所設定發音速度較標準的發音速度的慢一倍，則「第2圖」中的語音信號4 0經處理後將如「第6圖」所示，將每個音元信號段4 1 、5 1 、 6 1在記錄媒體中做兩次的複製，於是在原來的每個音元信號段4 1、5 1、6 1之後將分別產生音元信號段4 1 a、5 1 a、6 1 a ;但是若所設定發音速度較標準的發音速度慢二分之一倍，則會如「第7圖j所示，將語音信號中奇數的音元信號段4 1、.6 1 ，在記錄媒體中做兩次的複製，產生音元信號段41a、61a ，偶數的音元信號段5 1 ，則在記錄媒體中只做一次的複製；再者，請參閱「第8圖」，若是設定發音速度較標準的發音速度快一倍，則是每隔一個音元信號段，在記錄媒體中複製一個音442 7 4 0 V. Description of the invention (9) 5; Step A 2 0-Determine the vowel signal segments in the speech signal; Step A 2 1 · Perform all vowel signal segments in a recording medium according to the setting of the pronunciation speed Copy; Step A2 2 · Use the voice processing unit 20 to convert the vowel signal segments copied in the recording medium into audible sound signals; Step A 2 3 to determine whether all the copied vowel signal segments have been processed If yes, go to step A2 5; if no, go to next step; Step A 2 4 · Take out the next copied vowel signal segment and skip to step A 2 2; and step A 2 5 · will Voice processing unit 2 ◦ Put on standby. In the above step A21, if the set pronunciation speed is twice as slow as the standard pronunciation speed, the voice signal 40 in the "picture 2" will be processed as shown in "picture 6", and each The phonetic signal segments 4 1, 5 1, and 6 1 are duplicated twice in the recording medium, so that each of the original phoneme signal segments 4 1, 5 1, and 6 1 will be separately generated after each of the phoneme signal segments 4 1 a, 5 1 a, 6 1 a; However, if the set pronunciation speed is one-half times slower than the standard pronunciation speed, as shown in "Figure 7 j, the odd number of vowel signal segments in the speech signal will be 4 1, .6 1, making two copies in the recording medium to generate the vowel signal segments 41a, 61a, and the even number of vowel signal segments 5 1, then making the copy only once in the recording medium; moreover, see "Figure 8", if the pronunciation speed is set to be twice as fast as the standard pronunciation speed, then every other phoneme signal segment is copied in the recording medium

C:\ProgramFiles\Patent\P-229TW.ptd 第 12 頁 442 74 Ο 五、發明說明αο) 元信號段’也就是只對在該語音信號中為奇數壙位的音元信號段4 1、6 1進行複製，便可以實現語音的快速播放。以上所述僅為本發明之較佳實施例，並不限於以上述之硬體的裝置實施’舉凡任何熟習此項技藝者在本發明之領域内所做的任何修飾’具有同等之功效者，均下列之申請專利範圍内。 ‘ 【發明之效果】本發明之方法對各種格式的5吾音文件都可以做變速處理’以使得在調整語音的播放速度後，所產生的語音清淅、語調保持不變、聲音不失真。【圖示符號說明】 1 0音源產生設備 20語音處理單元 3 0邏輯處理單元 4 0語音信號 4 1音元信號段 4 1 a音元信號段 4 1 1信號採樣點 4 1 1 a信號採樣點 4 2音元信號段 4 4起始點 4 5終止點 4 6中心線C: \ ProgramFiles \ Patent \ P-229TW.ptd Page 12 442 74 Ο V. Description of the invention αο) Meta signal segment 'that is, only for the vowel signal segment which is an odd number of bits in the voice signal 4 1, 6 1 Make a copy to achieve fast voice playback. The above is only a preferred embodiment of the present invention, and is not limited to the implementation of 'any modification made by any person skilled in the art in the field of the present invention' with the hardware device described above, which has equivalent efficacy, Within the scope of the following patent applications. [Effects of the invention] The method of the present invention can perform variable-speed processing on various vocal files in various formats' so that after adjusting the playback speed of the voice, the resulting voice is clear, the tone remains unchanged, and the sound is not distorted. [Illustration of symbols] 1 0 sound source generating device 20 voice processing unit 3 0 logic processing unit 4 0 voice signal 4 1 phoneme signal segment 4 1 a phoneme signal segment 4 1 1 signal sampling point 4 1 1 a signal sampling point 4 2 phonetic signal segment 4 4 start point 4 5 end point 4 6 center line

C:\Program Files\Patent\P-229TW. ptd 第 13 頁 4 42 74 Ο 五、發明說明αu 4 7結構鏈表 4 7 1鏈表 5 0記錄媒體 5 1音元信號段 5 1 a音元信號段 6 0聲音輸出單元 6 1音元信號段 6 1 a音元信號段 C:\Program Files\Patent\P-229TW. ptd 第 14 頁C: \ Program Files \ Patent \ P-229TW. Ptd Page 13 4 42 74 〇 V. Description of the invention αu 4 7 Structure linked list 4 7 1 linked list 5 0 recording medium 5 1 phoneme signal segment 5 1 a phoneme Signal section 6 0 sound output unit 6 1 phoneme signal section 6 1 a phoneme signal section C: \ Program Files \ Patent \ P-229TW. Ptd page 14

Claims

44274 Ο 六、申請專利範圍 - 1 一種改變發音速度的方法，應用於數位化之語音信號的播放，以讓一語咅虚瑰罝元处 M . ^ ^ ^ ^ s處理早几能以預定之發音速度播放該S吾音彳S唬，其包括有：取得該語音信號中的一音元信號段；設定該語音信號的一播放速度；由一邏輯運算單元根據該播放速度，一記錄媒體十丨以及设製該音疋信號段於藉由該語音處理單元，將儲存於該記錄換成可聽之聲音訊號。 ' S元k號段 2.如申請專利範圍第1項所述改變雜立、ώ 法，其中該音元信號段係由複數個信號 * s速度的方 3 .如申請專利範圍第Μ所述^變=占立所組成。法，其中該邏輯運算單元係將該音元信號赞；曰速度的方記錄媒體中。』又複製兩次於該 4 ·如申請專利範圍第1項所述改變法，其中該邏輯運算單元係將在該語音信號s迷度的方的該音元信號段複製兩次於該記錄媒體中^ =為奇數順位的該音元信號段複製一次於該記錄媒體中。芘將偶數順位 5 .如申請專利範圍第1項所述改變發立法’其中該邏輯-運算單元係僅將在該語音信/速度的方的該音元信號段複製一次於該記錄媒體中:為奇數順44274 〇 6. Scope of patent application-1 A method to change the speed of pronunciation, applied to the playback of digitized voice signals, so that a single word can be processed in the original place M. ^ ^ ^ ^ s processing can be scheduled as soon as possible The sound speed plays the sound sound, including: obtaining a phonetic signal segment in the speech signal; setting a playback speed of the speech signal; a logical operation unit according to the playback speed, a recording medium ten丨 and set the audio signal segment to change the stored in the record into an audible sound signal by the voice processing unit. 'S yuan k segment 2. The method of changing hybrid or free-sale as described in item 1 of the scope of patent application, wherein the vowel signal segment is composed of a plurality of signals * s speed 3. As described in the scope of patent application M ^ 变 = 占立 composed. Method, wherein the logical operation unit praises the vowel signal; the method of speed is recorded in the recording medium. "It is duplicated twice in the 4. The change method as described in item 1 of the scope of patent application, wherein the logical operation unit duplicates the vowel signal segment on the square of the voice signal s to the recording medium twice. The middle ^ = is an odd-numbered sequence of the vowel signal segment copied once in the recording medium.芘 Change the even-numbered rank 5 as described in item 1 of the scope of patent application, where the logic-operation unit is to copy the vowel signal segment on the side of the voice message / speed only once in the recording medium: Odd-numbered

C:\Program Files\Patent\P-229TW. ptd 第15頁C: \ Program Files \ Patent \ P-229TW. Ptd Page 15