GB1485803A - Method and apparatus for the analysis and synthesis of speech - Google Patents
Method and apparatus for the analysis and synthesis of speechInfo
- Publication number
- GB1485803A GB1485803A GB41575/74A GB4157574A GB1485803A GB 1485803 A GB1485803 A GB 1485803A GB 41575/74 A GB41575/74 A GB 41575/74A GB 4157574 A GB4157574 A GB 4157574A GB 1485803 A GB1485803 A GB 1485803A
- Authority
- GB
- United Kingdom
- Prior art keywords
- model
- vocal tract
- speech
- fed
- output
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired
Links
- 230000015572 biosynthetic process Effects 0.000 title abstract 3
- 238000003786 synthesis reaction Methods 0.000 title abstract 3
- 238000000034 method Methods 0.000 title abstract 2
- 230000001755 vocal effect Effects 0.000 abstract 11
- 239000013598 vector Substances 0.000 abstract 9
- 239000011159 matrix material Substances 0.000 abstract 6
- 230000005540 biological transmission Effects 0.000 abstract 3
- 230000009977 dual effect Effects 0.000 abstract 3
- 230000005284 excitation Effects 0.000 abstract 3
- 238000006243 chemical reaction Methods 0.000 abstract 1
- 239000003795 chemical substances by application Substances 0.000 abstract 1
- 238000004891 communication Methods 0.000 abstract 1
- 238000001914 filtration Methods 0.000 abstract 1
- 238000005457 optimization Methods 0.000 abstract 1
- 238000005070 sampling Methods 0.000 abstract 1
- 230000007704 transition Effects 0.000 abstract 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Transmission Systems Not Characterized By The Medium Used For Transmission (AREA)
- Circuit For Audible Band Transducer (AREA)
Abstract
1485803 Analysis synthesis communication system GRETAG AG 24 Sept 1974 [22 July 1974] 41575/74 Heading H4R In an analysis synthesis telephone system the analyser includes a synthesizer which is identical to the synthesizer used at the receiver of the system, and is adjusted in accordance with the transmitted parameters, the output of the synthesizer is compared with stored samples of the original speech, and the transmitted parameters are adjusted to minimize the comparator output. As described the speech input from 1 after filtering in filter 2, having a cut-off frequency in the region 3-5 kHz, is converted to digital form in the A. to D converter 3 operating at a sampling frequency of 6 to 10 kHz and a short sample of speech, of 10 to 30 msecs. length is stored in unit 5. Simultaneously the pitch detector 4 determines, in conventional manner, whether the speech is voiced or unvoiced and, if voiced, the pitch period. The pitch detector output is fed to coder 10 for transmission to the synthesizer S, and also to a generator 6 which generates pitch pulses at the appropriate frequency if the speech is voiced, or a pseudo-random pulse stream if the speech is unvoiced. The output from generator 6 is fed to a vocal tract model 7, which is a linear digital filter simulating the transfer characteristic of the vocal tract, and the output from the model is compared at 8 with the incoming speech to provide an error signal which is used to adjust the parameters of the vocal tract model to reduce the error signal from the comparator. When the error signal has been reduced to a predetermined level the parameters set in the model 7 are coded in the coder 10 and transmitted. At the receiver a similar pulse/noise generator 6<SP>1</SP> and vocal tract model 7<SP>1</SP> to that in the transmitter are used to synthesize a signal suitable for conversion back to analogue form and reproduction on speaker 13. Since similar filters, generators, and vocal tract models, are used for transmission and reception it is suggested that only one filter, one generator, and one vocal tract model, are used with switching to change their function between transmission and reception. Vocal tract model, Fig. 2a, which includes a number of storage locations U 1 to U 8 , which forms the state vector U n of the model on the nth cycle, 8 linear combinations are formed from these values in the feedback matrix 22, the nth sample of the excitation sequence multiplied by the input vectors components b 1 to b 8 in multipliers 23 is added to each of the linear combinations from the matrix 22 in adders 26 to provide the new state vector U n + 1 for the n + 1th cycle. The output sequence y n is calculated as a linear combination of the values in the store 21 which are multiplied by the output vector components C 1 to C 8 in multipliers 24 and added, in 27, to the input pulse x n multiplied by a scalar quantity d to obtain the output sequence y n . The components of matrix A and vectors b and c and scalar quantity d can be divided into two groups, those that are invariable, and normally have simple values such as 0; 1, or - 1, and those that are changed by the optimization process. Parameter computer, Fig. 3a (not shown), includes a first primary model (29) a unit (30), and 8 primary part models (31 to 38). The primary model (29), is identical to the vocal tract model and can therefore be used as such, it is fed with the excitation function x n and yields the synthetic speech signal y n and the partial derivatives which is equal to the corresponding component Ui of the state vector U, and the derivative which is equal to the corresponding term of the excitation sequence x n in respect of the transition coefficient d. The unit 30 (not shown) is a dual model with respect to the vocal tract model and from it can be derived the partial derivatives In addition the components of the state vector U 1 to U 8 , formed by a matrix which is the transpose of the matrix 22 of the vocal tract model, are fed to primary part models 31 to 38 and the state vectors of the primary part modes each provide The partial derivatives of y n are fed to computer stages, Fig. 4 (not shown), to derive the partial derivatives of the error signal, in accordance with the selected error dimension, which partial derivatives are fed back to the parameter computer, Fig. 3a, to increment the parameters to enable y n to more closely approach the incoming speech signal S n . In an alternative arrangement the parameter computer includes the primary model (39), equivalent to the vocal tract model, and a dual part model (40) (as in 29 and 30 in Fig. 3a) but the state vector components U 1 to U 8 of the primary model (39) are fed to primary dual part models (41 to 48) to obtain the partial derivatives with respect to the matrix coefficients.
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CH1006674A CH581878A5 (en) | 1974-07-22 | 1974-07-22 |
Publications (1)
Publication Number | Publication Date |
---|---|
GB1485803A true GB1485803A (en) | 1977-09-14 |
Family
ID=4358956
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
GB41575/74A Expired GB1485803A (en) | 1974-07-22 | 1974-09-24 | Method and apparatus for the analysis and synthesis of speech |
Country Status (4)
Country | Link |
---|---|
US (1) | US3909533A (en) |
CA (1) | CA1039407A (en) |
CH (1) | CH581878A5 (en) |
GB (1) | GB1485803A (en) |
Families Citing this family (19)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPS5154714A (en) * | 1974-10-16 | 1976-05-14 | Nippon Telegraph & Telephone | Tajuonseidensohoshiki |
US4058676A (en) * | 1975-07-07 | 1977-11-15 | International Communication Sciences | Speech analysis and synthesis system |
DE2536585C3 (en) * | 1975-08-16 | 1981-04-02 | Philips Patentverwaltung Gmbh, 2000 Hamburg | Arrangement for statistical signal analysis |
US4051331A (en) * | 1976-03-29 | 1977-09-27 | Brigham Young University | Speech coding hearing aid system utilizing formant frequency transformation |
IT1083533B (en) * | 1977-06-20 | 1985-05-21 | Cselt Centro Studi Lab Telecom | PROCEDURE AND DEVICE FOR THE GENERATION OF A VOICE TYPE SIGNAL FOR THE PERFORMANCE OF OBJECTIVE MEASUREMENTS OF EQUIPMENT PERFORMANCE PART OF VOICE SIGNAL TRANSMISSION SYSTEMS |
US4406626A (en) * | 1979-07-31 | 1983-09-27 | Anderson Weston A | Electronic teaching aid |
JPS56500980A (en) * | 1979-07-31 | 1981-07-16 | ||
US4319083A (en) * | 1980-02-04 | 1982-03-09 | Texas Instruments Incorporated | Integrated speech synthesis circuit with internal and external excitation capabilities |
JPS58162470A (en) * | 1982-03-24 | 1983-09-27 | 三菱電機株式会社 | Register for calling of elevator |
US4520499A (en) * | 1982-06-25 | 1985-05-28 | Milton Bradley Company | Combination speech synthesis and recognition apparatus |
US5127055A (en) * | 1988-12-30 | 1992-06-30 | Kurzweil Applied Intelligence, Inc. | Speech recognition apparatus & method having dynamic reference pattern adaptation |
US4972474A (en) * | 1989-05-01 | 1990-11-20 | Cylink Corporation | Integer encryptor |
SG47586A1 (en) * | 1990-12-21 | 1998-04-17 | British Telecomm | Speech coding |
JP2810252B2 (en) * | 1991-05-22 | 1998-10-15 | シャープ株式会社 | Audio playback device |
FI96247C (en) * | 1993-02-12 | 1996-05-27 | Nokia Telecommunications Oy | Procedure for converting speech |
US5471527A (en) | 1993-12-02 | 1995-11-28 | Dsc Communications Corporation | Voice enhancement system and method |
US5797120A (en) * | 1996-09-04 | 1998-08-18 | Advanced Micro Devices, Inc. | System and method for generating re-configurable band limited noise using modulation |
US20040210440A1 (en) * | 2002-11-01 | 2004-10-21 | Khosrow Lashkari | Efficient implementation for joint optimization of excitation and model parameters with a general excitation function |
US20130229530A1 (en) * | 2012-03-02 | 2013-09-05 | Apple Inc. | Spectral calibration of imaging devices |
Family Cites Families (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US3631520A (en) * | 1968-08-19 | 1971-12-28 | Bell Telephone Labor Inc | Predictive coding of speech signals |
US3624302A (en) * | 1969-10-29 | 1971-11-30 | Bell Telephone Labor Inc | Speech analysis and synthesis by the use of the linear prediction of a speech wave |
-
1974
- 1974-07-22 CH CH1006674A patent/CH581878A5/xx not_active IP Right Cessation
- 1974-09-24 GB GB41575/74A patent/GB1485803A/en not_active Expired
- 1974-10-08 US US513160A patent/US3909533A/en not_active Expired - Lifetime
- 1974-10-09 CA CA211,115A patent/CA1039407A/en not_active Expired
Also Published As
Publication number | Publication date |
---|---|
CH581878A5 (en) | 1976-11-15 |
CA1039407A (en) | 1978-09-26 |
US3909533A (en) | 1975-09-30 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
GB1485803A (en) | Method and apparatus for the analysis and synthesis of speech | |
US4220819A (en) | Residual excited predictive speech coding system | |
CA1046642A (en) | Phase vocoder speech synthesis system | |
US5007095A (en) | System for synthesizing speech having fluctuation | |
CA2160749C (en) | Speech coding apparatus, speech decoding apparatus, speech coding and decoding method and a phase amplitude characteristic extracting apparatus for carrying out the method | |
CA2010830C (en) | Dynamic codebook for efficient speech coding based on algebraic codes | |
US4301329A (en) | Speech analysis and synthesis apparatus | |
US6006174A (en) | Multiple impulse excitation speech encoder and decoder | |
US4389540A (en) | Adaptive linear prediction filters | |
US4742550A (en) | 4800 BPS interoperable relp system | |
KR920022683A (en) | Sampling frequency converter | |
US5048088A (en) | Linear predictive speech analysis-synthesis apparatus | |
CA2154881A1 (en) | A system and method for compression and decompression of audio signals | |
US4700393A (en) | Speech synthesizer with variable speed of speech | |
US5369730A (en) | Speech synthesizer | |
US4845753A (en) | Pitch detecting device | |
US4601052A (en) | Voice analysis composing method | |
Rabiner et al. | A hardware realization of a digital formant speech synthesizer | |
JPS57197600A (en) | Phonemic piece editting type voice synthesization system | |
DE3478065D1 (en) | Method and apparatus for coding digital signals | |
JPS59195283A (en) | Electronic musical instrument | |
SU1316030A1 (en) | Method and apparatus for analyzing and synthesizing speech | |
KR100264389B1 (en) | Computer music cycle with key change function | |
Jewett et al. | Pitch Variations in Vocoded Voice | |
SU1767698A1 (en) | Sonic signal frequency coder |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PS | Patent sealed [section 19, patents act 1949] | ||
746 | Register noted 'licences of right' (sect. 46/1977) | ||
PCNP | Patent ceased through non-payment of renewal fee |