BR9612624A - Speech synthesizer having acoustic element database - Google Patents
Speech synthesizer having acoustic element databaseInfo
- Publication number
- BR9612624A BR9612624A BR9612624-8A BR9612624A BR9612624A BR 9612624 A BR9612624 A BR 9612624A BR 9612624 A BR9612624 A BR 9612624A BR 9612624 A BR9612624 A BR 9612624A
- Authority
- BR
- Brazil
- Prior art keywords
- database
- phonetic
- sequences
- trajectories
- acoustic element
- Prior art date
Links
- 238000000034 method Methods 0.000 abstract 1
- 238000001308 synthesis method Methods 0.000 abstract 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/02—Methods for producing synthetic speech; Speech synthesisers
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
<B>SINTETIZADOR DE FALA TENDO BASE DE DADOS DE ELEMENTO ACúSTICO<D> Um método de síntese de fala emprega uma base de dados de elemento acústico que é estabelecida a partir de seq³ências fonéticas ocorridas em um intervalo de um sinal de fala. ao estabelecer a base de dados, trajetórias são determinadas (220) para cada uma das seq³ências fonéticas contendo um segmento fonético que corresponde a um fonema particular (210). Uma região de tolerância é então identificada baseada em uma concentração de trajetórias que correspondem às seq³ências de fonemas diferentes (230). Os elementos acústicos para a base de dados (260) são formados por porções das seq³ências fonéticas ao identificar pontos de corte (250) nas seq³ências fonéticas que correspondem aos pontos de tempo ao longo das trajetórias respectivas próximas à região de tolerância (240). Desta maneira, é possível concatenar os elementos acústicos tendo um fonema de junção comum, de modo que descontinuidades perceptíveis nos fonemas de junção sejam minimizadas. Métodos computacionalmente simples e rápidos para determinar a região de tolerância são também expostos.<B> SPEAK SYNTHESIZER HAVING ACOUSTIC ELEMENT DATABASE <D> A speech synthesis method employs an acoustic element database that is established from phonetic sequences occurring in an interval of a speech signal. when establishing the database, trajectories are determined (220) for each of the phonetic sequences containing a phonetic segment that corresponds to a particular phoneme (210). A tolerance region is then identified based on a concentration of trajectories that correspond to the different phoneme sequences (230). The acoustic elements for the database (260) are formed by portions of the phonetic sequences when identifying cutoff points (250) in the phonetic sequences that correspond to the time points along the respective trajectories close to the tolerance region (240). In this way, it is possible to concatenate the acoustic elements having a common junction phoneme, so that discernible discontinuities in the junction phonemes are minimized. Computationally simple and fast methods for determining the tolerance region are also exposed.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US08/515,887 US5751907A (en) | 1995-08-16 | 1995-08-16 | Speech synthesizer having an acoustic element database |
PCT/US1996/012628 WO1997007500A1 (en) | 1995-08-16 | 1996-08-02 | Speech synthesizer having an acoustic element database |
Publications (1)
Publication Number | Publication Date |
---|---|
BR9612624A true BR9612624A (en) | 2000-05-23 |
Family
ID=24053185
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
BR9612624-8A BR9612624A (en) | 1995-08-16 | 1996-08-02 | Speech synthesizer having acoustic element database |
Country Status (10)
Country | Link |
---|---|
US (1) | US5751907A (en) |
EP (1) | EP0845139B1 (en) |
JP (1) | JP3340748B2 (en) |
AU (1) | AU6645096A (en) |
BR (1) | BR9612624A (en) |
CA (1) | CA2222582C (en) |
DE (1) | DE69627865T2 (en) |
MX (1) | MX9801086A (en) |
TW (1) | TW305990B (en) |
WO (1) | WO1997007500A1 (en) |
Families Citing this family (22)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7251314B2 (en) * | 1994-10-18 | 2007-07-31 | Lucent Technologies | Voice message transfer between a sender and a receiver |
JP3349905B2 (en) * | 1996-12-10 | 2002-11-25 | 松下電器産業株式会社 | Voice synthesis method and apparatus |
JP2000075878A (en) * | 1998-08-31 | 2000-03-14 | Canon Inc | Device and method for voice synthesis and storage medium |
US6202049B1 (en) | 1999-03-09 | 2001-03-13 | Matsushita Electric Industrial Co., Ltd. | Identification of unit overlap regions for concatenative speech synthesis system |
US6178402B1 (en) * | 1999-04-29 | 2001-01-23 | Motorola, Inc. | Method, apparatus and system for generating acoustic parameters in a text-to-speech system using a neural network |
US7369994B1 (en) | 1999-04-30 | 2008-05-06 | At&T Corp. | Methods and apparatus for rapid acoustic unit selection from a large speech corpus |
US6618699B1 (en) | 1999-08-30 | 2003-09-09 | Lucent Technologies Inc. | Formant tracking based on phoneme information |
US7149690B2 (en) | 1999-09-09 | 2006-12-12 | Lucent Technologies Inc. | Method and apparatus for interactive language instruction |
US6725190B1 (en) * | 1999-11-02 | 2004-04-20 | International Business Machines Corporation | Method and system for speech reconstruction from speech recognition features, pitch and voicing with resampled basis functions providing reconstruction of the spectral envelope |
US7392185B2 (en) * | 1999-11-12 | 2008-06-24 | Phoenix Solutions, Inc. | Speech based learning/training system using semantic decoding |
US7050977B1 (en) | 1999-11-12 | 2006-05-23 | Phoenix Solutions, Inc. | Speech-enabled server for internet website and method |
US9076448B2 (en) * | 1999-11-12 | 2015-07-07 | Nuance Communications, Inc. | Distributed real time speech recognition system |
US7725307B2 (en) * | 1999-11-12 | 2010-05-25 | Phoenix Solutions, Inc. | Query engine for processing voice based queries including semantic decoding |
US7400712B2 (en) * | 2001-01-18 | 2008-07-15 | Lucent Technologies Inc. | Network provided information using text-to-speech and speech recognition and text or speech activated network control sequences for complimentary feature access |
US6625576B2 (en) | 2001-01-29 | 2003-09-23 | Lucent Technologies Inc. | Method and apparatus for performing text-to-speech conversion in a client/server environment |
US7010488B2 (en) * | 2002-05-09 | 2006-03-07 | Oregon Health & Science University | System and method for compressing concatenative acoustic inventories for speech synthesis |
US20040030555A1 (en) * | 2002-08-12 | 2004-02-12 | Oregon Health & Science University | System and method for concatenating acoustic contours for speech synthesis |
US7542903B2 (en) * | 2004-02-18 | 2009-06-02 | Fuji Xerox Co., Ltd. | Systems and methods for determining predictive models of discourse functions |
US20050187772A1 (en) * | 2004-02-25 | 2005-08-25 | Fuji Xerox Co., Ltd. | Systems and methods for synthesizing speech using discourse function level prosodic features |
JP4878538B2 (en) * | 2006-10-24 | 2012-02-15 | 株式会社日立製作所 | Speech synthesizer |
US8103506B1 (en) * | 2007-09-20 | 2012-01-24 | United Services Automobile Association | Free text matching system and method |
JP2011180416A (en) * | 2010-03-02 | 2011-09-15 | Denso Corp | Voice synthesis device, voice synthesis method and car navigation system |
Family Cites Families (14)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US3704345A (en) * | 1971-03-19 | 1972-11-28 | Bell Telephone Labor Inc | Conversion of printed text into synthetic speech |
BG24190A1 (en) * | 1976-09-08 | 1978-01-10 | Antonov | Method of synthesis of speech and device for effecting same |
US4692941A (en) * | 1984-04-10 | 1987-09-08 | First Byte | Real-time text-to-speech conversion system |
US4831654A (en) * | 1985-09-09 | 1989-05-16 | Wang Laboratories, Inc. | Apparatus for making and editing dictionary entries in a text to speech conversion system |
WO1987002816A1 (en) * | 1985-10-30 | 1987-05-07 | Central Institute For The Deaf | Speech processing apparatus and methods |
US4820059A (en) * | 1985-10-30 | 1989-04-11 | Central Institute For The Deaf | Speech processing apparatus and methods |
US4829580A (en) * | 1986-03-26 | 1989-05-09 | Telephone And Telegraph Company, At&T Bell Laboratories | Text analysis system with letter sequence recognition and speech stress assignment arrangement |
GB2207027B (en) * | 1987-07-15 | 1992-01-08 | Matsushita Electric Works Ltd | Voice encoding and composing system |
US4979216A (en) * | 1989-02-17 | 1990-12-18 | Malsheen Bathsheba J | Text to speech synthesis system and method using context dependent vowel allophones |
JPH031200A (en) * | 1989-05-29 | 1991-01-07 | Nec Corp | Regulation type voice synthesizing device |
US5235669A (en) * | 1990-06-29 | 1993-08-10 | At&T Laboratories | Low-delay code-excited linear-predictive coding of wideband speech at 32 kbits/sec |
US5283833A (en) * | 1991-09-19 | 1994-02-01 | At&T Bell Laboratories | Method and apparatus for speech processing using morphology and rhyming |
JPH05181491A (en) * | 1991-12-30 | 1993-07-23 | Sony Corp | Speech synthesizing device |
US5490234A (en) * | 1993-01-21 | 1996-02-06 | Apple Computer, Inc. | Waveform blending technique for text-to-speech system |
-
1995
- 1995-08-16 US US08/515,887 patent/US5751907A/en not_active Expired - Lifetime
-
1996
- 1996-08-02 EP EP96926228A patent/EP0845139B1/en not_active Expired - Lifetime
- 1996-08-02 WO PCT/US1996/012628 patent/WO1997007500A1/en active IP Right Grant
- 1996-08-02 CA CA002222582A patent/CA2222582C/en not_active Expired - Fee Related
- 1996-08-02 BR BR9612624-8A patent/BR9612624A/en not_active Application Discontinuation
- 1996-08-02 DE DE69627865T patent/DE69627865T2/en not_active Expired - Lifetime
- 1996-08-02 AU AU66450/96A patent/AU6645096A/en not_active Abandoned
- 1996-08-02 JP JP50931697A patent/JP3340748B2/en not_active Expired - Fee Related
- 1996-08-02 MX MX9801086A patent/MX9801086A/en not_active IP Right Cessation
- 1996-08-13 TW TW085109787A patent/TW305990B/zh not_active IP Right Cessation
Also Published As
Publication number | Publication date |
---|---|
EP0845139B1 (en) | 2003-05-02 |
US5751907A (en) | 1998-05-12 |
DE69627865D1 (en) | 2003-06-05 |
DE69627865T2 (en) | 2004-02-19 |
MX9801086A (en) | 1998-04-30 |
EP0845139A1 (en) | 1998-06-03 |
CA2222582A1 (en) | 1997-02-27 |
CA2222582C (en) | 2001-09-11 |
EP0845139A4 (en) | 1999-10-20 |
WO1997007500A1 (en) | 1997-02-27 |
JP3340748B2 (en) | 2002-11-05 |
TW305990B (en) | 1997-05-21 |
AU6645096A (en) | 1997-03-12 |
JP2000509157A (en) | 2000-07-18 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
BR9612624A (en) | Speech synthesizer having acoustic element database | |
Ney et al. | Improvements in beam search for 10000-word continuous speech recognition | |
CA2313526A1 (en) | Apparatus and methods for detecting emotions | |
EP0387602A3 (en) | Method and apparatus for the automatic determination of phonological rules as for a continuous speech recognition system | |
JPS57158900A (en) | Text voice synthesizer | |
FI955025A (en) | Method and apparatus for detecting and developing transient situations in audible signals | |
Wightman et al. | The aligner: Text-to-speech alignment using Markov models | |
Gordon | Induction of rate-dependent processing by coarse-grained aspects of speech | |
SE9402284D0 (en) | Method and apparatus for adapting a speech recognition equipment for dialectal variations in a language | |
Larreur et al. | Linguistic and prosodic processing for a text-to-speech synthesis system. | |
Post | French tonal structures | |
EP1010170A4 (en) | Method and system for automatic text-independent grading of pronunciation for language instruction | |
JP2940835B2 (en) | Pitch frequency difference feature extraction method | |
Xu et al. | Intonation perception of low-pass filtered speech in Mandarin and Cantonese | |
Yoshioka et al. | Laryngeal adjustments in Japanese voiceless sound production | |
Campbell | Durational cues to prominence and grouping | |
Gustafson | Transcribing names with foreign origin in the ONOMASTICA project | |
Tatham et al. | Syllable reconstruction in concatenated waveform speech synthesis | |
Dilley et al. | Ambiguity in prominence perception in spoken utterances of American English | |
Tseng | A Linguistic Analysis of Repair Signals in Co-operative Spoken Dialogues | |
Carlson et al. | Segmental intelligibility of synthetic and natural speech in real and nonsense words. | |
JPH09244681A (en) | Method and device for speech segmentation | |
IT1179093B (en) | PROCEDURE AND DEVICE FOR RECOGNITION WITHOUT PREVENTIVE TRAINING OF WORDS RELATED TO SMALL VOCABULARS | |
Sirigos et al. | A comparison of several speech parameters for speaker independent speech recognition and speaker recognition. | |
Dent | Voice onset time of spontaneously spoken Spanish voiceless stops |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
FA10 | Dismissal: dismissal - article 33 of industrial property law | ||
B15K | Others concerning applications: alteration of classification |
Ipc: G10L 13/02 (2013.01) |