BR9612624A - Speech synthesizer having acoustic element database - Google Patents

Speech synthesizer having acoustic element database

Info

Publication number
BR9612624A
BR9612624A BR9612624-8A BR9612624A BR9612624A BR 9612624 A BR9612624 A BR 9612624A BR 9612624 A BR9612624 A BR 9612624A BR 9612624 A BR9612624 A BR 9612624A
Authority
BR
Brazil
Prior art keywords
database
phonetic
sequences
trajectories
acoustic element
Prior art date
Application number
BR9612624-8A
Other languages
Portuguese (pt)
Inventor
Bernd Moebius
Joseph Philip Olive
Michael Abraham Tanenblatt
Jan Pieter Van Santen
Original Assignee
Lucent Technologies Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Lucent Technologies Inc filed Critical Lucent Technologies Inc
Publication of BR9612624A publication Critical patent/BR9612624A/en

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/02Methods for producing synthetic speech; Speech synthesisers

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

<B>SINTETIZADOR DE FALA TENDO BASE DE DADOS DE ELEMENTO ACúSTICO<D> Um método de síntese de fala emprega uma base de dados de elemento acústico que é estabelecida a partir de seq³ências fonéticas ocorridas em um intervalo de um sinal de fala. ao estabelecer a base de dados, trajetórias são determinadas (220) para cada uma das seq³ências fonéticas contendo um segmento fonético que corresponde a um fonema particular (210). Uma região de tolerância é então identificada baseada em uma concentração de trajetórias que correspondem às seq³ências de fonemas diferentes (230). Os elementos acústicos para a base de dados (260) são formados por porções das seq³ências fonéticas ao identificar pontos de corte (250) nas seq³ências fonéticas que correspondem aos pontos de tempo ao longo das trajetórias respectivas próximas à região de tolerância (240). Desta maneira, é possível concatenar os elementos acústicos tendo um fonema de junção comum, de modo que descontinuidades perceptíveis nos fonemas de junção sejam minimizadas. Métodos computacionalmente simples e rápidos para determinar a região de tolerância são também expostos.<B> SPEAK SYNTHESIZER HAVING ACOUSTIC ELEMENT DATABASE <D> A speech synthesis method employs an acoustic element database that is established from phonetic sequences occurring in an interval of a speech signal. when establishing the database, trajectories are determined (220) for each of the phonetic sequences containing a phonetic segment that corresponds to a particular phoneme (210). A tolerance region is then identified based on a concentration of trajectories that correspond to the different phoneme sequences (230). The acoustic elements for the database (260) are formed by portions of the phonetic sequences when identifying cutoff points (250) in the phonetic sequences that correspond to the time points along the respective trajectories close to the tolerance region (240). In this way, it is possible to concatenate the acoustic elements having a common junction phoneme, so that discernible discontinuities in the junction phonemes are minimized. Computationally simple and fast methods for determining the tolerance region are also exposed.

BR9612624-8A 1995-08-16 1996-08-02 Speech synthesizer having acoustic element database BR9612624A (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US08/515,887 US5751907A (en) 1995-08-16 1995-08-16 Speech synthesizer having an acoustic element database
PCT/US1996/012628 WO1997007500A1 (en) 1995-08-16 1996-08-02 Speech synthesizer having an acoustic element database

Publications (1)

Publication Number Publication Date
BR9612624A true BR9612624A (en) 2000-05-23

Family

ID=24053185

Family Applications (1)

Application Number Title Priority Date Filing Date
BR9612624-8A BR9612624A (en) 1995-08-16 1996-08-02 Speech synthesizer having acoustic element database

Country Status (10)

Country Link
US (1) US5751907A (en)
EP (1) EP0845139B1 (en)
JP (1) JP3340748B2 (en)
AU (1) AU6645096A (en)
BR (1) BR9612624A (en)
CA (1) CA2222582C (en)
DE (1) DE69627865T2 (en)
MX (1) MX9801086A (en)
TW (1) TW305990B (en)
WO (1) WO1997007500A1 (en)

Families Citing this family (22)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7251314B2 (en) * 1994-10-18 2007-07-31 Lucent Technologies Voice message transfer between a sender and a receiver
JP3349905B2 (en) * 1996-12-10 2002-11-25 松下電器産業株式会社 Voice synthesis method and apparatus
JP2000075878A (en) * 1998-08-31 2000-03-14 Canon Inc Device and method for voice synthesis and storage medium
US6202049B1 (en) 1999-03-09 2001-03-13 Matsushita Electric Industrial Co., Ltd. Identification of unit overlap regions for concatenative speech synthesis system
US6178402B1 (en) * 1999-04-29 2001-01-23 Motorola, Inc. Method, apparatus and system for generating acoustic parameters in a text-to-speech system using a neural network
US7369994B1 (en) 1999-04-30 2008-05-06 At&T Corp. Methods and apparatus for rapid acoustic unit selection from a large speech corpus
US6618699B1 (en) 1999-08-30 2003-09-09 Lucent Technologies Inc. Formant tracking based on phoneme information
US7149690B2 (en) 1999-09-09 2006-12-12 Lucent Technologies Inc. Method and apparatus for interactive language instruction
US6725190B1 (en) * 1999-11-02 2004-04-20 International Business Machines Corporation Method and system for speech reconstruction from speech recognition features, pitch and voicing with resampled basis functions providing reconstruction of the spectral envelope
US7392185B2 (en) * 1999-11-12 2008-06-24 Phoenix Solutions, Inc. Speech based learning/training system using semantic decoding
US7050977B1 (en) 1999-11-12 2006-05-23 Phoenix Solutions, Inc. Speech-enabled server for internet website and method
US9076448B2 (en) * 1999-11-12 2015-07-07 Nuance Communications, Inc. Distributed real time speech recognition system
US7725307B2 (en) * 1999-11-12 2010-05-25 Phoenix Solutions, Inc. Query engine for processing voice based queries including semantic decoding
US7400712B2 (en) * 2001-01-18 2008-07-15 Lucent Technologies Inc. Network provided information using text-to-speech and speech recognition and text or speech activated network control sequences for complimentary feature access
US6625576B2 (en) 2001-01-29 2003-09-23 Lucent Technologies Inc. Method and apparatus for performing text-to-speech conversion in a client/server environment
US7010488B2 (en) * 2002-05-09 2006-03-07 Oregon Health & Science University System and method for compressing concatenative acoustic inventories for speech synthesis
US20040030555A1 (en) * 2002-08-12 2004-02-12 Oregon Health & Science University System and method for concatenating acoustic contours for speech synthesis
US7542903B2 (en) * 2004-02-18 2009-06-02 Fuji Xerox Co., Ltd. Systems and methods for determining predictive models of discourse functions
US20050187772A1 (en) * 2004-02-25 2005-08-25 Fuji Xerox Co., Ltd. Systems and methods for synthesizing speech using discourse function level prosodic features
JP4878538B2 (en) * 2006-10-24 2012-02-15 株式会社日立製作所 Speech synthesizer
US8103506B1 (en) * 2007-09-20 2012-01-24 United Services Automobile Association Free text matching system and method
JP2011180416A (en) * 2010-03-02 2011-09-15 Denso Corp Voice synthesis device, voice synthesis method and car navigation system

Family Cites Families (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US3704345A (en) * 1971-03-19 1972-11-28 Bell Telephone Labor Inc Conversion of printed text into synthetic speech
BG24190A1 (en) * 1976-09-08 1978-01-10 Antonov Method of synthesis of speech and device for effecting same
US4692941A (en) * 1984-04-10 1987-09-08 First Byte Real-time text-to-speech conversion system
US4831654A (en) * 1985-09-09 1989-05-16 Wang Laboratories, Inc. Apparatus for making and editing dictionary entries in a text to speech conversion system
WO1987002816A1 (en) * 1985-10-30 1987-05-07 Central Institute For The Deaf Speech processing apparatus and methods
US4820059A (en) * 1985-10-30 1989-04-11 Central Institute For The Deaf Speech processing apparatus and methods
US4829580A (en) * 1986-03-26 1989-05-09 Telephone And Telegraph Company, At&T Bell Laboratories Text analysis system with letter sequence recognition and speech stress assignment arrangement
GB2207027B (en) * 1987-07-15 1992-01-08 Matsushita Electric Works Ltd Voice encoding and composing system
US4979216A (en) * 1989-02-17 1990-12-18 Malsheen Bathsheba J Text to speech synthesis system and method using context dependent vowel allophones
JPH031200A (en) * 1989-05-29 1991-01-07 Nec Corp Regulation type voice synthesizing device
US5235669A (en) * 1990-06-29 1993-08-10 At&T Laboratories Low-delay code-excited linear-predictive coding of wideband speech at 32 kbits/sec
US5283833A (en) * 1991-09-19 1994-02-01 At&T Bell Laboratories Method and apparatus for speech processing using morphology and rhyming
JPH05181491A (en) * 1991-12-30 1993-07-23 Sony Corp Speech synthesizing device
US5490234A (en) * 1993-01-21 1996-02-06 Apple Computer, Inc. Waveform blending technique for text-to-speech system

Also Published As

Publication number Publication date
EP0845139B1 (en) 2003-05-02
US5751907A (en) 1998-05-12
DE69627865D1 (en) 2003-06-05
DE69627865T2 (en) 2004-02-19
MX9801086A (en) 1998-04-30
EP0845139A1 (en) 1998-06-03
CA2222582A1 (en) 1997-02-27
CA2222582C (en) 2001-09-11
EP0845139A4 (en) 1999-10-20
WO1997007500A1 (en) 1997-02-27
JP3340748B2 (en) 2002-11-05
TW305990B (en) 1997-05-21
AU6645096A (en) 1997-03-12
JP2000509157A (en) 2000-07-18

Similar Documents

Publication Publication Date Title
BR9612624A (en) Speech synthesizer having acoustic element database
Ney et al. Improvements in beam search for 10000-word continuous speech recognition
CA2313526A1 (en) Apparatus and methods for detecting emotions
EP0387602A3 (en) Method and apparatus for the automatic determination of phonological rules as for a continuous speech recognition system
JPS57158900A (en) Text voice synthesizer
FI955025A (en) Method and apparatus for detecting and developing transient situations in audible signals
Wightman et al. The aligner: Text-to-speech alignment using Markov models
Gordon Induction of rate-dependent processing by coarse-grained aspects of speech
SE9402284D0 (en) Method and apparatus for adapting a speech recognition equipment for dialectal variations in a language
Larreur et al. Linguistic and prosodic processing for a text-to-speech synthesis system.
Post French tonal structures
EP1010170A4 (en) Method and system for automatic text-independent grading of pronunciation for language instruction
JP2940835B2 (en) Pitch frequency difference feature extraction method
Xu et al. Intonation perception of low-pass filtered speech in Mandarin and Cantonese
Yoshioka et al. Laryngeal adjustments in Japanese voiceless sound production
Campbell Durational cues to prominence and grouping
Gustafson Transcribing names with foreign origin in the ONOMASTICA project
Tatham et al. Syllable reconstruction in concatenated waveform speech synthesis
Dilley et al. Ambiguity in prominence perception in spoken utterances of American English
Tseng A Linguistic Analysis of Repair Signals in Co-operative Spoken Dialogues
Carlson et al. Segmental intelligibility of synthetic and natural speech in real and nonsense words.
JPH09244681A (en) Method and device for speech segmentation
IT1179093B (en) PROCEDURE AND DEVICE FOR RECOGNITION WITHOUT PREVENTIVE TRAINING OF WORDS RELATED TO SMALL VOCABULARS
Sirigos et al. A comparison of several speech parameters for speaker independent speech recognition and speaker recognition.
Dent Voice onset time of spontaneously spoken Spanish voiceless stops

Legal Events

Date Code Title Description
FA10 Dismissal: dismissal - article 33 of industrial property law
B15K Others concerning applications: alteration of classification

Ipc: G10L 13/02 (2013.01)