ATE183010T1 - Auf mikrosegmenten basierendes sprachsyntheseverfahren - Google Patents

Auf mikrosegmenten basierendes sprachsyntheseverfahren

Info

Publication number
ATE183010T1
ATE183010T1 AT97917259T AT97917259T ATE183010T1 AT E183010 T1 ATE183010 T1 AT E183010T1 AT 97917259 T AT97917259 T AT 97917259T AT 97917259 T AT97917259 T AT 97917259T AT E183010 T1 ATE183010 T1 AT E183010T1
Authority
AT
Austria
Prior art keywords
vowel
segments
speech
output
phoneme
Prior art date
Application number
AT97917259T
Other languages
English (en)
Inventor
William Barry
Ralf Benzmueller
Andreas Luening
Original Assignee
Data Software Gmbh G
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Data Software Gmbh G filed Critical Data Software Gmbh G
Application granted granted Critical
Publication of ATE183010T1 publication Critical patent/ATE183010T1/de

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/06Elementary speech units used in speech synthesisers; Concatenation rules
    • G10L13/07Concatenation rules
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/02Methods for producing synthetic speech; Speech synthesisers
    • G10L13/04Details of speech synthesis systems, e.g. synthesiser structure or memory management

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Electrically Operated Instructional Devices (AREA)
  • Machine Translation (AREA)
  • Document Processing Apparatus (AREA)
AT97917259T 1996-03-14 1997-03-08 Auf mikrosegmenten basierendes sprachsyntheseverfahren ATE183010T1 (de)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
DE19610019A DE19610019C2 (de) 1996-03-14 1996-03-14 Digitales Sprachsyntheseverfahren

Publications (1)

Publication Number Publication Date
ATE183010T1 true ATE183010T1 (de) 1999-08-15

Family

ID=7788258

Family Applications (1)

Application Number Title Priority Date Filing Date
AT97917259T ATE183010T1 (de) 1996-03-14 1997-03-08 Auf mikrosegmenten basierendes sprachsyntheseverfahren

Country Status (5)

Country Link
US (1) US6308156B1 (de)
EP (1) EP0886853B1 (de)
AT (1) ATE183010T1 (de)
DE (2) DE19610019C2 (de)
WO (1) WO1997034291A1 (de)

Families Citing this family (34)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DE19841683A1 (de) * 1998-09-11 2000-05-11 Hans Kull Vorrichtung und Verfahren zur digitalen Sprachbearbeitung
US6928404B1 (en) * 1999-03-17 2005-08-09 International Business Machines Corporation System and methods for acoustic and language modeling for automatic speech recognition with large vocabularies
US7369994B1 (en) * 1999-04-30 2008-05-06 At&T Corp. Methods and apparatus for rapid acoustic unit selection from a large speech corpus
DE19939947C2 (de) * 1999-08-23 2002-01-24 Data Software Ag G Digitales Sprachsyntheseverfahren mit Intonationsnachbildung
US20030191625A1 (en) * 1999-11-05 2003-10-09 Gorin Allen Louis Method and system for creating a named entity language model
US7085720B1 (en) * 1999-11-05 2006-08-01 At & T Corp. Method for task classification using morphemes
US8392188B1 (en) 1999-11-05 2013-03-05 At&T Intellectual Property Ii, L.P. Method and system for building a phonotactic model for domain independent speech recognition
US7286984B1 (en) 1999-11-05 2007-10-23 At&T Corp. Method and system for automatically detecting morphemes in a task classification system using lattices
US7213027B1 (en) 2000-03-21 2007-05-01 Aol Llc System and method for the transformation and canonicalization of semantically structured data
JP2002221980A (ja) * 2001-01-25 2002-08-09 Oki Electric Ind Co Ltd テキスト音声変換装置
US20040030555A1 (en) * 2002-08-12 2004-02-12 Oregon Health & Science University System and method for concatenating acoustic contours for speech synthesis
US8768701B2 (en) * 2003-01-24 2014-07-01 Nuance Communications, Inc. Prosodic mimic method and apparatus
US7308407B2 (en) * 2003-03-03 2007-12-11 International Business Machines Corporation Method and system for generating natural sounding concatenative synthetic speech
JP2005031259A (ja) * 2003-07-09 2005-02-03 Canon Inc 自然言語処理方法
US20050125236A1 (en) * 2003-12-08 2005-06-09 International Business Machines Corporation Automatic capture of intonation cues in audio segments for speech applications
JP4265501B2 (ja) * 2004-07-15 2009-05-20 ヤマハ株式会社 音声合成装置およびプログラム
DE102005002474A1 (de) 2005-01-19 2006-07-27 Obstfelder, Sigrid Handy und Verfahren zur Spracheingabe in ein solches sowie Spracheingabebaustein und Verfahren zur Spracheingabe in einen solchen
US8924212B1 (en) * 2005-08-26 2014-12-30 At&T Intellectual Property Ii, L.P. System and method for robust access and entry to large structured data using voice form-filling
JP2008225254A (ja) * 2007-03-14 2008-09-25 Canon Inc 音声合成装置及び方法並びにプログラム
JP5119700B2 (ja) * 2007-03-20 2013-01-16 富士通株式会社 韻律修正装置、韻律修正方法、および、韻律修正プログラム
US7953600B2 (en) * 2007-04-24 2011-05-31 Novaspeech Llc System and method for hybrid speech synthesis
WO2008142836A1 (ja) * 2007-05-14 2008-11-27 Panasonic Corporation 声質変換装置および声質変換方法
CN101312038B (zh) * 2007-05-25 2012-01-04 纽昂斯通讯公司 用于合成语音的方法
US8321222B2 (en) * 2007-08-14 2012-11-27 Nuance Communications, Inc. Synthesis by generation and concatenation of multi-form segments
JP6047922B2 (ja) 2011-06-01 2016-12-21 ヤマハ株式会社 音声合成装置および音声合成方法
JP5914996B2 (ja) * 2011-06-07 2016-05-11 ヤマハ株式会社 音声合成装置およびプログラム
US9368104B2 (en) 2012-04-30 2016-06-14 Src, Inc. System and method for synthesizing human speech using multiple speakers and context
PL401371A1 (pl) * 2012-10-26 2014-04-28 Ivona Software Spółka Z Ograniczoną Odpowiedzialnością Opracowanie głosu dla zautomatyzowanej zamiany tekstu na mowę
PL401372A1 (pl) * 2012-10-26 2014-04-28 Ivona Software Spółka Z Ograniczoną Odpowiedzialnością Hybrydowa kompresja danych głosowych w systemach zamiany tekstu na mowę
JP2015014665A (ja) * 2013-07-04 2015-01-22 セイコーエプソン株式会社 音声認識装置及び方法、並びに、半導体集積回路装置
DE102013219828B4 (de) * 2013-09-30 2019-05-02 Continental Automotive Gmbh Verfahren zum Phonetisieren von textenthaltenden Datensätzen mit mehreren Datensatzteilen und sprachgesteuerte Benutzerschnittstelle
RU2692051C1 (ru) 2017-12-29 2019-06-19 Общество С Ограниченной Ответственностью "Яндекс" Способ и система для синтеза речи из текста
FR3087566B1 (fr) * 2018-10-18 2021-07-30 A I O Dispositif de suivi des mouvements et/ou des efforts d’une personne, methode d’apprentissage dudit dispositif et procede d’analyse des mouvements et/ou des efforts d’une personne
US11302300B2 (en) * 2019-11-19 2022-04-12 Applications Technology (Apptek), Llc Method and apparatus for forced duration in neural speech synthesis

Family Cites Families (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
BG24190A1 (en) * 1976-09-08 1978-01-10 Antonov Method of synthesis of speech and device for effecting same
JPS5919358B2 (ja) * 1978-12-11 1984-05-04 株式会社日立製作所 音声内容伝送方式
JPH0642158B2 (ja) * 1983-11-01 1994-06-01 日本電気株式会社 音声合成装置
US4692941A (en) * 1984-04-10 1987-09-08 First Byte Real-time text-to-speech conversion system
DE69028072T2 (de) * 1989-11-06 1997-01-09 Canon Kk Verfahren und Einrichtung zur Sprachsynthese
KR940002854B1 (ko) * 1991-11-06 1994-04-04 한국전기통신공사 음성 합성시스팀의 음성단편 코딩 및 그의 피치조절 방법과 그의 유성음 합성장치
JP3083640B2 (ja) * 1992-05-28 2000-09-04 株式会社東芝 音声合成方法および装置
US5878396A (en) * 1993-01-21 1999-03-02 Apple Computer, Inc. Method and apparatus for synthetic speech in facial animation
EP0681729B1 (de) 1993-01-30 1999-09-08 Korea Telecommunications Authority System zur sprachsynthese und spracherkennung
JP3085631B2 (ja) * 1994-10-19 2000-09-11 日本アイ・ビー・エム株式会社 音声合成方法及びシステム
US5864812A (en) * 1994-12-06 1999-01-26 Matsushita Electric Industrial Co., Ltd. Speech synthesizing method and apparatus for combining natural speech segments and synthesized speech segments

Also Published As

Publication number Publication date
EP0886853B1 (de) 1999-08-04
WO1997034291A1 (de) 1997-09-18
DE59700315D1 (de) 1999-09-09
US6308156B1 (en) 2001-10-23
DE19610019A1 (de) 1997-09-18
EP0886853A1 (de) 1998-12-30
DE19610019C2 (de) 1999-10-28

Similar Documents

Publication Publication Date Title
ATE183010T1 (de) Auf mikrosegmenten basierendes sprachsyntheseverfahren
US8566099B2 (en) Tabulating triphone sequences by 5-phoneme contexts for speech synthesis
CA2351842C (en) Synthesis-based pre-selection of suitable units for concatenative speech
Cosi et al. Festival speaks italian!
EP1629464A4 (de) Spracherkennungssystem und verfahren auf phonetischer basis
DE68928097D1 (de) Spracherkennungssystem
MX9505299A (es) Sistemas, metodos y articulos de fabricacion para realizar la hipotesizacion de n-cadenas optimas de alta resolucion.
Traber SVOX: the impementation of a text-to-speech system for german
MXPA02005387A (es) Proceso y dispositivo para reconocimiento de voz que utiliza modelos de lenguaje desarticulados.
CN1032391C (zh) 基于波形编辑的汉语文字-语音转换方法及***
KR100373329B1 (ko) 음운환경과 묵음구간 길이를 이용한 텍스트/음성변환 장치 및그 방법
Waseem et al. Speech synthesis system for Indian accent using festvox
JPH0887297A (ja) 音声合成システム
JPS595916B2 (ja) 音声分折合成装置
Pitrelli et al. Expressive speech synthesis using American English ToBI: questions and contrastive emphasis
Baudoin et al. Corpus based very low bit rate speech coding
RU2298234C2 (ru) Способ компиляционного фонемного синтеза русской речи и устройство для его реализации
Maghbouleh A logistic regression model for detecting prominences
WO2000026901A3 (en) Performing spoken recorded actions
Law et al. Cantonese text-to-speech synthesis using sub-syllable units.
JPH11231899A (ja) 音声・動画像合成装置及び音声・動画像データベース
Zhang et al. Speech recognition based on syllable recovery.
Narupiyakul et al. A stochastic knowledge-based Thai text-to-speech system
Li et al. Corpus design and annotation for speech synthesis and recognition
Lyudovyk et al. Unit Selection Speech Synthesis Using Phonetic-Prosodic Description of Speech Databases

Legal Events

Date Code Title Description
REN Ceased due to non-payment of the annual fee