DE69629763D1 - Verfahren und Vorrichtung zur Ermittlung von Triphone Hidden Markov Modellen (HMM) - Google Patents

Verfahren und Vorrichtung zur Ermittlung von Triphone Hidden Markov Modellen (HMM)

Info

Publication number: DE69629763D1
Authority: DE; Germany
Prior art keywords: triphone; hmm; determining; hidden markov; markov models
Prior art date: 1995-06-19
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.): Expired - Fee Related

Application number

DE69629763T

Other languages

English (en)

Other versions

DE69629763T2 (de

Inventor

Yasuhiro Komori

Yasunori Ohora

Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)

Canon Inc

Original Assignee

Canon Inc

Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)

1995-06-19

Filing date

1996-06-18

Publication date

2003-10-09

1996-06-18 Application filed by Canon Inc filed Critical Canon Inc

2003-10-09 Application granted granted Critical

2003-10-09 Publication of DE69629763D1 publication Critical patent/DE69629763D1/de

2004-07-15 Publication of DE69629763T2 publication Critical patent/DE69629763T2/de

2016-06-19 Anticipated expiration legal-status Critical

Status Expired - Fee Related legal-status Critical Current

Links

Classifications

- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/063—Training
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/14—Speech classification or search using statistical models, e.g. Hidden Markov Models [HMMs]
- G10L15/142—Hidden Markov Models [HMMs]
- G10L15/144—Training of HMMs
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
- G10L2015/022—Demisyllables, biphones or triphones being the recognition units
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/063—Training
- G10L2015/0631—Creating reference templates; Clustering

Landscapes

Engineering & Computer Science (AREA)
Physics & Mathematics (AREA)
Computational Linguistics (AREA)
Health & Medical Sciences (AREA)
Audiology, Speech & Language Pathology (AREA)
Human Computer Interaction (AREA)
Acoustics & Sound (AREA)
Multimedia (AREA)
Probability & Statistics with Applications (AREA)
Artificial Intelligence (AREA)
Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

DE69629763T 1995-06-19 1996-06-18 Verfahren und Vorrichtung zur Ermittlung von Triphone Hidden Markov Modellen (HMM) Expired - Fee Related DE69629763T2 (de)

Applications Claiming Priority (2)

Application Number	Priority Date	Filing Date	Title
JP15148995		1995-06-19
JP15148995A JP3453456B2 (ja)	1995-06-19	1995-06-19	状態共有モデルの設計方法及び装置ならびにその状態共有モデルを用いた音声認識方法および装置

Publications (2)

Publication Number	Publication Date
DE69629763D1 true DE69629763D1 (de)	2003-10-09
DE69629763T2 DE69629763T2 (de)	2004-07-15

Family

ID=15519621

Family Applications (1)

Application Number	Title	Priority Date	Filing Date
DE69629763T Expired - Fee Related DE69629763T2 (de)	1995-06-19	1996-06-18	Verfahren und Vorrichtung zur Ermittlung von Triphone Hidden Markov Modellen (HMM)

Country Status (4)

Country	Link
US (1)	US5812975A (de)
EP (1)	EP0750293B1 (de)
JP (1)	JP3453456B2 (de)
DE (1)	DE69629763T2 (de)

Families Citing this family (46)

* Cited by examiner, † Cited by third party
Publication number	Priority date	Publication date	Assignee	Title
JPH10161692A (ja) *	1996-12-03	1998-06-19	Canon Inc	音声認識装置及び音声認識方法
JPH10187195A (ja) *	1996-12-26	1998-07-14	Canon Inc	音声合成方法および装置
JP3962445B2 (ja)	1997-03-13	2007-08-22	キヤノン株式会社	音声処理方法及び装置
US6807537B1 (en) *	1997-12-04	2004-10-19	Microsoft Corporation	Mixtures of Bayesian networks
US6317712B1 (en) *	1998-02-03	2001-11-13	Texas Instruments Incorporated	Method of phonetic modeling using acoustic decision tree
US6073096A (en) *	1998-02-04	2000-06-06	International Business Machines Corporation	Speaker adaptation system and method based on class-specific pre-clustering training speakers
US6263309B1 (en)	1998-04-30	2001-07-17	Matsushita Electric Industrial Co., Ltd.	Maximum likelihood method for finding an adapted speaker model in eigenvoice space
US6343267B1 (en)	1998-04-30	2002-01-29	Matsushita Electric Industrial Co., Ltd.	Dimensionality reduction for speaker normalization and speaker and environment adaptation using eigenvoice techniques
US6405159B2 (en)	1998-06-03	2002-06-11	Sbc Technology Resources, Inc.	Method for categorizing, describing and modeling types of system users
JP2000047696A (ja)	1998-07-29	2000-02-18	Canon Inc	情報処理方法及び装置、その記憶媒体
WO2000031723A1 (en) *	1998-11-25	2000-06-02	Sony Electronics, Inc.	Method and apparatus for very large vocabulary isolated word recognition in a parameter sharing speech recognition system
US7086007B1 (en)	1999-05-27	2006-08-01	Sbc Technology Resources, Inc.	Method for integrating user models to interface design
JP3969908B2 (ja)	1999-09-14	2007-09-05	キヤノン株式会社	音声入力端末器、音声認識装置、音声通信システム及び音声通信方法
KR100434538B1 (ko) *	1999-11-17	2004-06-05	삼성전자주식회사	음성의 천이 구간 검출 장치, 그 방법 및 천이 구간의음성 합성 방법
US6571208B1 (en)	1999-11-29	2003-05-27	Matsushita Electric Industrial Co., Ltd.	Context-dependent acoustic models for medium and large vocabulary speech recognition with eigenvoice training
US6526379B1 (en) *	1999-11-29	2003-02-25	Matsushita Electric Industrial Co., Ltd.	Discriminative clustering methods for automatic speech recognition
US6778643B1 (en)	2000-03-21	2004-08-17	Sbc Technology Resources, Inc.	Interface and method of designing an interface
US20040006473A1 (en)	2002-07-02	2004-01-08	Sbc Technology Resources, Inc.	Method and system for automated categorization of statements
JP3728172B2 (ja)	2000-03-31	2005-12-21	キヤノン株式会社	音声合成方法および装置
US7039588B2 (en) *	2000-03-31	2006-05-02	Canon Kabushiki Kaisha	Synthesis unit selection apparatus and method, and storage medium
JP4632384B2 (ja) *	2000-03-31	2011-02-16	キヤノン株式会社	音声情報処理装置及びその方法と記憶媒体
JP2001282278A (ja) *	2000-03-31	2001-10-12	Canon Inc	音声情報処理装置及びその方法と記憶媒体
JP3728177B2 (ja) *	2000-05-24	2005-12-21	キヤノン株式会社	音声処理システム、装置、方法及び記憶媒体
US6910000B1 (en) *	2000-06-02	2005-06-21	Mitsubishi Electric Research Labs, Inc.	Generalized belief propagation for probabilistic systems
US7024350B2 (en) *	2000-07-20	2006-04-04	Microsoft Corporation	Compact easily parseable binary format for a context-free grammer
WO2002027535A1 (en) *	2000-09-28	2002-04-04	Intel Corporation	Method and system for expanding a word graph to a phone graph based on a cross-word acoustical model to improve continuous speech recognition
US6980954B1 (en)	2000-09-30	2005-12-27	Intel Corporation	Search method based on single triphone tree for large vocabulary continuous speech recognizer
US7006969B2 (en)	2000-11-02	2006-02-28	At&T Corp.	System and method of pattern recognition in very high-dimensional space
US7369993B1 (en)	2000-11-02	2008-05-06	At&T Corp.	System and method of pattern recognition in very high-dimensional space
US6801656B1 (en)	2000-11-06	2004-10-05	Koninklijke Philips Electronics N.V.	Method and apparatus for determining a number of states for a hidden Markov model in a signal processing system
US7065201B2 (en)	2001-07-31	2006-06-20	Sbc Technology Resources, Inc.	Telephone call processing in an interactive voice response call management system
US7305070B2 (en)	2002-01-30	2007-12-04	At&T Labs, Inc.	Sequential presentation of long instructions in an interactive voice response system
US6914975B2 (en)	2002-02-21	2005-07-05	Sbc Properties, L.P.	Interactive dialog-based training method
US7266497B2 (en) *	2002-03-29	2007-09-04	At&T Corp.	Automatic segmentation in speech synthesis
US7027586B2 (en)	2003-12-18	2006-04-11	Sbc Knowledge Ventures, L.P.	Intelligently routing customer communications
JP4587160B2 (ja) *	2004-03-26	2010-11-24	キヤノン株式会社	信号処理装置および方法
JP4541781B2 (ja) *	2004-06-29	2010-09-08	キヤノン株式会社	音声認識装置および方法
US7643686B2 (en) *	2004-11-17	2010-01-05	Eastman Kodak Company	Multi-tiered image clustering by event
US7634406B2 (en) *	2004-12-10	2009-12-15	Microsoft Corporation	System and method for identifying semantic intent from acoustic information
US7805301B2 (en) *	2005-07-01	2010-09-28	Microsoft Corporation	Covariance estimation for pattern recognition
US20070213988A1 (en) *	2006-03-10	2007-09-13	International Business Machines Corporation	Using speech processing technologies for verification sequence instances
US20070260459A1 (en) *	2006-05-04	2007-11-08	Texas Instruments, Incorporated	System and method for generating heterogeneously tied gaussian mixture models for automatic speech recognition acoustic models
US8234116B2 (en) *	2006-08-22	2012-07-31	Microsoft Corporation	Calculating cost measures between HMM acoustic models
US20080059190A1 (en) *	2006-08-22	2008-03-06	Microsoft Corporation	Speech unit selection using HMM acoustic models
US8244534B2 (en) *	2007-08-20	2012-08-14	Microsoft Corporation	HMM-based bilingual (Mandarin-English) TTS techniques
US8060360B2 (en) *	2007-10-30	2011-11-15	Microsoft Corporation	Word-dependent transition models in HMM based word alignment for statistical machine translation

Family Cites Families (18)

* Cited by examiner, † Cited by third party
Publication number	Priority date	Publication date	Assignee	Title
US4156868A (en) *	1977-05-05	1979-05-29	Bell Telephone Laboratories, Incorporated	Syntactic word recognizer
US5165007A (en) *	1985-02-01	1992-11-17	International Business Machines Corporation	Feneme-based Markov models for words
JPH06105394B2 (ja) *	1986-03-19	1994-12-21	株式会社東芝	音声認識方式
US4918731A (en) *	1987-07-17	1990-04-17	Ricoh Company, Ltd.	Speech recognition method and apparatus
US4817156A (en) *	1987-08-10	1989-03-28	International Business Machines Corporation	Rapidly training a speech recognizer to a subsequent speaker given training data of a reference speaker
JPH01102599A (ja) *	1987-10-12	1989-04-20	Internatl Business Mach Corp <Ibm>	音声認識方法
JPH0296800A (ja) *	1988-10-03	1990-04-09	Nec Corp	連続音声認識装置
JPH02239292A (ja) *	1989-03-13	1990-09-21	Canon Inc	音声合成装置
US5073939A (en) *	1989-06-08	1991-12-17	Itt Corporation	Dynamic time warping (DTW) apparatus for use in speech recognition systems
EP0427485B1 (de) *	1989-11-06	1996-08-14	Canon Kabushiki Kaisha	Verfahren und Einrichtung zur Sprachsynthese
JP2964507B2 (ja) *	1989-12-12	1999-10-18	松下電器産業株式会社	Ｈｍｍ装置
US5444817A (en) *	1991-10-02	1995-08-22	Matsushita Electric Industrial Co., Ltd.	Speech recognizing apparatus using the predicted duration of syllables
JPH05257492A (ja) *	1992-03-13	1993-10-08	Toshiba Corp	音声認識方式
JP2795058B2 (ja) *	1992-06-03	1998-09-10	松下電器産業株式会社	時系列信号処理装置
US5535305A (en) *	1992-12-31	1996-07-09	Apple Computer, Inc.	Sub-partitioned vector quantization of probability density functions
US5515475A (en) *	1993-06-24	1996-05-07	Northern Telecom Limited	Speech recognition method using a two-pass search
US5621859A (en) *	1994-01-19	1997-04-15	Bbn Corporation	Single tree method for grammar directed, very large vocabulary speech recognizer
US5615286A (en) *	1995-05-05	1997-03-25	Bell Communications Research, Inc.	Method for determining a most likely sequence of states

1995
- 1995-06-19 JP JP15148995A patent/JP3453456B2/ja not_active Expired - Fee Related
1996
- 1996-06-18 EP EP96304526A patent/EP0750293B1/de not_active Expired - Lifetime
- 1996-06-18 US US08/665,503 patent/US5812975A/en not_active Expired - Fee Related
- 1996-06-18 DE DE69629763T patent/DE69629763T2/de not_active Expired - Fee Related

Also Published As

Publication number	Publication date
EP0750293B1 (de)	2003-09-03
JP3453456B2 (ja)	2003-10-06
JPH096386A (ja)	1997-01-10
DE69629763T2 (de)	2004-07-15
EP0750293A2 (de)	1996-12-27
US5812975A (en)	1998-09-22
EP0750293A3 (de)	1997-10-08

Legal Events

Date	Code	Title	Description
2004-09-30	8364	No opposition during term of opposition
2007-04-19	8339	Ceased/non-payment of the annual fee

Publication	Publication Date	Title
DE69629763D1 (de)	2003-10-09	Verfahren und Vorrichtung zur Ermittlung von Triphone Hidden Markov Modellen (HMM)
DE69632901D1 (de)	2004-08-19	Vorrichtung und Verfahren zur Sprachsynthese
DE69635500D1 (de)	2006-01-05	Verfahren und Vorrichtung zur Erkennung eines nahen Sprachsignals
DE69432943T2 (de)	2003-12-24	Verfahren und Vorrichtung zur Sprachdetektion
DE69625950T2 (de)	2003-12-24	Verfahren und Vorrichtung zur Spracherkennung und Übersetzungssystem
DE69831991D1 (de)	2005-12-01	Verfahren und Vorrichtung zur Sprachdetektion
DE69828141D1 (de)	2005-01-20	Verfahren und Vorrichtung zur Spracherkennung
DE69518705T2 (de)	2001-01-04	Verfahren und Vorrichtung zur Spracherkennung
DE69524829T2 (de)	2002-06-20	Verfahren und Vorrichtung zur Spracherkennung
DE69806557D1 (de)	2002-08-22	Verfahren und Vorrichtung zur Spracherkennung
DE69717899T2 (de)	2003-08-21	Verfahren und Vorrichtung zur Spracherkennung
DE69726235D1 (de)	2003-12-24	Verfahren und Vorrichtung zur Spracherkennung
DE69623248D1 (de)	2002-10-02	Verfahren und Vorrichtung zur Entfernungsmessung
DE59707384D1 (de)	2002-07-11	Verfahren und Vorrichtung zur Spracherkennung
DE69830017D1 (de)	2005-06-09	Verfahren und Vorrichtung zur Spracherkennung
DE69519820T2 (de)	2001-07-19	Verfahren und Vorrichtung zur Sprachsynthese
DE69607913D1 (de)	2000-05-31	Verfahren und vorrichtung zur spracherkennung auf der basis neuer wortmodelle
DE69623879T2 (de)	2003-05-08	Vorrichtung und Verfahren zur Volumenermittlung
DE69523998D1 (de)	2002-01-03	Verfahren und Vorrichtung zur Sprachsynthese
DE69710525D1 (de)	2002-03-28	Verfahren und Vorrichtung zur Sprachsynthese
DE69612958D1 (de)	2001-06-28	Verfahren und vorrichtung zur resynthetisierung eines sprachsignals
DE69937854D1 (de)	2008-02-14	Verfahren und Vorrichtung zur Spracherkennung unter Verwendung von phonetischen Transkriptionen
DE69527703D1 (de)	2002-09-12	Verfahren und Vorrichtung zur Signalauswahl und Fehlererkennung
DE69517829T2 (de)	2001-03-08	Vorrichtung und Verfahren zur Spracherkennung
DE69519818D1 (de)	2001-02-15	Verfahren und Vorrichtung zur Sprachsynthese