DE69519840T2 - Einrichtung und Verfahren zur Spracherkennung - Google Patents
Einrichtung und Verfahren zur SpracherkennungInfo
- Publication number
- DE69519840T2 DE69519840T2 DE69519840T DE69519840T DE69519840T2 DE 69519840 T2 DE69519840 T2 DE 69519840T2 DE 69519840 T DE69519840 T DE 69519840T DE 69519840 T DE69519840 T DE 69519840T DE 69519840 T2 DE69519840 T2 DE 69519840T2
- Authority
- DE
- Germany
- Prior art keywords
- voice recognition
- recognition device
- voice
- recognition
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Lifetime
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/14—Speech classification or search using statistical models, e.g. Hidden Markov Models [HMMs]
- G10L15/142—Hidden Markov Models [HMMs]
- G10L15/144—Training of HMMs
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/183—Speech classification or search using natural language modelling using context dependencies, e.g. language models
- G10L15/187—Phonemic context, e.g. pronunciation rules, phonotactical constraints or phoneme n-grams
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Probability & Statistics with Applications (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Machine Translation (AREA)
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP24400594A JP3581401B2 (ja) | 1994-10-07 | 1994-10-07 | 音声認識方法 |
Publications (2)
Publication Number | Publication Date |
---|---|
DE69519840D1 DE69519840D1 (de) | 2001-02-15 |
DE69519840T2 true DE69519840T2 (de) | 2001-06-28 |
Family
ID=17112303
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
DE69519840T Expired - Lifetime DE69519840T2 (de) | 1994-10-07 | 1995-09-29 | Einrichtung und Verfahren zur Spracherkennung |
Country Status (4)
Country | Link |
---|---|
US (1) | US5787396A (de) |
EP (1) | EP0706171B1 (de) |
JP (1) | JP3581401B2 (de) |
DE (1) | DE69519840T2 (de) |
Families Citing this family (41)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5937384A (en) * | 1996-05-01 | 1999-08-10 | Microsoft Corporation | Method and system for speech recognition using continuous density hidden Markov models |
US5963903A (en) * | 1996-06-28 | 1999-10-05 | Microsoft Corporation | Method and system for dynamically adjusted training for speech recognition |
JPH10161692A (ja) * | 1996-12-03 | 1998-06-19 | Canon Inc | 音声認識装置及び音声認識方法 |
JPH10187195A (ja) * | 1996-12-26 | 1998-07-14 | Canon Inc | 音声合成方法および装置 |
US5857173A (en) * | 1997-01-30 | 1999-01-05 | Motorola, Inc. | Pronunciation measurement device and method |
JP3962445B2 (ja) | 1997-03-13 | 2007-08-22 | キヤノン株式会社 | 音声処理方法及び装置 |
JPH10254486A (ja) | 1997-03-13 | 1998-09-25 | Canon Inc | 音声認識装置および方法 |
US6073098A (en) * | 1997-11-21 | 2000-06-06 | At&T Corporation | Method and apparatus for generating deterministic approximate weighted finite-state automata |
JP3884856B2 (ja) | 1998-03-09 | 2007-02-21 | キヤノン株式会社 | 音声合成用データ作成装置、音声合成装置及びそれらの方法、コンピュータ可読メモリ |
JP3902860B2 (ja) * | 1998-03-09 | 2007-04-11 | キヤノン株式会社 | 音声合成制御装置及びその制御方法、コンピュータ可読メモリ |
EP0953971A1 (de) * | 1998-05-01 | 1999-11-03 | Entropic Cambridge Research Laboratory Ltd. | System und Verfahren zur Spracherkennung |
US6061653A (en) * | 1998-07-14 | 2000-05-09 | Alcatel Usa Sourcing, L.P. | Speech recognition system using shared speech models for multiple recognition processes |
JP2000047696A (ja) | 1998-07-29 | 2000-02-18 | Canon Inc | 情報処理方法及び装置、その記憶媒体 |
US6725195B2 (en) * | 1998-08-25 | 2004-04-20 | Sri International | Method and apparatus for probabilistic recognition using small number of state clusters |
US6246982B1 (en) * | 1999-01-26 | 2001-06-12 | International Business Machines Corporation | Method for measuring distance between collections of distributions |
JP3969908B2 (ja) | 1999-09-14 | 2007-09-05 | キヤノン株式会社 | 音声入力端末器、音声認識装置、音声通信システム及び音声通信方法 |
US7039588B2 (en) * | 2000-03-31 | 2006-05-02 | Canon Kabushiki Kaisha | Synthesis unit selection apparatus and method, and storage medium |
JP3814459B2 (ja) * | 2000-03-31 | 2006-08-30 | キヤノン株式会社 | 音声認識方法及び装置と記憶媒体 |
JP2001282278A (ja) * | 2000-03-31 | 2001-10-12 | Canon Inc | 音声情報処理装置及びその方法と記憶媒体 |
JP3728172B2 (ja) | 2000-03-31 | 2005-12-21 | キヤノン株式会社 | 音声合成方法および装置 |
JP4632384B2 (ja) * | 2000-03-31 | 2011-02-16 | キヤノン株式会社 | 音声情報処理装置及びその方法と記憶媒体 |
US6629073B1 (en) * | 2000-04-27 | 2003-09-30 | Microsoft Corporation | Speech recognition method and apparatus utilizing multi-unit models |
US6662158B1 (en) | 2000-04-27 | 2003-12-09 | Microsoft Corporation | Temporal pattern recognition method and apparatus utilizing segment and frame-based models |
JP3728177B2 (ja) | 2000-05-24 | 2005-12-21 | キヤノン株式会社 | 音声処理システム、装置、方法及び記憶媒体 |
KR100464428B1 (ko) * | 2002-08-12 | 2005-01-03 | 삼성전자주식회사 | 음성 인식 장치 |
JP4280505B2 (ja) * | 2003-01-20 | 2009-06-17 | キヤノン株式会社 | 情報処理装置及び情報処理方法 |
JP4587160B2 (ja) * | 2004-03-26 | 2010-11-24 | キヤノン株式会社 | 信号処理装置および方法 |
EP1741092B1 (de) * | 2004-04-20 | 2008-06-11 | France Télécom | Spracherkennung durch kontextuelle modellierung der spracheinheiten |
JP4541781B2 (ja) * | 2004-06-29 | 2010-09-08 | キヤノン株式会社 | 音声認識装置および方法 |
JP4298672B2 (ja) * | 2005-04-11 | 2009-07-22 | キヤノン株式会社 | 混合分布hmmの状態の出力確率計算方法および装置 |
US7970613B2 (en) | 2005-11-12 | 2011-06-28 | Sony Computer Entertainment Inc. | Method and system for Gaussian probability data bit reduction and computation |
KR100943236B1 (ko) * | 2006-02-14 | 2010-02-18 | 에스케이에너지 주식회사 | 용융파단 특성이 우수한 폴리올레핀 미세다공막 및 그제조방법 |
US7877256B2 (en) * | 2006-02-17 | 2011-01-25 | Microsoft Corporation | Time synchronous decoding for long-span hidden trajectory model |
US8010358B2 (en) * | 2006-02-21 | 2011-08-30 | Sony Computer Entertainment Inc. | Voice recognition with parallel gender and age normalization |
US7778831B2 (en) * | 2006-02-21 | 2010-08-17 | Sony Computer Entertainment Inc. | Voice recognition with dynamic filter bank adjustment based on speaker categorization determined from runtime pitch |
JP4853298B2 (ja) * | 2007-01-17 | 2012-01-11 | 日本電気株式会社 | 信号処理装置、信号処理方法および信号処理プログラム |
US8543393B2 (en) * | 2008-05-20 | 2013-09-24 | Calabrio, Inc. | Systems and methods of improving automated speech recognition accuracy using statistical analysis of search terms |
US8442833B2 (en) * | 2009-02-17 | 2013-05-14 | Sony Computer Entertainment Inc. | Speech processing with source location estimation using signals from two or more microphones |
US8442829B2 (en) * | 2009-02-17 | 2013-05-14 | Sony Computer Entertainment Inc. | Automatic computation streaming partition for voice recognition on multiple processors with limited memory |
US8788256B2 (en) * | 2009-02-17 | 2014-07-22 | Sony Computer Entertainment Inc. | Multiple language voice recognition |
US9153235B2 (en) | 2012-04-09 | 2015-10-06 | Sony Computer Entertainment Inc. | Text dependent speaker recognition with long-term feature based on functional data analysis |
Family Cites Families (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH0833739B2 (ja) * | 1990-09-13 | 1996-03-29 | 三菱電機株式会社 | パターン表現モデル学習装置 |
US5199077A (en) * | 1991-09-19 | 1993-03-30 | Xerox Corporation | Wordspotting for voice editing and indexing |
JPH05257492A (ja) * | 1992-03-13 | 1993-10-08 | Toshiba Corp | 音声認識方式 |
-
1994
- 1994-10-07 JP JP24400594A patent/JP3581401B2/ja not_active Expired - Fee Related
-
1995
- 1995-09-18 US US08/529,436 patent/US5787396A/en not_active Expired - Lifetime
- 1995-09-29 EP EP95306890A patent/EP0706171B1/de not_active Expired - Lifetime
- 1995-09-29 DE DE69519840T patent/DE69519840T2/de not_active Expired - Lifetime
Also Published As
Publication number | Publication date |
---|---|
EP0706171B1 (de) | 2001-01-10 |
US5787396A (en) | 1998-07-28 |
JPH08110791A (ja) | 1996-04-30 |
EP0706171A1 (de) | 1996-04-10 |
DE69519840D1 (de) | 2001-02-15 |
JP3581401B2 (ja) | 2004-10-27 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
DE69519840D1 (de) | Einrichtung und Verfahren zur Spracherkennung | |
DE69420400D1 (de) | Verfahren und gerät zur sprechererkennung | |
DE69518705T2 (de) | Verfahren und Vorrichtung zur Spracherkennung | |
DE69524829D1 (de) | Verfahren und Vorrichtung zur Spracherkennung | |
DE69717899T2 (de) | Verfahren und Vorrichtung zur Spracherkennung | |
DE69730930D1 (de) | Verfahren und Gerät zur Zeichenerkennung | |
DE69828141D1 (de) | Verfahren und Vorrichtung zur Spracherkennung | |
DE69806557T2 (de) | Verfahren und Vorrichtung zur Spracherkennung | |
DE69707876D1 (de) | Verfahren und vorrichtung fuer dynamisch eingestelltes training zur spracherkennung | |
DE69726235D1 (de) | Verfahren und Vorrichtung zur Spracherkennung | |
DE69324428T2 (de) | Verfahren zur Sprachformung und Gerät zur Spracherkennung | |
DE69732156D1 (de) | Verfahren und Gerät zur Zeichenerkennung | |
DE59707384D1 (de) | Verfahren und Vorrichtung zur Spracherkennung | |
DE69324629T2 (de) | Verfahren und Vorrichtung zur Spracherkennung | |
DE69531710D1 (de) | Verfahren und Vorrichtung zur Verminderung von Rauschen bei Sprachsignalen | |
DE69031284D1 (de) | Verfahren und Einrichtung zur Spracherkennung | |
DE69421324T2 (de) | Verfahren und Vorrichtung zur Sprachkommunikation | |
DE69524677D1 (de) | Gerät und Verfahren zur Bilderkennung | |
DE69727895D1 (de) | Verfahren und Vorrichtung zur Sprachkodierung | |
DE69428475D1 (de) | Verfahren und Gerät zur automatischen Spracherkennung | |
DE69830017D1 (de) | Verfahren und Vorrichtung zur Spracherkennung | |
DE69517829D1 (de) | Vorrichtung und Verfahren zur Spracherkennung | |
DE69715281D1 (de) | Verfahren und Vorrichtung zur Spracherkennung | |
DE69620304T2 (de) | Vorrichtung und Verfahren zur Spracherkennung | |
DE69030548D1 (de) | Verfahren und Einrichtung zur Spracherkennung |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
8364 | No opposition during term of opposition |