DE69519840T2 - Einrichtung und Verfahren zur Spracherkennung - Google Patents

Einrichtung und Verfahren zur Spracherkennung

Info

Publication number
DE69519840T2
DE69519840T2 DE69519840T DE69519840T DE69519840T2 DE 69519840 T2 DE69519840 T2 DE 69519840T2 DE 69519840 T DE69519840 T DE 69519840T DE 69519840 T DE69519840 T DE 69519840T DE 69519840 T2 DE69519840 T2 DE 69519840T2
Authority
DE
Germany
Prior art keywords
voice recognition
recognition device
voice
recognition
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Lifetime
Application number
DE69519840T
Other languages
English (en)
Other versions
DE69519840D1 (de
Inventor
Yasuhiro Komori
Yasunori Ohora
Masayuki Yamada
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Canon Inc
Original Assignee
Canon Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Canon Inc filed Critical Canon Inc
Application granted granted Critical
Publication of DE69519840D1 publication Critical patent/DE69519840D1/de
Publication of DE69519840T2 publication Critical patent/DE69519840T2/de
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/14Speech classification or search using statistical models, e.g. Hidden Markov Models [HMMs]
    • G10L15/142Hidden Markov Models [HMMs]
    • G10L15/144Training of HMMs
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • G10L15/183Speech classification or search using natural language modelling using context dependencies, e.g. language models
    • G10L15/187Phonemic context, e.g. pronunciation rules, phonotactical constraints or phoneme n-grams

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Probability & Statistics with Applications (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Machine Translation (AREA)
DE69519840T 1994-10-07 1995-09-29 Einrichtung und Verfahren zur Spracherkennung Expired - Lifetime DE69519840T2 (de)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP24400594A JP3581401B2 (ja) 1994-10-07 1994-10-07 音声認識方法

Publications (2)

Publication Number Publication Date
DE69519840D1 DE69519840D1 (de) 2001-02-15
DE69519840T2 true DE69519840T2 (de) 2001-06-28

Family

ID=17112303

Family Applications (1)

Application Number Title Priority Date Filing Date
DE69519840T Expired - Lifetime DE69519840T2 (de) 1994-10-07 1995-09-29 Einrichtung und Verfahren zur Spracherkennung

Country Status (4)

Country Link
US (1) US5787396A (de)
EP (1) EP0706171B1 (de)
JP (1) JP3581401B2 (de)
DE (1) DE69519840T2 (de)

Families Citing this family (41)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5937384A (en) * 1996-05-01 1999-08-10 Microsoft Corporation Method and system for speech recognition using continuous density hidden Markov models
US5963903A (en) * 1996-06-28 1999-10-05 Microsoft Corporation Method and system for dynamically adjusted training for speech recognition
JPH10161692A (ja) * 1996-12-03 1998-06-19 Canon Inc 音声認識装置及び音声認識方法
JPH10187195A (ja) * 1996-12-26 1998-07-14 Canon Inc 音声合成方法および装置
US5857173A (en) * 1997-01-30 1999-01-05 Motorola, Inc. Pronunciation measurement device and method
JP3962445B2 (ja) 1997-03-13 2007-08-22 キヤノン株式会社 音声処理方法及び装置
JPH10254486A (ja) 1997-03-13 1998-09-25 Canon Inc 音声認識装置および方法
US6073098A (en) * 1997-11-21 2000-06-06 At&T Corporation Method and apparatus for generating deterministic approximate weighted finite-state automata
JP3884856B2 (ja) 1998-03-09 2007-02-21 キヤノン株式会社 音声合成用データ作成装置、音声合成装置及びそれらの方法、コンピュータ可読メモリ
JP3902860B2 (ja) * 1998-03-09 2007-04-11 キヤノン株式会社 音声合成制御装置及びその制御方法、コンピュータ可読メモリ
EP0953971A1 (de) * 1998-05-01 1999-11-03 Entropic Cambridge Research Laboratory Ltd. System und Verfahren zur Spracherkennung
US6061653A (en) * 1998-07-14 2000-05-09 Alcatel Usa Sourcing, L.P. Speech recognition system using shared speech models for multiple recognition processes
JP2000047696A (ja) 1998-07-29 2000-02-18 Canon Inc 情報処理方法及び装置、その記憶媒体
US6725195B2 (en) * 1998-08-25 2004-04-20 Sri International Method and apparatus for probabilistic recognition using small number of state clusters
US6246982B1 (en) * 1999-01-26 2001-06-12 International Business Machines Corporation Method for measuring distance between collections of distributions
JP3969908B2 (ja) 1999-09-14 2007-09-05 キヤノン株式会社 音声入力端末器、音声認識装置、音声通信システム及び音声通信方法
US7039588B2 (en) * 2000-03-31 2006-05-02 Canon Kabushiki Kaisha Synthesis unit selection apparatus and method, and storage medium
JP3814459B2 (ja) * 2000-03-31 2006-08-30 キヤノン株式会社 音声認識方法及び装置と記憶媒体
JP2001282278A (ja) * 2000-03-31 2001-10-12 Canon Inc 音声情報処理装置及びその方法と記憶媒体
JP3728172B2 (ja) 2000-03-31 2005-12-21 キヤノン株式会社 音声合成方法および装置
JP4632384B2 (ja) * 2000-03-31 2011-02-16 キヤノン株式会社 音声情報処理装置及びその方法と記憶媒体
US6629073B1 (en) * 2000-04-27 2003-09-30 Microsoft Corporation Speech recognition method and apparatus utilizing multi-unit models
US6662158B1 (en) 2000-04-27 2003-12-09 Microsoft Corporation Temporal pattern recognition method and apparatus utilizing segment and frame-based models
JP3728177B2 (ja) 2000-05-24 2005-12-21 キヤノン株式会社 音声処理システム、装置、方法及び記憶媒体
KR100464428B1 (ko) * 2002-08-12 2005-01-03 삼성전자주식회사 음성 인식 장치
JP4280505B2 (ja) * 2003-01-20 2009-06-17 キヤノン株式会社 情報処理装置及び情報処理方法
JP4587160B2 (ja) * 2004-03-26 2010-11-24 キヤノン株式会社 信号処理装置および方法
EP1741092B1 (de) * 2004-04-20 2008-06-11 France Télécom Spracherkennung durch kontextuelle modellierung der spracheinheiten
JP4541781B2 (ja) * 2004-06-29 2010-09-08 キヤノン株式会社 音声認識装置および方法
JP4298672B2 (ja) * 2005-04-11 2009-07-22 キヤノン株式会社 混合分布hmmの状態の出力確率計算方法および装置
US7970613B2 (en) 2005-11-12 2011-06-28 Sony Computer Entertainment Inc. Method and system for Gaussian probability data bit reduction and computation
KR100943236B1 (ko) * 2006-02-14 2010-02-18 에스케이에너지 주식회사 용융파단 특성이 우수한 폴리올레핀 미세다공막 및 그제조방법
US7877256B2 (en) * 2006-02-17 2011-01-25 Microsoft Corporation Time synchronous decoding for long-span hidden trajectory model
US8010358B2 (en) * 2006-02-21 2011-08-30 Sony Computer Entertainment Inc. Voice recognition with parallel gender and age normalization
US7778831B2 (en) * 2006-02-21 2010-08-17 Sony Computer Entertainment Inc. Voice recognition with dynamic filter bank adjustment based on speaker categorization determined from runtime pitch
JP4853298B2 (ja) * 2007-01-17 2012-01-11 日本電気株式会社 信号処理装置、信号処理方法および信号処理プログラム
US8543393B2 (en) * 2008-05-20 2013-09-24 Calabrio, Inc. Systems and methods of improving automated speech recognition accuracy using statistical analysis of search terms
US8442833B2 (en) * 2009-02-17 2013-05-14 Sony Computer Entertainment Inc. Speech processing with source location estimation using signals from two or more microphones
US8442829B2 (en) * 2009-02-17 2013-05-14 Sony Computer Entertainment Inc. Automatic computation streaming partition for voice recognition on multiple processors with limited memory
US8788256B2 (en) * 2009-02-17 2014-07-22 Sony Computer Entertainment Inc. Multiple language voice recognition
US9153235B2 (en) 2012-04-09 2015-10-06 Sony Computer Entertainment Inc. Text dependent speaker recognition with long-term feature based on functional data analysis

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH0833739B2 (ja) * 1990-09-13 1996-03-29 三菱電機株式会社 パターン表現モデル学習装置
US5199077A (en) * 1991-09-19 1993-03-30 Xerox Corporation Wordspotting for voice editing and indexing
JPH05257492A (ja) * 1992-03-13 1993-10-08 Toshiba Corp 音声認識方式

Also Published As

Publication number Publication date
EP0706171B1 (de) 2001-01-10
US5787396A (en) 1998-07-28
JPH08110791A (ja) 1996-04-30
EP0706171A1 (de) 1996-04-10
DE69519840D1 (de) 2001-02-15
JP3581401B2 (ja) 2004-10-27

Similar Documents

Publication Publication Date Title
DE69519840D1 (de) Einrichtung und Verfahren zur Spracherkennung
DE69420400D1 (de) Verfahren und gerät zur sprechererkennung
DE69518705T2 (de) Verfahren und Vorrichtung zur Spracherkennung
DE69524829D1 (de) Verfahren und Vorrichtung zur Spracherkennung
DE69717899T2 (de) Verfahren und Vorrichtung zur Spracherkennung
DE69730930D1 (de) Verfahren und Gerät zur Zeichenerkennung
DE69828141D1 (de) Verfahren und Vorrichtung zur Spracherkennung
DE69806557T2 (de) Verfahren und Vorrichtung zur Spracherkennung
DE69707876D1 (de) Verfahren und vorrichtung fuer dynamisch eingestelltes training zur spracherkennung
DE69726235D1 (de) Verfahren und Vorrichtung zur Spracherkennung
DE69324428T2 (de) Verfahren zur Sprachformung und Gerät zur Spracherkennung
DE69732156D1 (de) Verfahren und Gerät zur Zeichenerkennung
DE59707384D1 (de) Verfahren und Vorrichtung zur Spracherkennung
DE69324629T2 (de) Verfahren und Vorrichtung zur Spracherkennung
DE69531710D1 (de) Verfahren und Vorrichtung zur Verminderung von Rauschen bei Sprachsignalen
DE69031284D1 (de) Verfahren und Einrichtung zur Spracherkennung
DE69421324T2 (de) Verfahren und Vorrichtung zur Sprachkommunikation
DE69524677D1 (de) Gerät und Verfahren zur Bilderkennung
DE69727895D1 (de) Verfahren und Vorrichtung zur Sprachkodierung
DE69428475D1 (de) Verfahren und Gerät zur automatischen Spracherkennung
DE69830017D1 (de) Verfahren und Vorrichtung zur Spracherkennung
DE69517829D1 (de) Vorrichtung und Verfahren zur Spracherkennung
DE69715281D1 (de) Verfahren und Vorrichtung zur Spracherkennung
DE69620304T2 (de) Vorrichtung und Verfahren zur Spracherkennung
DE69030548D1 (de) Verfahren und Einrichtung zur Spracherkennung

Legal Events

Date Code Title Description
8364 No opposition during term of opposition