DE69925479D1 - Dynamisch konfigurierbares akustisches modell für spracherkennungssysteme - Google Patents

Dynamisch konfigurierbares akustisches modell für spracherkennungssysteme

Info

Publication number
DE69925479D1
DE69925479D1 DE69925479T DE69925479T DE69925479D1 DE 69925479 D1 DE69925479 D1 DE 69925479D1 DE 69925479 T DE69925479 T DE 69925479T DE 69925479 T DE69925479 T DE 69925479T DE 69925479 D1 DE69925479 D1 DE 69925479D1
Authority
DE
Germany
Prior art keywords
acoustic model
recognition systems
language recognition
dynamic configurable
configurable acoustic
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Lifetime
Application number
DE69925479T
Other languages
English (en)
Other versions
DE69925479T2 (de
Inventor
Mei-Yuh Hwang
D Huang
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Microsoft Corp
Original Assignee
Microsoft Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Microsoft Corp filed Critical Microsoft Corp
Publication of DE69925479D1 publication Critical patent/DE69925479D1/de
Application granted granted Critical
Publication of DE69925479T2 publication Critical patent/DE69925479T2/de
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/14Speech classification or search using statistical models, e.g. Hidden Markov Models [HMMs]
    • G10L15/142Hidden Markov Models [HMMs]
    • G10L15/144Training of HMMs
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • G10L15/063Training
    • G10L2015/0631Creating reference templates; Clustering

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Machine Translation (AREA)
DE69925479T 1998-04-15 1999-03-29 Dynamisch konfigurierbares akustisches modell für spracherkennungssysteme Expired - Lifetime DE69925479T2 (de)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US09/060,654 US6141641A (en) 1998-04-15 1998-04-15 Dynamically configurable acoustic model for speech recognition system
US60654 1998-04-15
PCT/US1999/006837 WO1999053478A1 (en) 1998-04-15 1999-03-29 Dynamically configurable acoustic model for speech recognition systems

Publications (2)

Publication Number Publication Date
DE69925479D1 true DE69925479D1 (de) 2005-06-30
DE69925479T2 DE69925479T2 (de) 2006-02-02

Family

ID=22030937

Family Applications (1)

Application Number Title Priority Date Filing Date
DE69925479T Expired - Lifetime DE69925479T2 (de) 1998-04-15 1999-03-29 Dynamisch konfigurierbares akustisches modell für spracherkennungssysteme

Country Status (6)

Country Link
US (1) US6141641A (de)
EP (1) EP1070314B1 (de)
JP (2) JP4450991B2 (de)
CN (1) CN1139911C (de)
DE (1) DE69925479T2 (de)
WO (1) WO1999053478A1 (de)

Families Citing this family (52)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6807537B1 (en) * 1997-12-04 2004-10-19 Microsoft Corporation Mixtures of Bayesian networks
US6418431B1 (en) * 1998-03-30 2002-07-09 Microsoft Corporation Information retrieval and speech recognition based on language models
US6141641A (en) * 1998-04-15 2000-10-31 Microsoft Corporation Dynamically configurable acoustic model for speech recognition system
DE59904741D1 (de) * 1998-05-11 2003-04-30 Siemens Ag Anordnung und verfahren zur erkennung eines vorgegebenen wortschatzes in gesprochener sprache durch einen rechner
US6684186B2 (en) * 1999-01-26 2004-01-27 International Business Machines Corporation Speaker recognition using a hierarchical speaker model tree
US6904402B1 (en) * 1999-11-05 2005-06-07 Microsoft Corporation System and iterative method for lexicon, segmentation and language model joint optimization
US6442519B1 (en) * 1999-11-10 2002-08-27 International Business Machines Corp. Speaker model adaptation via network of similar users
US6792405B2 (en) * 1999-12-10 2004-09-14 At&T Corp. Bitstream-based feature extraction method for a front-end speech recognizer
US7110947B2 (en) 1999-12-10 2006-09-19 At&T Corp. Frame erasure concealment technique for a bitstream-based feature extractor
US6865528B1 (en) * 2000-06-01 2005-03-08 Microsoft Corporation Use of a unified language model
US7031908B1 (en) 2000-06-01 2006-04-18 Microsoft Corporation Creating a language model for a language processing system
US7020587B1 (en) * 2000-06-30 2006-03-28 Microsoft Corporation Method and apparatus for generating and managing a language model data structure
JP4336865B2 (ja) * 2001-03-13 2009-09-30 日本電気株式会社 音声認識装置
US8229753B2 (en) * 2001-10-21 2012-07-24 Microsoft Corporation Web server controls for web enabled recognition and/or audible prompting
US7711570B2 (en) * 2001-10-21 2010-05-04 Microsoft Corporation Application abstraction with dialog purpose
CN1323532C (zh) * 2001-11-15 2007-06-27 松下电器产业株式会社 错误隐蔽装置和方法
DE10220524B4 (de) 2002-05-08 2006-08-10 Sap Ag Verfahren und System zur Verarbeitung von Sprachdaten und zur Erkennung einer Sprache
DE10220520A1 (de) * 2002-05-08 2003-11-20 Sap Ag Verfahren zur Erkennung von Sprachinformation
EP1361740A1 (de) * 2002-05-08 2003-11-12 Sap Ag Verfahren und System zur Verarbeitung von Sprachinformationen eines Dialogs
EP1363271A1 (de) 2002-05-08 2003-11-19 Sap Ag Verfahren und System zur Verarbeitung und Speicherung von Sprachinformationen eines Dialogs
US7940844B2 (en) 2002-06-18 2011-05-10 Qualcomm Incorporated Video encoding and decoding techniques
US7533023B2 (en) * 2003-02-12 2009-05-12 Panasonic Corporation Intermediary speech processor in network environments transforming customized speech parameters
US7529671B2 (en) 2003-03-04 2009-05-05 Microsoft Corporation Block synchronous decoding
US7571097B2 (en) * 2003-03-13 2009-08-04 Microsoft Corporation Method for training of subspace coded gaussian models
US7200559B2 (en) 2003-05-29 2007-04-03 Microsoft Corporation Semantic object synchronous understanding implemented with speech application language tags
US8301436B2 (en) * 2003-05-29 2012-10-30 Microsoft Corporation Semantic object synchronous understanding for highly interactive interface
US8160883B2 (en) 2004-01-10 2012-04-17 Microsoft Corporation Focus tracking in dialogs
US7231019B2 (en) * 2004-02-12 2007-06-12 Microsoft Corporation Automatic identification of telephone callers based on voice characteristics
KR100590561B1 (ko) * 2004-10-12 2006-06-19 삼성전자주식회사 신호의 피치를 평가하는 방법 및 장치
US20060136210A1 (en) * 2004-12-16 2006-06-22 Sony Corporation System and method for tying variance vectors for speech recognition
US20070088552A1 (en) * 2005-10-17 2007-04-19 Nokia Corporation Method and a device for speech recognition
US7970613B2 (en) * 2005-11-12 2011-06-28 Sony Computer Entertainment Inc. Method and system for Gaussian probability data bit reduction and computation
US7680664B2 (en) * 2006-08-16 2010-03-16 Microsoft Corporation Parsimonious modeling by non-uniform kernel allocation
US8234116B2 (en) * 2006-08-22 2012-07-31 Microsoft Corporation Calculating cost measures between HMM acoustic models
US8165877B2 (en) * 2007-08-03 2012-04-24 Microsoft Corporation Confidence measure generation for speech related searching
US8160878B2 (en) * 2008-09-16 2012-04-17 Microsoft Corporation Piecewise-based variable-parameter Hidden Markov Models and the training thereof
US8145488B2 (en) * 2008-09-16 2012-03-27 Microsoft Corporation Parameter clustering and sharing for variable-parameter hidden markov models
US9002713B2 (en) * 2009-06-09 2015-04-07 At&T Intellectual Property I, L.P. System and method for speech personalization by need
US8484023B2 (en) * 2010-09-24 2013-07-09 Nuance Communications, Inc. Sparse representation features for speech recognition
KR20120045582A (ko) * 2010-10-29 2012-05-09 한국전자통신연구원 음향 모델 생성 장치 및 방법
US8959014B2 (en) 2011-06-30 2015-02-17 Google Inc. Training acoustic models using distributed computing techniques
US9153235B2 (en) 2012-04-09 2015-10-06 Sony Computer Entertainment Inc. Text dependent speaker recognition with long-term feature based on functional data analysis
US9514739B2 (en) * 2012-06-06 2016-12-06 Cypress Semiconductor Corporation Phoneme score accelerator
US9224384B2 (en) * 2012-06-06 2015-12-29 Cypress Semiconductor Corporation Histogram based pre-pruning scheme for active HMMS
JP5659203B2 (ja) * 2012-09-06 2015-01-28 株式会社東芝 モデル学習装置、モデル作成方法及びモデル作成プログラム
US9336771B2 (en) * 2012-11-01 2016-05-10 Google Inc. Speech recognition using non-parametric models
US20140372118A1 (en) * 2013-06-17 2014-12-18 Speech Morphing Systems, Inc. Method and apparatus for exemplary chip architecture
CN104766608A (zh) * 2014-01-07 2015-07-08 深圳市中兴微电子技术有限公司 一种语音控制方法及装置
US9858922B2 (en) 2014-06-23 2018-01-02 Google Inc. Caching speech recognition scores
US9953646B2 (en) 2014-09-02 2018-04-24 Belleau Technologies Method and system for dynamic speech recognition and tracking of prewritten script
US9299347B1 (en) 2014-10-22 2016-03-29 Google Inc. Speech recognition using associative mapping
CN112509567B (zh) * 2020-12-25 2024-05-10 阿波罗智联(北京)科技有限公司 语音数据处理的方法、装置、设备、存储介质及程序产品

Family Cites Families (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0190489B1 (de) * 1984-12-27 1991-10-30 Texas Instruments Incorporated Verfahren und Einrichtung zur sprecherunabhängigen Spracherkennung
US4797929A (en) * 1986-01-03 1989-01-10 Motorola, Inc. Word recognition in a speech recognition system using data reduced word templates
US5033087A (en) * 1989-03-14 1991-07-16 International Business Machines Corp. Method and apparatus for the automatic determination of phonological rules as for a continuous speech recognition system
JP2662120B2 (ja) * 1991-10-01 1997-10-08 インターナショナル・ビジネス・マシーンズ・コーポレイション 音声認識装置および音声認識用処理ユニット
EP0590173A1 (de) * 1992-09-28 1994-04-06 International Business Machines Corporation Computersystem zur Spracherkennung
JPH0769711B2 (ja) * 1993-03-09 1995-07-31 株式会社エイ・ティ・アール自動翻訳電話研究所 音声認識方法
US5794197A (en) * 1994-01-21 1998-08-11 Micrsoft Corporation Senone tree representation and evaluation
US5794198A (en) * 1994-10-28 1998-08-11 Nippon Telegraph And Telephone Corporation Pattern recognition method
JPH08248986A (ja) * 1995-03-13 1996-09-27 Nippon Telegr & Teleph Corp <Ntt> パターン認識方法
US5710866A (en) * 1995-05-26 1998-01-20 Microsoft Corporation System and method for speech recognition using dynamically adjusted confidence measure
JP2852210B2 (ja) * 1995-09-19 1999-01-27 株式会社エイ・ティ・アール音声翻訳通信研究所 不特定話者モデル作成装置及び音声認識装置
DE69517705T2 (de) * 1995-11-04 2000-11-23 International Business Machines Corp., Armonk Verfahren und vorrichtung zur anpassung der grösse eines sprachmodells in einem spracherkennungssystem
JPH09134193A (ja) * 1995-11-08 1997-05-20 Nippon Telegr & Teleph Corp <Ntt> 音声認識装置
US5806030A (en) * 1996-05-06 1998-09-08 Matsushita Electric Ind Co Ltd Low complexity, high accuracy clustering method for speech recognizer
US5822730A (en) * 1996-08-22 1998-10-13 Dragon Systems, Inc. Lexical tree pre-filtering in speech recognition
US5963902A (en) * 1997-07-30 1999-10-05 Nynex Science & Technology, Inc. Methods and apparatus for decreasing the size of generated models trained for automatic pattern recognition
US5950158A (en) * 1997-07-30 1999-09-07 Nynex Science And Technology, Inc. Methods and apparatus for decreasing the size of pattern recognition models by pruning low-scoring models from generated sets of models
US6141641A (en) * 1998-04-15 2000-10-31 Microsoft Corporation Dynamically configurable acoustic model for speech recognition system

Also Published As

Publication number Publication date
JP4913204B2 (ja) 2012-04-11
JP2002511609A (ja) 2002-04-16
CN1139911C (zh) 2004-02-25
EP1070314A1 (de) 2001-01-24
JP2010049291A (ja) 2010-03-04
DE69925479T2 (de) 2006-02-02
US6141641A (en) 2000-10-31
CN1301379A (zh) 2001-06-27
EP1070314B1 (de) 2005-05-25
JP4450991B2 (ja) 2010-04-14
WO1999053478A1 (en) 1999-10-21

Similar Documents

Publication Publication Date Title
DE69925479D1 (de) Dynamisch konfigurierbares akustisches modell für spracherkennungssysteme
DE69839134D1 (de) Aktive akustische Vorrichtung
DE60134335D1 (de) Sondierungssystem für schallquellen
DE69817087D1 (de) Vorrichtung zur aktiven Geräuschdämpfung
DE69825975D1 (de) Sklero-keratektomie-implantat für die deskemetische membran
DE69526666D1 (de) Globales schallmikrofonsystem
NO20004921D0 (no) Akustisk innretning
DE60014833D1 (de) Sprachverarbeitung
DE69603165D1 (de) Tragbares intelligentes system für die luftprobennahme
DE60115738D1 (de) Sprachmodelle für die Spracherkennung
DE69932507D1 (de) Akustisches gerät
DE69933257D1 (de) Lithographische Vorrichtung
DE19983596T1 (de) Lautsprechersystem
DE59906108D1 (de) Akustisches diagnosesystem und -verfahren
DE69819672D1 (de) Akustisches Standortmesssystem
DE69816354D1 (de) Trennvorrichtung für nicht-stationäre Quellen
DE69922078D1 (de) Servo-Verstärkereinheit
DE69823471D1 (de) Schwingungsverzehrende Vorrichtung
DE10195945T1 (de) Richtverarbeitung für ein System mit mehreren Mikrophonen
IT1315163B1 (it) Diffusore acustico e metodo per la sua realizzazione
DE69613261D1 (de) Stanzeinrichtung für Lochbearbeitungen
DE69927393D1 (de) Stanzstempeleinheit
DE60120558D1 (de) Mikrofonhalterung für ein aktives Lärmunterdrückungssystem
FR2732574B3 (fr) Boite aux lettres
DE60103320D1 (de) Befestigungsvorrichtung für ein aktives Geräuschdämpfungssystem

Legal Events

Date Code Title Description
8364 No opposition during term of opposition