DE60000134T2 - Unüberwachte Anpassung eines Spracherkenners unter Verwendung zuverlässiger Informationen aus den besten N Rechenhypothesen - Google Patents
Unüberwachte Anpassung eines Spracherkenners unter Verwendung zuverlässiger Informationen aus den besten N RechenhypothesenInfo
- Publication number
- DE60000134T2 DE60000134T2 DE60000134T DE60000134T DE60000134T2 DE 60000134 T2 DE60000134 T2 DE 60000134T2 DE 60000134 T DE60000134 T DE 60000134T DE 60000134 T DE60000134 T DE 60000134T DE 60000134 T2 DE60000134 T2 DE 60000134T2
- Authority
- DE
- Germany
- Prior art keywords
- hypotheses
- computational
- speech recognizer
- reliable information
- unsupervised adaptation
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
- 230000006978 adaptation Effects 0.000 title 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/065—Adaptation
Landscapes
- Engineering & Computer Science (AREA)
- Artificial Intelligence (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Machine Translation (AREA)
- Image Analysis (AREA)
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US09/237,170 US6205426B1 (en) | 1999-01-25 | 1999-01-25 | Unsupervised speech model adaptation using reliable information among N-best strings |
Publications (2)
Publication Number | Publication Date |
---|---|
DE60000134D1 DE60000134D1 (de) | 2002-05-29 |
DE60000134T2 true DE60000134T2 (de) | 2002-12-12 |
Family
ID=22892612
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
DE60000134T Expired - Fee Related DE60000134T2 (de) | 1999-01-25 | 2000-01-18 | Unüberwachte Anpassung eines Spracherkenners unter Verwendung zuverlässiger Informationen aus den besten N Rechenhypothesen |
Country Status (5)
Country | Link |
---|---|
US (1) | US6205426B1 (de) |
EP (1) | EP1022723B1 (de) |
JP (1) | JP3685972B2 (de) |
DE (1) | DE60000134T2 (de) |
ES (1) | ES2174797T3 (de) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
DE102004029873B3 (de) * | 2004-06-16 | 2005-12-29 | Deutsche Telekom Ag | Verfahren und Vorrichtung zur intelligenten Eingabekorrektur für automatische Sprachdialogsysteme |
Families Citing this family (29)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6587824B1 (en) * | 2000-05-04 | 2003-07-01 | Visteon Global Technologies, Inc. | Selective speaker adaptation for an in-vehicle speech recognition system |
AU5205700A (en) * | 2000-06-15 | 2002-01-08 | Intel Corporation | Speaker adaptation using weighted feedback |
JP2002073072A (ja) * | 2000-08-31 | 2002-03-12 | Sony Corp | モデル適応装置およびモデル適応方法、記録媒体、並びにパターン認識装置 |
FR2814625B1 (fr) * | 2000-09-25 | 2003-01-03 | Prosodie | Systeme de telephonie avec sous-titrage et/ou traduction |
DE60029456T2 (de) * | 2000-12-11 | 2007-07-12 | Sony Deutschland Gmbh | Verfahren zur Online-Anpassung von Aussprachewörterbüchern |
US6970818B2 (en) * | 2001-12-07 | 2005-11-29 | Sony Corporation | Methodology for implementing a vocabulary set for use in a speech recognition system |
US7006972B2 (en) * | 2002-03-20 | 2006-02-28 | Microsoft Corporation | Generating a task-adapted acoustic model from one or more different corpora |
US7031918B2 (en) * | 2002-03-20 | 2006-04-18 | Microsoft Corporation | Generating a task-adapted acoustic model from one or more supervised and/or unsupervised corpora |
US7676366B2 (en) * | 2003-01-13 | 2010-03-09 | Art Advanced Recognition Technologies Inc. | Adaptation of symbols |
DE60316912T2 (de) * | 2003-04-29 | 2008-07-31 | Sony Deutschland Gmbh | Verfahren zur Spracherkennung |
US7835910B1 (en) * | 2003-05-29 | 2010-11-16 | At&T Intellectual Property Ii, L.P. | Exploiting unlabeled utterances for spoken language understanding |
US20060058999A1 (en) * | 2004-09-10 | 2006-03-16 | Simon Barker | Voice model adaptation |
GB2418764B (en) * | 2004-09-30 | 2008-04-09 | Fluency Voice Technology Ltd | Improving pattern recognition accuracy with distortions |
DE102004048348B4 (de) * | 2004-10-01 | 2006-07-13 | Daimlerchrysler Ag | Verfahren zur Adaption und/oder Erzeugung statistischer Sprachmodelle |
US7827032B2 (en) * | 2005-02-04 | 2010-11-02 | Vocollect, Inc. | Methods and systems for adapting a model for a speech recognition system |
US8200495B2 (en) | 2005-02-04 | 2012-06-12 | Vocollect, Inc. | Methods and systems for considering information about an expected response when performing speech recognition |
US7865362B2 (en) * | 2005-02-04 | 2011-01-04 | Vocollect, Inc. | Method and system for considering information about an expected response when performing speech recognition |
US7949533B2 (en) * | 2005-02-04 | 2011-05-24 | Vococollect, Inc. | Methods and systems for assessing and improving the performance of a speech recognition system |
US7895039B2 (en) * | 2005-02-04 | 2011-02-22 | Vocollect, Inc. | Methods and systems for optimizing model adaptation for a speech recognition system |
US8762148B2 (en) * | 2006-02-27 | 2014-06-24 | Nec Corporation | Reference pattern adaptation apparatus, reference pattern adaptation method and reference pattern adaptation program |
US8781837B2 (en) * | 2006-03-23 | 2014-07-15 | Nec Corporation | Speech recognition system and method for plural applications |
JP5041934B2 (ja) * | 2006-09-13 | 2012-10-03 | 本田技研工業株式会社 | ロボット |
US9798653B1 (en) * | 2010-05-05 | 2017-10-24 | Nuance Communications, Inc. | Methods, apparatus and data structure for cross-language speech adaptation |
KR20120046627A (ko) * | 2010-11-02 | 2012-05-10 | 삼성전자주식회사 | 화자 적응 방법 및 장치 |
US8914290B2 (en) | 2011-05-20 | 2014-12-16 | Vocollect, Inc. | Systems and methods for dynamically improving user intelligibility of synthesized speech in a work environment |
US9978395B2 (en) | 2013-03-15 | 2018-05-22 | Vocollect, Inc. | Method and system for mitigating delay in receiving audio stream during production of sound from audio stream |
EP2797078B1 (de) * | 2013-04-26 | 2016-10-12 | Agnitio S.L. | Schätzung der Zuverlässigkeit bei der Sprechererkennung |
US10714121B2 (en) | 2016-07-27 | 2020-07-14 | Vocollect, Inc. | Distinguishing user speech from background speech in speech-dense environments |
US10832679B2 (en) | 2018-11-20 | 2020-11-10 | International Business Machines Corporation | Method and system for correcting speech-to-text auto-transcription using local context of talk |
Family Cites Families (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5864810A (en) * | 1995-01-20 | 1999-01-26 | Sri International | Method and apparatus for speech recognition adapted to an individual speaker |
US5737489A (en) * | 1995-09-15 | 1998-04-07 | Lucent Technologies Inc. | Discriminative utterance verification for connected digits recognition |
US5835890A (en) * | 1996-08-02 | 1998-11-10 | Nippon Telegraph And Telephone Corporation | Method for speaker adaptation of speech models recognition scheme using the method and recording medium having the speech recognition method recorded thereon |
US5930753A (en) * | 1997-03-20 | 1999-07-27 | At&T Corp | Combining frequency warping and spectral shaping in HMM based speech recognition |
US5970239A (en) * | 1997-08-11 | 1999-10-19 | International Business Machines Corporation | Apparatus and method for performing model estimation utilizing a discriminant measure |
US6076053A (en) * | 1998-05-21 | 2000-06-13 | Lucent Technologies Inc. | Methods and apparatus for discriminative training and adaptation of pronunciation networks |
-
1999
- 1999-01-25 US US09/237,170 patent/US6205426B1/en not_active Expired - Lifetime
-
2000
- 2000-01-18 EP EP00300315A patent/EP1022723B1/de not_active Expired - Lifetime
- 2000-01-18 ES ES00300315T patent/ES2174797T3/es not_active Expired - Lifetime
- 2000-01-18 DE DE60000134T patent/DE60000134T2/de not_active Expired - Fee Related
- 2000-01-24 JP JP2000014485A patent/JP3685972B2/ja not_active Expired - Fee Related
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
DE102004029873B3 (de) * | 2004-06-16 | 2005-12-29 | Deutsche Telekom Ag | Verfahren und Vorrichtung zur intelligenten Eingabekorrektur für automatische Sprachdialogsysteme |
Also Published As
Publication number | Publication date |
---|---|
EP1022723A3 (de) | 2001-05-30 |
US6205426B1 (en) | 2001-03-20 |
DE60000134D1 (de) | 2002-05-29 |
JP3685972B2 (ja) | 2005-08-24 |
EP1022723A2 (de) | 2000-07-26 |
JP2000214883A (ja) | 2000-08-04 |
EP1022723B1 (de) | 2002-04-24 |
ES2174797T3 (es) | 2002-11-16 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
DE60000134D1 (de) | Unüberwachte Anpassung eines Spracherkenners unter Verwendung zuverlässiger Informationen aus den besten N Rechenhypothesen | |
DE10191732T1 (de) | Selektive Sprecheradaption für ein fahrzeuggebundenes Spracherkennungssystem | |
DE69917112D1 (de) | Erweiterung des Wortschatzes eines Client-Server-Spracherkennungssystems | |
DE69827586D1 (de) | Technik zur Adaptation von Hidden Markov Modellen für die Spracherkennung | |
DE60003971D1 (de) | Verteilte Architektur zum Trainieren eines Spracherkennungssystems | |
DE60000138D1 (de) | Erzeugung von mehreren Aussprachen eines Eigennames für die Spracherkennung | |
DE59801560D1 (de) | Verfahren zur Spracherkennung mit Sprachmodellanpassung | |
DE69919842D1 (de) | Sprachmodell basierend auf der spracherkennungshistorie | |
DE60123747D1 (de) | Spracherkennungsbasiertes Untertitelungssystem | |
DE69414752D1 (de) | Sprecherunabhängiges Erkennungssystem für isolierte Wörter unter Verwendung eines neuronalen Netzes | |
BR0206836A (pt) | Sistema de reconhecimento de voz distribuìdo usando modificação de vetor de caracterìstica acústica | |
DE69613910T2 (de) | Adaptives, auf der Grundlage eines Kodebuchs arbeitendes Sprachkompressionssystem | |
DE69626954D1 (de) | Signalkonditioniertes training mit minimaler fehlerrate für kontinuierliche spracherkennung | |
WO1999016052A3 (en) | Speech recognition system for recognizing continuous and isolated speech | |
DE69814589D1 (de) | Spracherkennung unter verwendung mehrerer spracherkenner | |
DE69615667T2 (de) | Spracherkennung | |
FI990077A (fi) | Menetelmä puheen tunnistuksessa ja puheella ohjattava langaton viestin | |
DE60229095D1 (de) | Ausprachen in mehreren Sprachen zur Spracherkennung | |
DK1374223T3 (da) | Stemmegenkendelsessystem, der gör brug af implicit talertilpasning | |
CA2299051A1 (en) | Hierarchical subband linear predictive cepstral features for hmm-based speech recognition | |
DE69933623D1 (de) | Spracherkennung | |
DE60042259D1 (de) | Spracherkennung einer in einer Sprachmitteilung eingefüten Telefonnummer in einem Sprachnachrichtensystem | |
DE69902233D1 (de) | Sprachkodierung unter verwendung einer weichen adaptation | |
DE59607861D1 (de) | Spracherkennungssystem | |
EP1251489A3 (de) | Training von Parametern eines Spracherkennungssystems zur Erkennung von Aussprachevarianten |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
8364 | No opposition during term of opposition | ||
8339 | Ceased/non-payment of the annual fee |