GB2391680A - Adaptive learning of language models for speech recognition - Google Patents
Adaptive learning of language models for speech recognition Download PDFInfo
- Publication number
- GB2391680A GB2391680A GB0326758A GB0326758A GB2391680A GB 2391680 A GB2391680 A GB 2391680A GB 0326758 A GB0326758 A GB 0326758A GB 0326758 A GB0326758 A GB 0326758A GB 2391680 A GB2391680 A GB 2391680A
- Authority
- GB
- United Kingdom
- Prior art keywords
- speech
- adaptive learning
- user
- utterances
- models
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 230000003044 adaptive effect Effects 0.000 title abstract 4
- 230000004044 response Effects 0.000 abstract 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/065—Adaptation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/183—Speech classification or search using natural language modelling using context dependencies, e.g. language models
- G10L15/19—Grammatical context, e.g. disambiguation of the recognition hypotheses based on word sequence rules
Landscapes
- Engineering & Computer Science (AREA)
- Artificial Intelligence (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Machine Translation (AREA)
Abstract
A pattern matching system compares an input signal with a set of models in order to map the signal to one of a set of classes. A spoken language interface comprises an automatic speech recognition system (ASR) for recognising speech inputs from a user, a speech generation system for providing speech to be delivered to the user, and a store of predicted utterances and compares a speech input from a user with those utterances to recognise the speech input. An adaptive learning unit includes an adaptive algorithm for automatically adapting the models of stored utterances in response to use of the Interface by one or more users. The adaptive learning may be on the basis of a recognition hypotheses selected and weighted according to their quality, determined by a hybrid confidence measure, their quantity, and their age.
Description
GB 2391680 A continuation (74) Agent and/or Address for Service: D Young &
Co 21 New Fetter Lane, LONDON, EC4A IDA, United Kingdom
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
GB0110810A GB2375211A (en) | 2001-05-02 | 2001-05-02 | Adaptive learning in speech recognition |
PCT/GB2002/002048 WO2002089112A1 (en) | 2001-05-02 | 2002-05-02 | Adaptive learning of language models for speech recognition |
Publications (3)
Publication Number | Publication Date |
---|---|
GB0326758D0 GB0326758D0 (en) | 2003-12-17 |
GB2391680A true GB2391680A (en) | 2004-02-11 |
GB2391680B GB2391680B (en) | 2005-07-20 |
Family
ID=9913924
Family Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
GB0110810A Withdrawn GB2375211A (en) | 2001-05-02 | 2001-05-02 | Adaptive learning in speech recognition |
GB0326758A Expired - Fee Related GB2391680B (en) | 2001-05-02 | 2002-05-02 | Adaptive learning of language models for speech recognition |
Family Applications Before (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
GB0110810A Withdrawn GB2375211A (en) | 2001-05-02 | 2001-05-02 | Adaptive learning in speech recognition |
Country Status (2)
Country | Link |
---|---|
GB (2) | GB2375211A (en) |
WO (1) | WO2002089112A1 (en) |
Families Citing this family (14)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
DE10341305A1 (en) * | 2003-09-05 | 2005-03-31 | Daimlerchrysler Ag | Intelligent user adaptation in dialog systems |
US7899671B2 (en) | 2004-02-05 | 2011-03-01 | Avaya, Inc. | Recognition results postprocessor for use in voice recognition systems |
US7925506B2 (en) | 2004-10-05 | 2011-04-12 | Inago Corporation | Speech recognition accuracy via concept to keyword mapping |
EP2317507B1 (en) * | 2004-10-05 | 2015-07-08 | Inago Corporation | Corpus compilation for language model generation |
US7827032B2 (en) | 2005-02-04 | 2010-11-02 | Vocollect, Inc. | Methods and systems for adapting a model for a speech recognition system |
US7783488B2 (en) | 2005-12-19 | 2010-08-24 | Nuance Communications, Inc. | Remote tracing and debugging of automatic speech recognition servers by speech reconstruction from cepstra and pitch information |
EP2005417A2 (en) * | 2006-04-03 | 2008-12-24 | Vocollect, Inc. | Methods and systems for optimizing model adaptation for a speech recognition system |
US7756708B2 (en) * | 2006-04-03 | 2010-07-13 | Google Inc. | Automatic language model update |
ES2311351B1 (en) * | 2006-05-31 | 2009-12-17 | France Telecom España, S.A. | METHOD FOR DYNAMICALLY ADAPTING THE ACOUSTIC MODELS OF ACKNOWLEDGMENT OF SPEAKING TO THE USER. |
US9508346B2 (en) * | 2013-08-28 | 2016-11-29 | Verint Systems Ltd. | System and method of automated language model adaptation |
CN104681023A (en) * | 2015-02-15 | 2015-06-03 | 联想(北京)有限公司 | Information processing method and electronic equipment |
US20180068323A1 (en) * | 2016-09-03 | 2018-03-08 | Neustar, Inc. | Automated method for learning the responsiveness of potential consumers to different stimuli in a marketplace |
WO2020041945A1 (en) | 2018-08-27 | 2020-03-05 | Beijing Didi Infinity Technology And Development Co., Ltd. | Artificial intelligent systems and methods for displaying destination on mobile device |
CN114158283A (en) * | 2020-07-08 | 2022-03-08 | 谷歌有限责任公司 | Recognition and utilization of misrecognition in automatic speech recognition |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6173266B1 (en) * | 1997-05-06 | 2001-01-09 | Speechworks International, Inc. | System and method for developing interactive speech applications |
WO2001026093A1 (en) * | 1999-10-05 | 2001-04-12 | One Voice Technologies, Inc. | Interactive user interface using speech recognition and natural language processing |
WO2001050453A2 (en) * | 2000-01-04 | 2001-07-12 | Heyanita, Inc. | Interactive voice response system |
Family Cites Families (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP0241170B1 (en) * | 1986-03-28 | 1992-05-27 | AT&T Corp. | Adaptive speech feature signal generation arrangement |
US6026359A (en) * | 1996-09-20 | 2000-02-15 | Nippon Telegraph And Telephone Corporation | Scheme for model adaptation in pattern recognition based on Taylor expansion |
DE19708183A1 (en) * | 1997-02-28 | 1998-09-03 | Philips Patentverwaltung | Method for speech recognition with language model adaptation |
EP1426923B1 (en) * | 1998-12-17 | 2006-03-29 | Sony Deutschland GmbH | Semi-supervised speaker adaptation |
JP2001100781A (en) * | 1999-09-30 | 2001-04-13 | Sony Corp | Method and device for voice processing and recording medium |
-
2001
- 2001-05-02 GB GB0110810A patent/GB2375211A/en not_active Withdrawn
-
2002
- 2002-05-02 WO PCT/GB2002/002048 patent/WO2002089112A1/en not_active Application Discontinuation
- 2002-05-02 GB GB0326758A patent/GB2391680B/en not_active Expired - Fee Related
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6173266B1 (en) * | 1997-05-06 | 2001-01-09 | Speechworks International, Inc. | System and method for developing interactive speech applications |
WO2001026093A1 (en) * | 1999-10-05 | 2001-04-12 | One Voice Technologies, Inc. | Interactive user interface using speech recognition and natural language processing |
WO2001050453A2 (en) * | 2000-01-04 | 2001-07-12 | Heyanita, Inc. | Interactive voice response system |
Non-Patent Citations (1)
Title |
---|
IEEE Transactions on speech and audio processing, Jan 2000, Vol 8, no 1, pages 3 to 10, G Riccardi et al, "Stochastic language adaption over time and state in natural spoken dialog systems" * |
Also Published As
Publication number | Publication date |
---|---|
GB0326758D0 (en) | 2003-12-17 |
GB2375211A (en) | 2002-11-06 |
WO2002089112A1 (en) | 2002-11-07 |
GB2391680B (en) | 2005-07-20 |
GB0110810D0 (en) | 2001-06-27 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US8065144B1 (en) | Multilingual speech recognition | |
US7228275B1 (en) | Speech recognition system having multiple speech recognizers | |
US6208964B1 (en) | Method and apparatus for providing unsupervised adaptation of transcriptions | |
US7826945B2 (en) | Automobile speech-recognition interface | |
GB2391680A (en) | Adaptive learning of language models for speech recognition | |
CA2387079C (en) | Natural language interface control system | |
EP1450349B1 (en) | Vehicle-mounted control apparatus and program that causes computer to execute method of providing guidance on the operation of the vehicle-mounted control apparatus | |
KR100826875B1 (en) | On-line speaker recognition method and apparatus for thereof | |
US20020087306A1 (en) | Computer-implemented noise normalization method and system | |
US20080103781A1 (en) | Automatically adapting user guidance in automated speech recognition | |
EP0398574A3 (en) | Speech recognition employing key word modeling and non-key word modeling | |
US7912727B2 (en) | Apparatus and method for integrated phrase-based and free-form speech-to-speech translation | |
EP0984428A3 (en) | Method and system for automatically determining phonetic transciptions associated with spelled words | |
EP0953967A3 (en) | An automated hotel attendant using speech recognition | |
JP2001503154A (en) | Hidden Markov Speech Model Fitting Method in Speech Recognition System | |
JPS59225635A (en) | Ultranarrow band communication system | |
EP1525577B1 (en) | Method for automatic speech recognition | |
DE69623364T2 (en) | Device for recognizing continuously spoken language | |
ATE391984T1 (en) | AUTOMATIC RETRAINING OF A VOICE RECOGNITION SYSTEM | |
EP0949606A3 (en) | Method and system for speech recognition based on phonetic transcriptions | |
AU2002233238A1 (en) | Mobile terminal controllable by spoken utterances | |
US20030163309A1 (en) | Speech dialogue system | |
JP4796686B2 (en) | How to train an automatic speech recognizer | |
Juang et al. | Deployable automatic speech recognition systems: Advances and challenges | |
GB2394590A (en) | System and method for speech verification using a robust confidence measure |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PCNP | Patent ceased through non-payment of renewal fee |
Effective date: 20200502 |
|
732E | Amendments to the register in respect of changes of name or changes affecting rights (sect. 32/1977) |
Free format text: REGISTERED BETWEEN 20210826 AND 20210901 |