GB2391680A - Adaptive learning of language models for speech recognition - Google Patents

Adaptive learning of language models for speech recognition Download PDF

Info

Publication number
GB2391680A
GB2391680A GB0326758A GB0326758A GB2391680A GB 2391680 A GB2391680 A GB 2391680A GB 0326758 A GB0326758 A GB 0326758A GB 0326758 A GB0326758 A GB 0326758A GB 2391680 A GB2391680 A GB 2391680A
Authority
GB
United Kingdom
Prior art keywords
speech
adaptive learning
user
utterances
models
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
GB0326758A
Other versions
GB0326758D0 (en
GB2391680B (en
Inventor
Kerry Robinson
David Horowitz
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Vox Generation Ltd
Original Assignee
Vox Generation Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Vox Generation Ltd filed Critical Vox Generation Ltd
Publication of GB0326758D0 publication Critical patent/GB0326758D0/en
Publication of GB2391680A publication Critical patent/GB2391680A/en
Application granted granted Critical
Publication of GB2391680B publication Critical patent/GB2391680B/en
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • G10L15/065Adaptation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • G10L15/183Speech classification or search using natural language modelling using context dependencies, e.g. language models
    • G10L15/19Grammatical context, e.g. disambiguation of the recognition hypotheses based on word sequence rules

Landscapes

  • Engineering & Computer Science (AREA)
  • Artificial Intelligence (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Machine Translation (AREA)

Abstract

A pattern matching system compares an input signal with a set of models in order to map the signal to one of a set of classes. A spoken language interface comprises an automatic speech recognition system (ASR) for recognising speech inputs from a user, a speech generation system for providing speech to be delivered to the user, and a store of predicted utterances and compares a speech input from a user with those utterances to recognise the speech input. An adaptive learning unit includes an adaptive algorithm for automatically adapting the models of stored utterances in response to use of the Interface by one or more users. The adaptive learning may be on the basis of a recognition hypotheses selected and weighted according to their quality, determined by a hybrid confidence measure, their quantity, and their age.

Description

GB 2391680 A continuation (74) Agent and/or Address for Service: D Young &
Co 21 New Fetter Lane, LONDON, EC4A IDA, United Kingdom
GB0326758A 2001-05-02 2002-05-02 Adaptive learning of language models for speech recognition Expired - Fee Related GB2391680B (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
GB0110810A GB2375211A (en) 2001-05-02 2001-05-02 Adaptive learning in speech recognition
PCT/GB2002/002048 WO2002089112A1 (en) 2001-05-02 2002-05-02 Adaptive learning of language models for speech recognition

Publications (3)

Publication Number Publication Date
GB0326758D0 GB0326758D0 (en) 2003-12-17
GB2391680A true GB2391680A (en) 2004-02-11
GB2391680B GB2391680B (en) 2005-07-20

Family

ID=9913924

Family Applications (2)

Application Number Title Priority Date Filing Date
GB0110810A Withdrawn GB2375211A (en) 2001-05-02 2001-05-02 Adaptive learning in speech recognition
GB0326758A Expired - Fee Related GB2391680B (en) 2001-05-02 2002-05-02 Adaptive learning of language models for speech recognition

Family Applications Before (1)

Application Number Title Priority Date Filing Date
GB0110810A Withdrawn GB2375211A (en) 2001-05-02 2001-05-02 Adaptive learning in speech recognition

Country Status (2)

Country Link
GB (2) GB2375211A (en)
WO (1) WO2002089112A1 (en)

Families Citing this family (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DE10341305A1 (en) * 2003-09-05 2005-03-31 Daimlerchrysler Ag Intelligent user adaptation in dialog systems
US7899671B2 (en) 2004-02-05 2011-03-01 Avaya, Inc. Recognition results postprocessor for use in voice recognition systems
US7925506B2 (en) 2004-10-05 2011-04-12 Inago Corporation Speech recognition accuracy via concept to keyword mapping
EP2317507B1 (en) * 2004-10-05 2015-07-08 Inago Corporation Corpus compilation for language model generation
US7827032B2 (en) 2005-02-04 2010-11-02 Vocollect, Inc. Methods and systems for adapting a model for a speech recognition system
US7783488B2 (en) 2005-12-19 2010-08-24 Nuance Communications, Inc. Remote tracing and debugging of automatic speech recognition servers by speech reconstruction from cepstra and pitch information
EP2005417A2 (en) * 2006-04-03 2008-12-24 Vocollect, Inc. Methods and systems for optimizing model adaptation for a speech recognition system
US7756708B2 (en) * 2006-04-03 2010-07-13 Google Inc. Automatic language model update
ES2311351B1 (en) * 2006-05-31 2009-12-17 France Telecom España, S.A. METHOD FOR DYNAMICALLY ADAPTING THE ACOUSTIC MODELS OF ACKNOWLEDGMENT OF SPEAKING TO THE USER.
US9508346B2 (en) * 2013-08-28 2016-11-29 Verint Systems Ltd. System and method of automated language model adaptation
CN104681023A (en) * 2015-02-15 2015-06-03 联想(北京)有限公司 Information processing method and electronic equipment
US20180068323A1 (en) * 2016-09-03 2018-03-08 Neustar, Inc. Automated method for learning the responsiveness of potential consumers to different stimuli in a marketplace
WO2020041945A1 (en) 2018-08-27 2020-03-05 Beijing Didi Infinity Technology And Development Co., Ltd. Artificial intelligent systems and methods for displaying destination on mobile device
CN114158283A (en) * 2020-07-08 2022-03-08 谷歌有限责任公司 Recognition and utilization of misrecognition in automatic speech recognition

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6173266B1 (en) * 1997-05-06 2001-01-09 Speechworks International, Inc. System and method for developing interactive speech applications
WO2001026093A1 (en) * 1999-10-05 2001-04-12 One Voice Technologies, Inc. Interactive user interface using speech recognition and natural language processing
WO2001050453A2 (en) * 2000-01-04 2001-07-12 Heyanita, Inc. Interactive voice response system

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0241170B1 (en) * 1986-03-28 1992-05-27 AT&T Corp. Adaptive speech feature signal generation arrangement
US6026359A (en) * 1996-09-20 2000-02-15 Nippon Telegraph And Telephone Corporation Scheme for model adaptation in pattern recognition based on Taylor expansion
DE19708183A1 (en) * 1997-02-28 1998-09-03 Philips Patentverwaltung Method for speech recognition with language model adaptation
EP1426923B1 (en) * 1998-12-17 2006-03-29 Sony Deutschland GmbH Semi-supervised speaker adaptation
JP2001100781A (en) * 1999-09-30 2001-04-13 Sony Corp Method and device for voice processing and recording medium

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6173266B1 (en) * 1997-05-06 2001-01-09 Speechworks International, Inc. System and method for developing interactive speech applications
WO2001026093A1 (en) * 1999-10-05 2001-04-12 One Voice Technologies, Inc. Interactive user interface using speech recognition and natural language processing
WO2001050453A2 (en) * 2000-01-04 2001-07-12 Heyanita, Inc. Interactive voice response system

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
IEEE Transactions on speech and audio processing, Jan 2000, Vol 8, no 1, pages 3 to 10, G Riccardi et al, "Stochastic language adaption over time and state in natural spoken dialog systems" *

Also Published As

Publication number Publication date
GB0326758D0 (en) 2003-12-17
GB2375211A (en) 2002-11-06
WO2002089112A1 (en) 2002-11-07
GB2391680B (en) 2005-07-20
GB0110810D0 (en) 2001-06-27

Similar Documents

Publication Publication Date Title
US8065144B1 (en) Multilingual speech recognition
US7228275B1 (en) Speech recognition system having multiple speech recognizers
US6208964B1 (en) Method and apparatus for providing unsupervised adaptation of transcriptions
US7826945B2 (en) Automobile speech-recognition interface
GB2391680A (en) Adaptive learning of language models for speech recognition
CA2387079C (en) Natural language interface control system
EP1450349B1 (en) Vehicle-mounted control apparatus and program that causes computer to execute method of providing guidance on the operation of the vehicle-mounted control apparatus
KR100826875B1 (en) On-line speaker recognition method and apparatus for thereof
US20020087306A1 (en) Computer-implemented noise normalization method and system
US20080103781A1 (en) Automatically adapting user guidance in automated speech recognition
EP0398574A3 (en) Speech recognition employing key word modeling and non-key word modeling
US7912727B2 (en) Apparatus and method for integrated phrase-based and free-form speech-to-speech translation
EP0984428A3 (en) Method and system for automatically determining phonetic transciptions associated with spelled words
EP0953967A3 (en) An automated hotel attendant using speech recognition
JP2001503154A (en) Hidden Markov Speech Model Fitting Method in Speech Recognition System
JPS59225635A (en) Ultranarrow band communication system
EP1525577B1 (en) Method for automatic speech recognition
DE69623364T2 (en) Device for recognizing continuously spoken language
ATE391984T1 (en) AUTOMATIC RETRAINING OF A VOICE RECOGNITION SYSTEM
EP0949606A3 (en) Method and system for speech recognition based on phonetic transcriptions
AU2002233238A1 (en) Mobile terminal controllable by spoken utterances
US20030163309A1 (en) Speech dialogue system
JP4796686B2 (en) How to train an automatic speech recognizer
Juang et al. Deployable automatic speech recognition systems: Advances and challenges
GB2394590A (en) System and method for speech verification using a robust confidence measure

Legal Events

Date Code Title Description
PCNP Patent ceased through non-payment of renewal fee

Effective date: 20200502

732E Amendments to the register in respect of changes of name or changes affecting rights (sect. 32/1977)

Free format text: REGISTERED BETWEEN 20210826 AND 20210901