HK1095013A1 - Learning in automatic speech recognition - Google Patents

Learning in automatic speech recognition

Info

Publication number
HK1095013A1
HK1095013A1 HK07101337.3A HK07101337A HK1095013A1 HK 1095013 A1 HK1095013 A1 HK 1095013A1 HK 07101337 A HK07101337 A HK 07101337A HK 1095013 A1 HK1095013 A1 HK 1095013A1
Authority
HK
Hong Kong
Prior art keywords
learning
speech recognition
automatic speech
automatic
recognition
Prior art date
Application number
HK07101337.3A
Other languages
English (en)
Inventor
Dilek Z Hakkani-Tur
Mazin G Rahim
Gokhan Tur
Giuseppe Riccardi
Original Assignee
At & T Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by At & T Corp filed Critical At & T Corp
Publication of HK1095013A1 publication Critical patent/HK1095013A1/xx

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • G10L15/063Training
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • G10L15/065Adaptation
    • G10L15/07Adaptation to the speaker
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • G10L15/063Training
    • G10L2015/0638Interactive procedures

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Artificial Intelligence (AREA)
  • Machine Translation (AREA)
  • Electrically Operated Instructional Devices (AREA)
HK07101337.3A 2005-02-23 2007-02-05 Learning in automatic speech recognition HK1095013A1 (en)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
US11/063,910 US8818808B2 (en) 2005-02-23 2005-02-23 Unsupervised and active learning in automatic speech recognition for call classification

Publications (1)

Publication Number Publication Date
HK1095013A1 true HK1095013A1 (en) 2007-04-20

Family

ID=36263105

Family Applications (1)

Application Number Title Priority Date Filing Date
HK07101337.3A HK1095013A1 (en) 2005-02-23 2007-02-05 Learning in automatic speech recognition

Country Status (5)

Country Link
US (3) US8818808B2 (fr)
EP (1) EP1696421B1 (fr)
CA (1) CA2537503A1 (fr)
DE (1) DE602006002287D1 (fr)
HK (1) HK1095013A1 (fr)

Families Citing this family (44)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8818808B2 (en) * 2005-02-23 2014-08-26 At&T Intellectual Property Ii, L.P. Unsupervised and active learning in automatic speech recognition for call classification
US7599861B2 (en) 2006-03-02 2009-10-06 Convergys Customer Management Group, Inc. System and method for closed loop decisionmaking in an automated care system
US7752152B2 (en) * 2006-03-17 2010-07-06 Microsoft Corporation Using predictive user models for language modeling on a personal device with user behavior models based on statistical modeling
US8032375B2 (en) * 2006-03-17 2011-10-04 Microsoft Corporation Using generic predictive models for slot values in language modeling
US7689420B2 (en) 2006-04-06 2010-03-30 Microsoft Corporation Personalizing a context-free grammar using a dictation language model
US8379830B1 (en) 2006-05-22 2013-02-19 Convergys Customer Management Delaware Llc System and method for automated customer service with contingent live interaction
US7809663B1 (en) 2006-05-22 2010-10-05 Convergys Cmg Utah, Inc. System and method for supporting the utilization of machine language
US9299345B1 (en) * 2006-06-20 2016-03-29 At&T Intellectual Property Ii, L.P. Bootstrapping language models for spoken dialog systems using the world wide web
US8521510B2 (en) * 2006-08-31 2013-08-27 At&T Intellectual Property Ii, L.P. Method and system for providing an automated web transcription service
US9244901B1 (en) * 2007-02-12 2016-01-26 West Corporation Automatic speech tagging system and method thereof
US8086549B2 (en) * 2007-11-09 2011-12-27 Microsoft Corporation Multi-label active learning
US8781833B2 (en) * 2008-07-17 2014-07-15 Nuance Communications, Inc. Speech recognition semantic classification training
US9218807B2 (en) * 2010-01-08 2015-12-22 Nuance Communications, Inc. Calibration of a speech recognition engine using validated text
US8645136B2 (en) * 2010-07-20 2014-02-04 Intellisist, Inc. System and method for efficiently reducing transcription error using hybrid voice transcription
US8606575B1 (en) * 2011-09-06 2013-12-10 West Corporation Method and apparatus of providing semi-automated classifier adaptation for natural language processing
US9129606B2 (en) * 2011-09-23 2015-09-08 Microsoft Technology Licensing, Llc User query history expansion for improving language model adaptation
US8374865B1 (en) * 2012-04-26 2013-02-12 Google Inc. Sampling training data for an automatic speech recognition system based on a benchmark classification distribution
US9292492B2 (en) * 2013-02-04 2016-03-22 Microsoft Technology Licensing, Llc Scaling statistical language understanding systems across domains and intents
US10235358B2 (en) 2013-02-21 2019-03-19 Microsoft Technology Licensing, Llc Exploiting structured content for unsupervised natural language semantic parsing
US20140365218A1 (en) * 2013-06-07 2014-12-11 Microsoft Corporation Language model adaptation using result selection
US20150006148A1 (en) * 2013-06-27 2015-01-01 Microsoft Corporation Automatically Creating Training Data For Language Identifiers
US20150088511A1 (en) * 2013-09-24 2015-03-26 Verizon Patent And Licensing Inc. Named-entity based speech recognition
US10073840B2 (en) 2013-12-20 2018-09-11 Microsoft Technology Licensing, Llc Unsupervised relation detection model training
US9870356B2 (en) 2014-02-13 2018-01-16 Microsoft Technology Licensing, Llc Techniques for inferring the unknown intents of linguistic items
US20150254233A1 (en) * 2014-03-06 2015-09-10 Nice-Systems Ltd Text-based unsupervised learning of language models
JP6362893B2 (ja) * 2014-03-20 2018-07-25 株式会社東芝 モデル更新装置及びモデル更新方法
JP5932869B2 (ja) * 2014-03-27 2016-06-08 インターナショナル・ビジネス・マシーンズ・コーポレーションInternational Business Machines Corporation N−gram言語モデルの教師無し学習方法、学習装置、および学習プログラム
US10108608B2 (en) * 2014-06-12 2018-10-23 Microsoft Technology Licensing, Llc Dialog state tracking using web-style ranking and multiple language understanding engines
US9905224B2 (en) 2015-06-11 2018-02-27 Nice Ltd. System and method for automatic language model generation
US11062228B2 (en) 2015-07-06 2021-07-13 Microsoft Technoiogy Licensing, LLC Transfer learning techniques for disparate label sets
US9858923B2 (en) * 2015-09-24 2018-01-02 Intel Corporation Dynamic adaptation of language models and semantic tracking for automatic speech recognition
CN106683677B (zh) 2015-11-06 2021-11-12 阿里巴巴集团控股有限公司 语音识别方法及装置
US10847152B2 (en) 2017-03-28 2020-11-24 Samsung Electronics Co., Ltd. Method for operating speech recognition service, electronic device and system supporting the same
US10474967B2 (en) 2017-05-23 2019-11-12 International Business Machines Corporation Conversation utterance labeling
US10885900B2 (en) 2017-08-11 2021-01-05 Microsoft Technology Licensing, Llc Domain adaptation in speech recognition via teacher-student learning
US10269376B1 (en) * 2018-06-28 2019-04-23 Invoca, Inc. Desired signal spotting in noisy, flawed environments
KR102281590B1 (ko) 2019-07-31 2021-07-29 엘지전자 주식회사 음성인식 성능 향상을 위한 비 지도 가중치 적용 학습 시스템 및 방법, 그리고 기록 매체
US11158322B2 (en) 2019-09-06 2021-10-26 Verbit Software Ltd. Human resolution of repeated phrases in a hybrid transcription system
US11373045B2 (en) * 2019-09-24 2022-06-28 ContactEngine Limited Determining context and intent in omnichannel communications using machine learning based artificial intelligence (AI) techniques
US10832656B1 (en) 2020-02-25 2020-11-10 Fawzi Shaya Computing device and method for populating digital forms from un-parsed data
US11741307B2 (en) * 2020-07-06 2023-08-29 Samsung Electronics Co., Ltd. System and method for learning new concepts from input utterances
US11630958B2 (en) * 2021-06-02 2023-04-18 Microsoft Technology Licensing, Llc Determining topic labels for communication transcripts based on a trained generative summarization model
US20230101424A1 (en) * 2021-09-30 2023-03-30 Uniphore Technologies Inc. Method and apparatus for active learning based call categorization
CN113993034B (zh) * 2021-11-18 2023-04-07 厦门理工学院 一种用于话筒的指向性音响传播方法及***

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6122613A (en) 1997-01-30 2000-09-19 Dragon Systems, Inc. Speech recognition using multiple recognizers (selectively) applied to the same input sample
US6453307B1 (en) 1998-03-03 2002-09-17 At&T Corp. Method and apparatus for multi-class, multi-label information categorization
EP1088299A2 (fr) 1999-03-26 2001-04-04 Scansoft, Inc. Reconnaissance vocale client-serveur
DE10138408A1 (de) 2001-08-04 2003-02-20 Philips Corp Intellectual Pty Verfahren zur Unterstützung des Korrekturlesens eines spracherkannten Textes mit an die Erkennungszuverlässigkeit angepasstem Wiedergabegeschwindigkeitsverlauf
US7257530B2 (en) * 2002-02-27 2007-08-14 Hongfeng Yin Method and system of knowledge based search engine using text mining
US20040138885A1 (en) 2003-01-09 2004-07-15 Xiaofan Lin Commercial automatic speech recognition engine combinations
US20060025995A1 (en) * 2004-07-29 2006-02-02 Erhart George W Method and apparatus for natural language call routing using confidence scores
US8818808B2 (en) * 2005-02-23 2014-08-26 At&T Intellectual Property Ii, L.P. Unsupervised and active learning in automatic speech recognition for call classification

Also Published As

Publication number Publication date
EP1696421A3 (fr) 2007-02-28
US8818808B2 (en) 2014-08-26
DE602006002287D1 (de) 2008-10-02
US20160027434A1 (en) 2016-01-28
US20060190253A1 (en) 2006-08-24
EP1696421A2 (fr) 2006-08-30
US9666182B2 (en) 2017-05-30
US20150046159A1 (en) 2015-02-12
US9159318B2 (en) 2015-10-13
CA2537503A1 (fr) 2006-08-23
EP1696421B1 (fr) 2008-08-20

Similar Documents

Publication Publication Date Title
HK1095013A1 (en) Learning in automatic speech recognition
EP2092514A4 (fr) Selection de contenu par reconnaissance de la parole
EP1771840A4 (fr) Pointeur terminal vocal
GB2457855B (en) Speech recognition system and speech recognition system program
GB0506528D0 (en) System and method for automatic speech recognition
GB2409750B (en) Speech recognition system and technique
EP1818909A4 (fr) Système de reconnaissance vocale
GB0719453D0 (en) Automatic speech recognition method and apparatus
EP2171710A4 (fr) Décomposition en carreaux de données de reconnaissance automatique de la parole (asr)
HK1135225A1 (en) Voice recognition device
EP1691344A4 (fr) Dispositif de reconnaissance vocale
GB2435758B (en) A Speech recognition circuit and method
SG119357A1 (en) Mixed-lingual text to speech
IL196017A0 (en) Two tiered text recognition
GB0513820D0 (en) Distributed voice recognition system and method
EP1880254A4 (fr) Analyse biométrique multimodale
EP1949260A4 (fr) Elagage d'index vocal
EP2260264A4 (fr) Sélection de grammaire par reconnaissance vocale basée sur le contexte
GB0616070D0 (en) Speech Recognition Feedback
EP1732063A4 (fr) Dispositif de reconnaissance vocale et procédé de reconnaissance vocale
DE602005000896D1 (de) Sprachsegmentierung
GB2409560B (en) Interactive speech recognition model
GB2467067B (en) VSP pattern recognition in absolute time
EP1889464A4 (fr) Systeme de commande a reconnaissance vocale
EP1939585A4 (fr) Dispositif de reconnaissance d'objet

Legal Events

Date Code Title Description
PC Patent ceased (i.e. patent has lapsed due to the failure to pay the renewal fee)

Effective date: 20220219