HK1095013A1 - Learning in automatic speech recognition - Google Patents
Learning in automatic speech recognitionInfo
- Publication number
- HK1095013A1 HK1095013A1 HK07101337.3A HK07101337A HK1095013A1 HK 1095013 A1 HK1095013 A1 HK 1095013A1 HK 07101337 A HK07101337 A HK 07101337A HK 1095013 A1 HK1095013 A1 HK 1095013A1
- Authority
- HK
- Hong Kong
- Prior art keywords
- learning
- speech recognition
- automatic speech
- automatic
- recognition
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/063—Training
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/065—Adaptation
- G10L15/07—Adaptation to the speaker
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/26—Speech to text systems
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/063—Training
- G10L2015/0638—Interactive procedures
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Artificial Intelligence (AREA)
- Machine Translation (AREA)
- Electrically Operated Instructional Devices (AREA)
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US11/063,910 US8818808B2 (en) | 2005-02-23 | 2005-02-23 | Unsupervised and active learning in automatic speech recognition for call classification |
Publications (1)
Publication Number | Publication Date |
---|---|
HK1095013A1 true HK1095013A1 (en) | 2007-04-20 |
Family
ID=36263105
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
HK07101337.3A HK1095013A1 (en) | 2005-02-23 | 2007-02-05 | Learning in automatic speech recognition |
Country Status (5)
Country | Link |
---|---|
US (3) | US8818808B2 (fr) |
EP (1) | EP1696421B1 (fr) |
CA (1) | CA2537503A1 (fr) |
DE (1) | DE602006002287D1 (fr) |
HK (1) | HK1095013A1 (fr) |
Families Citing this family (44)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8818808B2 (en) * | 2005-02-23 | 2014-08-26 | At&T Intellectual Property Ii, L.P. | Unsupervised and active learning in automatic speech recognition for call classification |
US7599861B2 (en) | 2006-03-02 | 2009-10-06 | Convergys Customer Management Group, Inc. | System and method for closed loop decisionmaking in an automated care system |
US7752152B2 (en) * | 2006-03-17 | 2010-07-06 | Microsoft Corporation | Using predictive user models for language modeling on a personal device with user behavior models based on statistical modeling |
US8032375B2 (en) * | 2006-03-17 | 2011-10-04 | Microsoft Corporation | Using generic predictive models for slot values in language modeling |
US7689420B2 (en) | 2006-04-06 | 2010-03-30 | Microsoft Corporation | Personalizing a context-free grammar using a dictation language model |
US8379830B1 (en) | 2006-05-22 | 2013-02-19 | Convergys Customer Management Delaware Llc | System and method for automated customer service with contingent live interaction |
US7809663B1 (en) | 2006-05-22 | 2010-10-05 | Convergys Cmg Utah, Inc. | System and method for supporting the utilization of machine language |
US9299345B1 (en) * | 2006-06-20 | 2016-03-29 | At&T Intellectual Property Ii, L.P. | Bootstrapping language models for spoken dialog systems using the world wide web |
US8521510B2 (en) * | 2006-08-31 | 2013-08-27 | At&T Intellectual Property Ii, L.P. | Method and system for providing an automated web transcription service |
US9244901B1 (en) * | 2007-02-12 | 2016-01-26 | West Corporation | Automatic speech tagging system and method thereof |
US8086549B2 (en) * | 2007-11-09 | 2011-12-27 | Microsoft Corporation | Multi-label active learning |
US8781833B2 (en) * | 2008-07-17 | 2014-07-15 | Nuance Communications, Inc. | Speech recognition semantic classification training |
US9218807B2 (en) * | 2010-01-08 | 2015-12-22 | Nuance Communications, Inc. | Calibration of a speech recognition engine using validated text |
US8645136B2 (en) * | 2010-07-20 | 2014-02-04 | Intellisist, Inc. | System and method for efficiently reducing transcription error using hybrid voice transcription |
US8606575B1 (en) * | 2011-09-06 | 2013-12-10 | West Corporation | Method and apparatus of providing semi-automated classifier adaptation for natural language processing |
US9129606B2 (en) * | 2011-09-23 | 2015-09-08 | Microsoft Technology Licensing, Llc | User query history expansion for improving language model adaptation |
US8374865B1 (en) * | 2012-04-26 | 2013-02-12 | Google Inc. | Sampling training data for an automatic speech recognition system based on a benchmark classification distribution |
US9292492B2 (en) * | 2013-02-04 | 2016-03-22 | Microsoft Technology Licensing, Llc | Scaling statistical language understanding systems across domains and intents |
US10235358B2 (en) | 2013-02-21 | 2019-03-19 | Microsoft Technology Licensing, Llc | Exploiting structured content for unsupervised natural language semantic parsing |
US20140365218A1 (en) * | 2013-06-07 | 2014-12-11 | Microsoft Corporation | Language model adaptation using result selection |
US20150006148A1 (en) * | 2013-06-27 | 2015-01-01 | Microsoft Corporation | Automatically Creating Training Data For Language Identifiers |
US20150088511A1 (en) * | 2013-09-24 | 2015-03-26 | Verizon Patent And Licensing Inc. | Named-entity based speech recognition |
US10073840B2 (en) | 2013-12-20 | 2018-09-11 | Microsoft Technology Licensing, Llc | Unsupervised relation detection model training |
US9870356B2 (en) | 2014-02-13 | 2018-01-16 | Microsoft Technology Licensing, Llc | Techniques for inferring the unknown intents of linguistic items |
US20150254233A1 (en) * | 2014-03-06 | 2015-09-10 | Nice-Systems Ltd | Text-based unsupervised learning of language models |
JP6362893B2 (ja) * | 2014-03-20 | 2018-07-25 | 株式会社東芝 | モデル更新装置及びモデル更新方法 |
JP5932869B2 (ja) * | 2014-03-27 | 2016-06-08 | インターナショナル・ビジネス・マシーンズ・コーポレーションInternational Business Machines Corporation | N−gram言語モデルの教師無し学習方法、学習装置、および学習プログラム |
US10108608B2 (en) * | 2014-06-12 | 2018-10-23 | Microsoft Technology Licensing, Llc | Dialog state tracking using web-style ranking and multiple language understanding engines |
US9905224B2 (en) | 2015-06-11 | 2018-02-27 | Nice Ltd. | System and method for automatic language model generation |
US11062228B2 (en) | 2015-07-06 | 2021-07-13 | Microsoft Technoiogy Licensing, LLC | Transfer learning techniques for disparate label sets |
US9858923B2 (en) * | 2015-09-24 | 2018-01-02 | Intel Corporation | Dynamic adaptation of language models and semantic tracking for automatic speech recognition |
CN106683677B (zh) | 2015-11-06 | 2021-11-12 | 阿里巴巴集团控股有限公司 | 语音识别方法及装置 |
US10847152B2 (en) | 2017-03-28 | 2020-11-24 | Samsung Electronics Co., Ltd. | Method for operating speech recognition service, electronic device and system supporting the same |
US10474967B2 (en) | 2017-05-23 | 2019-11-12 | International Business Machines Corporation | Conversation utterance labeling |
US10885900B2 (en) | 2017-08-11 | 2021-01-05 | Microsoft Technology Licensing, Llc | Domain adaptation in speech recognition via teacher-student learning |
US10269376B1 (en) * | 2018-06-28 | 2019-04-23 | Invoca, Inc. | Desired signal spotting in noisy, flawed environments |
KR102281590B1 (ko) | 2019-07-31 | 2021-07-29 | 엘지전자 주식회사 | 음성인식 성능 향상을 위한 비 지도 가중치 적용 학습 시스템 및 방법, 그리고 기록 매체 |
US11158322B2 (en) | 2019-09-06 | 2021-10-26 | Verbit Software Ltd. | Human resolution of repeated phrases in a hybrid transcription system |
US11373045B2 (en) * | 2019-09-24 | 2022-06-28 | ContactEngine Limited | Determining context and intent in omnichannel communications using machine learning based artificial intelligence (AI) techniques |
US10832656B1 (en) | 2020-02-25 | 2020-11-10 | Fawzi Shaya | Computing device and method for populating digital forms from un-parsed data |
US11741307B2 (en) * | 2020-07-06 | 2023-08-29 | Samsung Electronics Co., Ltd. | System and method for learning new concepts from input utterances |
US11630958B2 (en) * | 2021-06-02 | 2023-04-18 | Microsoft Technology Licensing, Llc | Determining topic labels for communication transcripts based on a trained generative summarization model |
US20230101424A1 (en) * | 2021-09-30 | 2023-03-30 | Uniphore Technologies Inc. | Method and apparatus for active learning based call categorization |
CN113993034B (zh) * | 2021-11-18 | 2023-04-07 | 厦门理工学院 | 一种用于话筒的指向性音响传播方法及*** |
Family Cites Families (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6122613A (en) | 1997-01-30 | 2000-09-19 | Dragon Systems, Inc. | Speech recognition using multiple recognizers (selectively) applied to the same input sample |
US6453307B1 (en) | 1998-03-03 | 2002-09-17 | At&T Corp. | Method and apparatus for multi-class, multi-label information categorization |
EP1088299A2 (fr) | 1999-03-26 | 2001-04-04 | Scansoft, Inc. | Reconnaissance vocale client-serveur |
DE10138408A1 (de) | 2001-08-04 | 2003-02-20 | Philips Corp Intellectual Pty | Verfahren zur Unterstützung des Korrekturlesens eines spracherkannten Textes mit an die Erkennungszuverlässigkeit angepasstem Wiedergabegeschwindigkeitsverlauf |
US7257530B2 (en) * | 2002-02-27 | 2007-08-14 | Hongfeng Yin | Method and system of knowledge based search engine using text mining |
US20040138885A1 (en) | 2003-01-09 | 2004-07-15 | Xiaofan Lin | Commercial automatic speech recognition engine combinations |
US20060025995A1 (en) * | 2004-07-29 | 2006-02-02 | Erhart George W | Method and apparatus for natural language call routing using confidence scores |
US8818808B2 (en) * | 2005-02-23 | 2014-08-26 | At&T Intellectual Property Ii, L.P. | Unsupervised and active learning in automatic speech recognition for call classification |
-
2005
- 2005-02-23 US US11/063,910 patent/US8818808B2/en active Active
-
2006
- 2006-02-22 CA CA002537503A patent/CA2537503A1/fr not_active Abandoned
- 2006-02-23 DE DE602006002287T patent/DE602006002287D1/de active Active
- 2006-02-23 EP EP06110328A patent/EP1696421B1/fr active Active
-
2007
- 2007-02-05 HK HK07101337.3A patent/HK1095013A1/xx not_active IP Right Cessation
-
2014
- 2014-08-26 US US14/468,375 patent/US9159318B2/en not_active Expired - Fee Related
-
2015
- 2015-10-05 US US14/874,843 patent/US9666182B2/en active Active
Also Published As
Publication number | Publication date |
---|---|
EP1696421A3 (fr) | 2007-02-28 |
US8818808B2 (en) | 2014-08-26 |
DE602006002287D1 (de) | 2008-10-02 |
US20160027434A1 (en) | 2016-01-28 |
US20060190253A1 (en) | 2006-08-24 |
EP1696421A2 (fr) | 2006-08-30 |
US9666182B2 (en) | 2017-05-30 |
US20150046159A1 (en) | 2015-02-12 |
US9159318B2 (en) | 2015-10-13 |
CA2537503A1 (fr) | 2006-08-23 |
EP1696421B1 (fr) | 2008-08-20 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
HK1095013A1 (en) | Learning in automatic speech recognition | |
EP2092514A4 (fr) | Selection de contenu par reconnaissance de la parole | |
EP1771840A4 (fr) | Pointeur terminal vocal | |
GB2457855B (en) | Speech recognition system and speech recognition system program | |
GB0506528D0 (en) | System and method for automatic speech recognition | |
GB2409750B (en) | Speech recognition system and technique | |
EP1818909A4 (fr) | Système de reconnaissance vocale | |
GB0719453D0 (en) | Automatic speech recognition method and apparatus | |
EP2171710A4 (fr) | Décomposition en carreaux de données de reconnaissance automatique de la parole (asr) | |
HK1135225A1 (en) | Voice recognition device | |
EP1691344A4 (fr) | Dispositif de reconnaissance vocale | |
GB2435758B (en) | A Speech recognition circuit and method | |
SG119357A1 (en) | Mixed-lingual text to speech | |
IL196017A0 (en) | Two tiered text recognition | |
GB0513820D0 (en) | Distributed voice recognition system and method | |
EP1880254A4 (fr) | Analyse biométrique multimodale | |
EP1949260A4 (fr) | Elagage d'index vocal | |
EP2260264A4 (fr) | Sélection de grammaire par reconnaissance vocale basée sur le contexte | |
GB0616070D0 (en) | Speech Recognition Feedback | |
EP1732063A4 (fr) | Dispositif de reconnaissance vocale et procédé de reconnaissance vocale | |
DE602005000896D1 (de) | Sprachsegmentierung | |
GB2409560B (en) | Interactive speech recognition model | |
GB2467067B (en) | VSP pattern recognition in absolute time | |
EP1889464A4 (fr) | Systeme de commande a reconnaissance vocale | |
EP1939585A4 (fr) | Dispositif de reconnaissance d'objet |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PC | Patent ceased (i.e. patent has lapsed due to the failure to pay the renewal fee) |
Effective date: 20220219 |