DE69811921T2 - Vorrichtung und verfahren zur unterscheidung von ähnlich klingenden wörtern in der spracherkennung - Google Patents

Vorrichtung und verfahren zur unterscheidung von ähnlich klingenden wörtern in der spracherkennung

Info

Publication number: DE69811921T2
Authority: DE; Germany
Prior art keywords: distinating; similar; voice recognition; sounding words; hints
Prior art date: 1997-09-24
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.): Expired - Lifetime

Application number

DE69811921T

Other languages

English (en)

Other versions

DE69811921D1 (de

Inventor

Koen Schoofs

Guido Gallopyn

Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)

Lernout and Hauspie Speech Products NV

Original Assignee

Lernout and Hauspie Speech Products NV

Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)

1997-09-24

Filing date

1998-09-24

Publication date

2003-11-13

1998-09-24 Application filed by Lernout and Hauspie Speech Products NV filed Critical Lernout and Hauspie Speech Products NV

2003-04-10 Publication of DE69811921D1 publication Critical patent/DE69811921D1/de

2003-11-13 Application granted granted Critical

2003-11-13 Publication of DE69811921T2 publication Critical patent/DE69811921T2/de

2018-09-25 Anticipated expiration legal-status Critical

Status Expired - Lifetime legal-status Critical Current

Links

230000000877 morphologic effect Effects 0.000 abstract 1

Classifications

- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/1815—Semantic context, e.g. disambiguation of the recognition hypotheses based on word meaning
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/063—Training
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/063—Training
- G10L2015/0631—Creating reference templates; Clustering
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G10L2015/226—Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics

Landscapes

Engineering & Computer Science (AREA)
Artificial Intelligence (AREA)
Computational Linguistics (AREA)
Health & Medical Sciences (AREA)
Audiology, Speech & Language Pathology (AREA)
Human Computer Interaction (AREA)
Physics & Mathematics (AREA)
Acoustics & Sound (AREA)
Multimedia (AREA)
Machine Translation (AREA)
Fittings On The Vehicle Exterior For Carrying Loads, And Devices For Holding Or Mounting Articles (AREA)
Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)

DE69811921T 1997-09-24 1998-09-24 Vorrichtung und verfahren zur unterscheidung von ähnlich klingenden wörtern in der spracherkennung Expired - Lifetime DE69811921T2 (de)

Applications Claiming Priority (2)

Application Number	Priority Date	Filing Date	Title
US5989597P	1997-09-24	1997-09-24
PCT/IB1998/001717 WO1999016051A1 (en)	1997-09-24	1998-09-24	Apparatus and method for distinguishing similar-sounding utterances in speech recognition

Publications (2)

Publication Number	Publication Date
DE69811921D1 DE69811921D1 (de)	2003-04-10
DE69811921T2 true DE69811921T2 (de)	2003-11-13

Family

ID=22025979

Family Applications (1)

Application Number	Title	Priority Date	Filing Date
DE69811921T Expired - Lifetime DE69811921T2 (de)	1997-09-24	1998-09-24	Vorrichtung und verfahren zur unterscheidung von ähnlich klingenden wörtern in der spracherkennung

Country Status (8)

Country	Link
US (1)	US6487532B1 (de)
EP (1)	EP1018109B1 (de)
JP (1)	JP2001517815A (de)
AT (1)	ATE233935T1 (de)
AU (1)	AU9455498A (de)
CA (1)	CA2303312A1 (de)
DE (1)	DE69811921T2 (de)
WO (1)	WO1999016051A1 (de)

Families Citing this family (45)

* Cited by examiner, † Cited by third party
Publication number	Priority date	Publication date	Assignee	Title
CN1343337B (zh) *	1999-03-05	2013-03-20	佳能株式会社	用于产生包括音素数据和解码的字的注释数据的方法和设备
US6882970B1 (en)	1999-10-28	2005-04-19	Canon Kabushiki Kaisha	Language recognition using sequence frequency
WO2001031627A2 (en) *	1999-10-28	2001-05-03	Canon Kabushiki Kaisha	Pattern matching method and apparatus
US7310600B1 (en)	1999-10-28	2007-12-18	Canon Kabushiki Kaisha	Language recognition using a similarity measure
JP2001249684A (ja) *	2000-03-02	2001-09-14	Sony Corp	音声認識装置および音声認識方法、並びに記録媒体
GB0011798D0 (en) *	2000-05-16	2000-07-05	Canon Kk	Database annotation and retrieval
GB0015233D0 (en) *	2000-06-21	2000-08-16	Canon Kk	Indexing method and apparatus
ATE326754T1 (de) *	2000-09-18	2006-06-15	L & H Holdings Usa Inc	Homophonewahl in der spracherkennung
GB0023930D0 (en)	2000-09-29	2000-11-15	Canon Kk	Database annotation and retrieval
US7085716B1 (en)	2000-10-26	2006-08-01	Nuance Communications, Inc.	Speech recognition using word-in-phrase command
GB0027178D0 (en)	2000-11-07	2000-12-27	Canon Kk	Speech processing system
GB0028277D0 (en)	2000-11-20	2001-01-03	Canon Kk	Speech processing system
US6973427B2 (en) *	2000-12-26	2005-12-06	Microsoft Corporation	Method for adding phonetic descriptions to a speech recognition lexicon
US6934683B2 (en) *	2001-01-31	2005-08-23	Microsoft Corporation	Disambiguation language model
US7729913B1 (en) *	2003-03-18	2010-06-01	A9.Com, Inc.	Generation and selection of voice recognition grammars for conducting database searches
US7676364B2 (en)	2004-03-25	2010-03-09	Ashwin Rao	System and method for speech-to-text conversion using constrained dictation in a speak-and-spell mode
US20060136195A1 (en) *	2004-12-22	2006-06-22	International Business Machines Corporation	Text grouping for disambiguation in a speech application
US7865362B2 (en) *	2005-02-04	2011-01-04	Vocollect, Inc.	Method and system for considering information about an expected response when performing speech recognition
US7827032B2 (en) *	2005-02-04	2010-11-02	Vocollect, Inc.	Methods and systems for adapting a model for a speech recognition system
US7895039B2 (en)	2005-02-04	2011-02-22	Vocollect, Inc.	Methods and systems for optimizing model adaptation for a speech recognition system
US7949533B2 (en) *	2005-02-04	2011-05-24	Vococollect, Inc.	Methods and systems for assessing and improving the performance of a speech recognition system
US8200495B2 (en)	2005-02-04	2012-06-12	Vocollect, Inc.	Methods and systems for considering information about an expected response when performing speech recognition
US8170875B2 (en)	2005-06-15	2012-05-01	Qnx Software Systems Limited	Speech end-pointer
US8311819B2 (en) *	2005-06-15	2012-11-13	Qnx Software Systems Limited	System for detecting speech with background voice estimates and noise estimates
JP4734155B2 (ja)	2006-03-24	2011-07-27	株式会社東芝	音声認識装置、音声認識方法および音声認識プログラム
US8374862B2 (en) *	2006-08-30	2013-02-12	Research In Motion Limited	Method, software and device for uniquely identifying a desired contact in a contacts database based on a single utterance
US7831431B2 (en)	2006-10-31	2010-11-09	Honda Motor Co., Ltd.	Voice recognition updates via remote broadcast signal
US9830912B2 (en)	2006-11-30	2017-11-28	Ashwin P Rao	Speak and touch auto correction interface
US7844456B2 (en) *	2007-03-09	2010-11-30	Microsoft Corporation	Grammar confusability metric for speech recognition
US8536976B2 (en) *	2008-06-11	2013-09-17	Veritrix, Inc.	Single-channel multi-factor authentication
US8166297B2 (en) *	2008-07-02	2012-04-24	Veritrix, Inc.	Systems and methods for controlling access to encrypted data stored on a mobile device
US9922640B2 (en)	2008-10-17	2018-03-20	Ashwin P Rao	System and method for multimodal utterance detection
EP2353125A4 (de) *	2008-11-03	2013-06-12	Veritrix Inc	Benutzerauthentifizierung für soziale netzwerke
JP5533042B2 (ja) *	2010-03-04	2014-06-25	富士通株式会社	音声検索装置、音声検索方法、プログラム及び記録媒体
US8812321B2 (en) *	2010-09-30	2014-08-19	At&T Intellectual Property I, L.P.	System and method for combining speech recognition outputs from a plurality of domain-specific speech recognizers via machine learning
KR101828273B1 (ko) *	2011-01-04	2018-02-14	삼성전자주식회사	결합기반의 음성명령 인식 장치 및 그 방법
US8914290B2 (en)	2011-05-20	2014-12-16	Vocollect, Inc.	Systems and methods for dynamically improving user intelligibility of synthesized speech in a work environment
US8560310B1 (en) *	2012-05-08	2013-10-15	Nuance Communications, Inc.	Method and apparatus providing improved voice activated functions
US10957310B1 (en)	2012-07-23	2021-03-23	Soundhound, Inc.	Integrated programming framework for speech and text understanding with meaning parsing
US8977555B2 (en)	2012-12-20	2015-03-10	Amazon Technologies, Inc.	Identification of utterance subjects
US9978395B2 (en)	2013-03-15	2018-05-22	Vocollect, Inc.	Method and system for mitigating delay in receiving audio stream during production of sound from audio stream
US9202459B2 (en) *	2013-04-19	2015-12-01	GM Global Technology Operations LLC	Methods and systems for managing dialog of speech systems
US11295730B1 (en)	2014-02-27	2022-04-05	Soundhound, Inc.	Using phonetic variants in a local context to improve natural language understanding
US10714121B2 (en)	2016-07-27	2020-07-14	Vocollect, Inc.	Distinguishing user speech from background speech in speech-dense environments
US20180358004A1 (en) *	2017-06-07	2018-12-13	Lenovo (Singapore) Pte. Ltd.	Apparatus, method, and program product for spelling words

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number	Priority date	Publication date	Assignee	Title
EP0302614B1 (de) *	1987-07-16	1993-03-10	Fujitsu Limited	Spracherkennungseinrichtung
US5146405A (en) *	1988-02-05	1992-09-08	At&T Bell Laboratories	Methods for part-of-speech determination and usage
US5054074A (en) *	1989-03-02	1991-10-01	International Business Machines Corporation	Optimized speech recognition system and method
GB9511855D0 (en)	1995-06-12	1995-08-09	Dragon Syst Uk Ltd	Speech recognition apparatus and methods
US5828991A (en) *	1995-06-30	1998-10-27	The Research Foundation Of The State University Of New York	Sentence reconstruction using word ambiguity resolution
US5903864A (en) *	1995-08-30	1999-05-11	Dragon Systems	Speech recognition
US5752230A (en) *	1996-08-20	1998-05-12	Ncr Corporation	Method and apparatus for identifying names with a speech recognition program

1998
- 1998-09-24 DE DE69811921T patent/DE69811921T2/de not_active Expired - Lifetime
- 1998-09-24 EP EP98947737A patent/EP1018109B1/de not_active Expired - Lifetime
- 1998-09-24 AU AU94554/98A patent/AU9455498A/en not_active Abandoned
- 1998-09-24 WO PCT/IB1998/001717 patent/WO1999016051A1/en active IP Right Grant
- 1998-09-24 US US09/159,838 patent/US6487532B1/en not_active Expired - Lifetime
- 1998-09-24 CA CA002303312A patent/CA2303312A1/en not_active Abandoned
- 1998-09-24 AT AT98947737T patent/ATE233935T1/de not_active IP Right Cessation
- 1998-09-24 JP JP2000513269A patent/JP2001517815A/ja not_active Withdrawn

Also Published As

Publication number	Publication date
AU9455498A (en)	1999-04-12
ATE233935T1 (de)	2003-03-15
DE69811921D1 (de)	2003-04-10
WO1999016051A1 (en)	1999-04-01
US6487532B1 (en)	2002-11-26
EP1018109B1 (de)	2003-03-05
JP2001517815A (ja)	2001-10-09
CA2303312A1 (en)	1999-04-01
EP1018109A1 (de)	2000-07-12

Legal Events

Date	Code	Title	Description
2004-04-01	8364	No opposition during term of opposition

Publication	Publication Date	Title
DE69811921D1 (de)	2003-04-10	Vorrichtung und verfahren zur unterscheidung von ähnlich klingenden wörtern in der spracherkennung
DE69937176D1 (de)	2007-11-08	Segmentierungsverfahren zur Erweiterung des aktiven Vokabulars von Spracherkennern
ATE349751T1 (de)	2007-01-15	System und verfahren zur spracherkennung mit einer vielzahl von spracherkennungsvorrichtungen
DE69725802D1 (de)	2003-12-04	Vorfilterung mittels lexikalischer Bäumen für die Spracherkennung
DE69127961T2 (de)	1998-03-05	Verfahren zur Spracherkennung
DE69513314T2 (de)	2000-05-25	Vorrichtung zur Erzeugung von Applaus für Karaoke-Singstimmen
DE69717899D1 (de)	2003-01-30	Verfahren und Vorrichtung zur Spracherkennung
DE69828141D1 (de)	2005-01-20	Verfahren und Vorrichtung zur Spracherkennung
DE69806557D1 (de)	2002-08-22	Verfahren und Vorrichtung zur Spracherkennung
DE69726235D1 (de)	2003-12-24	Verfahren und Vorrichtung zur Spracherkennung
ATE220473T1 (de)	2002-07-15	System, verfahren und programmdatenträger zur darstellung komplexer informationen als klang
DE69518705D1 (de)	2000-10-12	Verfahren und Vorrichtung zur Spracherkennung
DE69524829D1 (de)	2002-02-07	Verfahren und Vorrichtung zur Spracherkennung
DE59707384D1 (de)	2002-07-11	Verfahren und Vorrichtung zur Spracherkennung
DE69321656D1 (de)	1998-11-26	Verfahren zur Spracherkennung
ATE253763T1 (de)	2003-11-15	Verfahren zur spracherkennung
DE69830017D1 (de)	2005-06-09	Verfahren und Vorrichtung zur Spracherkennung
DK0749109T3 (da)	2002-03-25	Talegenkendelse for tonesprog
DE69607913D1 (de)	2000-05-31	Verfahren und vorrichtung zur spracherkennung auf der basis neuer wortmodelle
DE60228716D1 (de)	2008-10-16	Verfahren zum bereitstellen von kontoinformation und system zum aufschreiben von diktiertem text
DE69613293T2 (de)	2002-05-02	Vorrichtung zur Musteranpassung für Sprach- oder Mustererkennung
DE69419223T2 (de)	2000-07-06	Vorrichtung zur Sprachverbesserung
DE69517829T2 (de)	2001-03-08	Vorrichtung und Verfahren zur Spracherkennung
DE3767757D1 (de)	1991-03-07	Einrichtung und verfahren zur spracherkennung.
DE3774200D1 (de)	1991-12-05	Vorrichtung und verfahren zur spracherkennung.