ATE257616T1 - Spracherkennungsverfahren - Google Patents

Spracherkennungsverfahren

Info

Publication number
ATE257616T1
ATE257616T1 AT00909156T AT00909156T ATE257616T1 AT E257616 T1 ATE257616 T1 AT E257616T1 AT 00909156 T AT00909156 T AT 00909156T AT 00909156 T AT00909156 T AT 00909156T AT E257616 T1 ATE257616 T1 AT E257616T1
Authority
AT
Austria
Prior art keywords
speech
client
recognizers
voice recognition
recognition method
Prior art date
Application number
AT00909156T
Other languages
English (en)
Inventor
Stefan Besling
Eric Thelen
Meinhard Ullrich
Original Assignee
Koninkl Philips Electronics Nv
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Koninkl Philips Electronics Nv filed Critical Koninkl Philips Electronics Nv
Application granted granted Critical
Publication of ATE257616T1 publication Critical patent/ATE257616T1/de

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/28Constructional details of speech recognition systems
    • G10L15/30Distributed recognition, e.g. in client-server systems, for mobile phones or network applications
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Theoretical Computer Science (AREA)
  • Mathematical Physics (AREA)
  • Databases & Information Systems (AREA)
  • Software Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Computer And Data Communications (AREA)
  • Information Transfer Between Computers (AREA)
  • Telephonic Communication Services (AREA)
  • Navigation (AREA)
  • Electric Clocks (AREA)
AT00909156T 1999-03-09 2000-02-10 Spracherkennungsverfahren ATE257616T1 (de)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
DE19910236A DE19910236A1 (de) 1999-03-09 1999-03-09 Verfahren zur Spracherkennung
PCT/EP2000/001143 WO2000054251A2 (en) 1999-03-09 2000-02-10 Method of speech recognition

Publications (1)

Publication Number Publication Date
ATE257616T1 true ATE257616T1 (de) 2004-01-15

Family

ID=7900179

Family Applications (1)

Application Number Title Priority Date Filing Date
AT00909156T ATE257616T1 (de) 1999-03-09 2000-02-10 Spracherkennungsverfahren

Country Status (9)

Country Link
US (1) US6757655B1 (de)
EP (1) EP1163661B1 (de)
JP (1) JP4597383B2 (de)
KR (1) KR20020003865A (de)
CN (1) CN1343351A (de)
AT (1) ATE257616T1 (de)
AU (1) AU3153700A (de)
DE (2) DE19910236A1 (de)
WO (1) WO2000054251A2 (de)

Families Citing this family (50)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB9911971D0 (en) * 1999-05-21 1999-07-21 Canon Kk A system, a server for a system and a machine for use in a system
US7330815B1 (en) 1999-10-04 2008-02-12 Globalenglish Corporation Method and system for network-based speech recognition
US6931376B2 (en) * 2000-07-20 2005-08-16 Microsoft Corporation Speech-related event notification system
FI20001918A (fi) * 2000-08-30 2002-03-01 Nokia Corp Monimodaalinen sisältö ja automaattinen puheen tunnistus langattomassa tietoliikennejärjestelmässä
DE60125597T2 (de) * 2000-08-31 2007-05-03 Hitachi, Ltd. Vorrichtung für die Dienstleistungsvermittlung
CN1404603A (zh) * 2000-09-07 2003-03-19 皇家菲利浦电子有限公司 语音控制及加载的用户控制信息
JP2002116796A (ja) * 2000-10-11 2002-04-19 Canon Inc 音声処理装置、音声処理方法及び記憶媒体
JP3326424B2 (ja) 2000-10-23 2002-09-24 株式会社ジー・エフ 電話応答装置、及び電話応答装置で実現する各種の応答機能を記述した各手順ファイルを取得して電話応答する方法
US7610547B2 (en) * 2001-05-04 2009-10-27 Microsoft Corporation Markup language extensions for web enabled recognition
US7409349B2 (en) * 2001-05-04 2008-08-05 Microsoft Corporation Servers for web enabled speech recognition
US7506022B2 (en) * 2001-05-04 2009-03-17 Microsoft.Corporation Web enabled recognition architecture
US7711570B2 (en) * 2001-10-21 2010-05-04 Microsoft Corporation Application abstraction with dialog purpose
US8229753B2 (en) * 2001-10-21 2012-07-24 Microsoft Corporation Web server controls for web enabled recognition and/or audible prompting
US7133829B2 (en) * 2001-10-31 2006-11-07 Dictaphone Corporation Dynamic insertion of a speech recognition engine within a distributed speech recognition system
US7146321B2 (en) * 2001-10-31 2006-12-05 Dictaphone Corporation Distributed speech recognition system
US7292975B2 (en) * 2002-05-01 2007-11-06 Nuance Communications, Inc. Systems and methods for evaluating speaker suitability for automatic speech recognition aided transcription
US7236931B2 (en) * 2002-05-01 2007-06-26 Usb Ag, Stamford Branch Systems and methods for automatic acoustic speaker adaptation in computer-assisted transcription systems
US7260535B2 (en) * 2003-04-28 2007-08-21 Microsoft Corporation Web server controls for web enabled recognition and/or audible prompting for call controls
US7571102B2 (en) * 2003-04-29 2009-08-04 Ford Motor Company Controller for use with a motor vehicle
US20040230637A1 (en) * 2003-04-29 2004-11-18 Microsoft Corporation Application controls for speech enabled recognition
JP2005031758A (ja) * 2003-07-07 2005-02-03 Canon Inc 音声処理装置及び方法
US7552055B2 (en) 2004-01-10 2009-06-23 Microsoft Corporation Dialog component re-use in recognition systems
US8160883B2 (en) * 2004-01-10 2012-04-17 Microsoft Corporation Focus tracking in dialogs
DE602004008887T2 (de) * 2004-05-18 2008-01-17 Alcatel Lucent Verfahren und Server zur Bereitstellung eines multi-modalen Dialogs
KR100695127B1 (ko) 2004-10-08 2007-03-14 삼성전자주식회사 다 단계 음성 인식 장치 및 방법
GB2424560B (en) * 2005-02-15 2009-04-29 David Llewellyn Rees User interface for systems with automatic conversion from text to an acoustic representation
JP5394738B2 (ja) * 2005-08-09 2014-01-22 モバイル・ヴォイス・コントロール・エルエルシー 音声制御型ワイヤレス通信デバイス・システム
US8032372B1 (en) 2005-09-13 2011-10-04 Escription, Inc. Dictation selection
CN101326571B (zh) * 2005-12-07 2012-05-23 三菱电机株式会社 声音识别装置
US8996379B2 (en) 2007-03-07 2015-03-31 Vlingo Corporation Speech recognition text entry for software applications
US10056077B2 (en) 2007-03-07 2018-08-21 Nuance Communications, Inc. Using speech recognition results based on an unstructured language model with a music system
US8949266B2 (en) 2007-03-07 2015-02-03 Vlingo Corporation Multiple web-based content category searching in mobile search application
US8886545B2 (en) * 2007-03-07 2014-11-11 Vlingo Corporation Dealing with switch latency in speech recognition
US8949130B2 (en) * 2007-03-07 2015-02-03 Vlingo Corporation Internal and external speech recognition use with a mobile communication facility
US8838457B2 (en) 2007-03-07 2014-09-16 Vlingo Corporation Using results of unstructured language model based speech recognition to control a system-level function of a mobile communications facility
US8635243B2 (en) 2007-03-07 2014-01-21 Research In Motion Limited Sending a communications header with voice recording to send metadata for use in speech recognition, formatting, and search mobile search application
US8886540B2 (en) 2007-03-07 2014-11-11 Vlingo Corporation Using speech recognition results based on an unstructured language model in a mobile communication facility application
US20080228493A1 (en) * 2007-03-12 2008-09-18 Chih-Lin Hu Determining voice commands with cooperative voice recognition
US8180641B2 (en) * 2008-09-29 2012-05-15 Microsoft Corporation Sequential speech recognition with two unequal ASR systems
TWI411981B (zh) * 2008-11-10 2013-10-11 Inventec Corp 提供真人引導發音之語言學習系統、伺服器及其方法
US8515762B2 (en) * 2009-01-22 2013-08-20 Microsoft Corporation Markup language-based selection and utilization of recognizers for utterance processing
US8346549B2 (en) * 2009-12-04 2013-01-01 At&T Intellectual Property I, L.P. System and method for supplemental speech recognition by identified idle resources
CN102571882A (zh) * 2010-12-31 2012-07-11 上海博泰悦臻电子设备制造有限公司 基于网络的语音提醒的方法和***
WO2012116110A1 (en) 2011-02-22 2012-08-30 Speak With Me, Inc. Hybridized client-server speech recognition
JP5637131B2 (ja) * 2011-12-26 2014-12-10 株式会社デンソー 音声認識装置
JP6050171B2 (ja) * 2013-03-28 2016-12-21 日本電気株式会社 認識処理制御装置、認識処理制御方法および認識処理制御プログラム
FR3045909B1 (fr) * 2015-12-17 2017-12-29 Delta Dore Procede et dispositif d'analyse et de repartition de commandes vocales
US20180025731A1 (en) * 2016-07-21 2018-01-25 Andrew Lovitt Cascading Specialized Recognition Engines Based on a Recognition Policy
US10748531B2 (en) * 2017-04-13 2020-08-18 Harman International Industries, Incorporated Management layer for multiple intelligent personal assistant services
CN110444196B (zh) * 2018-05-10 2023-04-07 腾讯科技(北京)有限公司 基于同声传译的数据处理方法、装置、***和存储介质

Family Cites Families (19)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2818362B2 (ja) * 1992-09-21 1998-10-30 インターナショナル・ビジネス・マシーンズ・コーポレイション 音声認識装置のコンテキスト切換えシステムおよび方法
ZA948426B (en) * 1993-12-22 1995-06-30 Qualcomm Inc Distributed voice recognition system
JPH0863478A (ja) 1994-08-26 1996-03-08 Toshiba Corp 言語処理方法及び言語処理装置
US5745776A (en) * 1995-04-19 1998-04-28 Sheppard, Ii; Charles Bradford Enhanced electronic dictionary
US5890123A (en) * 1995-06-05 1999-03-30 Lucent Technologies, Inc. System and method for voice controlled video screen display
US5710918A (en) * 1995-06-07 1998-01-20 International Business Machines Corporation Method for distributed task fulfillment of web browser requests
US5915001A (en) * 1996-11-14 1999-06-22 Vois Corporation System and method for providing and using universally accessible voice and speech data files
JPH10177468A (ja) * 1996-12-16 1998-06-30 Casio Comput Co Ltd 移動端末音声認識/データベース検索通信システム
US5960399A (en) * 1996-12-24 1999-09-28 Gte Internetworking Incorporated Client/server speech processor/recognizer
US6122613A (en) 1997-01-30 2000-09-19 Dragon Systems, Inc. Speech recognition using multiple recognizers (selectively) applied to the same input sample
US6173259B1 (en) * 1997-03-27 2001-01-09 Speech Machines Plc Speech to text conversion
GB2323693B (en) 1997-03-27 2001-09-26 Forum Technology Ltd Speech to text conversion
US5884266A (en) * 1997-04-02 1999-03-16 Motorola, Inc. Audio interface for document based information resource navigation and method therefor
US6078886A (en) * 1997-04-14 2000-06-20 At&T Corporation System and method for providing remote automatic speech recognition services via a packet network
US6112176A (en) * 1997-05-16 2000-08-29 Compaq Computer Corporation Speech data collection over the world wide web
US6157705A (en) * 1997-12-05 2000-12-05 E*Trade Group, Inc. Voice control of a server
US6233559B1 (en) * 1998-04-01 2001-05-15 Motorola, Inc. Speech control of multiple applications using applets
US6115686A (en) * 1998-04-02 2000-09-05 Industrial Technology Research Institute Hyper text mark up language document to speech converter
GB2343777B (en) * 1998-11-13 2003-07-02 Motorola Ltd Mitigating errors in a distributed speech recognition process

Also Published As

Publication number Publication date
KR20020003865A (ko) 2002-01-15
JP4597383B2 (ja) 2010-12-15
DE60007620D1 (de) 2004-02-12
WO2000054251A3 (en) 2000-12-28
CN1343351A (zh) 2002-04-03
JP2002539480A (ja) 2002-11-19
US6757655B1 (en) 2004-06-29
AU3153700A (en) 2000-09-28
WO2000054251A2 (en) 2000-09-14
DE60007620T2 (de) 2004-11-18
EP1163661A2 (de) 2001-12-19
DE19910236A1 (de) 2000-09-21
EP1163661B1 (de) 2004-01-07

Similar Documents

Publication Publication Date Title
ATE257616T1 (de) Spracherkennungsverfahren
US6208964B1 (en) Method and apparatus for providing unsupervised adaptation of transcriptions
Rabiner Applications of voice processing to telecommunications
DE69811921T2 (de) Vorrichtung und verfahren zur unterscheidung von ähnlich klingenden wörtern in der spracherkennung
US20020138274A1 (en) Server based adaption of acoustic models for client-based speech systems
ATE353463T1 (de) Client/server basiertes spracherkennungssystem
AU2002211438A1 (en) Language independent voice-based search system
WO2003005340A3 (en) Method and apparatus for improving voice recognition performance in a voice application distribution system
WO2001097213A8 (en) Speech recognition using utterance-level confidence estimates
DE69937176D1 (de) Segmentierungsverfahren zur Erweiterung des aktiven Vokabulars von Spracherkennern
EP0398574A3 (de) Spracherkennung unter Anwendung von Stichwörtern und Nichtstichwörter-Modellierung
EP1256936A3 (de) Verfahren zum Training oder zur Adaption eines Spracherkenners
CN107564531A (zh) 基于声纹特征的会议记录方法、装置及计算机设备
DE69827667D1 (de) Vokoder basierter spracherkenner
JP2010102254A (ja) 話者テンプレートを更新する装置及び方法
ATE292524T1 (de) Vorrichtung und verfahren zur telefonie-basierten spracherkennung für das bereitstellen von informationen zum sortieren von poststücken und paketen.
FI20010792A (fi) Käyttäjäriippumattoman puheentunnistuksen järjestäminen
BR9913524A (pt) Reconhecedor de voz, e, processo de reconhecimento de voz
DE60002584D1 (de) Anwendung von Referenzdaten für Spracherkennung
DE60034772D1 (de) Zurückweisungsverfahren in der spracherkennung
TW200304638A (en) Network-accessible speaker-dependent voice models of multiple persons
ATE172317T1 (de) Sprachumsetzungsverfahren
CN109616116B (zh) 通话***及其通话方法
WO2002049004A3 (de) Verfahren und anordnung zur spracherkennung für ein kleingerät
ATE341809T1 (de) Verfahren zur erkennung von sprachinformationen

Legal Events

Date Code Title Description
RER Ceased as to paragraph 5 lit. 3 law introducing patent treaties