DE60201939D1 - Vorrichtung zur sprecherunabhängigen Spracherkennung , basierend auf einem Client-Server-System - Google Patents

Vorrichtung zur sprecherunabhängigen Spracherkennung , basierend auf einem Client-Server-System

Info

Publication number
DE60201939D1
DE60201939D1 DE60201939T DE60201939T DE60201939D1 DE 60201939 D1 DE60201939 D1 DE 60201939D1 DE 60201939 T DE60201939 T DE 60201939T DE 60201939 T DE60201939 T DE 60201939T DE 60201939 D1 DE60201939 D1 DE 60201939D1
Authority
DE
Germany
Prior art keywords
electronic device
server
speech recognition
speaker
client
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Lifetime
Application number
DE60201939T
Other languages
English (en)
Other versions
DE60201939T2 (de
Inventor
Olli Viikki
Kari Laurila
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Nokia Oyj
Original Assignee
Nokia Oyj
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nokia Oyj filed Critical Nokia Oyj
Publication of DE60201939D1 publication Critical patent/DE60201939D1/de
Application granted granted Critical
Publication of DE60201939T2 publication Critical patent/DE60201939T2/de
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • G10L15/063Training
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/28Constructional details of speech recognition systems
    • G10L15/30Distributed recognition, e.g. in client-server systems, for mobile phones or network applications

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Artificial Intelligence (AREA)
  • Telephonic Communication Services (AREA)
  • Computer And Data Communications (AREA)
  • Lock And Its Accessories (AREA)
  • Machine Translation (AREA)
  • Navigation (AREA)
  • Document Processing Apparatus (AREA)
  • Manipulator (AREA)
  • Financial Or Insurance-Related Operations Such As Payment And Settlement (AREA)
  • Electrically Operated Instructional Devices (AREA)
DE60201939T 2001-04-17 2002-03-22 Vorrichtung zur sprecherunabhängigen Spracherkennung , basierend auf einem Client-Server-System Expired - Lifetime DE60201939T2 (de)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
FI20010792A FI20010792A (fi) 2001-04-17 2001-04-17 Käyttäjäriippumattoman puheentunnistuksen järjestäminen
FI20010792 2001-04-17

Publications (2)

Publication Number Publication Date
DE60201939D1 true DE60201939D1 (de) 2004-12-23
DE60201939T2 DE60201939T2 (de) 2005-03-31

Family

ID=8561000

Family Applications (1)

Application Number Title Priority Date Filing Date
DE60201939T Expired - Lifetime DE60201939T2 (de) 2001-04-17 2002-03-22 Vorrichtung zur sprecherunabhängigen Spracherkennung , basierend auf einem Client-Server-System

Country Status (6)

Country Link
US (1) US7392184B2 (de)
EP (1) EP1251492B1 (de)
CN (2) CN101334997A (de)
AT (1) ATE282882T1 (de)
DE (1) DE60201939T2 (de)
FI (1) FI20010792A (de)

Families Citing this family (23)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7407384B2 (en) 2003-05-29 2008-08-05 Robert Bosch Gmbh System, method and device for language education through a voice portal server
GB2422276B (en) * 2003-09-11 2007-10-03 Voice Signal Technologies Inc Phone number and name pronunciation interchange via mobile phone
FI20031566A (fi) * 2003-10-27 2005-04-28 Nokia Corp Kielen valitseminen sanantunnistusta varten
US20050197837A1 (en) * 2004-03-08 2005-09-08 Janne Suontausta Enhanced multilingual speech recognition system
US7925512B2 (en) * 2004-05-19 2011-04-12 Nuance Communications, Inc. Method, system, and apparatus for a voice markup language interpreter and voice browser
JP4384939B2 (ja) * 2004-05-31 2009-12-16 株式会社インパルスジャパン 言語判別装置、翻訳装置、翻訳サーバ、言語判別方法並びに翻訳処理方法
US7788098B2 (en) * 2004-08-02 2010-08-31 Nokia Corporation Predicting tone pattern information for textual information used in telecommunication systems
EP1810277A1 (de) * 2004-11-08 2007-07-25 France Telecom S.A. Verfahren zur verteilten konstruktion eines stimmenerkennungsmodells sowie vorrichtung, server und computerprogramme zu seiner implementierung
US20070088549A1 (en) * 2005-10-14 2007-04-19 Microsoft Corporation Natural input of arbitrary text
US9170120B2 (en) * 2007-03-22 2015-10-27 Panasonic Automotive Systems Company Of America, Division Of Panasonic Corporation Of North America Vehicle navigation playback method
US8175236B2 (en) * 2007-11-16 2012-05-08 At&T Mobility Ii Llc IMS and SMS interworking
US8073693B2 (en) 2008-12-04 2011-12-06 At&T Intellectual Property I, L.P. System and method for pronunciation modeling
US8959232B2 (en) * 2008-12-30 2015-02-17 At&T Mobility Ii Llc IMS and MMS interworking
US20120078635A1 (en) * 2010-09-24 2012-03-29 Apple Inc. Voice control system
US20140136210A1 (en) * 2012-11-14 2014-05-15 At&T Intellectual Property I, L.P. System and method for robust personalization of speech recognition
KR102371697B1 (ko) * 2015-02-11 2022-03-08 삼성전자주식회사 음성 기능 운용 방법 및 이를 지원하는 전자 장치
US11010179B2 (en) 2018-04-20 2021-05-18 Facebook, Inc. Aggregating semantic information for improved understanding of users
US11307880B2 (en) 2018-04-20 2022-04-19 Meta Platforms, Inc. Assisting users with personalized and contextual communication content
US11886473B2 (en) 2018-04-20 2024-01-30 Meta Platforms, Inc. Intent identification for agent matching by assistant systems
US11676220B2 (en) 2018-04-20 2023-06-13 Meta Platforms, Inc. Processing multimodal user input for assistant systems
US11715042B1 (en) 2018-04-20 2023-08-01 Meta Platforms Technologies, Llc Interpretability of deep reinforcement learning models in assistant systems
US11935523B2 (en) * 2019-11-15 2024-03-19 Master English Oy Detection of correctness of pronunciation
CN112837688B (zh) * 2019-11-22 2024-04-02 阿里巴巴集团控股有限公司 语音转写方法、装置、相关***及设备

Family Cites Families (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5165095A (en) * 1990-09-28 1992-11-17 Texas Instruments Incorporated Voice telephone dialing
DE69232112T2 (de) * 1991-11-12 2002-03-14 Fujitsu Ltd Vorrichtung zur Sprachsynthese
SE9304222L (sv) * 1993-12-21 1995-06-22 Telia Ab Metod och anordning vid samtal från mobilstationer
JPH0816619A (ja) 1994-06-30 1996-01-19 Casio Comput Co Ltd 情報処理システム
US6041300A (en) * 1997-03-21 2000-03-21 International Business Machines Corporation System and method of using pre-enrolled speech sub-units for efficient speech synthesis
KR100238189B1 (ko) * 1997-10-16 2000-01-15 윤종용 다중 언어 tts장치 및 다중 언어 tts 처리 방법
DE19751123C1 (de) 1997-11-19 1999-06-17 Deutsche Telekom Ag Vorrichtung und Verfahren zur sprecherunabhängigen Sprachnamenwahl für Telekommunikations-Endeinrichtungen
US6272456B1 (en) * 1998-03-19 2001-08-07 Microsoft Corporation System and method for identifying the language of written text having a plurality of different length n-gram profiles
GB2338369B (en) 1998-06-09 2003-08-06 Nec Technologies Language selection for voice dialling
US6208964B1 (en) * 1998-08-31 2001-03-27 Nortel Networks Limited Method and apparatus for providing unsupervised adaptation of transcriptions
US6128482A (en) * 1998-12-22 2000-10-03 General Motors Corporation Providing mobile application services with download of speaker independent voice model
US6385586B1 (en) * 1999-01-28 2002-05-07 International Business Machines Corporation Speech recognition text-based language conversion and text-to-speech in a client-server configuration to enable language translation devices
US6463413B1 (en) * 1999-04-20 2002-10-08 Matsushita Electrical Industrial Co., Ltd. Speech recognition training for small hardware devices
DE19918382B4 (de) * 1999-04-22 2004-02-05 Siemens Ag Erstellen eines Referenzmodell-Verzeichnisses für ein sprachgesteuertes Kommunikationsgerät
EP1074973B1 (de) * 1999-06-30 2006-03-15 International Business Machines Corporation Verfahren zur Erweiterung des Wortschatzes eines Spracherkennungssystems
US6557026B1 (en) * 1999-09-29 2003-04-29 Morphism, L.L.C. System and apparatus for dynamically generating audible notices from an information network
US6625576B2 (en) * 2001-01-29 2003-09-23 Lucent Technologies Inc. Method and apparatus for performing text-to-speech conversion in a client/server environment

Also Published As

Publication number Publication date
US20020152067A1 (en) 2002-10-17
FI20010792A (fi) 2002-10-18
ATE282882T1 (de) 2004-12-15
EP1251492A1 (de) 2002-10-23
FI20010792A0 (fi) 2001-04-17
CN101334997A (zh) 2008-12-31
DE60201939T2 (de) 2005-03-31
US7392184B2 (en) 2008-06-24
EP1251492B1 (de) 2004-11-17
CN1381831A (zh) 2002-11-27

Similar Documents

Publication Publication Date Title
DE60201939D1 (de) Vorrichtung zur sprecherunabhängigen Spracherkennung , basierend auf einem Client-Server-System
US5774860A (en) Adaptive knowledge base of complex information through interactive voice dialogue
EP1748421A3 (de) Spracheingabeverarbeitung mit einer emotions-basierten Modell Antwort Generation
EP0977175A3 (de) Verfahren und Vorrichtung zur Spracherkennung unter Verwendung einer Wissensbasis
EP1557821A3 (de) Segmentbasierte tonale Modellierung für tonale Sprachen
EP0774729A3 (de) System zur Zeichenerkennung und Übersetzung und System zur Spracherkennung und Übersetzung
ATE139856T1 (de) Sprachtraining
ATE233935T1 (de) Vorrichtung und verfahren zur unterscheidung von ähnlich klingenden wörtern in der spracherkennung
EP1047046A3 (de) Verteilte Architektur zum Trainieren eines Spracherkennungssystems
CN107564511A (zh) 电子装置、语音合成方法和计算机可读存储介质
JPS6466698A (en) Voice recognition equipment
EP1022722A3 (de) Sprecheradaptation auf der Basis von Stimm-Eigenvektoren
EP1429313A3 (de) Sprachmodell für die Spracherkennung
EP0825586A3 (de) Vorfilterung mittels lexikalischer Bäumen für die Spracherkennung
DE60111329D1 (de) Anpassung des phonetischen Kontextes zur Verbesserung der Spracherkennung
DE60030920D1 (de) Verfahren zur Ermittlung von Persönlichkeitsmerkmalen unter Verwendung eines sprachbasierten Dialogs
WO2004090866A3 (en) Phonetically based speech recognition system and method
JP2001296880A5 (de)
EP1094406A3 (de) System und Verfahren um auf fernsehbezogene Informationen über das Internet zuzugreifen
EP0852374A3 (de) Verfahren und System zur sprecherunabhängigen Erkennung von benutzerdefinierten Sätzen
EP1475777A3 (de) Verfahren und vorrichtung zur schlüsselworterkennung, programm zur schlüsselworterkennung, mit adaption von schlüsselwort- und nichtschlüsselwortmodellen.
EP1326232A3 (de) Verfahren, Vorrichtung und Computerprogramm zur Herstellung eines akustischen Modells
DE69623364D1 (de) Einrichtung zur Erkennung kontinuierlich gesprochener Sprache
EP1054387A3 (de) Verfahren und Vorrichtung zum Aktivieren von sprachgesteuerten Geräten
EP0949606A3 (de) Verfahren und Vorrichtung zur Spracherkennung unter Verwendung von phonetischen Transkriptionen

Legal Events

Date Code Title Description
8364 No opposition during term of opposition