ATE385024T1 - Multilinguale spracherkennung - Google Patents

Multilinguale spracherkennung

Info

Publication number
ATE385024T1
ATE385024T1 AT05003670T AT05003670T ATE385024T1 AT E385024 T1 ATE385024 T1 AT E385024T1 AT 05003670 T AT05003670 T AT 05003670T AT 05003670 T AT05003670 T AT 05003670T AT E385024 T1 ATE385024 T1 AT E385024T1
Authority
AT
Austria
Prior art keywords
subword
speech recognition
items
list
subword unit
Prior art date
Application number
AT05003670T
Other languages
English (en)
Inventor
Marcus Hennecke
Thomas Krippgans
Original Assignee
Harman Becker Automotive Sys
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Harman Becker Automotive Sys filed Critical Harman Becker Automotive Sys
Application granted granted Critical
Publication of ATE385024T1 publication Critical patent/ATE385024T1/de

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/28Constructional details of speech recognition systems
    • G10L15/32Multiple recognisers used in sequence or in parallel; Score combination systems therefor, e.g. voting systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • G10L15/183Speech classification or search using natural language modelling using context dependencies, e.g. language models
    • G10L15/187Phonemic context, e.g. pronunciation rules, phonotactical constraints or phoneme n-grams

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Artificial Intelligence (AREA)
  • Machine Translation (AREA)
AT05003670T 2005-02-21 2005-02-21 Multilinguale spracherkennung ATE385024T1 (de)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
EP05003670A EP1693828B1 (de) 2005-02-21 2005-02-21 Multilinguale Spracherkennung

Publications (1)

Publication Number Publication Date
ATE385024T1 true ATE385024T1 (de) 2008-02-15

Family

ID=34933852

Family Applications (1)

Application Number Title Priority Date Filing Date
AT05003670T ATE385024T1 (de) 2005-02-21 2005-02-21 Multilinguale spracherkennung

Country Status (4)

Country Link
US (1) US20060206331A1 (de)
EP (1) EP1693828B1 (de)
AT (1) ATE385024T1 (de)
DE (1) DE602005004503T2 (de)

Families Citing this family (31)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1693829B1 (de) * 2005-02-21 2018-12-05 Harman Becker Automotive Systems GmbH Sprachgesteuertes Datensystem
US7697827B2 (en) 2005-10-17 2010-04-13 Konicek Jeffrey C User-friendlier interfaces for a camera
SG133419A1 (en) * 2005-12-12 2007-07-30 Creative Tech Ltd A method and apparatus for accessing a digital file from a collection of digital files
US7873517B2 (en) 2006-11-09 2011-01-18 Volkswagen Of America, Inc. Motor vehicle with a speech interface
DE102006057159A1 (de) * 2006-12-01 2008-06-05 Deutsche Telekom Ag Verfahren zur Klassifizierung der gesprochenen Sprache in Sprachdialogsystemen
EP1975923B1 (de) * 2007-03-28 2016-04-27 Nuance Communications, Inc. Mehrsprachige nicht-muttersprachliche Spracherkennung
DE112009004313B4 (de) * 2009-01-28 2016-09-22 Mitsubishi Electric Corp. Stimmerkennungseinrichtung
US8949125B1 (en) * 2010-06-16 2015-02-03 Google Inc. Annotating maps with user-contributed pronunciations
US8489398B1 (en) 2011-01-14 2013-07-16 Google Inc. Disambiguation of spoken proper names
US9286894B1 (en) 2012-01-31 2016-03-15 Google Inc. Parallel recognition
US9093076B2 (en) * 2012-04-30 2015-07-28 2236008 Ontario Inc. Multipass ASR controlling multiple applications
US9431012B2 (en) 2012-04-30 2016-08-30 2236008 Ontario Inc. Post processing of natural language automatic speech recognition
US20140214401A1 (en) 2013-01-29 2014-07-31 Tencent Technology (Shenzhen) Company Limited Method and device for error correction model training and text error correction
US9471567B2 (en) * 2013-01-31 2016-10-18 Ncr Corporation Automatic language recognition
DE102013005844B3 (de) * 2013-03-28 2014-08-28 Technische Universität Braunschweig Verfahren und Vorrichtung zum Messen der Qualität eines Sprachsignals
KR102084646B1 (ko) 2013-07-04 2020-04-14 삼성전자주식회사 음성 인식 장치 및 음성 인식 방법
EP3040985B1 (de) * 2013-08-26 2023-08-23 Samsung Electronics Co., Ltd. Elektronische vorrichtung und verfahren zur spracherkennung
WO2015075789A1 (ja) 2013-11-20 2015-05-28 三菱電機株式会社 音声認識装置および音声認識方法
US9747897B2 (en) * 2013-12-17 2017-08-29 Google Inc. Identifying substitute pronunciations
US10339920B2 (en) * 2014-03-04 2019-07-02 Amazon Technologies, Inc. Predicting pronunciation in speech recognition
DE102014210716A1 (de) * 2014-06-05 2015-12-17 Continental Automotive Gmbh Assistenzsystem, das mittels Spracheingaben steuerbar ist, mit einer Funktionseinrichtung und mehreren Spracherkennungsmodulen
US9683862B2 (en) * 2015-08-24 2017-06-20 International Business Machines Corporation Internationalization during navigation
DE102015014206B4 (de) 2015-11-04 2020-06-25 Audi Ag Verfahren und Vorrichtung zum Auswählen eines Navigationsziels aus einer von mehreren Sprachregionen mittels Spracheingabe
US9959887B2 (en) * 2016-03-08 2018-05-01 International Business Machines Corporation Multi-pass speech activity detection strategy to improve automatic speech recognition
US10593321B2 (en) * 2017-12-15 2020-03-17 Mitsubishi Electric Research Laboratories, Inc. Method and apparatus for multi-lingual end-to-end speech recognition
US10565320B1 (en) 2018-09-28 2020-02-18 International Business Machines Corporation Dynamic multilingual speech recognition
CN113692616B (zh) * 2019-05-03 2024-01-05 谷歌有限责任公司 用于在端到端模型中的跨语言语音识别的基于音素的场境化
CN112364658A (zh) 2019-07-24 2021-02-12 阿里巴巴集团控股有限公司 翻译以及语音识别方法、装置、设备
CN110634487B (zh) * 2019-10-24 2022-05-17 科大讯飞股份有限公司 一种双语种混合语音识别方法、装置、设备及存储介质
CN111798836B (zh) * 2020-08-03 2023-12-05 上海茂声智能科技有限公司 一种自动切换语种方法、装置、***、设备和存储介质
CN113035171B (zh) * 2021-03-05 2022-09-02 随锐科技集团股份有限公司 语音识别处理方法及***

Family Cites Families (21)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5602960A (en) * 1994-09-30 1997-02-11 Apple Computer, Inc. Continuous mandarin chinese speech recognition system having an integrated tone classifier
DE19636739C1 (de) * 1996-09-10 1997-07-03 Siemens Ag Verfahren zur Mehrsprachenverwendung eines hidden Markov Lautmodelles in einem Spracherkennungssystem
US6085160A (en) * 1998-07-10 2000-07-04 Lernout & Hauspie Speech Products N.V. Language independent speech recognition
US7120582B1 (en) * 1999-09-07 2006-10-10 Dragon Systems, Inc. Expanding an effective vocabulary of a speech recognition system
US6912499B1 (en) * 1999-08-31 2005-06-28 Nortel Networks Limited Method and apparatus for training a multilingual speech model set
EP1134726A1 (de) * 2000-03-15 2001-09-19 Siemens Aktiengesellschaft Verfahren zur Erkennung von Sprachäusserungen nicht-muttersprachlicher Sprecher in einem Sprachverarbeitungssystem
US7181395B1 (en) * 2000-10-27 2007-02-20 International Business Machines Corporation Methods and apparatus for automatic generation of multiple pronunciations from acoustic data
ATE297588T1 (de) * 2000-11-14 2005-06-15 Ibm Anpassung des phonetischen kontextes zur verbesserung der spracherkennung
GB0028277D0 (en) * 2000-11-20 2001-01-03 Canon Kk Speech processing system
EP1217610A1 (de) * 2000-11-28 2002-06-26 Siemens Aktiengesellschaft Verfahren und System zur multilingualen Spracherkennung
EP1233406A1 (de) * 2001-02-14 2002-08-21 Sony International (Europe) GmbH Angepasste Spracherkennung für ausländische Sprecher
US7043431B2 (en) * 2001-08-31 2006-05-09 Nokia Corporation Multilingual speech recognition system using text derived recognition models
DE10207895B4 (de) * 2002-02-23 2005-11-03 Harman Becker Automotive Systems Gmbh Verfahren zur Spracherkennung und Spracherkennungssystem
US7092883B1 (en) * 2002-03-29 2006-08-15 At&T Generating confidence scores from word lattices
US6932873B2 (en) * 2002-07-30 2005-08-23 Applied Materials Israel, Ltd. Managing work-piece deflection
US7149688B2 (en) * 2002-11-04 2006-12-12 Speechworks International, Inc. Multi-lingual speech recognition with cross-language context modeling
WO2004047077A1 (en) * 2002-11-15 2004-06-03 Voice Signal Technologies, Inc. Multilingual speech recognition
US8285537B2 (en) * 2003-01-31 2012-10-09 Comverse, Inc. Recognition of proper nouns using native-language pronunciation
US7689404B2 (en) * 2004-02-24 2010-03-30 Arkady Khasin Method of multilingual speech recognition by reduction to single-language recognizer engine components
US20050197837A1 (en) * 2004-03-08 2005-09-08 Janne Suontausta Enhanced multilingual speech recognition system
US20050267755A1 (en) * 2004-05-27 2005-12-01 Nokia Corporation Arrangement for speech recognition

Also Published As

Publication number Publication date
DE602005004503D1 (de) 2008-03-13
EP1693828A1 (de) 2006-08-23
EP1693828B1 (de) 2008-01-23
DE602005004503T2 (de) 2009-01-22
US20060206331A1 (en) 2006-09-14

Similar Documents

Publication Publication Date Title
ATE385024T1 (de) Multilinguale spracherkennung
CN105869634B (zh) 一种基于领域的带反馈语音识别后文本纠错方法及***
ATE527652T1 (de) Mehrstufige spracherkennung
US20160336007A1 (en) Speech search device and speech search method
ATE457510T1 (de) Spracherkennungssystem mit riesigem vokabular
CN108074562B (zh) 语音识别装置、语音识别方法以及存储介质
JP2006048628A5 (de)
US20200273449A1 (en) Method, system and apparatus for multilingual and multimodal keyword search in a mixlingual speech corpus
TW200638337A (en) Using a spoken utterance for disambiguation of spelling inputs into a speech recognition system
CN108091334B (zh) 识别装置、识别方法以及存储介质
ATE419616T1 (de) Verfahren, einrichtung und computerprogramm zur spracherkennung
ATE524777T1 (de) Automatische aktualisierung eines sprachmodells
EP1800293A4 (de) System zur identifikation gesprochener sprache und verfahren zu seinem training und betrieb
WO2007015869A3 (en) Spoken language proficiency assessment by computer
DE60005326D1 (de) Erkennungseinheiten mit komplementären sprachmodellen
Deng et al. Improving accent identification and accented speech recognition under a framework of self-supervised learning
WO2006086511A3 (en) Method and apparatus utilizing voice input to resolve ambiguous manually entered text input
WO2009008055A1 (ja) 音声認識装置、音声認識方法、および、音声認識プログラム
Gupta et al. A language independent approach to audio search
Szöke et al. BUT QUESST 2014 system description.
US8682668B2 (en) Language model score look-ahead value imparting device, language model score look-ahead value imparting method, and program storage medium
Rastrow et al. Towards using hybrid word and fragment units for vocabulary independent LVCSR systems
WO2010018453A3 (en) System and method for processing electronically generated text
Dzhambazov et al. Automatic lyrics-to-audio alignment in classical Turkish music
JP5004863B2 (ja) 音声検索装置および音声検索方法

Legal Events

Date Code Title Description
RER Ceased as to paragraph 5 lit. 3 law introducing patent treaties