DE60114968D1 - Geräuschrobuste Spracherkennung - Google Patents

Geräuschrobuste Spracherkennung

Info

Publication number
DE60114968D1
DE60114968D1 DE60114968T DE60114968T DE60114968D1 DE 60114968 D1 DE60114968 D1 DE 60114968D1 DE 60114968 T DE60114968 T DE 60114968T DE 60114968 T DE60114968 T DE 60114968T DE 60114968 D1 DE60114968 D1 DE 60114968D1
Authority
DE
Germany
Prior art keywords
sound
speech recognition
proof
proof speech
recognition
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
DE60114968T
Other languages
English (en)
Other versions
DE60114968T2 (de
Inventor
Kiyoshi Yajima
Soichi Toyama
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Pioneer Corp
Original Assignee
Pioneer Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Pioneer Corp filed Critical Pioneer Corp
Application granted granted Critical
Publication of DE60114968D1 publication Critical patent/DE60114968D1/de
Publication of DE60114968T2 publication Critical patent/DE60114968T2/de
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/20Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise, of stress induced speech
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/10Speech classification or search using distance or distortion measures between unknown speech and reference templates
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/14Speech classification or search using statistical models, e.g. Hidden Markov Models [HMMs]
    • G10L15/142Hidden Markov Models [HMMs]
    • G10L15/144Training of HMMs

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Quality & Reliability (AREA)
  • Signal Processing (AREA)
  • Complex Calculations (AREA)
  • Image Analysis (AREA)
DE60114968T 2000-09-29 2001-09-27 Geräuschrobuste Spracherkennung Expired - Fee Related DE60114968T2 (de)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
JP2000298536A JP4169921B2 (ja) 2000-09-29 2000-09-29 音声認識システム
JP2000298536 2000-09-29

Publications (2)

Publication Number Publication Date
DE60114968D1 true DE60114968D1 (de) 2005-12-22
DE60114968T2 DE60114968T2 (de) 2006-07-27

Family

ID=18780481

Family Applications (1)

Application Number Title Priority Date Filing Date
DE60114968T Expired - Fee Related DE60114968T2 (de) 2000-09-29 2001-09-27 Geräuschrobuste Spracherkennung

Country Status (5)

Country Link
US (1) US7065488B2 (de)
EP (1) EP1195744B1 (de)
JP (1) JP4169921B2 (de)
CN (1) CN1236421C (de)
DE (1) DE60114968T2 (de)

Families Citing this family (29)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2002123285A (ja) * 2000-10-13 2002-04-26 Sony Corp 話者適応装置および話者適応方法、記録媒体、並びに音声認識装置
GB2391679B (en) * 2002-02-04 2004-03-24 Zentian Ltd Speech recognition circuit using parallel processors
DE102004008225B4 (de) * 2004-02-19 2006-02-16 Infineon Technologies Ag Verfahren und Einrichtung zum Ermitteln von Merkmalsvektoren aus einem Signal zur Mustererkennung, Verfahren und Einrichtung zur Mustererkennung sowie computerlesbare Speichermedien
US7865362B2 (en) 2005-02-04 2011-01-04 Vocollect, Inc. Method and system for considering information about an expected response when performing speech recognition
US7895039B2 (en) 2005-02-04 2011-02-22 Vocollect, Inc. Methods and systems for optimizing model adaptation for a speech recognition system
US7949533B2 (en) 2005-02-04 2011-05-24 Vococollect, Inc. Methods and systems for assessing and improving the performance of a speech recognition system
US8200495B2 (en) * 2005-02-04 2012-06-12 Vocollect, Inc. Methods and systems for considering information about an expected response when performing speech recognition
US7827032B2 (en) 2005-02-04 2010-11-02 Vocollect, Inc. Methods and systems for adapting a model for a speech recognition system
JP4332129B2 (ja) * 2005-04-20 2009-09-16 富士通株式会社 文書分類プログラム、文書分類方法および文書分類装置
US20070112630A1 (en) * 2005-11-07 2007-05-17 Scanscout, Inc. Techniques for rendering advertisments with rich media
US8762148B2 (en) * 2006-02-27 2014-06-24 Nec Corporation Reference pattern adaptation apparatus, reference pattern adaptation method and reference pattern adaptation program
JP4245617B2 (ja) * 2006-04-06 2009-03-25 株式会社東芝 特徴量補正装置、特徴量補正方法および特徴量補正プログラム
JP4316583B2 (ja) * 2006-04-07 2009-08-19 株式会社東芝 特徴量補正装置、特徴量補正方法および特徴量補正プログラム
US20080109391A1 (en) * 2006-11-07 2008-05-08 Scanscout, Inc. Classifying content based on mood
US8549550B2 (en) 2008-09-17 2013-10-01 Tubemogul, Inc. Method and apparatus for passively monitoring online video viewing and viewer behavior
US8577996B2 (en) * 2007-09-18 2013-11-05 Tremor Video, Inc. Method and apparatus for tracing users of online video web sites
US9612995B2 (en) 2008-09-17 2017-04-04 Adobe Systems Incorporated Video viewer targeting based on preference similarity
US20110093783A1 (en) * 2009-10-16 2011-04-21 Charles Parra Method and system for linking media components
EP2502195A2 (de) * 2009-11-20 2012-09-26 Tadashi Yonezaki Verfahren und vorrichtung zur optimierung einer zuweisung von werbeinhalten
KR101060183B1 (ko) * 2009-12-11 2011-08-30 한국과학기술연구원 임베디드 청각 시스템 및 음성 신호 처리 방법
US20110150270A1 (en) * 2009-12-22 2011-06-23 Carpenter Michael D Postal processing including voice training
JP5494468B2 (ja) * 2010-12-27 2014-05-14 富士通株式会社 状態検出装置、状態検出方法および状態検出のためのプログラム
US8914290B2 (en) 2011-05-20 2014-12-16 Vocollect, Inc. Systems and methods for dynamically improving user intelligibility of synthesized speech in a work environment
CN102760436B (zh) * 2012-08-09 2014-06-11 河南省烟草公司开封市公司 一种语音词库筛选方法
US9978395B2 (en) 2013-03-15 2018-05-22 Vocollect, Inc. Method and system for mitigating delay in receiving audio stream during production of sound from audio stream
US10714121B2 (en) 2016-07-27 2020-07-14 Vocollect, Inc. Distinguishing user speech from background speech in speech-dense environments
KR102637339B1 (ko) 2018-08-31 2024-02-16 삼성전자주식회사 음성 인식 모델을 개인화하는 방법 및 장치
CN110197670B (zh) * 2019-06-04 2022-06-07 大众问问(北京)信息科技有限公司 音频降噪方法、装置及电子设备
CN112863483B (zh) * 2021-01-05 2022-11-08 杭州一知智能科技有限公司 支持多说话人风格、语言切换且韵律可控的语音合成装置

Family Cites Families (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5721808A (en) * 1995-03-06 1998-02-24 Nippon Telegraph And Telephone Corporation Method for the composition of noise-resistant hidden markov models for speech recognition and speech recognizer using the same
JP2780676B2 (ja) * 1995-06-23 1998-07-30 日本電気株式会社 音声認識装置及び音声認識方法
JP3001037B2 (ja) * 1995-12-13 2000-01-17 日本電気株式会社 音声認識装置
US6026359A (en) * 1996-09-20 2000-02-15 Nippon Telegraph And Telephone Corporation Scheme for model adaptation in pattern recognition based on Taylor expansion
US5924065A (en) * 1997-06-16 1999-07-13 Digital Equipment Corporation Environmently compensated speech processing
EP0997003A2 (de) * 1997-07-01 2000-05-03 Partran APS Verfahren und schaltung zum rauschereduktion in sprachsignalen
US6658385B1 (en) * 1999-03-12 2003-12-02 Texas Instruments Incorporated Method for transforming HMMs for speaker-independent recognition in a noisy environment
US6529866B1 (en) * 1999-11-24 2003-03-04 The United States Of America As Represented By The Secretary Of The Navy Speech recognition system and associated methods
US7089182B2 (en) * 2000-04-18 2006-08-08 Matsushita Electric Industrial Co., Ltd. Method and apparatus for feature domain joint channel and additive noise compensation
US7003455B1 (en) * 2000-10-16 2006-02-21 Microsoft Corporation Method of noise reduction using correction and scaling vectors with partitioning of the acoustic space in the domain of noisy speech

Also Published As

Publication number Publication date
EP1195744A3 (de) 2003-01-22
EP1195744A2 (de) 2002-04-10
US20020042712A1 (en) 2002-04-11
DE60114968T2 (de) 2006-07-27
CN1346125A (zh) 2002-04-24
CN1236421C (zh) 2006-01-11
US7065488B2 (en) 2006-06-20
JP2002108383A (ja) 2002-04-10
JP4169921B2 (ja) 2008-10-22
EP1195744B1 (de) 2005-11-16

Similar Documents

Publication Publication Date Title
DE60114968D1 (de) Geräuschrobuste Spracherkennung
FI19992351A (fi) Puheentunnistus
DE60123747D1 (de) Spracherkennungsbasiertes Untertitelungssystem
DE69933623D1 (de) Spracherkennung
DE60204374D1 (de) Spracherkennungsvorrichtung
DE602004021716D1 (de) Spracherkennungssystem
DE60119647D1 (de) Schlüsselerkennungssystem
DE60233561D1 (de) Sprachantwortsystem
DE60136156D1 (de) Lautsprecher
DE60323362D1 (de) Spracherkennungseinrichtung
FI20001577A (fi) Puheenkoodaus
DE60044154D1 (de) Sprachdekodierung
DE60128706D1 (de) Zeichenerkennungssystem
FI20001327A (fi) Mikrofonirakenne
DE60138503D1 (de) Lautsprecher
DE69920714D1 (de) Spracherkennung
DE60032068D1 (de) Sprachdekodierung
DE60014031D1 (de) Sprachererkennung durch korrelierung von spektrogrammen
DE60132357D1 (de) Lautsprecher
DE60125084D1 (de) Lautsprecher
ATA7972001A (de) Elektrostatisches mikrofon
DE60225215D1 (de) Verbesserte Spracherkennung
DE60109240D1 (de) Sprecherverifikation und -erkennung
DE60142729D1 (de) Spracherkennungssystem
DE60000244D1 (de) Erkennungsgerät

Legal Events

Date Code Title Description
8364 No opposition during term of opposition
8339 Ceased/non-payment of the annual fee