DE60114968D1 - Geräuschrobuste Spracherkennung - Google Patents
Geräuschrobuste SpracherkennungInfo
- Publication number
- DE60114968D1 DE60114968D1 DE60114968T DE60114968T DE60114968D1 DE 60114968 D1 DE60114968 D1 DE 60114968D1 DE 60114968 T DE60114968 T DE 60114968T DE 60114968 T DE60114968 T DE 60114968T DE 60114968 D1 DE60114968 D1 DE 60114968D1
- Authority
- DE
- Germany
- Prior art keywords
- sound
- speech recognition
- proof
- proof speech
- recognition
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/20—Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise, of stress induced speech
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/10—Speech classification or search using distance or distortion measures between unknown speech and reference templates
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/14—Speech classification or search using statistical models, e.g. Hidden Markov Models [HMMs]
- G10L15/142—Hidden Markov Models [HMMs]
- G10L15/144—Training of HMMs
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Quality & Reliability (AREA)
- Signal Processing (AREA)
- Complex Calculations (AREA)
- Image Analysis (AREA)
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2000298536A JP4169921B2 (ja) | 2000-09-29 | 2000-09-29 | 音声認識システム |
JP2000298536 | 2000-09-29 |
Publications (2)
Publication Number | Publication Date |
---|---|
DE60114968D1 true DE60114968D1 (de) | 2005-12-22 |
DE60114968T2 DE60114968T2 (de) | 2006-07-27 |
Family
ID=18780481
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
DE60114968T Expired - Fee Related DE60114968T2 (de) | 2000-09-29 | 2001-09-27 | Geräuschrobuste Spracherkennung |
Country Status (5)
Country | Link |
---|---|
US (1) | US7065488B2 (de) |
EP (1) | EP1195744B1 (de) |
JP (1) | JP4169921B2 (de) |
CN (1) | CN1236421C (de) |
DE (1) | DE60114968T2 (de) |
Families Citing this family (29)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2002123285A (ja) * | 2000-10-13 | 2002-04-26 | Sony Corp | 話者適応装置および話者適応方法、記録媒体、並びに音声認識装置 |
GB2391679B (en) * | 2002-02-04 | 2004-03-24 | Zentian Ltd | Speech recognition circuit using parallel processors |
DE102004008225B4 (de) * | 2004-02-19 | 2006-02-16 | Infineon Technologies Ag | Verfahren und Einrichtung zum Ermitteln von Merkmalsvektoren aus einem Signal zur Mustererkennung, Verfahren und Einrichtung zur Mustererkennung sowie computerlesbare Speichermedien |
US7865362B2 (en) | 2005-02-04 | 2011-01-04 | Vocollect, Inc. | Method and system for considering information about an expected response when performing speech recognition |
US7895039B2 (en) | 2005-02-04 | 2011-02-22 | Vocollect, Inc. | Methods and systems for optimizing model adaptation for a speech recognition system |
US7949533B2 (en) | 2005-02-04 | 2011-05-24 | Vococollect, Inc. | Methods and systems for assessing and improving the performance of a speech recognition system |
US8200495B2 (en) * | 2005-02-04 | 2012-06-12 | Vocollect, Inc. | Methods and systems for considering information about an expected response when performing speech recognition |
US7827032B2 (en) | 2005-02-04 | 2010-11-02 | Vocollect, Inc. | Methods and systems for adapting a model for a speech recognition system |
JP4332129B2 (ja) * | 2005-04-20 | 2009-09-16 | 富士通株式会社 | 文書分類プログラム、文書分類方法および文書分類装置 |
US20070112630A1 (en) * | 2005-11-07 | 2007-05-17 | Scanscout, Inc. | Techniques for rendering advertisments with rich media |
US8762148B2 (en) * | 2006-02-27 | 2014-06-24 | Nec Corporation | Reference pattern adaptation apparatus, reference pattern adaptation method and reference pattern adaptation program |
JP4245617B2 (ja) * | 2006-04-06 | 2009-03-25 | 株式会社東芝 | 特徴量補正装置、特徴量補正方法および特徴量補正プログラム |
JP4316583B2 (ja) * | 2006-04-07 | 2009-08-19 | 株式会社東芝 | 特徴量補正装置、特徴量補正方法および特徴量補正プログラム |
US20080109391A1 (en) * | 2006-11-07 | 2008-05-08 | Scanscout, Inc. | Classifying content based on mood |
US8549550B2 (en) | 2008-09-17 | 2013-10-01 | Tubemogul, Inc. | Method and apparatus for passively monitoring online video viewing and viewer behavior |
US8577996B2 (en) * | 2007-09-18 | 2013-11-05 | Tremor Video, Inc. | Method and apparatus for tracing users of online video web sites |
US9612995B2 (en) | 2008-09-17 | 2017-04-04 | Adobe Systems Incorporated | Video viewer targeting based on preference similarity |
US20110093783A1 (en) * | 2009-10-16 | 2011-04-21 | Charles Parra | Method and system for linking media components |
EP2502195A2 (de) * | 2009-11-20 | 2012-09-26 | Tadashi Yonezaki | Verfahren und vorrichtung zur optimierung einer zuweisung von werbeinhalten |
KR101060183B1 (ko) * | 2009-12-11 | 2011-08-30 | 한국과학기술연구원 | 임베디드 청각 시스템 및 음성 신호 처리 방법 |
US20110150270A1 (en) * | 2009-12-22 | 2011-06-23 | Carpenter Michael D | Postal processing including voice training |
JP5494468B2 (ja) * | 2010-12-27 | 2014-05-14 | 富士通株式会社 | 状態検出装置、状態検出方法および状態検出のためのプログラム |
US8914290B2 (en) | 2011-05-20 | 2014-12-16 | Vocollect, Inc. | Systems and methods for dynamically improving user intelligibility of synthesized speech in a work environment |
CN102760436B (zh) * | 2012-08-09 | 2014-06-11 | 河南省烟草公司开封市公司 | 一种语音词库筛选方法 |
US9978395B2 (en) | 2013-03-15 | 2018-05-22 | Vocollect, Inc. | Method and system for mitigating delay in receiving audio stream during production of sound from audio stream |
US10714121B2 (en) | 2016-07-27 | 2020-07-14 | Vocollect, Inc. | Distinguishing user speech from background speech in speech-dense environments |
KR102637339B1 (ko) | 2018-08-31 | 2024-02-16 | 삼성전자주식회사 | 음성 인식 모델을 개인화하는 방법 및 장치 |
CN110197670B (zh) * | 2019-06-04 | 2022-06-07 | 大众问问(北京)信息科技有限公司 | 音频降噪方法、装置及电子设备 |
CN112863483B (zh) * | 2021-01-05 | 2022-11-08 | 杭州一知智能科技有限公司 | 支持多说话人风格、语言切换且韵律可控的语音合成装置 |
Family Cites Families (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5721808A (en) * | 1995-03-06 | 1998-02-24 | Nippon Telegraph And Telephone Corporation | Method for the composition of noise-resistant hidden markov models for speech recognition and speech recognizer using the same |
JP2780676B2 (ja) * | 1995-06-23 | 1998-07-30 | 日本電気株式会社 | 音声認識装置及び音声認識方法 |
JP3001037B2 (ja) * | 1995-12-13 | 2000-01-17 | 日本電気株式会社 | 音声認識装置 |
US6026359A (en) * | 1996-09-20 | 2000-02-15 | Nippon Telegraph And Telephone Corporation | Scheme for model adaptation in pattern recognition based on Taylor expansion |
US5924065A (en) * | 1997-06-16 | 1999-07-13 | Digital Equipment Corporation | Environmently compensated speech processing |
EP0997003A2 (de) * | 1997-07-01 | 2000-05-03 | Partran APS | Verfahren und schaltung zum rauschereduktion in sprachsignalen |
US6658385B1 (en) * | 1999-03-12 | 2003-12-02 | Texas Instruments Incorporated | Method for transforming HMMs for speaker-independent recognition in a noisy environment |
US6529866B1 (en) * | 1999-11-24 | 2003-03-04 | The United States Of America As Represented By The Secretary Of The Navy | Speech recognition system and associated methods |
US7089182B2 (en) * | 2000-04-18 | 2006-08-08 | Matsushita Electric Industrial Co., Ltd. | Method and apparatus for feature domain joint channel and additive noise compensation |
US7003455B1 (en) * | 2000-10-16 | 2006-02-21 | Microsoft Corporation | Method of noise reduction using correction and scaling vectors with partitioning of the acoustic space in the domain of noisy speech |
-
2000
- 2000-09-29 JP JP2000298536A patent/JP4169921B2/ja not_active Expired - Fee Related
-
2001
- 2001-09-27 EP EP01308268A patent/EP1195744B1/de not_active Expired - Lifetime
- 2001-09-27 DE DE60114968T patent/DE60114968T2/de not_active Expired - Fee Related
- 2001-09-28 US US09/964,677 patent/US7065488B2/en not_active Expired - Fee Related
- 2001-09-29 CN CN01137997.9A patent/CN1236421C/zh not_active Expired - Fee Related
Also Published As
Publication number | Publication date |
---|---|
EP1195744A3 (de) | 2003-01-22 |
EP1195744A2 (de) | 2002-04-10 |
US20020042712A1 (en) | 2002-04-11 |
DE60114968T2 (de) | 2006-07-27 |
CN1346125A (zh) | 2002-04-24 |
CN1236421C (zh) | 2006-01-11 |
US7065488B2 (en) | 2006-06-20 |
JP2002108383A (ja) | 2002-04-10 |
JP4169921B2 (ja) | 2008-10-22 |
EP1195744B1 (de) | 2005-11-16 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
DE60114968D1 (de) | Geräuschrobuste Spracherkennung | |
FI19992351A (fi) | Puheentunnistus | |
DE60123747D1 (de) | Spracherkennungsbasiertes Untertitelungssystem | |
DE69933623D1 (de) | Spracherkennung | |
DE60204374D1 (de) | Spracherkennungsvorrichtung | |
DE602004021716D1 (de) | Spracherkennungssystem | |
DE60119647D1 (de) | Schlüsselerkennungssystem | |
DE60233561D1 (de) | Sprachantwortsystem | |
DE60136156D1 (de) | Lautsprecher | |
DE60323362D1 (de) | Spracherkennungseinrichtung | |
FI20001577A (fi) | Puheenkoodaus | |
DE60044154D1 (de) | Sprachdekodierung | |
DE60128706D1 (de) | Zeichenerkennungssystem | |
FI20001327A (fi) | Mikrofonirakenne | |
DE60138503D1 (de) | Lautsprecher | |
DE69920714D1 (de) | Spracherkennung | |
DE60032068D1 (de) | Sprachdekodierung | |
DE60014031D1 (de) | Sprachererkennung durch korrelierung von spektrogrammen | |
DE60132357D1 (de) | Lautsprecher | |
DE60125084D1 (de) | Lautsprecher | |
ATA7972001A (de) | Elektrostatisches mikrofon | |
DE60225215D1 (de) | Verbesserte Spracherkennung | |
DE60109240D1 (de) | Sprecherverifikation und -erkennung | |
DE60142729D1 (de) | Spracherkennungssystem | |
DE60000244D1 (de) | Erkennungsgerät |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
8364 | No opposition during term of opposition | ||
8339 | Ceased/non-payment of the annual fee |