DE60114968D1 - Geräuschrobuste Spracherkennung - Google Patents

Geräuschrobuste Spracherkennung

Info

Publication number: DE60114968D1
Authority: DE; Germany
Prior art keywords: sound; speech recognition; proof; proof speech; recognition
Prior art date: 2000-09-29
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.): Expired - Fee Related

Application number

DE60114968T

Other languages

English (en)

Other versions

DE60114968T2 (de

Inventor

Kiyoshi Yajima

Soichi Toyama

Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)

Pioneer Corp

Original Assignee

Pioneer Corp

Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)

2000-09-29

Filing date

2001-09-27

Publication date

2005-12-22

2001-09-27 Application filed by Pioneer Corp filed Critical Pioneer Corp

2005-12-22 Application granted granted Critical

2005-12-22 Publication of DE60114968D1 publication Critical patent/DE60114968D1/de

2006-07-27 Publication of DE60114968T2 publication Critical patent/DE60114968T2/de

2021-09-28 Anticipated expiration legal-status Critical

Status Expired - Fee Related legal-status Critical Current

Links

Classifications

- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/20—Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise, of stress induced speech
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/10—Speech classification or search using distance or distortion measures between unknown speech and reference templates
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/14—Speech classification or search using statistical models, e.g. Hidden Markov Models [HMMs]
- G10L15/142—Hidden Markov Models [HMMs]
- G10L15/144—Training of HMMs

Landscapes

Engineering & Computer Science (AREA)
Computational Linguistics (AREA)
Health & Medical Sciences (AREA)
Audiology, Speech & Language Pathology (AREA)
Human Computer Interaction (AREA)
Physics & Mathematics (AREA)
Acoustics & Sound (AREA)
Multimedia (AREA)
Quality & Reliability (AREA)
Signal Processing (AREA)
Complex Calculations (AREA)
Image Analysis (AREA)

DE60114968T 2000-09-29 2001-09-27 Geräuschrobuste Spracherkennung Expired - Fee Related DE60114968T2 (de)

Applications Claiming Priority (2)

Application Number	Priority Date	Filing Date	Title
JP2000298536A JP4169921B2 (ja)	2000-09-29	2000-09-29	音声認識システム
JP2000298536		2000-09-29

Publications (2)

Publication Number	Publication Date
DE60114968D1 true DE60114968D1 (de)	2005-12-22
DE60114968T2 DE60114968T2 (de)	2006-07-27

Family

ID=18780481

Family Applications (1)

Application Number	Title	Priority Date	Filing Date
DE60114968T Expired - Fee Related DE60114968T2 (de)	2000-09-29	2001-09-27	Geräuschrobuste Spracherkennung

Country Status (5)

Country	Link
US (1)	US7065488B2 (de)
EP (1)	EP1195744B1 (de)
JP (1)	JP4169921B2 (de)
CN (1)	CN1236421C (de)
DE (1)	DE60114968T2 (de)

Families Citing this family (29)

* Cited by examiner, † Cited by third party
Publication number	Priority date	Publication date	Assignee	Title
JP2002123285A (ja) *	2000-10-13	2002-04-26	Sony Corp	話者適応装置および話者適応方法、記録媒体、並びに音声認識装置
GB2391679B (en) *	2002-02-04	2004-03-24	Zentian Ltd	Speech recognition circuit using parallel processors
DE102004008225B4 (de) *	2004-02-19	2006-02-16	Infineon Technologies Ag	Verfahren und Einrichtung zum Ermitteln von Merkmalsvektoren aus einem Signal zur Mustererkennung, Verfahren und Einrichtung zur Mustererkennung sowie computerlesbare Speichermedien
US7865362B2 (en)	2005-02-04	2011-01-04	Vocollect, Inc.	Method and system for considering information about an expected response when performing speech recognition
US7895039B2 (en)	2005-02-04	2011-02-22	Vocollect, Inc.	Methods and systems for optimizing model adaptation for a speech recognition system
US7949533B2 (en)	2005-02-04	2011-05-24	Vococollect, Inc.	Methods and systems for assessing and improving the performance of a speech recognition system
US8200495B2 (en) *	2005-02-04	2012-06-12	Vocollect, Inc.	Methods and systems for considering information about an expected response when performing speech recognition
US7827032B2 (en)	2005-02-04	2010-11-02	Vocollect, Inc.	Methods and systems for adapting a model for a speech recognition system
JP4332129B2 (ja) *	2005-04-20	2009-09-16	富士通株式会社	文書分類プログラム、文書分類方法および文書分類装置
US20070112630A1 (en) *	2005-11-07	2007-05-17	Scanscout, Inc.	Techniques for rendering advertisments with rich media
US8762148B2 (en) *	2006-02-27	2014-06-24	Nec Corporation	Reference pattern adaptation apparatus, reference pattern adaptation method and reference pattern adaptation program
JP4245617B2 (ja) *	2006-04-06	2009-03-25	株式会社東芝	特徴量補正装置、特徴量補正方法および特徴量補正プログラム
JP4316583B2 (ja) *	2006-04-07	2009-08-19	株式会社東芝	特徴量補正装置、特徴量補正方法および特徴量補正プログラム
US20080109391A1 (en) *	2006-11-07	2008-05-08	Scanscout, Inc.	Classifying content based on mood
US8549550B2 (en)	2008-09-17	2013-10-01	Tubemogul, Inc.	Method and apparatus for passively monitoring online video viewing and viewer behavior
US8577996B2 (en) *	2007-09-18	2013-11-05	Tremor Video, Inc.	Method and apparatus for tracing users of online video web sites
US9612995B2 (en)	2008-09-17	2017-04-04	Adobe Systems Incorporated	Video viewer targeting based on preference similarity
US20110093783A1 (en) *	2009-10-16	2011-04-21	Charles Parra	Method and system for linking media components
EP2502195A2 (de) *	2009-11-20	2012-09-26	Tadashi Yonezaki	Verfahren und vorrichtung zur optimierung einer zuweisung von werbeinhalten
KR101060183B1 (ko) *	2009-12-11	2011-08-30	한국과학기술연구원	임베디드 청각 시스템 및 음성 신호 처리 방법
US20110150270A1 (en) *	2009-12-22	2011-06-23	Carpenter Michael D	Postal processing including voice training
JP5494468B2 (ja) *	2010-12-27	2014-05-14	富士通株式会社	状態検出装置、状態検出方法および状態検出のためのプログラム
US8914290B2 (en)	2011-05-20	2014-12-16	Vocollect, Inc.	Systems and methods for dynamically improving user intelligibility of synthesized speech in a work environment
CN102760436B (zh) *	2012-08-09	2014-06-11	河南省烟草公司开封市公司	一种语音词库筛选方法
US9978395B2 (en)	2013-03-15	2018-05-22	Vocollect, Inc.	Method and system for mitigating delay in receiving audio stream during production of sound from audio stream
US10714121B2 (en)	2016-07-27	2020-07-14	Vocollect, Inc.	Distinguishing user speech from background speech in speech-dense environments
KR102637339B1 (ko)	2018-08-31	2024-02-16	삼성전자주식회사	음성 인식 모델을 개인화하는 방법 및 장치
CN110197670B (zh) *	2019-06-04	2022-06-07	大众问问(北京)信息科技有限公司	音频降噪方法、装置及电子设备
CN112863483B (zh) *	2021-01-05	2022-11-08	杭州一知智能科技有限公司	支持多说话人风格、语言切换且韵律可控的语音合成装置

Family Cites Families (10)

* Cited by examiner, † Cited by third party
Publication number	Priority date	Publication date	Assignee	Title
US5721808A (en) *	1995-03-06	1998-02-24	Nippon Telegraph And Telephone Corporation	Method for the composition of noise-resistant hidden markov models for speech recognition and speech recognizer using the same
JP2780676B2 (ja) *	1995-06-23	1998-07-30	日本電気株式会社	音声認識装置及び音声認識方法
JP3001037B2 (ja) *	1995-12-13	2000-01-17	日本電気株式会社	音声認識装置
US6026359A (en) *	1996-09-20	2000-02-15	Nippon Telegraph And Telephone Corporation	Scheme for model adaptation in pattern recognition based on Taylor expansion
US5924065A (en) *	1997-06-16	1999-07-13	Digital Equipment Corporation	Environmently compensated speech processing
EP0997003A2 (de) *	1997-07-01	2000-05-03	Partran APS	Verfahren und schaltung zum rauschereduktion in sprachsignalen
US6658385B1 (en) *	1999-03-12	2003-12-02	Texas Instruments Incorporated	Method for transforming HMMs for speaker-independent recognition in a noisy environment
US6529866B1 (en) *	1999-11-24	2003-03-04	The United States Of America As Represented By The Secretary Of The Navy	Speech recognition system and associated methods
US7089182B2 (en) *	2000-04-18	2006-08-08	Matsushita Electric Industrial Co., Ltd.	Method and apparatus for feature domain joint channel and additive noise compensation
US7003455B1 (en) *	2000-10-16	2006-02-21	Microsoft Corporation	Method of noise reduction using correction and scaling vectors with partitioning of the acoustic space in the domain of noisy speech

2000
- 2000-09-29 JP JP2000298536A patent/JP4169921B2/ja not_active Expired - Fee Related
2001
- 2001-09-27 EP EP01308268A patent/EP1195744B1/de not_active Expired - Lifetime
- 2001-09-27 DE DE60114968T patent/DE60114968T2/de not_active Expired - Fee Related
- 2001-09-28 US US09/964,677 patent/US7065488B2/en not_active Expired - Fee Related
- 2001-09-29 CN CN01137997.9A patent/CN1236421C/zh not_active Expired - Fee Related

Also Published As

Publication number	Publication date
EP1195744A3 (de)	2003-01-22
EP1195744A2 (de)	2002-04-10
US20020042712A1 (en)	2002-04-11
DE60114968T2 (de)	2006-07-27
CN1346125A (zh)	2002-04-24
CN1236421C (zh)	2006-01-11
US7065488B2 (en)	2006-06-20
JP2002108383A (ja)	2002-04-10
JP4169921B2 (ja)	2008-10-22
EP1195744B1 (de)	2005-11-16

Legal Events

Date	Code	Title	Description
2006-12-07	8364	No opposition during term of opposition
2008-07-17	8339	Ceased/non-payment of the annual fee

Publication	Publication Date	Title
DE60114968D1 (de)	2005-12-22	Geräuschrobuste Spracherkennung
FI19992351A (fi)	2001-04-30	Puheentunnistus
DE60123747D1 (de)	2006-11-23	Spracherkennungsbasiertes Untertitelungssystem
DE69933623D1 (de)	2006-11-30	Spracherkennung
DE60204374D1 (de)	2005-07-07	Spracherkennungsvorrichtung
DE602004021716D1 (de)	2009-08-06	Spracherkennungssystem
DE60119647D1 (de)	2006-06-22	Schlüsselerkennungssystem
DE60233561D1 (de)	2009-10-15	Sprachantwortsystem
DE60136156D1 (de)	2008-11-27	Lautsprecher
DE60323362D1 (de)	2008-10-16	Spracherkennungseinrichtung
FI20001577A (fi)	2001-12-31	Puheenkoodaus
DE60044154D1 (de)	2010-05-20	Sprachdekodierung
DE60128706D1 (de)	2007-07-12	Zeichenerkennungssystem
FI20001327A (fi)	2001-09-11	Mikrofonirakenne
DE60138503D1 (de)	2009-06-10	Lautsprecher
DE69920714D1 (de)	2004-11-04	Spracherkennung
DE60032068D1 (de)	2007-01-11	Sprachdekodierung
DE60014031D1 (de)	2004-10-28	Sprachererkennung durch korrelierung von spektrogrammen
DE60132357D1 (de)	2008-02-21	Lautsprecher
DE60125084D1 (de)	2007-01-25	Lautsprecher
ATA7972001A (de)	2002-02-15	Elektrostatisches mikrofon
DE60225215D1 (de)	2008-04-10	Verbesserte Spracherkennung
DE60109240D1 (de)	2005-04-14	Sprecherverifikation und -erkennung
DE60142729D1 (de)	2010-09-16	Spracherkennungssystem
DE60000244D1 (de)	2002-08-08	Erkennungsgerät