DE69926851D1 - Verfahren und Vorrichtung zur Sprachaktivitätsdetektion - Google Patents
Verfahren und Vorrichtung zur SprachaktivitätsdetektionInfo
- Publication number
- DE69926851D1 DE69926851D1 DE69926851T DE69926851T DE69926851D1 DE 69926851 D1 DE69926851 D1 DE 69926851D1 DE 69926851 T DE69926851 T DE 69926851T DE 69926851 T DE69926851 T DE 69926851T DE 69926851 D1 DE69926851 D1 DE 69926851D1
- Authority
- DE
- Germany
- Prior art keywords
- voice activity
- activity detection
- voice
- detection
- activity
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Lifetime
Links
- 238000001514 detection method Methods 0.000 title 1
- 238000000034 method Methods 0.000 title 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/04—Segmentation; Word boundary detection
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Telephonic Communication Services (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Telephone Function (AREA)
Applications Claiming Priority (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
GBGB9822928.9A GB9822928D0 (en) | 1998-10-20 | 1998-10-20 | Speech processing apparatus and method |
GB9822932 | 1998-10-20 | ||
GBGB9822932.1A GB9822932D0 (en) | 1998-10-20 | 1998-10-20 | Speech processing apparatus and method |
GB9822928 | 1998-10-20 |
Publications (2)
Publication Number | Publication Date |
---|---|
DE69926851D1 true DE69926851D1 (de) | 2005-09-29 |
DE69926851T2 DE69926851T2 (de) | 2006-06-08 |
Family
ID=26314539
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
DE69926851T Expired - Lifetime DE69926851T2 (de) | 1998-10-20 | 1999-10-18 | Verfahren und Vorrichtung zur Sprachaktivitätsdetektion |
Country Status (4)
Country | Link |
---|---|
US (2) | US6711536B2 (de) |
EP (1) | EP0996110B1 (de) |
JP (1) | JP4484283B2 (de) |
DE (1) | DE69926851T2 (de) |
Families Citing this family (58)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6711536B2 (en) * | 1998-10-20 | 2004-03-23 | Canon Kabushiki Kaisha | Speech processing apparatus and method |
US6327564B1 (en) * | 1999-03-05 | 2001-12-04 | Matsushita Electric Corporation Of America | Speech detection using stochastic confidence measures on the frequency spectrum |
US6868380B2 (en) * | 2000-03-24 | 2005-03-15 | Eliza Corporation | Speech recognition system and method for generating phonotic estimates |
AU2001294989A1 (en) * | 2000-10-04 | 2002-04-15 | Clarity, L.L.C. | Speech detection |
JP2002132287A (ja) * | 2000-10-20 | 2002-05-09 | Canon Inc | 音声収録方法および音声収録装置および記憶媒体 |
US6850887B2 (en) * | 2001-02-28 | 2005-02-01 | International Business Machines Corporation | Speech recognition in noisy environments |
WO2002073600A1 (en) * | 2001-03-14 | 2002-09-19 | International Business Machines Corporation | Method and processor system for processing of an audio signal |
GB2380644A (en) * | 2001-06-07 | 2003-04-09 | Canon Kk | Speech detection |
US6959276B2 (en) * | 2001-09-27 | 2005-10-25 | Microsoft Corporation | Including the category of environmental noise when processing speech signals |
US7299173B2 (en) * | 2002-01-30 | 2007-11-20 | Motorola Inc. | Method and apparatus for speech detection using time-frequency variance |
KR101047194B1 (ko) * | 2002-05-03 | 2011-07-06 | 하만인터내셔날인더스트리스인코포레이티드 | 사운드 검출 및 위치측정 시스템 |
US7072828B2 (en) * | 2002-05-13 | 2006-07-04 | Avaya Technology Corp. | Apparatus and method for improved voice activity detection |
US20040064314A1 (en) * | 2002-09-27 | 2004-04-01 | Aubert Nicolas De Saint | Methods and apparatus for speech end-point detection |
US8326621B2 (en) | 2003-02-21 | 2012-12-04 | Qnx Software Systems Limited | Repetitive transient noise removal |
US8271279B2 (en) | 2003-02-21 | 2012-09-18 | Qnx Software Systems Limited | Signature noise removal |
US7885420B2 (en) * | 2003-02-21 | 2011-02-08 | Qnx Software Systems Co. | Wind noise suppression system |
US8073689B2 (en) * | 2003-02-21 | 2011-12-06 | Qnx Software Systems Co. | Repetitive transient noise removal |
US7725315B2 (en) * | 2003-02-21 | 2010-05-25 | Qnx Software Systems (Wavemakers), Inc. | Minimization of transient noises in a voice signal |
US7895036B2 (en) * | 2003-02-21 | 2011-02-22 | Qnx Software Systems Co. | System for suppressing wind noise |
US7949522B2 (en) | 2003-02-21 | 2011-05-24 | Qnx Software Systems Co. | System for suppressing rain noise |
JP4348970B2 (ja) * | 2003-03-06 | 2009-10-21 | ソニー株式会社 | 情報検出装置及び方法、並びにプログラム |
US8918316B2 (en) * | 2003-07-29 | 2014-12-23 | Alcatel Lucent | Content identification system |
GB2405949A (en) * | 2003-09-12 | 2005-03-16 | Canon Kk | Voice activated device with periodicity determination |
GB2405948B (en) * | 2003-09-12 | 2006-06-28 | Canon Res Ct Europ Ltd | Voice activated device |
US7756709B2 (en) * | 2004-02-02 | 2010-07-13 | Applied Voice & Speech Technologies, Inc. | Detection of voice inactivity within a sound stream |
JP4460580B2 (ja) * | 2004-07-21 | 2010-05-12 | 富士通株式会社 | 速度変換装置、速度変換方法及びプログラム |
US20060100866A1 (en) * | 2004-10-28 | 2006-05-11 | International Business Machines Corporation | Influencing automatic speech recognition signal-to-noise levels |
EP1840877A4 (de) * | 2005-01-18 | 2008-05-21 | Fujitsu Ltd | Sprachgeschwindigkeits-änderungsverfahren, und sprachgeschwindigkeits-änderungseinrichtung |
FR2881867A1 (fr) * | 2005-02-04 | 2006-08-11 | France Telecom | Procede de transmission de marques de fin de parole dans un systeme de reconnaissance de la parole |
US8219391B2 (en) * | 2005-02-15 | 2012-07-10 | Raytheon Bbn Technologies Corp. | Speech analyzing system with speech codebook |
US7962340B2 (en) * | 2005-08-22 | 2011-06-14 | Nuance Communications, Inc. | Methods and apparatus for buffering data for use in accordance with a speech recognition system |
US7697827B2 (en) | 2005-10-17 | 2010-04-13 | Konicek Jeffrey C | User-friendlier interfaces for a camera |
JPWO2008007616A1 (ja) * | 2006-07-13 | 2009-12-10 | 日本電気株式会社 | 無音声発声の入力警告装置と方法並びにプログラム |
KR100883652B1 (ko) * | 2006-08-03 | 2009-02-18 | 삼성전자주식회사 | 음성 구간 검출 방법 및 장치, 및 이를 이용한 음성 인식시스템 |
US8775168B2 (en) * | 2006-08-10 | 2014-07-08 | Stmicroelectronics Asia Pacific Pte, Ltd. | Yule walker based low-complexity voice activity detector in noise suppression systems |
KR100897554B1 (ko) * | 2007-02-21 | 2009-05-15 | 삼성전자주식회사 | 분산 음성인식시스템 및 방법과 분산 음성인식을 위한 단말기 |
JP5089295B2 (ja) * | 2007-08-31 | 2012-12-05 | インターナショナル・ビジネス・マシーンズ・コーポレーション | 音声処理システム、方法及びプログラム |
US8473282B2 (en) | 2008-01-25 | 2013-06-25 | Yamaha Corporation | Sound processing device and program |
JP5169297B2 (ja) * | 2008-02-22 | 2013-03-27 | ヤマハ株式会社 | 音処理装置およびプログラム |
US8190440B2 (en) * | 2008-02-29 | 2012-05-29 | Broadcom Corporation | Sub-band codec with native voice activity detection |
US8762150B2 (en) | 2010-09-16 | 2014-06-24 | Nuance Communications, Inc. | Using codec parameters for endpoint detection in speech recognition |
US8942975B2 (en) * | 2010-11-10 | 2015-01-27 | Broadcom Corporation | Noise suppression in a Mel-filtered spectral domain |
US8719019B2 (en) * | 2011-04-25 | 2014-05-06 | Microsoft Corporation | Speaker identification |
US8972256B2 (en) * | 2011-10-17 | 2015-03-03 | Nuance Communications, Inc. | System and method for dynamic noise adaptation for robust automatic speech recognition |
WO2013124862A1 (en) * | 2012-02-21 | 2013-08-29 | Tata Consultancy Services Limited | Modified mel filter bank structure using spectral characteristics for sound analysis |
US9060052B2 (en) | 2013-03-13 | 2015-06-16 | Accusonus S.A. | Single channel, binaural and multi-channel dereverberation |
WO2016028495A1 (en) | 2014-08-22 | 2016-02-25 | Sri International | Systems for speech-based assessment of a patient's state-of-mind |
CN104599675A (zh) * | 2015-02-09 | 2015-05-06 | 宇龙计算机通信科技(深圳)有限公司 | 语音处理方法、语音处理装置和终端 |
US10134425B1 (en) * | 2015-06-29 | 2018-11-20 | Amazon Technologies, Inc. | Direction-based speech endpointing |
US10706873B2 (en) * | 2015-09-18 | 2020-07-07 | Sri International | Real-time speaker state analytics platform |
CN106373592B (zh) * | 2016-08-31 | 2019-04-23 | 北京华科飞扬科技股份公司 | 音频容噪断句处理方法及*** |
CN106157951B (zh) * | 2016-08-31 | 2019-04-23 | 北京华科飞扬科技股份公司 | 进行音频断句的自动拆分方法及*** |
JP2018072723A (ja) * | 2016-11-02 | 2018-05-10 | ヤマハ株式会社 | 音響処理方法および音響処理装置 |
US11216724B2 (en) * | 2017-12-07 | 2022-01-04 | Intel Corporation | Acoustic event detection based on modelling of sequence of event subparts |
JP6838588B2 (ja) * | 2018-08-28 | 2021-03-03 | 横河電機株式会社 | 音声分析装置、音声分析方法、プログラム、および記録媒体 |
CN110136715B (zh) | 2019-05-16 | 2021-04-06 | 北京百度网讯科技有限公司 | 语音识别方法和装置 |
CN113593539A (zh) * | 2020-04-30 | 2021-11-02 | 阿里巴巴集团控股有限公司 | 流式端到端语音识别方法、装置及电子设备 |
TWI748587B (zh) * | 2020-08-04 | 2021-12-01 | 瑞昱半導體股份有限公司 | 聲音事件偵測系統及方法 |
Family Cites Families (34)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US3873925A (en) | 1974-03-07 | 1975-03-25 | Motorola Inc | Audio frequency squelch system |
US3873926A (en) | 1974-05-03 | 1975-03-25 | Motorola Inc | Audio frequency squelch system |
US4187396A (en) | 1977-06-09 | 1980-02-05 | Harris Corporation | Voice detector circuit |
US4481593A (en) * | 1981-10-05 | 1984-11-06 | Exxon Corporation | Continuous speech recognition |
US4489434A (en) * | 1981-10-05 | 1984-12-18 | Exxon Corporation | Speech recognition method and apparatus |
JPS5868097A (ja) * | 1981-10-20 | 1983-04-22 | 日産自動車株式会社 | 車両用音声認識装置 |
US4484344A (en) | 1982-03-01 | 1984-11-20 | Rockwell International Corporation | Voice operated switch |
JPS6048100A (ja) * | 1983-08-26 | 1985-03-15 | 松下電器産業株式会社 | 音声認識装置 |
JPS60200300A (ja) * | 1984-03-23 | 1985-10-09 | 松下電器産業株式会社 | 音声の始端・終端検出装置 |
US4718092A (en) * | 1984-03-27 | 1988-01-05 | Exxon Research And Engineering Company | Speech recognition activation and deactivation method |
JPS6148898A (ja) * | 1984-08-16 | 1986-03-10 | 松下電器産業株式会社 | 音声の有声無声判定装置 |
US4956865A (en) * | 1985-01-30 | 1990-09-11 | Northern Telecom Limited | Speech recognition |
US4870686A (en) * | 1987-10-19 | 1989-09-26 | Motorola, Inc. | Method for entering digit sequences by voice command |
US5305422A (en) * | 1992-02-28 | 1994-04-19 | Panasonic Technologies, Inc. | Method for determining boundaries of isolated words within a speech signal |
JPH0619498A (ja) * | 1992-07-01 | 1994-01-28 | Fujitsu Ltd | 音声検出器 |
US5617508A (en) | 1992-10-05 | 1997-04-01 | Panasonic Technologies Inc. | Speech detection device for the detection of speech end points based on variance of frequency band limited energy |
FR2697101B1 (fr) * | 1992-10-21 | 1994-11-25 | Sextant Avionique | Procédé de détection de la parole. |
US5692104A (en) * | 1992-12-31 | 1997-11-25 | Apple Computer, Inc. | Method and apparatus for detecting end points of speech activity |
US5459814A (en) * | 1993-03-26 | 1995-10-17 | Hughes Aircraft Company | Voice activity detector for speech signals in variable background noise |
US5473726A (en) * | 1993-07-06 | 1995-12-05 | The United States Of America As Represented By The Secretary Of The Air Force | Audio and amplitude modulated photo data collection for speech recognition |
JPH07273738A (ja) * | 1994-03-28 | 1995-10-20 | Toshiba Corp | 音声送信制御回路 |
DE4422545A1 (de) | 1994-06-28 | 1996-01-04 | Sel Alcatel Ag | Start-/Endpunkt-Detektion zur Worterkennung |
US5594834A (en) * | 1994-09-30 | 1997-01-14 | Motorola, Inc. | Method and system for recognizing a boundary between sounds in continuous speech |
US5638487A (en) * | 1994-12-30 | 1997-06-10 | Purespeech, Inc. | Automatic speech recognition |
US5778342A (en) * | 1996-02-01 | 1998-07-07 | Dspc Israel Ltd. | Pattern recognition system and method |
US5842161A (en) * | 1996-06-25 | 1998-11-24 | Lucent Technologies Inc. | Telecommunications instrument employing variable criteria speech recognition |
US6570991B1 (en) | 1996-12-18 | 2003-05-27 | Interval Research Corporation | Multi-feature speech/music discrimination system |
JP2000047697A (ja) * | 1998-07-30 | 2000-02-18 | Nec Eng Ltd | ノイズキャンセラ |
US6138095A (en) * | 1998-09-03 | 2000-10-24 | Lucent Technologies Inc. | Speech recognition |
JP3310225B2 (ja) * | 1998-09-29 | 2002-08-05 | 松下電器産業株式会社 | 雑音レベル時間変動率計算方法及び装置と雑音低減方法及び装置 |
GB9822931D0 (en) * | 1998-10-20 | 1998-12-16 | Canon Kk | Speech processing apparatus and method |
US6711536B2 (en) * | 1998-10-20 | 2004-03-23 | Canon Kabushiki Kaisha | Speech processing apparatus and method |
GB9822930D0 (en) * | 1998-10-20 | 1998-12-16 | Canon Kk | Speech processing apparatus and method |
US6249757B1 (en) * | 1999-02-16 | 2001-06-19 | 3Com Corporation | System for detecting voice activity |
-
1999
- 1999-09-30 US US09/409,247 patent/US6711536B2/en not_active Expired - Lifetime
- 1999-10-18 DE DE69926851T patent/DE69926851T2/de not_active Expired - Lifetime
- 1999-10-18 EP EP99308210A patent/EP0996110B1/de not_active Expired - Lifetime
- 1999-10-20 JP JP29876899A patent/JP4484283B2/ja not_active Expired - Fee Related
-
2004
- 2004-02-04 US US10/770,421 patent/US20040158465A1/en not_active Abandoned
Also Published As
Publication number | Publication date |
---|---|
EP0996110A1 (de) | 2000-04-26 |
EP0996110B1 (de) | 2005-08-24 |
US6711536B2 (en) | 2004-03-23 |
US20030055639A1 (en) | 2003-03-20 |
DE69926851T2 (de) | 2006-06-08 |
US20040158465A1 (en) | 2004-08-12 |
JP2000132177A (ja) | 2000-05-12 |
JP4484283B2 (ja) | 2010-06-16 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
DE69926851D1 (de) | Verfahren und Vorrichtung zur Sprachaktivitätsdetektion | |
DE69831991D1 (de) | Verfahren und Vorrichtung zur Sprachdetektion | |
DE69822687D1 (de) | Vorrichtung und Verfahren zur Zusammenfassung | |
DE69930560D1 (de) | Verfahren und Vorrichtung zur Mustererkennung | |
DE69926195D1 (de) | Vorrichtung und Verfahren zur Bildgebung | |
DE60032669D1 (de) | Vorrichtung und Verfahren zur Bandbreitenüberwachung | |
DE69927328T2 (de) | Vorrichtung und Verfahren zur Signalspitzenbegrenzung | |
DE60018733D1 (de) | Vorrichtung und verfahren zur probenanalyse | |
DE60021077D1 (de) | Vorrichtung und verfahren zur probenabgabe | |
DE69918005D1 (de) | Verfahren und Vorrichtung zur selektiven Entfernung von Kohlenmonoxid | |
DE59601862D1 (de) | Verfahren und Vorrichtung zur Elektrolyse | |
DE69834320D1 (de) | Verfahren und Vorrichtung zur Folgeschätzung | |
DE69715071T2 (de) | Verfahren und Vorrichtung zur Sprachverarbeitung | |
DE69942553D1 (de) | Vorrichtung und Verfahren zur Zeitmessung | |
DE69942295D1 (de) | Vorrichtung und verfahren zur informationsverarbeitung | |
DE69928852D1 (de) | Vorrichtung und Verfahren zur Unterstützung der Programmierung | |
DE69926451D1 (de) | Verfahren und Vorrichtung zur Unterdrückung von Mehrkanalechos | |
DE69803202D1 (de) | Verfahren und vorrichtung zur sprachdetektion | |
DE69823416D1 (de) | Verfahren und Vorrichtung zur Branderkennung | |
DE60031812D1 (de) | Vorrichtung und Verfahren zur Klangsynthesierung | |
DE10081176D2 (de) | Verfahren und Vorrichtung zur Objektabtastung | |
DE69943234D1 (de) | Vorrichtung und verfahren zur sprachdekodierung | |
DE69921066D1 (de) | Verfahren und Vorrichtung zur Sprachkodierung | |
DE69922769D1 (de) | Vorrichtung und Verfahren zur Sprachverarbeitung | |
DE69928456D1 (de) | Verfahren und Vorrichtung zur Mustererkennung |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
8364 | No opposition during term of opposition |