DE69926851D1 - Verfahren und Vorrichtung zur Sprachaktivitätsdetektion - Google Patents

Verfahren und Vorrichtung zur Sprachaktivitätsdetektion

Info

Publication number
DE69926851D1
DE69926851D1 DE69926851T DE69926851T DE69926851D1 DE 69926851 D1 DE69926851 D1 DE 69926851D1 DE 69926851 T DE69926851 T DE 69926851T DE 69926851 T DE69926851 T DE 69926851T DE 69926851 D1 DE69926851 D1 DE 69926851D1
Authority
DE
Germany
Prior art keywords
voice activity
activity detection
voice
detection
activity
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Lifetime
Application number
DE69926851T
Other languages
English (en)
Other versions
DE69926851T2 (de
Inventor
David Llewellyn Rees
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Canon Inc
Original Assignee
Canon Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from GBGB9822928.9A external-priority patent/GB9822928D0/en
Priority claimed from GBGB9822932.1A external-priority patent/GB9822932D0/en
Application filed by Canon Inc filed Critical Canon Inc
Application granted granted Critical
Publication of DE69926851D1 publication Critical patent/DE69926851D1/de
Publication of DE69926851T2 publication Critical patent/DE69926851T2/de
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/04Segmentation; Word boundary detection

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Telephonic Communication Services (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Telephone Function (AREA)
DE69926851T 1998-10-20 1999-10-18 Verfahren und Vorrichtung zur Sprachaktivitätsdetektion Expired - Lifetime DE69926851T2 (de)

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
GBGB9822928.9A GB9822928D0 (en) 1998-10-20 1998-10-20 Speech processing apparatus and method
GB9822932 1998-10-20
GBGB9822932.1A GB9822932D0 (en) 1998-10-20 1998-10-20 Speech processing apparatus and method
GB9822928 1998-10-20

Publications (2)

Publication Number Publication Date
DE69926851D1 true DE69926851D1 (de) 2005-09-29
DE69926851T2 DE69926851T2 (de) 2006-06-08

Family

ID=26314539

Family Applications (1)

Application Number Title Priority Date Filing Date
DE69926851T Expired - Lifetime DE69926851T2 (de) 1998-10-20 1999-10-18 Verfahren und Vorrichtung zur Sprachaktivitätsdetektion

Country Status (4)

Country Link
US (2) US6711536B2 (de)
EP (1) EP0996110B1 (de)
JP (1) JP4484283B2 (de)
DE (1) DE69926851T2 (de)

Families Citing this family (58)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6711536B2 (en) * 1998-10-20 2004-03-23 Canon Kabushiki Kaisha Speech processing apparatus and method
US6327564B1 (en) * 1999-03-05 2001-12-04 Matsushita Electric Corporation Of America Speech detection using stochastic confidence measures on the frequency spectrum
US6868380B2 (en) * 2000-03-24 2005-03-15 Eliza Corporation Speech recognition system and method for generating phonotic estimates
AU2001294989A1 (en) * 2000-10-04 2002-04-15 Clarity, L.L.C. Speech detection
JP2002132287A (ja) * 2000-10-20 2002-05-09 Canon Inc 音声収録方法および音声収録装置および記憶媒体
US6850887B2 (en) * 2001-02-28 2005-02-01 International Business Machines Corporation Speech recognition in noisy environments
WO2002073600A1 (en) * 2001-03-14 2002-09-19 International Business Machines Corporation Method and processor system for processing of an audio signal
GB2380644A (en) * 2001-06-07 2003-04-09 Canon Kk Speech detection
US6959276B2 (en) * 2001-09-27 2005-10-25 Microsoft Corporation Including the category of environmental noise when processing speech signals
US7299173B2 (en) * 2002-01-30 2007-11-20 Motorola Inc. Method and apparatus for speech detection using time-frequency variance
KR101047194B1 (ko) * 2002-05-03 2011-07-06 하만인터내셔날인더스트리스인코포레이티드 사운드 검출 및 위치측정 시스템
US7072828B2 (en) * 2002-05-13 2006-07-04 Avaya Technology Corp. Apparatus and method for improved voice activity detection
US20040064314A1 (en) * 2002-09-27 2004-04-01 Aubert Nicolas De Saint Methods and apparatus for speech end-point detection
US8326621B2 (en) 2003-02-21 2012-12-04 Qnx Software Systems Limited Repetitive transient noise removal
US8271279B2 (en) 2003-02-21 2012-09-18 Qnx Software Systems Limited Signature noise removal
US7885420B2 (en) * 2003-02-21 2011-02-08 Qnx Software Systems Co. Wind noise suppression system
US8073689B2 (en) * 2003-02-21 2011-12-06 Qnx Software Systems Co. Repetitive transient noise removal
US7725315B2 (en) * 2003-02-21 2010-05-25 Qnx Software Systems (Wavemakers), Inc. Minimization of transient noises in a voice signal
US7895036B2 (en) * 2003-02-21 2011-02-22 Qnx Software Systems Co. System for suppressing wind noise
US7949522B2 (en) 2003-02-21 2011-05-24 Qnx Software Systems Co. System for suppressing rain noise
JP4348970B2 (ja) * 2003-03-06 2009-10-21 ソニー株式会社 情報検出装置及び方法、並びにプログラム
US8918316B2 (en) * 2003-07-29 2014-12-23 Alcatel Lucent Content identification system
GB2405949A (en) * 2003-09-12 2005-03-16 Canon Kk Voice activated device with periodicity determination
GB2405948B (en) * 2003-09-12 2006-06-28 Canon Res Ct Europ Ltd Voice activated device
US7756709B2 (en) * 2004-02-02 2010-07-13 Applied Voice & Speech Technologies, Inc. Detection of voice inactivity within a sound stream
JP4460580B2 (ja) * 2004-07-21 2010-05-12 富士通株式会社 速度変換装置、速度変換方法及びプログラム
US20060100866A1 (en) * 2004-10-28 2006-05-11 International Business Machines Corporation Influencing automatic speech recognition signal-to-noise levels
EP1840877A4 (de) * 2005-01-18 2008-05-21 Fujitsu Ltd Sprachgeschwindigkeits-änderungsverfahren, und sprachgeschwindigkeits-änderungseinrichtung
FR2881867A1 (fr) * 2005-02-04 2006-08-11 France Telecom Procede de transmission de marques de fin de parole dans un systeme de reconnaissance de la parole
US8219391B2 (en) * 2005-02-15 2012-07-10 Raytheon Bbn Technologies Corp. Speech analyzing system with speech codebook
US7962340B2 (en) * 2005-08-22 2011-06-14 Nuance Communications, Inc. Methods and apparatus for buffering data for use in accordance with a speech recognition system
US7697827B2 (en) 2005-10-17 2010-04-13 Konicek Jeffrey C User-friendlier interfaces for a camera
JPWO2008007616A1 (ja) * 2006-07-13 2009-12-10 日本電気株式会社 無音声発声の入力警告装置と方法並びにプログラム
KR100883652B1 (ko) * 2006-08-03 2009-02-18 삼성전자주식회사 음성 구간 검출 방법 및 장치, 및 이를 이용한 음성 인식시스템
US8775168B2 (en) * 2006-08-10 2014-07-08 Stmicroelectronics Asia Pacific Pte, Ltd. Yule walker based low-complexity voice activity detector in noise suppression systems
KR100897554B1 (ko) * 2007-02-21 2009-05-15 삼성전자주식회사 분산 음성인식시스템 및 방법과 분산 음성인식을 위한 단말기
JP5089295B2 (ja) * 2007-08-31 2012-12-05 インターナショナル・ビジネス・マシーンズ・コーポレーション 音声処理システム、方法及びプログラム
US8473282B2 (en) 2008-01-25 2013-06-25 Yamaha Corporation Sound processing device and program
JP5169297B2 (ja) * 2008-02-22 2013-03-27 ヤマハ株式会社 音処理装置およびプログラム
US8190440B2 (en) * 2008-02-29 2012-05-29 Broadcom Corporation Sub-band codec with native voice activity detection
US8762150B2 (en) 2010-09-16 2014-06-24 Nuance Communications, Inc. Using codec parameters for endpoint detection in speech recognition
US8942975B2 (en) * 2010-11-10 2015-01-27 Broadcom Corporation Noise suppression in a Mel-filtered spectral domain
US8719019B2 (en) * 2011-04-25 2014-05-06 Microsoft Corporation Speaker identification
US8972256B2 (en) * 2011-10-17 2015-03-03 Nuance Communications, Inc. System and method for dynamic noise adaptation for robust automatic speech recognition
WO2013124862A1 (en) * 2012-02-21 2013-08-29 Tata Consultancy Services Limited Modified mel filter bank structure using spectral characteristics for sound analysis
US9060052B2 (en) 2013-03-13 2015-06-16 Accusonus S.A. Single channel, binaural and multi-channel dereverberation
WO2016028495A1 (en) 2014-08-22 2016-02-25 Sri International Systems for speech-based assessment of a patient's state-of-mind
CN104599675A (zh) * 2015-02-09 2015-05-06 宇龙计算机通信科技(深圳)有限公司 语音处理方法、语音处理装置和终端
US10134425B1 (en) * 2015-06-29 2018-11-20 Amazon Technologies, Inc. Direction-based speech endpointing
US10706873B2 (en) * 2015-09-18 2020-07-07 Sri International Real-time speaker state analytics platform
CN106373592B (zh) * 2016-08-31 2019-04-23 北京华科飞扬科技股份公司 音频容噪断句处理方法及***
CN106157951B (zh) * 2016-08-31 2019-04-23 北京华科飞扬科技股份公司 进行音频断句的自动拆分方法及***
JP2018072723A (ja) * 2016-11-02 2018-05-10 ヤマハ株式会社 音響処理方法および音響処理装置
US11216724B2 (en) * 2017-12-07 2022-01-04 Intel Corporation Acoustic event detection based on modelling of sequence of event subparts
JP6838588B2 (ja) * 2018-08-28 2021-03-03 横河電機株式会社 音声分析装置、音声分析方法、プログラム、および記録媒体
CN110136715B (zh) 2019-05-16 2021-04-06 北京百度网讯科技有限公司 语音识别方法和装置
CN113593539A (zh) * 2020-04-30 2021-11-02 阿里巴巴集团控股有限公司 流式端到端语音识别方法、装置及电子设备
TWI748587B (zh) * 2020-08-04 2021-12-01 瑞昱半導體股份有限公司 聲音事件偵測系統及方法

Family Cites Families (34)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US3873925A (en) 1974-03-07 1975-03-25 Motorola Inc Audio frequency squelch system
US3873926A (en) 1974-05-03 1975-03-25 Motorola Inc Audio frequency squelch system
US4187396A (en) 1977-06-09 1980-02-05 Harris Corporation Voice detector circuit
US4481593A (en) * 1981-10-05 1984-11-06 Exxon Corporation Continuous speech recognition
US4489434A (en) * 1981-10-05 1984-12-18 Exxon Corporation Speech recognition method and apparatus
JPS5868097A (ja) * 1981-10-20 1983-04-22 日産自動車株式会社 車両用音声認識装置
US4484344A (en) 1982-03-01 1984-11-20 Rockwell International Corporation Voice operated switch
JPS6048100A (ja) * 1983-08-26 1985-03-15 松下電器産業株式会社 音声認識装置
JPS60200300A (ja) * 1984-03-23 1985-10-09 松下電器産業株式会社 音声の始端・終端検出装置
US4718092A (en) * 1984-03-27 1988-01-05 Exxon Research And Engineering Company Speech recognition activation and deactivation method
JPS6148898A (ja) * 1984-08-16 1986-03-10 松下電器産業株式会社 音声の有声無声判定装置
US4956865A (en) * 1985-01-30 1990-09-11 Northern Telecom Limited Speech recognition
US4870686A (en) * 1987-10-19 1989-09-26 Motorola, Inc. Method for entering digit sequences by voice command
US5305422A (en) * 1992-02-28 1994-04-19 Panasonic Technologies, Inc. Method for determining boundaries of isolated words within a speech signal
JPH0619498A (ja) * 1992-07-01 1994-01-28 Fujitsu Ltd 音声検出器
US5617508A (en) 1992-10-05 1997-04-01 Panasonic Technologies Inc. Speech detection device for the detection of speech end points based on variance of frequency band limited energy
FR2697101B1 (fr) * 1992-10-21 1994-11-25 Sextant Avionique Procédé de détection de la parole.
US5692104A (en) * 1992-12-31 1997-11-25 Apple Computer, Inc. Method and apparatus for detecting end points of speech activity
US5459814A (en) * 1993-03-26 1995-10-17 Hughes Aircraft Company Voice activity detector for speech signals in variable background noise
US5473726A (en) * 1993-07-06 1995-12-05 The United States Of America As Represented By The Secretary Of The Air Force Audio and amplitude modulated photo data collection for speech recognition
JPH07273738A (ja) * 1994-03-28 1995-10-20 Toshiba Corp 音声送信制御回路
DE4422545A1 (de) 1994-06-28 1996-01-04 Sel Alcatel Ag Start-/Endpunkt-Detektion zur Worterkennung
US5594834A (en) * 1994-09-30 1997-01-14 Motorola, Inc. Method and system for recognizing a boundary between sounds in continuous speech
US5638487A (en) * 1994-12-30 1997-06-10 Purespeech, Inc. Automatic speech recognition
US5778342A (en) * 1996-02-01 1998-07-07 Dspc Israel Ltd. Pattern recognition system and method
US5842161A (en) * 1996-06-25 1998-11-24 Lucent Technologies Inc. Telecommunications instrument employing variable criteria speech recognition
US6570991B1 (en) 1996-12-18 2003-05-27 Interval Research Corporation Multi-feature speech/music discrimination system
JP2000047697A (ja) * 1998-07-30 2000-02-18 Nec Eng Ltd ノイズキャンセラ
US6138095A (en) * 1998-09-03 2000-10-24 Lucent Technologies Inc. Speech recognition
JP3310225B2 (ja) * 1998-09-29 2002-08-05 松下電器産業株式会社 雑音レベル時間変動率計算方法及び装置と雑音低減方法及び装置
GB9822931D0 (en) * 1998-10-20 1998-12-16 Canon Kk Speech processing apparatus and method
US6711536B2 (en) * 1998-10-20 2004-03-23 Canon Kabushiki Kaisha Speech processing apparatus and method
GB9822930D0 (en) * 1998-10-20 1998-12-16 Canon Kk Speech processing apparatus and method
US6249757B1 (en) * 1999-02-16 2001-06-19 3Com Corporation System for detecting voice activity

Also Published As

Publication number Publication date
EP0996110A1 (de) 2000-04-26
EP0996110B1 (de) 2005-08-24
US6711536B2 (en) 2004-03-23
US20030055639A1 (en) 2003-03-20
DE69926851T2 (de) 2006-06-08
US20040158465A1 (en) 2004-08-12
JP2000132177A (ja) 2000-05-12
JP4484283B2 (ja) 2010-06-16

Similar Documents

Publication Publication Date Title
DE69926851D1 (de) Verfahren und Vorrichtung zur Sprachaktivitätsdetektion
DE69831991D1 (de) Verfahren und Vorrichtung zur Sprachdetektion
DE69822687D1 (de) Vorrichtung und Verfahren zur Zusammenfassung
DE69930560D1 (de) Verfahren und Vorrichtung zur Mustererkennung
DE69926195D1 (de) Vorrichtung und Verfahren zur Bildgebung
DE60032669D1 (de) Vorrichtung und Verfahren zur Bandbreitenüberwachung
DE69927328T2 (de) Vorrichtung und Verfahren zur Signalspitzenbegrenzung
DE60018733D1 (de) Vorrichtung und verfahren zur probenanalyse
DE60021077D1 (de) Vorrichtung und verfahren zur probenabgabe
DE69918005D1 (de) Verfahren und Vorrichtung zur selektiven Entfernung von Kohlenmonoxid
DE59601862D1 (de) Verfahren und Vorrichtung zur Elektrolyse
DE69834320D1 (de) Verfahren und Vorrichtung zur Folgeschätzung
DE69715071T2 (de) Verfahren und Vorrichtung zur Sprachverarbeitung
DE69942553D1 (de) Vorrichtung und Verfahren zur Zeitmessung
DE69942295D1 (de) Vorrichtung und verfahren zur informationsverarbeitung
DE69928852D1 (de) Vorrichtung und Verfahren zur Unterstützung der Programmierung
DE69926451D1 (de) Verfahren und Vorrichtung zur Unterdrückung von Mehrkanalechos
DE69803202D1 (de) Verfahren und vorrichtung zur sprachdetektion
DE69823416D1 (de) Verfahren und Vorrichtung zur Branderkennung
DE60031812D1 (de) Vorrichtung und Verfahren zur Klangsynthesierung
DE10081176D2 (de) Verfahren und Vorrichtung zur Objektabtastung
DE69943234D1 (de) Vorrichtung und verfahren zur sprachdekodierung
DE69921066D1 (de) Verfahren und Vorrichtung zur Sprachkodierung
DE69922769D1 (de) Vorrichtung und Verfahren zur Sprachverarbeitung
DE69928456D1 (de) Verfahren und Vorrichtung zur Mustererkennung

Legal Events

Date Code Title Description
8364 No opposition during term of opposition