CN101359472A - 一种人声判别的方法和装置 - Google Patents
一种人声判别的方法和装置 Download PDFInfo
- Publication number
- CN101359472A CN101359472A CN200810167142.1A CN200810167142A CN101359472A CN 101359472 A CN101359472 A CN 101359472A CN 200810167142 A CN200810167142 A CN 200810167142A CN 101359472 A CN101359472 A CN 101359472A
- Authority
- CN
- China
- Prior art keywords
- transition
- maximum value
- voice
- segmentation
- sound signal
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 title claims abstract description 21
- 230000007704 transition Effects 0.000 claims abstract description 105
- 230000005236 sound signal Effects 0.000 claims abstract description 58
- 230000011218 segmentation Effects 0.000 claims description 47
- 238000005070 sampling Methods 0.000 claims description 5
- 230000007423 decrease Effects 0.000 claims description 3
- 238000010586 diagram Methods 0.000 description 6
- 230000000694 effects Effects 0.000 description 2
- 238000002474 experimental method Methods 0.000 description 2
- 230000008569 process Effects 0.000 description 2
- 230000033228 biological regulation Effects 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 230000004069 differentiation Effects 0.000 description 1
- 230000009977 dual effect Effects 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Telephonic Communication Services (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Telephone Function (AREA)
Abstract
Description
Claims (13)
Priority Applications (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN200810167142.1A CN101359472B (zh) | 2008-09-26 | 2008-09-26 | 一种人声判别的方法和装置 |
EP09817165.5A EP2328143B8 (en) | 2008-09-26 | 2009-09-15 | Human voice distinguishing method and device |
PCT/CN2009/001037 WO2010037251A1 (zh) | 2008-09-26 | 2009-09-15 | 一种人声判别的方法和装置 |
US13/001,596 US20110166857A1 (en) | 2008-09-26 | 2009-09-15 | Human Voice Distinguishing Method and Device |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN200810167142.1A CN101359472B (zh) | 2008-09-26 | 2008-09-26 | 一种人声判别的方法和装置 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN101359472A true CN101359472A (zh) | 2009-02-04 |
CN101359472B CN101359472B (zh) | 2011-07-20 |
Family
ID=40331902
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN200810167142.1A Active CN101359472B (zh) | 2008-09-26 | 2008-09-26 | 一种人声判别的方法和装置 |
Country Status (4)
Country | Link |
---|---|
US (1) | US20110166857A1 (zh) |
EP (1) | EP2328143B8 (zh) |
CN (1) | CN101359472B (zh) |
WO (1) | WO2010037251A1 (zh) |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2010037251A1 (zh) * | 2008-09-26 | 2010-04-08 | 炬力集成电路设计有限公司 | 一种人声判别的方法和装置 |
CN104916288A (zh) * | 2014-03-14 | 2015-09-16 | 深圳Tcl新技术有限公司 | 一种音频中人声突出处理的方法及装置 |
CN109545191A (zh) * | 2018-11-15 | 2019-03-29 | 电子科技大学 | 一种歌曲中人声起始位置的实时检测方法 |
CN113131965A (zh) * | 2021-04-16 | 2021-07-16 | 成都天奥信息科技有限公司 | 一种民航甚高频地空通信电台遥控装置及人声判别方法 |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110890104B (zh) * | 2019-11-26 | 2022-05-03 | 思必驰科技股份有限公司 | 语音端点检测方法及*** |
Family Cites Families (23)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6236964B1 (en) * | 1990-02-01 | 2001-05-22 | Canon Kabushiki Kaisha | Speech recognition apparatus and method for matching inputted speech and a word generated from stored referenced phoneme data |
US6411928B2 (en) * | 1990-02-09 | 2002-06-25 | Sanyo Electric | Apparatus and method for recognizing voice with reduced sensitivity to ambient noise |
US5457769A (en) * | 1993-03-30 | 1995-10-10 | Earmark, Inc. | Method and apparatus for detecting the presence of human voice signals in audio signals |
JPH07287589A (ja) * | 1994-04-15 | 1995-10-31 | Toyo Commun Equip Co Ltd | 音声区間検出装置 |
US5768263A (en) * | 1995-10-20 | 1998-06-16 | Vtel Corporation | Method for talk/listen determination and multipoint conferencing system using such method |
US6314392B1 (en) * | 1996-09-20 | 2001-11-06 | Digital Equipment Corporation | Method and apparatus for clustering-based signal segmentation |
US6507814B1 (en) * | 1998-08-24 | 2003-01-14 | Conexant Systems, Inc. | Pitch determination using speech classification and prior pitch estimation |
JP2001166783A (ja) * | 1999-12-10 | 2001-06-22 | Sanyo Electric Co Ltd | 音声区間検出方法 |
US7127392B1 (en) * | 2003-02-12 | 2006-10-24 | The United States Of America As Represented By The National Security Agency | Device for and method of detecting voice activity |
JP3963850B2 (ja) * | 2003-03-11 | 2007-08-22 | 富士通株式会社 | 音声区間検出装置 |
DE10327239A1 (de) * | 2003-06-17 | 2005-01-27 | Opticom Dipl.-Ing. Michael Keyhl Gmbh | Vorrichtung und Verfahren zum extrahieren eines Testsignalabschnitts aus einem Audiosignal |
CN100375996C (zh) * | 2003-08-19 | 2008-03-19 | 联发科技股份有限公司 | 判断声音信号中是否混有低频声音信号的方法及相关装置 |
FI118704B (fi) * | 2003-10-07 | 2008-02-15 | Nokia Corp | Menetelmä ja laite lähdekoodauksen tekemiseksi |
US20050096900A1 (en) * | 2003-10-31 | 2005-05-05 | Bossemeyer Robert W. | Locating and confirming glottal events within human speech signals |
US7672835B2 (en) * | 2004-12-24 | 2010-03-02 | Casio Computer Co., Ltd. | Voice analysis/synthesis apparatus and program |
CA2613145A1 (en) * | 2005-06-24 | 2006-12-28 | Monash University | Speech analysis system |
CN102222498B (zh) * | 2005-10-20 | 2013-05-01 | 日本电气株式会社 | 声音判别***、声音判别方法以及声音判别用程序 |
US8121835B2 (en) * | 2007-03-21 | 2012-02-21 | Texas Instruments Incorporated | Automatic level control of speech signals |
GB2450886B (en) * | 2007-07-10 | 2009-12-16 | Motorola Inc | Voice activity detector and a method of operation |
US8630848B2 (en) * | 2008-05-30 | 2014-01-14 | Digital Rise Technology Co., Ltd. | Audio signal transient detection |
US20100017203A1 (en) * | 2008-07-15 | 2010-01-21 | Texas Instruments Incorporated | Automatic level control of speech signals |
CN101359472B (zh) * | 2008-09-26 | 2011-07-20 | 炬力集成电路设计有限公司 | 一种人声判别的方法和装置 |
JP2011065093A (ja) * | 2009-09-18 | 2011-03-31 | Toshiba Corp | オーディオ信号補正装置及びオーディオ信号補正方法 |
-
2008
- 2008-09-26 CN CN200810167142.1A patent/CN101359472B/zh active Active
-
2009
- 2009-09-15 WO PCT/CN2009/001037 patent/WO2010037251A1/zh active Application Filing
- 2009-09-15 US US13/001,596 patent/US20110166857A1/en not_active Abandoned
- 2009-09-15 EP EP09817165.5A patent/EP2328143B8/en active Active
Cited By (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2010037251A1 (zh) * | 2008-09-26 | 2010-04-08 | 炬力集成电路设计有限公司 | 一种人声判别的方法和装置 |
CN104916288A (zh) * | 2014-03-14 | 2015-09-16 | 深圳Tcl新技术有限公司 | 一种音频中人声突出处理的方法及装置 |
CN104916288B (zh) * | 2014-03-14 | 2019-01-18 | 深圳Tcl新技术有限公司 | 一种音频中人声突出处理的方法及装置 |
CN109545191A (zh) * | 2018-11-15 | 2019-03-29 | 电子科技大学 | 一种歌曲中人声起始位置的实时检测方法 |
CN109545191B (zh) * | 2018-11-15 | 2022-11-25 | 电子科技大学 | 一种歌曲中人声起始位置的实时检测方法 |
CN113131965A (zh) * | 2021-04-16 | 2021-07-16 | 成都天奥信息科技有限公司 | 一种民航甚高频地空通信电台遥控装置及人声判别方法 |
CN113131965B (zh) * | 2021-04-16 | 2023-11-07 | 成都天奥信息科技有限公司 | 一种民航甚高频地空通信电台遥控装置及人声判别方法 |
Also Published As
Publication number | Publication date |
---|---|
EP2328143A4 (en) | 2012-06-13 |
CN101359472B (zh) | 2011-07-20 |
WO2010037251A1 (zh) | 2010-04-08 |
EP2328143B8 (en) | 2016-06-22 |
EP2328143A1 (en) | 2011-06-01 |
EP2328143B1 (en) | 2016-04-13 |
US20110166857A1 (en) | 2011-07-07 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US10789290B2 (en) | Audio data processing method and apparatus, and computer storage medium | |
Dean et al. | The QUT-NOISE-TIMIT corpus for evaluation of voice activity detection algorithms | |
JP4568371B2 (ja) | 少なくとも2つのイベント・クラス間を区別するためのコンピュータ化された方法及びコンピュータ・プログラム | |
Kennedy et al. | Laughter detection in meetings | |
JP5331784B2 (ja) | スピーチエンドポインタ | |
US8442833B2 (en) | Speech processing with source location estimation using signals from two or more microphones | |
CN110706690A (zh) | 语音识别方法及其装置 | |
EP1909263A1 (en) | Exploitation of language identification of media file data in speech dialog systems | |
CN101359472B (zh) | 一种人声判别的方法和装置 | |
CN107274906A (zh) | 语音信息处理方法、装置、终端及存储介质 | |
CN102446504B (zh) | 语音/音乐识别方法及装置 | |
CN104079247A (zh) | 均衡器控制器和控制方法 | |
CN103915093B (zh) | 一种实现语音歌唱化的方法和装置 | |
CN101578659A (zh) | 音质转换装置及音质转换方法 | |
CN112133277B (zh) | 样本生成方法及装置 | |
CN105706167B (zh) | 有语音的话音检测方法和装置 | |
Rossignol et al. | Feature extraction and temporal segmentation of acoustic signals | |
JP2002136764A (ja) | 入力音声をキャラクタの動作に反映させるエンタテインメント装置、方法および記憶媒体 | |
CN102237085A (zh) | 音频信号的分类方法及装置 | |
CN107274892A (zh) | 说话人识别方法及装置 | |
CN104364845A (zh) | 处理装置、处理方法、程序、计算机可读信息记录介质以及处理*** | |
US20050159942A1 (en) | Classification of speech and music using linear predictive coding coefficients | |
JP4696418B2 (ja) | 情報検出装置及び方法 | |
WO2007049879A1 (en) | Apparatus for vocal-cord signal recognition and method thereof | |
US20150112687A1 (en) | Method for rerecording audio materials and device for implementation thereof |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
TR01 | Transfer of patent right |
Effective date of registration: 20170612 Address after: 519085 C District, 1# workshop, No. 1, science and technology No. four road, hi tech Zone, Zhuhai, Guangdong, China Patentee after: ACTIONS (ZHUHAI) TECHNOLOGY CO., LTD. Address before: 519085 No. 1, unit 15, building 1, 1 Da Ha Road, Tang Wan Town, Guangdong, Zhuhai Patentee before: Juli Integrated Circuit Design Co., Ltd. |
|
TR01 | Transfer of patent right | ||
TR01 | Transfer of patent right |
Effective date of registration: 20191010 Address after: Room 1101, Wanguo building office, intersection of Tongling North Road and North 2nd Ring Road, Xinzhan District, Hefei City, Anhui Province, 230000 Patentee after: Hefei Torch Core Intelligent Technology Co., Ltd. Address before: 519085 High-tech Zone, Tangjiawan Town, Zhuhai City, Guangdong Province Patentee before: Torch Core (Zhuhai) Technology Co., Ltd. |
|
TR01 | Transfer of patent right |