CN1251194A - 识别*** - Google Patents
识别*** Download PDFInfo
- Publication number
- CN1251194A CN1251194A CN98803644A CN98803644A CN1251194A CN 1251194 A CN1251194 A CN 1251194A CN 98803644 A CN98803644 A CN 98803644A CN 98803644 A CN98803644 A CN 98803644A CN 1251194 A CN1251194 A CN 1251194A
- Authority
- CN
- China
- Prior art keywords
- data vector
- model
- vector
- compensation
- data
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 239000013598 vector Substances 0.000 claims abstract description 208
- 230000009466 transformation Effects 0.000 claims abstract description 17
- 239000011159 matrix material Substances 0.000 claims description 50
- 238000000034 method Methods 0.000 claims description 47
- 238000006243 chemical reaction Methods 0.000 claims description 42
- 238000012937 correction Methods 0.000 claims description 18
- 238000009795 derivation Methods 0.000 claims description 18
- 238000009826 distribution Methods 0.000 claims description 16
- 238000006073 displacement reaction Methods 0.000 claims description 8
- 230000011218 segmentation Effects 0.000 claims description 5
- 238000012935 Averaging Methods 0.000 claims description 4
- 238000012986 modification Methods 0.000 claims description 2
- 230000004048 modification Effects 0.000 claims description 2
- 230000009467 reduction Effects 0.000 claims description 2
- 230000005540 biological transmission Effects 0.000 claims 2
- 238000004364 calculation method Methods 0.000 claims 1
- 230000004044 response Effects 0.000 abstract description 12
- 230000003595 spectral effect Effects 0.000 abstract description 11
- 230000008878 coupling Effects 0.000 description 25
- 238000010168 coupling process Methods 0.000 description 25
- 238000005859 coupling reaction Methods 0.000 description 25
- 230000008569 process Effects 0.000 description 17
- 238000001914 filtration Methods 0.000 description 9
- 238000004458 analytical method Methods 0.000 description 8
- 238000003860 storage Methods 0.000 description 8
- 230000000694 effects Effects 0.000 description 7
- 230000006978 adaptation Effects 0.000 description 6
- 230000008901 benefit Effects 0.000 description 6
- 238000005516 engineering process Methods 0.000 description 6
- 230000006870 function Effects 0.000 description 6
- 238000012549 training Methods 0.000 description 5
- 230000015572 biosynthetic process Effects 0.000 description 4
- 238000013461 design Methods 0.000 description 4
- 230000006872 improvement Effects 0.000 description 4
- 238000012545 processing Methods 0.000 description 4
- 238000004422 calculation algorithm Methods 0.000 description 3
- 230000008859 change Effects 0.000 description 3
- 230000014509 gene expression Effects 0.000 description 3
- 230000002889 sympathetic effect Effects 0.000 description 3
- 238000012360 testing method Methods 0.000 description 3
- 238000003491 array Methods 0.000 description 2
- 238000004590 computer program Methods 0.000 description 2
- 230000007774 longterm Effects 0.000 description 2
- 238000005259 measurement Methods 0.000 description 2
- 238000012552 review Methods 0.000 description 2
- 238000005070 sampling Methods 0.000 description 2
- 230000000007 visual effect Effects 0.000 description 2
- 239000000654 additive Substances 0.000 description 1
- 230000000996 additive effect Effects 0.000 description 1
- 230000002411 adverse Effects 0.000 description 1
- 230000003466 anti-cipated effect Effects 0.000 description 1
- 238000005266 casting Methods 0.000 description 1
- 230000000295 complement effect Effects 0.000 description 1
- 239000012141 concentrate Substances 0.000 description 1
- 238000011109 contamination Methods 0.000 description 1
- 238000005520 cutting process Methods 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 238000004870 electrical engineering Methods 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 230000002349 favourable effect Effects 0.000 description 1
- 238000005286 illumination Methods 0.000 description 1
- 230000008676 import Effects 0.000 description 1
- 238000007689 inspection Methods 0.000 description 1
- 230000014759 maintenance of location Effects 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 238000013507 mapping Methods 0.000 description 1
- 230000013011 mating Effects 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 210000003205 muscle Anatomy 0.000 description 1
- 238000002407 reforming Methods 0.000 description 1
- 238000001228 spectrum Methods 0.000 description 1
- GOLXNESZZPUPJE-UHFFFAOYSA-N spiromesifen Chemical compound CC1=CC(C)=CC(C)=C1C(C(O1)=O)=C(OC(=O)CC(C)(C)C)C11CCCC1 GOLXNESZZPUPJE-UHFFFAOYSA-N 0.000 description 1
- 238000005309 stochastic process Methods 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- 230000009897 systematic effect Effects 0.000 description 1
- 230000001131 transforming effect Effects 0.000 description 1
- 238000012795 verification Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/065—Adaptation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/20—Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise, of stress induced speech
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/14—Speech classification or search using statistical models, e.g. Hidden Markov Models [HMMs]
- G10L15/142—Hidden Markov Models [HMMs]
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Artificial Intelligence (AREA)
- Quality & Reliability (AREA)
- Signal Processing (AREA)
- Complex Calculations (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Circuit For Audible Band Transducer (AREA)
Abstract
Description
麦克风位置 | 不具有SSA的出错率 | 具有SSA的出错率 |
标准中央低颏 | Mic A Mic B22.6% 47.0%37.8% 73.8%31.7% 71.3%22.0% 76.2% | Mic A Mic B20.7% 22.0%26.2% 33.5%23.8% 20.1%13.4% 25.6% |
滤波器组频道号码 | 频道中心频率(Hz) | Bi,i | Bi,i+1 | Bi,i+2 |
0 | 0 | 1 | 0 | 0 |
1 | 120 | 0.7 | 0.3 | 0 |
2 | 240 | 0.5 | 0.5 | 0 |
3 | 360 | 0.3 | 0.6 | 0.1 |
4 | 481 | 0.2 | 0.7 | 0.1 |
5 | 603 | 0.1 | 0.5 | 0.4 |
6 | 729 | 0.1 | 0.5 | 0.4 |
7 | 859 | 0.1 | 0.5 | 0.4 |
8 | 994 | 0.1 | 0.5 | 0.4 |
9 | 1136 | 0.1 | 0.5 | 0.4 |
10 | 1286 | 0.1 | 0.5 | 0.4 |
11 | 1445 | 0.1 | 0.5 | 0.4 |
12 | 1615 | 0.1 | 0.5 | 0.4 |
13 | 1796 | 0.1 | 0.5 | 0.4 |
14 | 1990 | 0.1 | 0.5 | 0.4 |
15 | 2198 | 0.1 | 0.6 | 0.3 |
16 | 2421 | 0.1 | 0.6 | 0.3 |
17 | 2670 | 0.2 | 0.6 | 0.2 |
18 | 2962 | 0.3 | 0.6 | 0.1 |
19 | 3315 | 0.4 | 0.6 | 0 |
20 | 3747 | 0.7 | 0.3 | 0 |
21 | 4277 | 1 | 0 | 0 |
22 | 4921 | 1 | 0 | 0 |
23 | 5700 | 1 | 0 | 0 |
24 | 6629 | 1 | 0 | 0 |
25 | 7728 | 1 | 0 | 0 |
Claims (23)
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
GBGB9706174.1A GB9706174D0 (en) | 1997-03-25 | 1997-03-25 | Recognition system |
GB9706174.1 | 1997-03-25 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN1251194A true CN1251194A (zh) | 2000-04-19 |
CN1168069C CN1168069C (zh) | 2004-09-22 |
Family
ID=10809832
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CNB988036444A Expired - Fee Related CN1168069C (zh) | 1997-03-25 | 1998-02-24 | 识别***和识别方法 |
Country Status (9)
Country | Link |
---|---|
US (1) | US6671666B1 (zh) |
EP (1) | EP0970462B1 (zh) |
JP (1) | JP2001517325A (zh) |
KR (2) | KR20010005674A (zh) |
CN (1) | CN1168069C (zh) |
CA (1) | CA2284484A1 (zh) |
DE (1) | DE69836580D1 (zh) |
GB (2) | GB9706174D0 (zh) |
WO (1) | WO1998043237A1 (zh) |
Families Citing this family (31)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6505160B1 (en) | 1995-07-27 | 2003-01-07 | Digimarc Corporation | Connected audio and other media objects |
US6182036B1 (en) * | 1999-02-23 | 2001-01-30 | Motorola, Inc. | Method of extracting features in a voice recognition system |
GB9913773D0 (en) * | 1999-06-14 | 1999-08-11 | Simpson Mark C | Speech signal processing |
GB2355834A (en) | 1999-10-29 | 2001-05-02 | Nokia Mobile Phones Ltd | Speech recognition |
US7006787B1 (en) | 2000-02-14 | 2006-02-28 | Lucent Technologies Inc. | Mobile to mobile digital wireless connection having enhanced voice quality |
US6990446B1 (en) * | 2000-10-10 | 2006-01-24 | Microsoft Corporation | Method and apparatus using spectral addition for speaker recognition |
US7457750B2 (en) * | 2000-10-13 | 2008-11-25 | At&T Corp. | Systems and methods for dynamic re-configurable speech recognition |
EP1229516A1 (en) * | 2001-01-26 | 2002-08-07 | Telefonaktiebolaget L M Ericsson (Publ) | Method, device, terminal and system for the automatic recognition of distorted speech data |
US6985858B2 (en) * | 2001-03-20 | 2006-01-10 | Microsoft Corporation | Method and apparatus for removing noise from feature vectors |
JP2005249816A (ja) | 2004-03-01 | 2005-09-15 | Internatl Business Mach Corp <Ibm> | 信号強調装置、方法及びプログラム、並びに音声認識装置、方法及びプログラム |
US7512536B2 (en) * | 2004-05-14 | 2009-03-31 | Texas Instruments Incorporated | Efficient filter bank computation for audio coding |
US7643686B2 (en) * | 2004-11-17 | 2010-01-05 | Eastman Kodak Company | Multi-tiered image clustering by event |
US7567903B1 (en) | 2005-01-12 | 2009-07-28 | At&T Intellectual Property Ii, L.P. | Low latency real-time vocal tract length normalization |
US8219391B2 (en) * | 2005-02-15 | 2012-07-10 | Raytheon Bbn Technologies Corp. | Speech analyzing system with speech codebook |
US7970613B2 (en) | 2005-11-12 | 2011-06-28 | Sony Computer Entertainment Inc. | Method and system for Gaussian probability data bit reduction and computation |
US8010358B2 (en) | 2006-02-21 | 2011-08-30 | Sony Computer Entertainment Inc. | Voice recognition with parallel gender and age normalization |
US7778831B2 (en) | 2006-02-21 | 2010-08-17 | Sony Computer Entertainment Inc. | Voice recognition with dynamic filter bank adjustment based on speaker categorization determined from runtime pitch |
US7831431B2 (en) | 2006-10-31 | 2010-11-09 | Honda Motor Co., Ltd. | Voice recognition updates via remote broadcast signal |
JP4591793B2 (ja) * | 2008-04-22 | 2010-12-01 | ソニー株式会社 | 推定装置および方法、並びにプログラム |
WO2009133719A1 (ja) * | 2008-04-30 | 2009-11-05 | 日本電気株式会社 | 音響モデル学習装置および音声認識装置 |
US8543393B2 (en) | 2008-05-20 | 2013-09-24 | Calabrio, Inc. | Systems and methods of improving automated speech recognition accuracy using statistical analysis of search terms |
US8645135B2 (en) * | 2008-09-12 | 2014-02-04 | Rosetta Stone, Ltd. | Method for creating a speech model |
US8442829B2 (en) | 2009-02-17 | 2013-05-14 | Sony Computer Entertainment Inc. | Automatic computation streaming partition for voice recognition on multiple processors with limited memory |
US8788256B2 (en) | 2009-02-17 | 2014-07-22 | Sony Computer Entertainment Inc. | Multiple language voice recognition |
US8442833B2 (en) | 2009-02-17 | 2013-05-14 | Sony Computer Entertainment Inc. | Speech processing with source location estimation using signals from two or more microphones |
CN101566999B (zh) * | 2009-06-02 | 2010-11-17 | 哈尔滨工业大学 | 一种快速音频检索的方法 |
US9153235B2 (en) | 2012-04-09 | 2015-10-06 | Sony Computer Entertainment Inc. | Text dependent speaker recognition with long-term feature based on functional data analysis |
JP5844921B2 (ja) * | 2012-11-21 | 2016-01-20 | パナソニック株式会社 | 複合材料中の繊維状フィラーの3次元画像処理方法および3次元画像処理装置 |
US10685131B1 (en) * | 2017-02-03 | 2020-06-16 | Rockloans Marketplace Llc | User authentication |
KR20200140571A (ko) * | 2019-06-07 | 2020-12-16 | 삼성전자주식회사 | 데이터 인식 방법 및 장치 |
CN112104340B (zh) * | 2020-09-08 | 2024-04-16 | 华北电力大学 | 一种基于HMM模型和Kalman滤波技术的开关量输入模块BIT降虚警方法 |
Family Cites Families (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2737624B2 (ja) * | 1993-12-27 | 1998-04-08 | 日本電気株式会社 | 音声認識装置 |
JP2780676B2 (ja) * | 1995-06-23 | 1998-07-30 | 日本電気株式会社 | 音声認識装置及び音声認識方法 |
US5796924A (en) * | 1996-03-19 | 1998-08-18 | Motorola, Inc. | Method and system for selecting pattern recognition training vectors |
CA2281746A1 (en) * | 1997-03-25 | 1998-10-01 | Robert William Series | Speech analysis system |
-
1997
- 1997-03-25 GB GBGB9706174.1A patent/GB9706174D0/en not_active Ceased
- 1997-07-09 GB GBGB9714345.7A patent/GB9714345D0/en active Pending
-
1998
- 1998-02-24 CA CA002284484A patent/CA2284484A1/en not_active Abandoned
- 1998-02-24 US US09/381,571 patent/US6671666B1/en not_active Expired - Lifetime
- 1998-02-24 KR KR1019997008742A patent/KR20010005674A/ko not_active Application Discontinuation
- 1998-02-24 WO PCT/GB1998/000593 patent/WO1998043237A1/en active IP Right Grant
- 1998-02-24 CN CNB988036444A patent/CN1168069C/zh not_active Expired - Fee Related
- 1998-02-24 DE DE69836580T patent/DE69836580D1/de not_active Expired - Lifetime
- 1998-02-24 EP EP98907056A patent/EP0970462B1/en not_active Expired - Lifetime
- 1998-02-24 JP JP54444798A patent/JP2001517325A/ja active Pending
- 1998-02-26 KR KR1019997008753A patent/KR20010005685A/ko not_active Application Discontinuation
Also Published As
Publication number | Publication date |
---|---|
GB9706174D0 (en) | 1997-11-19 |
GB9714345D0 (en) | 1997-11-19 |
CA2284484A1 (en) | 1998-10-01 |
DE69836580D1 (de) | 2007-01-18 |
EP0970462A1 (en) | 2000-01-12 |
KR20010005685A (ko) | 2001-01-15 |
WO1998043237A1 (en) | 1998-10-01 |
JP2001517325A (ja) | 2001-10-02 |
CN1168069C (zh) | 2004-09-22 |
US6671666B1 (en) | 2003-12-30 |
KR20010005674A (ko) | 2001-01-15 |
EP0970462B1 (en) | 2006-12-06 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN1168069C (zh) | 识别***和识别方法 | |
US7995767B2 (en) | Sound signal processing method and apparatus | |
CN1121681C (zh) | 语言处理 | |
CN1188831C (zh) | 具有多个话音识别引擎的话音识别***和方法 | |
AU751333B2 (en) | Method and device for blind equalizing of transmission channel effects on a digital speech signal | |
EP0470245B1 (en) | Method for spectral estimation to improve noise robustness for speech recognition | |
US20070223731A1 (en) | Sound source separating device, method, and program | |
US5890113A (en) | Speech adaptation system and speech recognizer | |
CN1199488A (zh) | 模式识别 | |
US5734793A (en) | System for recognizing spoken sounds from continuous speech and method of using same | |
CN111128211B (zh) | 一种语音分离方法及装置 | |
Sugamura et al. | Isolated word recognition using phoneme-like templates | |
US6470314B1 (en) | Method and apparatus for rapid adapt via cumulative distribution function matching for continuous speech | |
CN1251193A (zh) | 语音分析*** | |
JP2010049249A (ja) | 音声認識装置及び音声認識装置のマスク生成方法 | |
CN109637555B (zh) | 一种商务会议用日语语音识别翻译*** | |
CN107919136B (zh) | 一种基于高斯混合模型的数字语音采样频率估计方法 | |
JPH01202798A (ja) | 音声認識方法 | |
JPH04369698A (ja) | 音声認識方式 | |
CN117854540B (zh) | 基于神经网络和多维特征融合的水声目标识别方法及*** | |
Sahoo et al. | Word extraction from speech recognition using correlation coefficients | |
CN112863525B (zh) | 一种语音波达方向的估计方法、装置及电子设备 | |
EP4171064A1 (en) | Spatial dependent feature extraction in neural network based audio processing | |
Zhang et al. | Acoustic Simulation in Dynamic Environments for Robot Audition | |
Bilmes | Joint distributional modeling with cross-correlation based features |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
ASS | Succession or assignment of patent right |
Owner name: JINNITICK CO., LTD. Free format text: FORMER OWNER: ENGLAND MINISTRY OF NATIONAL DEFENCE Effective date: 20041224 |
|
C41 | Transfer of patent application or patent right or utility model | ||
TR01 | Transfer of patent right |
Effective date of registration: 20041224 Address after: London, England Patentee after: Qinitik Co., Ltd. Address before: England Hampshire Patentee before: British Ministry of Defence |
|
ASS | Succession or assignment of patent right |
Owner name: AIRONIX CO., LTD. Free format text: FORMER OWNER: JINNITICK CO., LTD. Effective date: 20061124 |
|
C41 | Transfer of patent application or patent right or utility model | ||
TR01 | Transfer of patent right |
Effective date of registration: 20061124 Address after: Wooster County, England Patentee after: Aurius Co. Ltd. Address before: London, England Patentee before: Qinitik Co., Ltd. |
|
C17 | Cessation of patent right | ||
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20040922 |