CN1120371A - 连续语音识别 - Google Patents
连续语音识别 Download PDFInfo
- Publication number
- CN1120371A CN1120371A CN94191651A CN94191651A CN1120371A CN 1120371 A CN1120371 A CN 1120371A CN 94191651 A CN94191651 A CN 94191651A CN 94191651 A CN94191651 A CN 94191651A CN 1120371 A CN1120371 A CN 1120371A
- Authority
- CN
- China
- Prior art keywords
- parameter
- network
- node
- speech recognition
- recognition system
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 230000000644 propagated effect Effects 0.000 claims abstract description 6
- 238000009825 accumulation Methods 0.000 claims description 21
- 230000014509 gene expression Effects 0.000 claims description 21
- 238000000034 method Methods 0.000 claims description 18
- 238000012545 processing Methods 0.000 claims description 16
- 230000005540 biological transmission Effects 0.000 claims description 7
- 230000008878 coupling Effects 0.000 claims description 6
- 238000010168 coupling process Methods 0.000 claims description 6
- 238000005859 coupling reaction Methods 0.000 claims description 6
- 230000008569 process Effects 0.000 description 8
- 238000013138 pruning Methods 0.000 description 6
- 238000005516 engineering process Methods 0.000 description 5
- 238000012546 transfer Methods 0.000 description 5
- 230000001186 cumulative effect Effects 0.000 description 4
- 230000006870 function Effects 0.000 description 4
- 238000012360 testing method Methods 0.000 description 4
- 238000010586 diagram Methods 0.000 description 3
- 230000009471 action Effects 0.000 description 2
- 230000015572 biosynthetic process Effects 0.000 description 2
- 238000001514 detection method Methods 0.000 description 2
- 230000008676 import Effects 0.000 description 2
- 238000005070 sampling Methods 0.000 description 2
- 230000007704 transition Effects 0.000 description 2
- 230000008901 benefit Effects 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 239000011159 matrix material Substances 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 230000005039 memory span Effects 0.000 description 1
- 230000006855 networking Effects 0.000 description 1
- 230000037361 pathway Effects 0.000 description 1
- 230000001902 propagating effect Effects 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 230000011664 signaling Effects 0.000 description 1
- 238000004088 simulation Methods 0.000 description 1
- 238000001228 spectrum Methods 0.000 description 1
- 230000009897 systematic effect Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/28—Constructional details of speech recognition systems
- G10L15/30—Distributed recognition, e.g. in client-server systems, for mobile phones or network applications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/28—Constructional details of speech recognition systems
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/14—Speech classification or search using statistical models, e.g. Hidden Markov Models [HMMs]
- G10L15/142—Hidden Markov Models [HMMs]
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Telephonic Communication Services (AREA)
- Data Exchanges In Wide-Area Networks (AREA)
- Fittings On The Vehicle Exterior For Carrying Loads, And Devices For Holding Or Mounting Articles (AREA)
- Navigation (AREA)
- Machine Translation (AREA)
Abstract
Description
Claims (23)
Applications Claiming Priority (5)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP93302539 | 1993-03-31 | ||
EP93302539.7 | 1993-03-31 | ||
EP93304503.1 | 1993-06-10 | ||
EP9304503.1 | 1993-06-10 | ||
EP93304503 | 1993-06-10 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN1120371A true CN1120371A (zh) | 1996-04-10 |
CN1058097C CN1058097C (zh) | 2000-11-01 |
Family
ID=26134253
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN94191651A Expired - Fee Related CN1058097C (zh) | 1993-03-31 | 1994-03-31 | 连续语音识别 |
Country Status (13)
Country | Link |
---|---|
US (1) | US5819222A (zh) |
EP (1) | EP0695453B1 (zh) |
JP (1) | JPH08508583A (zh) |
KR (1) | KR100312920B1 (zh) |
CN (1) | CN1058097C (zh) |
AU (1) | AU672895B2 (zh) |
CA (1) | CA2157496C (zh) |
DE (1) | DE69421077T2 (zh) |
FI (1) | FI954573A0 (zh) |
NO (1) | NO953894L (zh) |
NZ (1) | NZ263230A (zh) |
SG (1) | SG50489A1 (zh) |
WO (1) | WO1994023425A1 (zh) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112259082A (zh) * | 2020-11-03 | 2021-01-22 | 苏州思必驰信息科技有限公司 | 实时语音识别方法及*** |
Families Citing this family (38)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8209184B1 (en) | 1997-04-14 | 2012-06-26 | At&T Intellectual Property Ii, L.P. | System and method of providing generated speech via a network |
US6078886A (en) * | 1997-04-14 | 2000-06-20 | At&T Corporation | System and method for providing remote automatic speech recognition services via a packet network |
GB9802836D0 (en) * | 1998-02-10 | 1998-04-08 | Canon Kk | Pattern matching method and apparatus |
US6243695B1 (en) * | 1998-03-18 | 2001-06-05 | Motorola, Inc. | Access control system and method therefor |
US7117149B1 (en) | 1999-08-30 | 2006-10-03 | Harman Becker Automotive Systems-Wavemakers, Inc. | Sound source classification |
US6473735B1 (en) * | 1999-10-21 | 2002-10-29 | Sony Corporation | System and method for speech verification using a confidence measure |
WO2002029615A1 (en) * | 2000-09-30 | 2002-04-11 | Intel Corporation | Search method based on single triphone tree for large vocabulary continuous speech recognizer |
US7191130B1 (en) * | 2002-09-27 | 2007-03-13 | Nuance Communications | Method and system for automatically optimizing recognition configuration parameters for speech recognition systems |
US7885420B2 (en) | 2003-02-21 | 2011-02-08 | Qnx Software Systems Co. | Wind noise suppression system |
US8271279B2 (en) | 2003-02-21 | 2012-09-18 | Qnx Software Systems Limited | Signature noise removal |
US7895036B2 (en) | 2003-02-21 | 2011-02-22 | Qnx Software Systems Co. | System for suppressing wind noise |
US8326621B2 (en) | 2003-02-21 | 2012-12-04 | Qnx Software Systems Limited | Repetitive transient noise removal |
US8073689B2 (en) | 2003-02-21 | 2011-12-06 | Qnx Software Systems Co. | Repetitive transient noise removal |
US7949522B2 (en) | 2003-02-21 | 2011-05-24 | Qnx Software Systems Co. | System for suppressing rain noise |
US7725315B2 (en) | 2003-02-21 | 2010-05-25 | Qnx Software Systems (Wavemakers), Inc. | Minimization of transient noises in a voice signal |
US9117460B2 (en) * | 2004-05-12 | 2015-08-25 | Core Wireless Licensing S.A.R.L. | Detection of end of utterance in speech recognition system |
US8306821B2 (en) | 2004-10-26 | 2012-11-06 | Qnx Software Systems Limited | Sub-band periodic signal enhancement system |
US8543390B2 (en) | 2004-10-26 | 2013-09-24 | Qnx Software Systems Limited | Multi-channel periodic signal enhancement system |
US7680652B2 (en) | 2004-10-26 | 2010-03-16 | Qnx Software Systems (Wavemakers), Inc. | Periodic signal enhancement system |
US7716046B2 (en) | 2004-10-26 | 2010-05-11 | Qnx Software Systems (Wavemakers), Inc. | Advanced periodic signal enhancement |
US7949520B2 (en) | 2004-10-26 | 2011-05-24 | QNX Software Sytems Co. | Adaptive filter pitch extraction |
US8170879B2 (en) | 2004-10-26 | 2012-05-01 | Qnx Software Systems Limited | Periodic signal enhancement system |
US8284947B2 (en) | 2004-12-01 | 2012-10-09 | Qnx Software Systems Limited | Reverberation estimation and suppression system |
US8027833B2 (en) | 2005-05-09 | 2011-09-27 | Qnx Software Systems Co. | System for suppressing passing tire hiss |
US8170875B2 (en) * | 2005-06-15 | 2012-05-01 | Qnx Software Systems Limited | Speech end-pointer |
US8311819B2 (en) | 2005-06-15 | 2012-11-13 | Qnx Software Systems Limited | System for detecting speech with background voice estimates and noise estimates |
KR100737343B1 (ko) * | 2005-12-08 | 2007-07-09 | 한국전자통신연구원 | 음성 인식 장치 및 방법 |
US7844453B2 (en) | 2006-05-12 | 2010-11-30 | Qnx Software Systems Co. | Robust noise estimation |
US8335685B2 (en) | 2006-12-22 | 2012-12-18 | Qnx Software Systems Limited | Ambient noise compensation system robust to high excitation noise |
US8326620B2 (en) | 2008-04-30 | 2012-12-04 | Qnx Software Systems Limited | Robust downlink speech and noise detector |
US8904400B2 (en) | 2007-09-11 | 2014-12-02 | 2236008 Ontario Inc. | Processing system having a partitioning component for resource partitioning |
US8850154B2 (en) | 2007-09-11 | 2014-09-30 | 2236008 Ontario Inc. | Processing system having memory partitioning |
US8694310B2 (en) | 2007-09-17 | 2014-04-08 | Qnx Software Systems Limited | Remote control server protocol system |
US8209514B2 (en) | 2008-02-04 | 2012-06-26 | Qnx Software Systems Limited | Media processing system having resource partitioning |
US8374868B2 (en) | 2009-08-21 | 2013-02-12 | General Motors Llc | Method of recognizing speech |
US9953646B2 (en) | 2014-09-02 | 2018-04-24 | Belleau Technologies | Method and system for dynamic speech recognition and tracking of prewritten script |
CN109215679A (zh) * | 2018-08-06 | 2019-01-15 | 百度在线网络技术(北京)有限公司 | 基于用户情绪的对话方法和装置 |
CN113076335B (zh) * | 2021-04-02 | 2024-05-24 | 西安交通大学 | 一种网络模因检测方法、***、设备及存储介质 |
Family Cites Families (19)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
USRE31188E (en) * | 1978-10-31 | 1983-03-22 | Bell Telephone Laboratories, Incorporated | Multiple template speech recognition system |
US4348553A (en) * | 1980-07-02 | 1982-09-07 | International Business Machines Corporation | Parallel pattern verifier with dynamic time warping |
US4783804A (en) * | 1985-03-21 | 1988-11-08 | American Telephone And Telegraph Company, At&T Bell Laboratories | Hidden Markov model speech recognition arrangement |
US4980918A (en) * | 1985-05-09 | 1990-12-25 | International Business Machines Corporation | Speech recognition system with efficient storage and rapid assembly of phonological graphs |
DE3750199T2 (de) * | 1986-06-02 | 1995-01-19 | Motorola Inc | System zur Erkennung kontinuierlicher Sprache. |
JP2717652B2 (ja) * | 1986-06-02 | 1998-02-18 | モトローラ・インコーポレーテッド | 連続音声認識システム |
JPH0760318B2 (ja) * | 1986-09-29 | 1995-06-28 | 株式会社東芝 | 連続音声認識方式 |
US4829578A (en) * | 1986-10-02 | 1989-05-09 | Dragon Systems, Inc. | Speech detection and recognition apparatus for use with background noise of varying levels |
US4837831A (en) * | 1986-10-15 | 1989-06-06 | Dragon Systems, Inc. | Method for creating and using multiple-word sound models in speech recognition |
US4803729A (en) * | 1987-04-03 | 1989-02-07 | Dragon Systems, Inc. | Speech recognition method |
US5228110A (en) * | 1989-09-15 | 1993-07-13 | U.S. Philips Corporation | Method for recognizing N different word strings in a speech signal |
DE69128990T2 (de) * | 1990-09-07 | 1998-08-27 | Toshiba Kawasaki Kk | Sprecherkennungsvorrichtung |
FR2677828B1 (fr) * | 1991-06-14 | 1993-08-20 | Sextant Avionique | Procede de detection d'un signal utile bruite. |
JP2870224B2 (ja) * | 1991-06-19 | 1999-03-17 | 松下電器産業株式会社 | 音声認識方法 |
US5388183A (en) * | 1991-09-30 | 1995-02-07 | Kurzwell Applied Intelligence, Inc. | Speech recognition providing multiple outputs |
US5390278A (en) * | 1991-10-08 | 1995-02-14 | Bell Canada | Phoneme based speech recognition |
US5583961A (en) * | 1993-03-25 | 1996-12-10 | British Telecommunications Public Limited Company | Speaker recognition using spectral coefficients normalized with respect to unequal frequency bands |
US5524169A (en) * | 1993-12-30 | 1996-06-04 | International Business Machines Incorporated | Method and system for location-specific speech recognition |
US5621859A (en) * | 1994-01-19 | 1997-04-15 | Bbn Corporation | Single tree method for grammar directed, very large vocabulary speech recognizer |
-
1994
- 1994-03-31 AU AU63836/94A patent/AU672895B2/en not_active Ceased
- 1994-03-31 US US08/530,170 patent/US5819222A/en not_active Expired - Lifetime
- 1994-03-31 NZ NZ263230A patent/NZ263230A/en unknown
- 1994-03-31 EP EP94911279A patent/EP0695453B1/en not_active Expired - Lifetime
- 1994-03-31 DE DE69421077T patent/DE69421077T2/de not_active Expired - Lifetime
- 1994-03-31 CN CN94191651A patent/CN1058097C/zh not_active Expired - Fee Related
- 1994-03-31 WO PCT/GB1994/000714 patent/WO1994023425A1/en active IP Right Grant
- 1994-03-31 SG SG1996002710A patent/SG50489A1/en unknown
- 1994-03-31 KR KR1019950704302A patent/KR100312920B1/ko not_active IP Right Cessation
- 1994-03-31 JP JP6521863A patent/JPH08508583A/ja not_active Ceased
- 1994-03-31 CA CA002157496A patent/CA2157496C/en not_active Expired - Fee Related
-
1995
- 1995-09-27 FI FI954573A patent/FI954573A0/fi unknown
- 1995-09-29 NO NO953894A patent/NO953894L/no unknown
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112259082A (zh) * | 2020-11-03 | 2021-01-22 | 苏州思必驰信息科技有限公司 | 实时语音识别方法及*** |
Also Published As
Publication number | Publication date |
---|---|
AU6383694A (en) | 1994-10-24 |
NO953894L (no) | 1995-11-30 |
KR100312920B1 (ko) | 2001-12-28 |
DE69421077T2 (de) | 2000-07-06 |
WO1994023425A1 (en) | 1994-10-13 |
NZ263230A (en) | 1997-07-27 |
EP0695453B1 (en) | 1999-10-06 |
NO953894D0 (no) | 1995-09-29 |
CN1058097C (zh) | 2000-11-01 |
CA2157496C (en) | 2000-08-15 |
DE69421077D1 (de) | 1999-11-11 |
KR960702144A (ko) | 1996-03-28 |
EP0695453A1 (en) | 1996-02-07 |
JPH08508583A (ja) | 1996-09-10 |
FI954573A (fi) | 1995-09-27 |
AU672895B2 (en) | 1996-10-17 |
FI954573A0 (fi) | 1995-09-27 |
US5819222A (en) | 1998-10-06 |
CA2157496A1 (en) | 1994-10-13 |
SG50489A1 (en) | 1998-07-20 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN1058097C (zh) | 连续语音识别 | |
CN1196104C (zh) | 语音处理 | |
Soong et al. | A Tree. Trellis based fast search for finding the n best sentence hypotheses in continuous speech recognition | |
CN1303582C (zh) | 自动语音归类方法 | |
US6587844B1 (en) | System and methods for optimizing networks of weighted unweighted directed graphs | |
CN1169115C (zh) | 语音合成***及方法 | |
CN1211779C (zh) | 语音识别***中确定非目标语言的方法和装置 | |
EP0387602B1 (en) | Method and apparatus for the automatic determination of phonological rules as for a continuous speech recognition system | |
US8527273B2 (en) | Systems and methods for determining the N-best strings | |
US20080294433A1 (en) | Automatic Text-Speech Mapping Tool | |
US20020087311A1 (en) | Computer-implemented dynamic language model generation method and system | |
CN1667699A (zh) | 为字母-声音转换生成有互信息标准的大文法音素单元 | |
EP1168199A3 (en) | Indexing method and apparatus | |
KR20080069990A (ko) | 음성 세그먼트 색인 및 검색 방법과 컴퓨터 실행 가능명령어를 갖는 컴퓨터 판독 가능 매체 | |
CN1748245A (zh) | 三级单个单词识别 | |
CN104040626A (zh) | 多译码模式信号分类 | |
CN1949211A (zh) | 一种新的汉语口语解析方法及装置 | |
US6230128B1 (en) | Path link passing speech recognition with vocabulary node being capable of simultaneously processing plural path links | |
US10402492B1 (en) | Processing natural language grammar | |
Švec et al. | Semantic entity detection from multiple ASR hypotheses within the WFST framework | |
CN1126052C (zh) | 采用多个文法网络的语音识别的方法 | |
JP2000075892A (ja) | 音声認識のための統計的言語モデル作成方法および装置 | |
EP0692134B1 (en) | Speech processing |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
CI01 | Publication of corrected invention patent application |
Correction item: Priority number Correct: 93304503.1 False: 9304503.1 Number: 15 Page: 171 Volume: 12 Correction item: Agency Correct: Yongxin Patent and Trademark Agency Co., Ltd. False: China Patent Agent (H.K.) Ltd. Number: 15 Page: 171 Volume: 12 |
|
ERR | Gazette correction |
Free format text: CORRECT: NUMBER OF PRIORITY AGENCY; FROM: 9304503.1 CHINA PATENT AGENT(XIANG GANG)LTD. TO: 93304503.1 YONGXIN PATENT AND TRADEMARK AGENT CO. LTD. |
|
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
C17 | Cessation of patent right | ||
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20001101 Termination date: 20130331 |