ATE282881T1 - Vokoder basierter spracherkenner - Google Patents
Vokoder basierter spracherkennerInfo
- Publication number
- ATE282881T1 ATE282881T1 AT98933871T AT98933871T ATE282881T1 AT E282881 T1 ATE282881 T1 AT E282881T1 AT 98933871 T AT98933871 T AT 98933871T AT 98933871 T AT98933871 T AT 98933871T AT E282881 T1 ATE282881 T1 AT E282881T1
- Authority
- AT
- Austria
- Prior art keywords
- word
- vocoder
- lpc
- data
- recognition features
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/28—Constructional details of speech recognition systems
- G10L15/30—Distributed recognition, e.g. in client-server systems, for mobile phones or network applications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
- G10L2025/783—Detection of presence or absence of voice signals based on threshold decision
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Mobile Radio Communication Systems (AREA)
- Telephonic Communication Services (AREA)
- Photoreceptors In Electrophotography (AREA)
- Steering Control In Accordance With Driving Conditions (AREA)
- Telephone Function (AREA)
- Diaphragms For Electromechanical Transducers (AREA)
- Character Discrimination (AREA)
- Machine Translation (AREA)
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US09/002,616 US6003004A (en) | 1998-01-08 | 1998-01-08 | Speech recognition method and system using compressed speech data |
PCT/IL1998/000341 WO1999035639A1 (en) | 1998-01-08 | 1998-07-22 | A vocoder-based voice recognizer |
Publications (1)
Publication Number | Publication Date |
---|---|
ATE282881T1 true ATE282881T1 (de) | 2004-12-15 |
Family
ID=21701631
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
AT98933871T ATE282881T1 (de) | 1998-01-08 | 1998-07-22 | Vokoder basierter spracherkenner |
Country Status (12)
Country | Link |
---|---|
US (3) | US6003004A (de) |
EP (1) | EP1046154B1 (de) |
JP (1) | JP2001510595A (de) |
KR (1) | KR100391287B1 (de) |
CN (1) | CN1125432C (de) |
AT (1) | ATE282881T1 (de) |
AU (1) | AU8355398A (de) |
DE (1) | DE69827667T2 (de) |
IL (1) | IL132449A (de) |
RU (1) | RU99124623A (de) |
TW (1) | TW394925B (de) |
WO (1) | WO1999035639A1 (de) |
Families Citing this family (37)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6370504B1 (en) * | 1997-05-29 | 2002-04-09 | University Of Washington | Speech recognition on MPEG/Audio encoded files |
US6134283A (en) * | 1997-11-18 | 2000-10-17 | Amati Communications Corporation | Method and system for synchronizing time-division-duplexed transceivers |
US6003004A (en) * | 1998-01-08 | 1999-12-14 | Advanced Recognition Technologies, Inc. | Speech recognition method and system using compressed speech data |
KR100277105B1 (ko) * | 1998-02-27 | 2001-01-15 | 윤종용 | 음성 인식 데이터 결정 장치 및 방법 |
US6223157B1 (en) * | 1998-05-07 | 2001-04-24 | Dsc Telecom, L.P. | Method for direct recognition of encoded speech data |
JP4081858B2 (ja) * | 1998-06-04 | 2008-04-30 | ソニー株式会社 | コンピュータシステム、コンピュータ端末装置、及び記録媒体 |
US6321197B1 (en) * | 1999-01-22 | 2001-11-20 | Motorola, Inc. | Communication device and method for endpointing speech utterances |
US6411926B1 (en) * | 1999-02-08 | 2002-06-25 | Qualcomm Incorporated | Distributed voice recognition system |
US6792405B2 (en) * | 1999-12-10 | 2004-09-14 | At&T Corp. | Bitstream-based feature extraction method for a front-end speech recognizer |
US6795698B1 (en) * | 2000-04-12 | 2004-09-21 | Northrop Grumman Corporation | Method and apparatus for embedding global positioning system (GPS) data in mobile telephone call data |
US6564182B1 (en) | 2000-05-12 | 2003-05-13 | Conexant Systems, Inc. | Look-ahead pitch determination |
US6999923B1 (en) * | 2000-06-23 | 2006-02-14 | International Business Machines Corporation | System and method for control of lights, signals, alarms using sound detection |
US7203651B2 (en) * | 2000-12-07 | 2007-04-10 | Art-Advanced Recognition Technologies, Ltd. | Voice control system with multiple voice recognition engines |
US7155387B2 (en) * | 2001-01-08 | 2006-12-26 | Art - Advanced Recognition Technologies Ltd. | Noise spectrum subtraction method and system |
US7089184B2 (en) * | 2001-03-22 | 2006-08-08 | Nurv Center Technologies, Inc. | Speech recognition for recognizing speaker-independent, continuous speech |
US20030028386A1 (en) * | 2001-04-02 | 2003-02-06 | Zinser Richard L. | Compressed domain universal transcoder |
US7319703B2 (en) * | 2001-09-04 | 2008-01-15 | Nokia Corporation | Method and apparatus for reducing synchronization delay in packet-based voice terminals by resynchronizing during talk spurts |
US7050969B2 (en) * | 2001-11-27 | 2006-05-23 | Mitsubishi Electric Research Laboratories, Inc. | Distributed speech recognition with codec parameters |
US7079657B2 (en) * | 2002-02-26 | 2006-07-18 | Broadcom Corporation | System and method of performing digital multi-channel audio signal decoding |
US7024353B2 (en) * | 2002-08-09 | 2006-04-04 | Motorola, Inc. | Distributed speech recognition with back-end voice activity detection apparatus and method |
US20040073428A1 (en) * | 2002-10-10 | 2004-04-15 | Igor Zlokarnik | Apparatus, methods, and programming for speech synthesis via bit manipulations of compressed database |
FI20021936A (fi) * | 2002-10-31 | 2004-05-01 | Nokia Corp | Vaihtuvanopeuksinen puhekoodekki |
CN1302454C (zh) * | 2003-07-11 | 2007-02-28 | 中国科学院声学研究所 | 语音识别的概率加权平均缺失特征数据重建方法 |
US7558736B2 (en) * | 2003-12-31 | 2009-07-07 | United States Cellular Corporation | System and method for providing talker arbitration in point-to-point/group communication |
KR100647290B1 (ko) * | 2004-09-22 | 2006-11-23 | 삼성전자주식회사 | 합성된 음성의 특성을 이용하여 양자화/역양자화를선택하는 음성 부호화/복호화 장치 및 그 방법 |
US7533018B2 (en) * | 2004-10-19 | 2009-05-12 | Motorola, Inc. | Tailored speaker-independent voice recognition system |
US20060095261A1 (en) * | 2004-10-30 | 2006-05-04 | Ibm Corporation | Voice packet identification based on celp compression parameters |
US20060224381A1 (en) * | 2005-04-04 | 2006-10-05 | Nokia Corporation | Detecting speech frames belonging to a low energy sequence |
US7697827B2 (en) | 2005-10-17 | 2010-04-13 | Konicek Jeffrey C | User-friendlier interfaces for a camera |
GB0710211D0 (en) * | 2007-05-29 | 2007-07-11 | Intrasonics Ltd | AMR Spectrography |
US20090094026A1 (en) * | 2007-10-03 | 2009-04-09 | Binshi Cao | Method of determining an estimated frame energy of a communication |
US9208796B2 (en) | 2011-08-22 | 2015-12-08 | Genband Us Llc | Estimation of speech energy based on code excited linear prediction (CELP) parameters extracted from a partially-decoded CELP-encoded bit stream and applications of same |
MX342965B (es) * | 2013-04-05 | 2016-10-19 | Dolby Laboratories Licensing Corp | Sistema y método de compansión para reducir el ruido de cuantificación usando extensión espectral avanzada. |
CN104683959B (zh) * | 2013-11-27 | 2018-09-18 | 深圳市盛天龙视听科技有限公司 | 即时通讯型便携式音频装置及其账号载入方法 |
KR20150096217A (ko) * | 2014-02-14 | 2015-08-24 | 한국전자통신연구원 | 디지털 데이터 압축 방법 및 장치 |
TWI631556B (zh) * | 2017-05-05 | 2018-08-01 | 英屬開曼群島商捷鼎創新股份有限公司 | 資料壓縮裝置及其資料壓縮方法 |
US10460749B1 (en) * | 2018-06-28 | 2019-10-29 | Nuvoton Technology Corporation | Voice activity detection using vocal tract area information |
Family Cites Families (14)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US3909532A (en) * | 1974-03-29 | 1975-09-30 | Bell Telephone Labor Inc | Apparatus and method for determining the beginning and the end of a speech utterance |
US4475189A (en) * | 1982-05-27 | 1984-10-02 | At&T Bell Laboratories | Automatic interactive conference arrangement |
US4519094A (en) * | 1982-08-26 | 1985-05-21 | At&T Bell Laboratories | LPC Word recognizer utilizing energy features |
US4866777A (en) * | 1984-11-09 | 1989-09-12 | Alcatel Usa Corporation | Apparatus for extracting features from a speech signal |
US4908865A (en) * | 1984-12-27 | 1990-03-13 | Texas Instruments Incorporated | Speaker independent speech recognition method and system |
US5548647A (en) * | 1987-04-03 | 1996-08-20 | Texas Instruments Incorporated | Fixed text speaker verification method and apparatus |
US5208897A (en) * | 1990-08-21 | 1993-05-04 | Emerson & Stern Associates, Inc. | Method and apparatus for speech recognition based on subsyllable spellings |
US5371853A (en) * | 1991-10-28 | 1994-12-06 | University Of Maryland At College Park | Method and system for CELP speech coding and codebook for use therewith |
US5305422A (en) * | 1992-02-28 | 1994-04-19 | Panasonic Technologies, Inc. | Method for determining boundaries of isolated words within a speech signal |
GB2272554A (en) * | 1992-11-13 | 1994-05-18 | Creative Tech Ltd | Recognizing speech by using wavelet transform and transient response therefrom |
ZA948426B (en) * | 1993-12-22 | 1995-06-30 | Qualcomm Inc | Distributed voice recognition system |
AU684872B2 (en) * | 1994-03-10 | 1998-01-08 | Cable And Wireless Plc | Communication system |
US5704009A (en) * | 1995-06-30 | 1997-12-30 | International Business Machines Corporation | Method and apparatus for transmitting a voice sample to a voice activated data processing system |
US6003004A (en) * | 1998-01-08 | 1999-12-14 | Advanced Recognition Technologies, Inc. | Speech recognition method and system using compressed speech data |
-
1998
- 1998-01-08 US US09/002,616 patent/US6003004A/en not_active Expired - Lifetime
- 1998-07-13 TW TW087111338A patent/TW394925B/zh not_active IP Right Cessation
- 1998-07-22 DE DE69827667T patent/DE69827667T2/de not_active Expired - Lifetime
- 1998-07-22 IL IL13244998A patent/IL132449A/xx not_active IP Right Cessation
- 1998-07-22 EP EP98933871A patent/EP1046154B1/de not_active Expired - Lifetime
- 1998-07-22 CN CN98808942A patent/CN1125432C/zh not_active Expired - Fee Related
- 1998-07-22 AT AT98933871T patent/ATE282881T1/de not_active IP Right Cessation
- 1998-07-22 WO PCT/IL1998/000341 patent/WO1999035639A1/en active IP Right Grant
- 1998-07-22 AU AU83553/98A patent/AU8355398A/en not_active Abandoned
- 1998-07-22 RU RU99124623/09A patent/RU99124623A/ru not_active Application Discontinuation
- 1998-07-22 KR KR10-1999-7009488A patent/KR100391287B1/ko not_active IP Right Cessation
- 1998-07-22 JP JP53591099A patent/JP2001510595A/ja not_active Ceased
-
1999
- 1999-10-05 US US09/412,406 patent/US6377923B1/en not_active Expired - Lifetime
-
2002
- 2002-01-22 US US10/051,350 patent/US20030018472A1/en not_active Abandoned
Also Published As
Publication number | Publication date |
---|---|
EP1046154A4 (de) | 2001-02-07 |
CN1125432C (zh) | 2003-10-22 |
IL132449A (en) | 2005-07-25 |
WO1999035639A1 (en) | 1999-07-15 |
RU99124623A (ru) | 2001-09-27 |
DE69827667D1 (de) | 2004-12-23 |
US20030018472A1 (en) | 2003-01-23 |
US6003004A (en) | 1999-12-14 |
KR100391287B1 (ko) | 2003-07-12 |
JP2001510595A (ja) | 2001-07-31 |
DE69827667T2 (de) | 2005-10-06 |
US6377923B1 (en) | 2002-04-23 |
AU8355398A (en) | 1999-07-26 |
IL132449A0 (en) | 2001-03-19 |
EP1046154B1 (de) | 2004-11-17 |
EP1046154A1 (de) | 2000-10-25 |
TW394925B (en) | 2000-06-21 |
CN1273662A (zh) | 2000-11-15 |
KR20010006401A (ko) | 2001-01-26 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
ATE282881T1 (de) | Vokoder basierter spracherkenner | |
US10332517B1 (en) | Privacy mode based on speaker identifier | |
EP1960997B1 (de) | Spracherkennungssystem mit riesigem vokabular | |
US6243680B1 (en) | Method and apparatus for obtaining a transcription of phrases through text and spoken utterances | |
US20160379638A1 (en) | Input speech quality matching | |
Jelinek et al. | 25 Continuous speech recognition: Statistical methods | |
EP1629464A4 (de) | Spracherkennungssystem und verfahren auf phonetischer basis | |
DE69922971D1 (de) | Netzwerk-interaktive benutzerschnittstelle mittels spracherkennung und verarbeitung natürlicher sprache | |
WO2001097213A8 (en) | Speech recognition using utterance-level confidence estimates | |
CA2069675A1 (en) | Flexible vocabulary recognition | |
ATE395685T1 (de) | Spracherkennung durch wort-in-phrase-befehl | |
EP1220197A3 (de) | System und Verfahren zur Spracherkennung | |
BR9913524A (pt) | Reconhecedor de voz, e, processo de reconhecimento de voz | |
DE60002584D1 (de) | Anwendung von Referenzdaten für Spracherkennung | |
ATE449401T1 (de) | Automatische erzeugung einer wortaussprache für die spracherkennung | |
Kim et al. | Robust DTW-based recognition algorithm for hand-held consumer devices | |
FR2857528B1 (fr) | Reconnaissance vocale pour les larges vocabulaires dynamiques | |
Chen et al. | Large vocabulary word recognition based on tree-trellis search | |
Hofmann et al. | Improving spontaneous English ASR using a joint-sequence pronunciation model | |
Tolba et al. | Speech recognition by intelligent machines | |
JP3727436B2 (ja) | 音声原稿最適照合装置および方法 | |
Choi et al. | Lexical tree decoding with a class-based language model for Chinese speech recognition | |
Nakagawa | A machine understanding system for spoken Japanese sentences | |
Tajchman et al. | Learning phonological rule probabilities from speech corpora with exploratory computational phonology | |
WO2000026901A3 (en) | Performing spoken recorded actions |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
RER | Ceased as to paragraph 5 lit. 3 law introducing patent treaties |