ATE282881T1 - Vokoder basierter spracherkenner - Google Patents

Vokoder basierter spracherkenner

Info

Publication number
ATE282881T1
ATE282881T1 AT98933871T AT98933871T ATE282881T1 AT E282881 T1 ATE282881 T1 AT E282881T1 AT 98933871 T AT98933871 T AT 98933871T AT 98933871 T AT98933871 T AT 98933871T AT E282881 T1 ATE282881 T1 AT E282881T1
Authority
AT
Austria
Prior art keywords
word
vocoder
lpc
data
recognition features
Prior art date
Application number
AT98933871T
Other languages
English (en)
Inventor
Yehuda Hershkovits
Gabriel Ilan
Original Assignee
Art Advanced Recognition Tech
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Art Advanced Recognition Tech filed Critical Art Advanced Recognition Tech
Application granted granted Critical
Publication of ATE282881T1 publication Critical patent/ATE282881T1/de

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/28Constructional details of speech recognition systems
    • G10L15/30Distributed recognition, e.g. in client-server systems, for mobile phones or network applications
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/02Feature extraction for speech recognition; Selection of recognition unit
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • G10L2025/783Detection of presence or absence of voice signals based on threshold decision

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Mobile Radio Communication Systems (AREA)
  • Telephonic Communication Services (AREA)
  • Photoreceptors In Electrophotography (AREA)
  • Steering Control In Accordance With Driving Conditions (AREA)
  • Telephone Function (AREA)
  • Diaphragms For Electromechanical Transducers (AREA)
  • Character Discrimination (AREA)
  • Machine Translation (AREA)
AT98933871T 1998-01-08 1998-07-22 Vokoder basierter spracherkenner ATE282881T1 (de)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US09/002,616 US6003004A (en) 1998-01-08 1998-01-08 Speech recognition method and system using compressed speech data
PCT/IL1998/000341 WO1999035639A1 (en) 1998-01-08 1998-07-22 A vocoder-based voice recognizer

Publications (1)

Publication Number Publication Date
ATE282881T1 true ATE282881T1 (de) 2004-12-15

Family

ID=21701631

Family Applications (1)

Application Number Title Priority Date Filing Date
AT98933871T ATE282881T1 (de) 1998-01-08 1998-07-22 Vokoder basierter spracherkenner

Country Status (12)

Country Link
US (3) US6003004A (de)
EP (1) EP1046154B1 (de)
JP (1) JP2001510595A (de)
KR (1) KR100391287B1 (de)
CN (1) CN1125432C (de)
AT (1) ATE282881T1 (de)
AU (1) AU8355398A (de)
DE (1) DE69827667T2 (de)
IL (1) IL132449A (de)
RU (1) RU99124623A (de)
TW (1) TW394925B (de)
WO (1) WO1999035639A1 (de)

Families Citing this family (37)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6370504B1 (en) * 1997-05-29 2002-04-09 University Of Washington Speech recognition on MPEG/Audio encoded files
US6134283A (en) * 1997-11-18 2000-10-17 Amati Communications Corporation Method and system for synchronizing time-division-duplexed transceivers
US6003004A (en) * 1998-01-08 1999-12-14 Advanced Recognition Technologies, Inc. Speech recognition method and system using compressed speech data
KR100277105B1 (ko) * 1998-02-27 2001-01-15 윤종용 음성 인식 데이터 결정 장치 및 방법
US6223157B1 (en) * 1998-05-07 2001-04-24 Dsc Telecom, L.P. Method for direct recognition of encoded speech data
JP4081858B2 (ja) * 1998-06-04 2008-04-30 ソニー株式会社 コンピュータシステム、コンピュータ端末装置、及び記録媒体
US6321197B1 (en) * 1999-01-22 2001-11-20 Motorola, Inc. Communication device and method for endpointing speech utterances
US6411926B1 (en) * 1999-02-08 2002-06-25 Qualcomm Incorporated Distributed voice recognition system
US6792405B2 (en) * 1999-12-10 2004-09-14 At&T Corp. Bitstream-based feature extraction method for a front-end speech recognizer
US6795698B1 (en) * 2000-04-12 2004-09-21 Northrop Grumman Corporation Method and apparatus for embedding global positioning system (GPS) data in mobile telephone call data
US6564182B1 (en) 2000-05-12 2003-05-13 Conexant Systems, Inc. Look-ahead pitch determination
US6999923B1 (en) * 2000-06-23 2006-02-14 International Business Machines Corporation System and method for control of lights, signals, alarms using sound detection
US7203651B2 (en) * 2000-12-07 2007-04-10 Art-Advanced Recognition Technologies, Ltd. Voice control system with multiple voice recognition engines
US7155387B2 (en) * 2001-01-08 2006-12-26 Art - Advanced Recognition Technologies Ltd. Noise spectrum subtraction method and system
US7089184B2 (en) * 2001-03-22 2006-08-08 Nurv Center Technologies, Inc. Speech recognition for recognizing speaker-independent, continuous speech
US20030028386A1 (en) * 2001-04-02 2003-02-06 Zinser Richard L. Compressed domain universal transcoder
US7319703B2 (en) * 2001-09-04 2008-01-15 Nokia Corporation Method and apparatus for reducing synchronization delay in packet-based voice terminals by resynchronizing during talk spurts
US7050969B2 (en) * 2001-11-27 2006-05-23 Mitsubishi Electric Research Laboratories, Inc. Distributed speech recognition with codec parameters
US7079657B2 (en) * 2002-02-26 2006-07-18 Broadcom Corporation System and method of performing digital multi-channel audio signal decoding
US7024353B2 (en) * 2002-08-09 2006-04-04 Motorola, Inc. Distributed speech recognition with back-end voice activity detection apparatus and method
US20040073428A1 (en) * 2002-10-10 2004-04-15 Igor Zlokarnik Apparatus, methods, and programming for speech synthesis via bit manipulations of compressed database
FI20021936A (fi) * 2002-10-31 2004-05-01 Nokia Corp Vaihtuvanopeuksinen puhekoodekki
CN1302454C (zh) * 2003-07-11 2007-02-28 中国科学院声学研究所 语音识别的概率加权平均缺失特征数据重建方法
US7558736B2 (en) * 2003-12-31 2009-07-07 United States Cellular Corporation System and method for providing talker arbitration in point-to-point/group communication
KR100647290B1 (ko) * 2004-09-22 2006-11-23 삼성전자주식회사 합성된 음성의 특성을 이용하여 양자화/역양자화를선택하는 음성 부호화/복호화 장치 및 그 방법
US7533018B2 (en) * 2004-10-19 2009-05-12 Motorola, Inc. Tailored speaker-independent voice recognition system
US20060095261A1 (en) * 2004-10-30 2006-05-04 Ibm Corporation Voice packet identification based on celp compression parameters
US20060224381A1 (en) * 2005-04-04 2006-10-05 Nokia Corporation Detecting speech frames belonging to a low energy sequence
US7697827B2 (en) 2005-10-17 2010-04-13 Konicek Jeffrey C User-friendlier interfaces for a camera
GB0710211D0 (en) * 2007-05-29 2007-07-11 Intrasonics Ltd AMR Spectrography
US20090094026A1 (en) * 2007-10-03 2009-04-09 Binshi Cao Method of determining an estimated frame energy of a communication
US9208796B2 (en) 2011-08-22 2015-12-08 Genband Us Llc Estimation of speech energy based on code excited linear prediction (CELP) parameters extracted from a partially-decoded CELP-encoded bit stream and applications of same
MX342965B (es) * 2013-04-05 2016-10-19 Dolby Laboratories Licensing Corp Sistema y método de compansión para reducir el ruido de cuantificación usando extensión espectral avanzada.
CN104683959B (zh) * 2013-11-27 2018-09-18 深圳市盛天龙视听科技有限公司 即时通讯型便携式音频装置及其账号载入方法
KR20150096217A (ko) * 2014-02-14 2015-08-24 한국전자통신연구원 디지털 데이터 압축 방법 및 장치
TWI631556B (zh) * 2017-05-05 2018-08-01 英屬開曼群島商捷鼎創新股份有限公司 資料壓縮裝置及其資料壓縮方法
US10460749B1 (en) * 2018-06-28 2019-10-29 Nuvoton Technology Corporation Voice activity detection using vocal tract area information

Family Cites Families (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US3909532A (en) * 1974-03-29 1975-09-30 Bell Telephone Labor Inc Apparatus and method for determining the beginning and the end of a speech utterance
US4475189A (en) * 1982-05-27 1984-10-02 At&T Bell Laboratories Automatic interactive conference arrangement
US4519094A (en) * 1982-08-26 1985-05-21 At&T Bell Laboratories LPC Word recognizer utilizing energy features
US4866777A (en) * 1984-11-09 1989-09-12 Alcatel Usa Corporation Apparatus for extracting features from a speech signal
US4908865A (en) * 1984-12-27 1990-03-13 Texas Instruments Incorporated Speaker independent speech recognition method and system
US5548647A (en) * 1987-04-03 1996-08-20 Texas Instruments Incorporated Fixed text speaker verification method and apparatus
US5208897A (en) * 1990-08-21 1993-05-04 Emerson & Stern Associates, Inc. Method and apparatus for speech recognition based on subsyllable spellings
US5371853A (en) * 1991-10-28 1994-12-06 University Of Maryland At College Park Method and system for CELP speech coding and codebook for use therewith
US5305422A (en) * 1992-02-28 1994-04-19 Panasonic Technologies, Inc. Method for determining boundaries of isolated words within a speech signal
GB2272554A (en) * 1992-11-13 1994-05-18 Creative Tech Ltd Recognizing speech by using wavelet transform and transient response therefrom
ZA948426B (en) * 1993-12-22 1995-06-30 Qualcomm Inc Distributed voice recognition system
AU684872B2 (en) * 1994-03-10 1998-01-08 Cable And Wireless Plc Communication system
US5704009A (en) * 1995-06-30 1997-12-30 International Business Machines Corporation Method and apparatus for transmitting a voice sample to a voice activated data processing system
US6003004A (en) * 1998-01-08 1999-12-14 Advanced Recognition Technologies, Inc. Speech recognition method and system using compressed speech data

Also Published As

Publication number Publication date
EP1046154A4 (de) 2001-02-07
CN1125432C (zh) 2003-10-22
IL132449A (en) 2005-07-25
WO1999035639A1 (en) 1999-07-15
RU99124623A (ru) 2001-09-27
DE69827667D1 (de) 2004-12-23
US20030018472A1 (en) 2003-01-23
US6003004A (en) 1999-12-14
KR100391287B1 (ko) 2003-07-12
JP2001510595A (ja) 2001-07-31
DE69827667T2 (de) 2005-10-06
US6377923B1 (en) 2002-04-23
AU8355398A (en) 1999-07-26
IL132449A0 (en) 2001-03-19
EP1046154B1 (de) 2004-11-17
EP1046154A1 (de) 2000-10-25
TW394925B (en) 2000-06-21
CN1273662A (zh) 2000-11-15
KR20010006401A (ko) 2001-01-26

Similar Documents

Publication Publication Date Title
ATE282881T1 (de) Vokoder basierter spracherkenner
US10332517B1 (en) Privacy mode based on speaker identifier
EP1960997B1 (de) Spracherkennungssystem mit riesigem vokabular
US6243680B1 (en) Method and apparatus for obtaining a transcription of phrases through text and spoken utterances
US20160379638A1 (en) Input speech quality matching
Jelinek et al. 25 Continuous speech recognition: Statistical methods
EP1629464A4 (de) Spracherkennungssystem und verfahren auf phonetischer basis
DE69922971D1 (de) Netzwerk-interaktive benutzerschnittstelle mittels spracherkennung und verarbeitung natürlicher sprache
WO2001097213A8 (en) Speech recognition using utterance-level confidence estimates
CA2069675A1 (en) Flexible vocabulary recognition
ATE395685T1 (de) Spracherkennung durch wort-in-phrase-befehl
EP1220197A3 (de) System und Verfahren zur Spracherkennung
BR9913524A (pt) Reconhecedor de voz, e, processo de reconhecimento de voz
DE60002584D1 (de) Anwendung von Referenzdaten für Spracherkennung
ATE449401T1 (de) Automatische erzeugung einer wortaussprache für die spracherkennung
Kim et al. Robust DTW-based recognition algorithm for hand-held consumer devices
FR2857528B1 (fr) Reconnaissance vocale pour les larges vocabulaires dynamiques
Chen et al. Large vocabulary word recognition based on tree-trellis search
Hofmann et al. Improving spontaneous English ASR using a joint-sequence pronunciation model
Tolba et al. Speech recognition by intelligent machines
JP3727436B2 (ja) 音声原稿最適照合装置および方法
Choi et al. Lexical tree decoding with a class-based language model for Chinese speech recognition
Nakagawa A machine understanding system for spoken Japanese sentences
Tajchman et al. Learning phonological rule probabilities from speech corpora with exploratory computational phonology
WO2000026901A3 (en) Performing spoken recorded actions

Legal Events

Date Code Title Description
RER Ceased as to paragraph 5 lit. 3 law introducing patent treaties