ATE457510T1 - Spracherkennungssystem mit riesigem vokabular - Google Patents

Spracherkennungssystem mit riesigem vokabular

Info

Publication number: ATE457510T1
Authority: AT; Austria
Prior art keywords: words; word; recognition system; speech recognition; speech
Prior art date: 2005-12-08

Application number

AT06832122T

Other languages

English (en)

Inventor

Zsolt Saffer

Original Assignee

Nuance Comm Austria Gmbh

Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)

2005-12-08

Filing date

2006-12-06

Publication date

2010-02-15

2006-12-06 Application filed by Nuance Comm Austria Gmbh filed Critical Nuance Comm Austria Gmbh

2010-02-15 Application granted granted Critical

2010-02-15 Publication of ATE457510T1 publication Critical patent/ATE457510T1/de

Links

Classifications

- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/28—Constructional details of speech recognition systems
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/183—Speech classification or search using natural language modelling using context dependencies, e.g. language models
- G10L15/187—Phonemic context, e.g. pronunciation rules, phonotactical constraints or phoneme n-grams
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
- G10L2015/025—Phonemes, fenemes or fenones being the recognition units

Landscapes

Engineering & Computer Science (AREA)
Computational Linguistics (AREA)
Health & Medical Sciences (AREA)
Audiology, Speech & Language Pathology (AREA)
Human Computer Interaction (AREA)
Physics & Mathematics (AREA)
Acoustics & Sound (AREA)
Multimedia (AREA)
Artificial Intelligence (AREA)
Machine Translation (AREA)
Document Processing Apparatus (AREA)
Telephonic Communication Services (AREA)

AT06832122T 2005-12-08 2006-12-06 Spracherkennungssystem mit riesigem vokabular ATE457510T1 (de)

Applications Claiming Priority (2)

Application Number	Priority Date	Filing Date	Title
EP05111839		2005-12-08
PCT/IB2006/054637 WO2007066297A1 (en)	2005-12-08	2006-12-06	Speech recognition system with huge vocabulary

Publications (1)

Publication Number	Publication Date
ATE457510T1 true ATE457510T1 (de)	2010-02-15

Family

ID=37907345

Family Applications (1)

Application Number	Title	Priority Date	Filing Date
AT06832122T ATE457510T1 (de)	2005-12-08	2006-12-06	Spracherkennungssystem mit riesigem vokabular

Country Status (8)

Country	Link
US (3)	US8140336B2 (de)
EP (1)	EP1960997B1 (de)
JP (2)	JP5322655B2 (de)
CN (2)	CN102176310B (de)
AT (1)	ATE457510T1 (de)
DE (1)	DE602006012218D1 (de)
RU (1)	RU2008127509A (de)
WO (1)	WO2007066297A1 (de)

Families Citing this family (27)

* Cited by examiner, † Cited by third party
Publication number	Priority date	Publication date	Assignee	Title
CN102176310B (zh)	2005-12-08	2013-08-21	纽昂斯奥地利通讯有限公司	具有巨大词汇量的语音识别***
US8135590B2 (en) *	2007-01-11	2012-03-13	Microsoft Corporation	Position-dependent phonetic models for reliable pronunciation identification
JP4973731B2 (ja) *	2007-07-09	2012-07-11	富士通株式会社	音声認識装置、音声認識方法、および、音声認識プログラム
US8738360B2 (en) *	2008-06-06	2014-05-27	Apple Inc.	Data detection of a character sequence having multiple possible data types
US8788256B2 (en) *	2009-02-17	2014-07-22	Sony Computer Entertainment Inc.	Multiple language voice recognition
US9646603B2 (en) *	2009-02-27	2017-05-09	Longsand Limited	Various apparatus and methods for a speech recognition system
US9659559B2 (en) *	2009-06-25	2017-05-23	Adacel Systems, Inc.	Phonetic distance measurement system and related methods
JP5660441B2 (ja) *	2010-09-22	2015-01-28	独立行政法人情報通信研究機構	音声認識装置、音声認識方法、及びプログラム
KR20120046627A (ko) *	2010-11-02	2012-05-10	삼성전자주식회사	화자 적응 방법 및 장치
CN102479508B (zh) *	2010-11-30	2015-02-11	国际商业机器公司	用于将文本转换成语音的方法和***
WO2012104708A1 (en) *	2011-01-31	2012-08-09	Walter Rosenbaum	Method and system for information recognition
CN102737638B (zh) *	2012-06-30	2015-06-03	北京百度网讯科技有限公司	一种语音解码的方法及装置
KR20140028174A (ko) *	2012-07-13	2014-03-10	삼성전자주식회사	음성 인식 방법 및 이를 적용한 전자 장치
WO2014035394A1 (en) *	2012-08-30	2014-03-06	Interactive Intelligence, Inc.	Method and system for predicting speech recognition performance using accuracy scores
US10019983B2 (en) *	2012-08-30	2018-07-10	Aravind Ganapathiraju	Method and system for predicting speech recognition performance using accuracy scores
US9035884B2 (en)	2012-10-17	2015-05-19	Nuance Communications, Inc.	Subscription updates in multiple device language models
CN103810997B (zh) *	2012-11-14	2018-04-03	北京百度网讯科技有限公司	一种确定语音识别结果置信度的方法和装置
CN103903619B (zh) *	2012-12-28	2016-12-28	科大讯飞股份有限公司	一种提高语音识别准确率的方法及***
US9953646B2 (en)	2014-09-02	2018-04-24	Belleau Technologies	Method and system for dynamic speech recognition and tracking of prewritten script
CN105895091B (zh) *	2016-04-06	2020-01-03	普强信息技术（北京）有限公司	一种eswfst构建方法
JP7102710B2 (ja) *	2017-11-22	2022-07-20	富士通株式会社	情報生成プログラム、単語抽出プログラム、情報処理装置、情報生成方法及び単語抽出方法
US10964311B2 (en) *	2018-02-23	2021-03-30	Kabushiki Kaisha Toshiba	Word detection system, word detection method, and storage medium
JP7124358B2 (ja) *	2018-03-13	2022-08-24	富士通株式会社	出力プログラム、情報処理装置及び出力制御方法
CN109002454B (zh) *	2018-04-28	2022-05-27	陈逸天	一种确定目标单词的拼读分区的方法和电子设备
CN109376358B (zh) *	2018-10-25	2021-07-16	陈逸天	一种借用历史拼读经验的单词学习方法、装置和电子设备
CN110176230B (zh) *	2018-12-11	2021-10-08	腾讯科技（深圳）有限公司	一种语音识别方法、装置、设备和存储介质
US11217245B2 (en) *	2019-08-29	2022-01-04	Sony Interactive Entertainment Inc.	Customizable keyword spotting system with keyword adaptation

Family Cites Families (20)

* Cited by examiner, † Cited by third party
Publication number	Priority date	Publication date	Assignee	Title
US4980918A (en) *	1985-05-09	1990-12-25	International Business Machines Corporation	Speech recognition system with efficient storage and rapid assembly of phonological graphs
JPS63155263A (ja) *	1986-12-18	1988-06-28	Fujitsu Ltd	音声ワ−ドプロセツサ
US5033087A (en)	1989-03-14	1991-07-16	International Business Machines Corp.	Method and apparatus for the automatic determination of phonological rules as for a continuous speech recognition system
JP3042455B2 (ja) *	1997-07-28	2000-05-15	日本電気株式会社	連続音声認識方式
US6757652B1 (en) *	1998-03-03	2004-06-29	Koninklijke Philips Electronics N.V.	Multiple stage speech recognizer
US6243680B1 (en) *	1998-06-15	2001-06-05	Nortel Networks Limited	Method and apparatus for obtaining a transcription of phrases through text and spoken utterances
US20020116196A1 (en) *	1998-11-12	2002-08-22	Tran Bao Q.	Speech recognizer
JP2000267693A (ja) *	1999-03-12	2000-09-29	Fuji Xerox Co Ltd	音声処理装置及び索引作成装置
US6542866B1 (en)	1999-09-22	2003-04-01	Microsoft Corporation	Speech recognition method and apparatus utilizing multiple feature streams
CN1329861C (zh) *	1999-10-28	2007-08-01	佳能株式会社	模式匹配方法和装置
US20030009331A1 (en) *	2001-07-05	2003-01-09	Johan Schalkwyk	Grammars for speech recognition
DE10207895B4 (de) *	2002-02-23	2005-11-03	Harman Becker Automotive Systems Gmbh	Verfahren zur Spracherkennung und Spracherkennungssystem
US7181398B2 (en) *	2002-03-27	2007-02-20	Hewlett-Packard Development Company, L.P.	Vocabulary independent speech recognition system and method using subword units
US6879954B2 (en)	2002-04-22	2005-04-12	Matsushita Electric Industrial Co., Ltd.	Pattern matching for large vocabulary speech recognition systems
US7149688B2 (en) *	2002-11-04	2006-12-12	Speechworks International, Inc.	Multi-lingual speech recognition with cross-language context modeling
JP4072718B2 (ja) *	2002-11-21	2008-04-09	ソニー株式会社	音声処理装置および方法、記録媒体並びにプログラム
US7409345B2 (en) *	2003-04-04	2008-08-05	International Business Machines Corporation	Methods for reducing spurious insertions in speech recognition
KR100612839B1 (ko) *	2004-02-18	2006-08-18	삼성전자주식회사	도메인 기반 대화 음성인식방법 및 장치
US20080103771A1 (en) *	2004-11-08	2008-05-01	France Telecom	Method for the Distributed Construction of a Voice Recognition Model, and Device, Server and Computer Programs Used to Implement Same
CN102176310B (zh)	2005-12-08	2013-08-21	纽昂斯奥地利通讯有限公司	具有巨大词汇量的语音识别***

2006
- 2006-12-06 CN CN2011101288722A patent/CN102176310B/zh not_active Expired - Fee Related
- 2006-12-06 WO PCT/IB2006/054637 patent/WO2007066297A1/en active Application Filing
- 2006-12-06 CN CN2006800460259A patent/CN101326572B/zh not_active Expired - Fee Related
- 2006-12-06 JP JP2008543980A patent/JP5322655B2/ja not_active Expired - Fee Related
- 2006-12-06 RU RU2008127509/09A patent/RU2008127509A/ru not_active Application Discontinuation
- 2006-12-06 US US12/096,046 patent/US8140336B2/en not_active Expired - Fee Related
- 2006-12-06 AT AT06832122T patent/ATE457510T1/de not_active IP Right Cessation
- 2006-12-06 EP EP06832122A patent/EP1960997B1/de not_active Not-in-force
- 2006-12-06 DE DE602006012218T patent/DE602006012218D1/de active Active
2012
- 2012-02-03 US US13/366,096 patent/US8417528B2/en active Active
- 2012-12-14 JP JP2012273922A patent/JP5968774B2/ja not_active Expired - Fee Related
2013
- 2013-03-06 US US13/786,973 patent/US8666745B2/en active Active

Also Published As

Publication number	Publication date
US20120136662A1 (en)	2012-05-31
EP1960997B1 (de)	2010-02-10
RU2008127509A (ru)	2010-01-20
JP2009518677A (ja)	2009-05-07
EP1960997A1 (de)	2008-08-27
CN101326572B (zh)	2011-07-06
JP2013068970A (ja)	2013-04-18
US8417528B2 (en)	2013-04-09
CN101326572A (zh)	2008-12-17
JP5968774B2 (ja)	2016-08-10
CN102176310B (zh)	2013-08-21
WO2007066297A1 (en)	2007-06-14
US8140336B2 (en)	2012-03-20
US20080294441A1 (en)	2008-11-27
US20130185073A1 (en)	2013-07-18
DE602006012218D1 (de)	2010-03-25
JP5322655B2 (ja)	2013-10-23
CN102176310A (zh)	2011-09-07
US8666745B2 (en)	2014-03-04

Legal Events

Date	Code	Title	Description
2010-08-15	RER	Ceased as to paragraph 5 lit. 3 law introducing patent treaties

Publication	Publication Date	Title
ATE457510T1 (de)	2010-02-15	Spracherkennungssystem mit riesigem vokabular
ATE419616T1 (de)	2009-01-15	Verfahren, einrichtung und computerprogramm zur spracherkennung
TW200638337A (en)	2006-11-01	Using a spoken utterance for disambiguation of spelling inputs into a speech recognition system
Stolcke et al.	2014	Highly accurate phonetic segmentation using boundary correction models and system fusion
ES2086345T3 (es)	1996-07-01	Metodo para el reconocimiento del habla adaptable al usuario.
WO2008073850A3 (en)	2008-08-28	Method and apparatus for reading education
TW200601263A (en)	2006-01-01	Apparatus and method for synthesized audible response to an utterance in speaker-independent voice recognition
Chen et al.	2011	Applying rhythm features to automatically assess non-native speech
WO2007015869A3 (en)	2007-04-19	Spoken language proficiency assessment by computer
US20110166861A1 (en)	2011-07-07	Method and apparatus for synthesizing a speech with information
WO2007117814A3 (en)	2008-05-22	Voice signal perturbation for speech recognition
EP4235648A3 (de)	2023-10-25	Beeinflussung eines sprachenmodells
WO2009008055A1 (ja)	2009-01-15	音声認識装置、音声認識方法、および、音声認識プログラム
WO2007034478A3 (en)	2009-04-30	System and method for correcting speech
Cardinal et al.	2015	Speaker adaptation using the i-vector technique for bottleneck features
Ghai et al.	2013	Phone based acoustic modeling for automatic speech recognition for punjabi language
Tong et al.	2015	Goodness of tone (GOT) for non-native Mandarin tone recognition.
Prudnikov et al.	2015	Improving acoustic models for Russian spontaneous speech recognition
Srikanth et al.	2012	Automatic pronunciation scoring and mispronunciation detection using CMUSphinx
Elaraby et al.	2016	A deep neural networks (DNN) based models for a computer aided pronunciation learning system
Luong et al.	2015	Tonal phoneme based model for Vietnamese LVCSR
Modipa et al.	2015	Predicting vowel substitution in code-switched speech
Kilgour et al.	2013	The 2013 KIT IWSLT Speech-to-Text Systems for German and English
Barnard et al.	2011	Phone recognition for spoken web search
Lazaridis et al.	2014	Syllable-based regional Swiss French accent identification using prosodic features