WO2001048737A3 - Speech recognizer with a lexical tree based n-gram language model - Google Patents
Speech recognizer with a lexical tree based n-gram language model Download PDFInfo
- Publication number
- WO2001048737A3 WO2001048737A3 PCT/CN1999/000217 CN9900217W WO0148737A3 WO 2001048737 A3 WO2001048737 A3 WO 2001048737A3 CN 9900217 W CN9900217 W CN 9900217W WO 0148737 A3 WO0148737 A3 WO 0148737A3
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- probabilities
- lexical tree
- estimated probabilities
- stored
- phonemes
- Prior art date
Links
- 238000000034 method Methods 0.000 abstract 5
- 238000013138 pruning Methods 0.000 abstract 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/183—Speech classification or search using natural language modelling using context dependencies, e.g. language models
- G10L15/19—Grammatical context, e.g. disambiguation of the recognition hypotheses based on word sequence rules
- G10L15/197—Probabilistic grammars, e.g. word n-grams
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/183—Speech classification or search using natural language modelling using context dependencies, e.g. language models
- G10L15/187—Phonemic context, e.g. pronunciation rules, phonotactical constraints or phoneme n-grams
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Probability & Statistics with Applications (AREA)
- Artificial Intelligence (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Machine Translation (AREA)
Abstract
Priority Applications (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
AU17676/00A AU1767600A (en) | 1999-12-23 | 1999-12-23 | Speech recognizer with a lexical tree based n-gram language model |
PCT/CN1999/000217 WO2001048737A2 (en) | 1999-12-23 | 1999-12-23 | Speech recognizer with a lexical tree based n-gram language model |
CN99817058.5A CN1201286C (en) | 1999-12-23 | 1999-12-23 | Speech recognizer with a lexial tree based N-gram language model |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PCT/CN1999/000217 WO2001048737A2 (en) | 1999-12-23 | 1999-12-23 | Speech recognizer with a lexical tree based n-gram language model |
Publications (2)
Publication Number | Publication Date |
---|---|
WO2001048737A2 WO2001048737A2 (en) | 2001-07-05 |
WO2001048737A3 true WO2001048737A3 (en) | 2002-11-14 |
Family
ID=4575158
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN1999/000217 WO2001048737A2 (en) | 1999-12-23 | 1999-12-23 | Speech recognizer with a lexical tree based n-gram language model |
Country Status (3)
Country | Link |
---|---|
CN (1) | CN1201286C (en) |
AU (1) | AU1767600A (en) |
WO (1) | WO2001048737A2 (en) |
Families Citing this family (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
GB0420464D0 (en) | 2004-09-14 | 2004-10-20 | Zentian Ltd | A speech recognition circuit and method |
CN101271450B (en) * | 2007-03-19 | 2010-09-29 | 株式会社东芝 | Method and device for cutting language model |
GB2453366B (en) * | 2007-10-04 | 2011-04-06 | Toshiba Res Europ Ltd | Automatic speech recognition method and apparatus |
WO2010105428A1 (en) * | 2009-03-19 | 2010-09-23 | Google Inc. | Input method editor |
WO2010105427A1 (en) | 2009-03-19 | 2010-09-23 | Google Inc. | Input method editor |
US8655647B2 (en) | 2010-03-11 | 2014-02-18 | Microsoft Corporation | N-gram selection for practical-sized language models |
US8589164B1 (en) * | 2012-10-18 | 2013-11-19 | Google Inc. | Methods and systems for speech recognition processing using search query information |
CN111128172B (en) * | 2019-12-31 | 2022-12-16 | 达闼机器人股份有限公司 | Voice recognition method, electronic equipment and storage medium |
Citations (13)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH0473694A (en) * | 1990-07-13 | 1992-03-09 | Nippon Telegr & Teleph Corp <Ntt> | Japanese language speech recognizing method |
EP0533260A2 (en) * | 1991-09-14 | 1993-03-24 | Philips Patentverwaltung GmbH | Method and apparatus for recognizing the uttered words in a speech signal |
US5502791A (en) * | 1992-09-29 | 1996-03-26 | International Business Machines Corporation | Speech recognition by concatenating fenonic allophone hidden Markov models in parallel among subwords |
JPH08123479A (en) * | 1994-10-26 | 1996-05-17 | Atr Onsei Honyaku Tsushin Kenkyusho:Kk | Continuous speech recognition device |
JPH08221091A (en) * | 1995-02-17 | 1996-08-30 | Matsushita Electric Ind Co Ltd | Voice recognition device |
WO1996027872A1 (en) * | 1995-03-07 | 1996-09-12 | British Telecommunications Public Limited Company | Speech recognition |
US5621859A (en) * | 1994-01-19 | 1997-04-15 | Bbn Corporation | Single tree method for grammar directed, very large vocabulary speech recognizer |
EP0825586A2 (en) * | 1996-08-22 | 1998-02-25 | Dragon Systems Inc. | Lexical tree pre-filtering in speech recognition |
US5758024A (en) * | 1996-06-25 | 1998-05-26 | Microsoft Corporation | Method and system for encoding pronunciation prefix trees |
US5832428A (en) * | 1995-10-04 | 1998-11-03 | Apple Computer, Inc. | Search engine for phrase recognition based on prefix/body/suffix architecture |
CN1233803A (en) * | 1998-04-29 | 1999-11-03 | 松下电器产业株式会社 | Method and apparatus using decision trees to generate and score multiple pronunciations for spelled word |
WO1999059141A1 (en) * | 1998-05-11 | 1999-11-18 | Siemens Aktiengesellschaft | Method and array for introducing temporal correlation in hidden markov models for speech recognition |
JPH11344991A (en) * | 1998-05-30 | 1999-12-14 | Brother Ind Ltd | Voice recognition device and storage medium |
-
1999
- 1999-12-23 CN CN99817058.5A patent/CN1201286C/en not_active Expired - Fee Related
- 1999-12-23 WO PCT/CN1999/000217 patent/WO2001048737A2/en active Application Filing
- 1999-12-23 AU AU17676/00A patent/AU1767600A/en not_active Abandoned
Patent Citations (13)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH0473694A (en) * | 1990-07-13 | 1992-03-09 | Nippon Telegr & Teleph Corp <Ntt> | Japanese language speech recognizing method |
EP0533260A2 (en) * | 1991-09-14 | 1993-03-24 | Philips Patentverwaltung GmbH | Method and apparatus for recognizing the uttered words in a speech signal |
US5502791A (en) * | 1992-09-29 | 1996-03-26 | International Business Machines Corporation | Speech recognition by concatenating fenonic allophone hidden Markov models in parallel among subwords |
US5621859A (en) * | 1994-01-19 | 1997-04-15 | Bbn Corporation | Single tree method for grammar directed, very large vocabulary speech recognizer |
JPH08123479A (en) * | 1994-10-26 | 1996-05-17 | Atr Onsei Honyaku Tsushin Kenkyusho:Kk | Continuous speech recognition device |
JPH08221091A (en) * | 1995-02-17 | 1996-08-30 | Matsushita Electric Ind Co Ltd | Voice recognition device |
WO1996027872A1 (en) * | 1995-03-07 | 1996-09-12 | British Telecommunications Public Limited Company | Speech recognition |
US5832428A (en) * | 1995-10-04 | 1998-11-03 | Apple Computer, Inc. | Search engine for phrase recognition based on prefix/body/suffix architecture |
US5758024A (en) * | 1996-06-25 | 1998-05-26 | Microsoft Corporation | Method and system for encoding pronunciation prefix trees |
EP0825586A2 (en) * | 1996-08-22 | 1998-02-25 | Dragon Systems Inc. | Lexical tree pre-filtering in speech recognition |
CN1233803A (en) * | 1998-04-29 | 1999-11-03 | 松下电器产业株式会社 | Method and apparatus using decision trees to generate and score multiple pronunciations for spelled word |
WO1999059141A1 (en) * | 1998-05-11 | 1999-11-18 | Siemens Aktiengesellschaft | Method and array for introducing temporal correlation in hidden markov models for speech recognition |
JPH11344991A (en) * | 1998-05-30 | 1999-12-14 | Brother Ind Ltd | Voice recognition device and storage medium |
Also Published As
Publication number | Publication date |
---|---|
CN1406374A (en) | 2003-03-26 |
AU1767600A (en) | 2001-07-09 |
CN1201286C (en) | 2005-05-11 |
WO2001048737A2 (en) | 2001-07-05 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
AU2001274936A1 (en) | Creating a unified task dependent language models with information retrieval techniques | |
EP1128361A3 (en) | Language models for speech recognition | |
WO2004110030A3 (en) | Assistive call center interface | |
CA2508946A1 (en) | Method and apparatus for natural language call routing using confidence scores | |
WO2008115285A3 (en) | Content selection using speech recognition | |
EP1220197A3 (en) | Speech recognition method and system | |
CA2321112A1 (en) | Information retrieval and speech recognition based on language models | |
EP1538535A3 (en) | Determination of meaning for text input in natural language understanding systems | |
EP1083545A3 (en) | Voice recognition of proper names in a navigation apparatus | |
CA2493640A1 (en) | Improvements in or relating to information provision for call centres | |
WO2002046719A3 (en) | Cryostorage method and device | |
WO2007035186A3 (en) | A method and system for the automatic recognition of deceptive language | |
WO2003065253A3 (en) | Method and system for storage and fast retrieval of digital terrain model elevations for use in positioning systems | |
EP1653444A3 (en) | System and method for converting text to speech | |
US20070033025A1 (en) | Algorithm for n-best ASR result processing to improve accuracy | |
WO2001048737A3 (en) | Speech recognizer with a lexical tree based n-gram language model | |
ATE223610T1 (en) | DEVICE FOR DETECTING CONTINUOUSLY SPOKEN LANGUAGE | |
WO2001084357A3 (en) | Cluster and pruning-based language model compression | |
EP0949606A3 (en) | Method and system for speech recognition based on phonetic transcriptions | |
Nocera et al. | Phoneme lattice based A* search algorithm for speech recognition | |
Wang et al. | A multi-pass linear fold algorithm for sentence boundary detection using prosodic cues | |
WO2004072947A3 (en) | Speech recognition with soft pruning | |
EP1321862A3 (en) | Hash function based transcription database | |
EP1406161A3 (en) | Information processing device and setting method therefor | |
US20040138884A1 (en) | Compression of language model structures and word identifiers for automated speech recognition systems |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AK | Designated states |
Kind code of ref document: A2 Designated state(s): AE AL AM AT AU AZ BA BB BG BR BY CA CH CN CU CZ DE DK EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MD MG MK MN MW MX NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT UA UG US UZ VN YU ZA ZW |
|
AL | Designated countries for regional patents |
Kind code of ref document: A2 Designated state(s): GH GM KE LS MW SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
DFPE | Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101) | ||
WWE | Wipo information: entry into national phase |
Ref document number: 998170585 Country of ref document: CN |
|
WWE | Wipo information: entry into national phase |
Ref document number: 09979628 Country of ref document: US |
|
AK | Designated states |
Kind code of ref document: A3 Designated state(s): AE AL AM AT AU AZ BA BB BG BR BY CA CH CN CU CZ DE DK EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MD MG MK MN MW MX NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT UA UG US UZ VN YU ZA ZW |
|
AL | Designated countries for regional patents |
Kind code of ref document: A3 Designated state(s): GH GM KE LS MW SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG |
|
REG | Reference to national code |
Ref country code: DE Ref legal event code: 8642 |
|
122 | Ep: pct application non-entry in european phase |