JP4724377B2 - 自然言語理解(NLU)システムにおける規則ベース文法に関するスロットおよび前終端記号(preterminal)に関する統計モデル - Google Patents
自然言語理解(NLU)システムにおける規則ベース文法に関するスロットおよび前終端記号(preterminal)に関する統計モデル Download PDFInfo
- Publication number
- JP4724377B2 JP4724377B2 JP2004130332A JP2004130332A JP4724377B2 JP 4724377 B2 JP4724377 B2 JP 4724377B2 JP 2004130332 A JP2004130332 A JP 2004130332A JP 2004130332 A JP2004130332 A JP 2004130332A JP 4724377 B2 JP4724377 B2 JP 4724377B2
- Authority
- JP
- Japan
- Prior art keywords
- model
- rule
- segmentation
- front terminal
- schema
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Lifetime
Links
- 238000013179 statistical model Methods 0.000 title claims abstract description 37
- 238000012549 training Methods 0.000 claims abstract description 51
- 230000011218 segmentation Effects 0.000 claims description 79
- 230000007704 transition Effects 0.000 claims description 16
- 238000013507 mapping Methods 0.000 claims 1
- 238000010586 diagram Methods 0.000 description 19
- 238000000034 method Methods 0.000 description 12
- 238000013138 pruning Methods 0.000 description 10
- 238000004891 communication Methods 0.000 description 6
- 239000002131 composite material Substances 0.000 description 6
- 238000012545 processing Methods 0.000 description 6
- 238000009499 grossing Methods 0.000 description 5
- 238000010606 normalization Methods 0.000 description 5
- 230000003287 optical effect Effects 0.000 description 5
- 230000008569 process Effects 0.000 description 5
- 238000007476 Maximum Likelihood Methods 0.000 description 4
- 230000008859 change Effects 0.000 description 4
- 238000011161 development Methods 0.000 description 4
- 230000018109 developmental process Effects 0.000 description 4
- 230000007246 mechanism Effects 0.000 description 4
- 230000002093 peripheral effect Effects 0.000 description 4
- 230000010006 flight Effects 0.000 description 3
- 230000006855 networking Effects 0.000 description 3
- 238000009827 uniform distribution Methods 0.000 description 3
- 230000005055 memory storage Effects 0.000 description 2
- 239000007787 solid Substances 0.000 description 2
- CDFKCKUONRRKJD-UHFFFAOYSA-N 1-(3-chlorophenoxy)-3-[2-[[3-(3-chlorophenoxy)-2-hydroxypropyl]amino]ethylamino]propan-2-ol;methanesulfonic acid Chemical compound CS(O)(=O)=O.CS(O)(=O)=O.C=1C=CC(Cl)=CC=1OCC(O)CNCCNCC(O)COC1=CC=CC(Cl)=C1 CDFKCKUONRRKJD-UHFFFAOYSA-N 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 238000009826 distribution Methods 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 238000000802 evaporation-induced self-assembly Methods 0.000 description 1
- 230000006870 function Effects 0.000 description 1
- 230000014509 gene expression Effects 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 238000005192 partition Methods 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 230000007723 transport mechanism Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/1822—Parsing for meaning understanding
-
- A—HUMAN NECESSITIES
- A23—FOODS OR FOODSTUFFS; TREATMENT THEREOF, NOT COVERED BY OTHER CLASSES
- A23P—SHAPING OR WORKING OF FOODSTUFFS, NOT FULLY COVERED BY A SINGLE OTHER SUBCLASS
- A23P30/00—Shaping or working of foodstuffs characterised by the process or apparatus
- A23P30/30—Puffing or expanding
-
- A—HUMAN NECESSITIES
- A23—FOODS OR FOODSTUFFS; TREATMENT THEREOF, NOT COVERED BY OTHER CLASSES
- A23L—FOODS, FOODSTUFFS, OR NON-ALCOHOLIC BEVERAGES, NOT COVERED BY SUBCLASSES A21D OR A23B-A23J; THEIR PREPARATION OR TREATMENT, e.g. COOKING, MODIFICATION OF NUTRITIVE QUALITIES, PHYSICAL TREATMENT; PRESERVATION OF FOODS OR FOODSTUFFS, IN GENERAL
- A23L7/00—Cereal-derived products; Malt products; Preparation or treatment thereof
- A23L7/10—Cereal-derived products
- A23L7/161—Puffed cereals, e.g. popcorn or puffed rice
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/205—Parsing
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/205—Parsing
- G06F40/211—Syntactic parsing, e.g. based on context-free grammar [CFG] or unification grammars
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/205—Parsing
- G06F40/216—Parsing using statistical methods
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Artificial Intelligence (AREA)
- Computational Linguistics (AREA)
- General Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- General Physics & Mathematics (AREA)
- Polymers & Plastics (AREA)
- Probability & Statistics with Applications (AREA)
- Life Sciences & Earth Sciences (AREA)
- Food Science & Technology (AREA)
- Chemical & Material Sciences (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Nutrition Science (AREA)
- Machine Translation (AREA)
- Electrically Operated Instructional Devices (AREA)
Description
=P(ε|ShowFlightCmd)*P(from|FPDCity)
+P(from|ShowFlightCmd)*P(ε|FPDCity)
=[(7/18)×(5/12)]+[(3/18)×(5/12)]=50/216
この数量から、空の文字列をShowFlightCmdと位置合わせし、「from」をFPDCityと位置合わせするセグメント化の比率は、以下のような新しく予測されるカウント
FlightPreArrivalCity→to
ShowFlightCmd→Show me the flight
さらに、ランタイム時に、センテンス入力が「Show flight to Boston」であると想定してみる。「Show flight」がShowFlightCmdであるとする規則はないため、入力されたセンテンスは理解されないことになる。
ChangeLoginPictureCmd→Please change my login icon
「ChangeLoginPicture」はコマンドであるため、規則に対するプロパティ部分がない。したがって、文法学習者は、獲得した規則中のセンテンス全体を単に「覚える」だけである。ユーザが発行したコマンドを認識して呼び出すためには、コマンドがトレーニングデータ中のセンテンス全体と一致しなければならない。一般化はまったくない。
Pr(<s>showflight</s>|ShowFlightCmd)=
Pr(show|<s>; ShowFlightCmd)*
Pr(flight|show;ShowFlightCmd)*
Pr(</s>|flight; ShowFlightCmd)
Pr(flight|show;ShowFlightCmd)=
backoff_weight*Pr(flight|ShowFlightCmd)
ShowFlightCmd→Show me the flight
は、それに関連付けられた極小(atomic)確率を有する。ただし、複合モデル351では、規則に関する確率は以下のように計算することができる。
Pr(ShowFlightCmd -> show me the flight) =
Pr(show|<s>; ShowFlightCmd) *
Pr(me|show; ShowFlightCmd) *
Pr(the|me; ShowFlightCmd) *
Pr(flight|the; ShowFlightCmd) *
Pr(</s>|flight; ShowFlightCmd)
1. log P(NewAppt) // 以前のクラス
2. log b(New | <s>; NewApptCmd) // 単語bigram
3. log b(meeting | new; NewApptCmd) // 単語bigram
4. log b(</s> | meeting; NewApptCmd) + // 単語bigram
log a( Attendee | <s>; NewAppt) // スロットbigram
5. log b(with | <s>; PreAttendee) // 単語bigram
6. log b(</s> | with; PreAttendee) // 単語bigram
7. log Pcfg(Peter | <Person>) // PCFG
8. 0
9. log b(</s> | <s>; PostAttendee) + // 単語bigram
log a( StartTime | Attendee; NewAppt) // スロットbigram
いずれの所望のプルーニングメカニズムも使用可能である。たとえば、1つのプルーニングメカニズムでは、トレリスの各列で、スコアが、同じ列の最大スコアよりも低いしきい値(5.0など)よりも小さい場合は、ノードの移行を行わないものとする。言い換えれば、同じ列内のノードに至る他のパスよりも105倍少ない見込みの場合、パスは拡張されない。復号器は、プルーニング後の堅固な解析プログラムよりもかなり速く実行される。
202 モデルオーサリング構成要素
204 ユーザインターフェース
206 スキーマ
208 トレーニング例テキスト文字列および注釈
209 文法ライブラリ
210 規則ベース文法
Claims (8)
- 自然言語理解(NLU)システムにおいて、スキーマから導出されたスロットおよび前終端記号に音声による自然言語入力をマッピングする際に使用するための構成要素を生成するように構成されたオーサリング構成要素であって、前記オーサリング構成要素は、
コンピュータのプロセッサを用いて実施されるモデルトレーナを備え、
前記モデルトレーナは、タスクが完了することを示すスキーマを得て、前記スキーマは、自然言語入力の一部で充填されるように構成されている、複数のスロットと複数の前終端記号とを含み、前記前終端記号は1つ以上の前記スロットに関連付けられたプリアンブルとポストアンブルの少なくとも1つを含み、
前記モデルトレーナは、トレーニングデータに基づいて規則ベース文法をトレーニングし、前記スキーマから導出された前記スロットに前記自然言語入力からの用語をマッピングし、前記自然言語入力からの用語を前記スキーマから導出された前記前終端記号にマッピングするように、複数の統計モデルをトレーニングするように構成され、前記モデルトレーナは複数の異なる前終端記号のそれぞれに対応する統計モデルをトレーニングするように構成され、前記モデルトレーナは前記トレーニングデータを受け取り、前記スロットおよび前終端記号を前記トレーニングデータに関連付ける、前記トレーニングデータのセグメント化を列挙し、前記モデルトレーナは、前記スキーマから導出された前終端記号のそれぞれに対する統計モデルを、前終端記号のそれぞれに関連付けられた前記テキストを前記統計モデルの前記前終端記号に対するトレーニングデータとして用いてトレーニングするように構成されていることを特徴とするオーサリング構成要素。 - 前記モデルトレーナは、スロット間の移行をモデル化する統計スロット移行モデルをトレーニングするように構成されることを特徴とする請求項1に記載のオーサリング構成要素。
- 前記スキーマはタスクを示し、前記モデルトレーナは、タスクの前の確率をモデル化する統計タスクモデルをトレーニングするように構成されることを特徴とする請求項1に記載のオーサリング構成要素。
- 前記モデルトレーナは、列挙された各セグメント化に予測カウントを割り当てるように構成されることを特徴とする請求項1に記載のオーサリング構成要素。
- 前記モデルトレーナは、前終端記号を選択し、前記選択された前終端記号に対応するセグメント化に割り当てられた前記予測カウントを使用して、前記選択された前終端記号に関する前記統計モデルをトレーニングするように構成されることを特徴とする請求項4に記載のオーサリング構成要素。
- 前記モデルトレーナは、期待値最大化(EM)アルゴリズムの適用に基づいて生成されたそれぞれのセグメント化に予測カウントを割り当てるように構成されることを特徴とする請求項4に記載のオーサリング構成要素。
- 前記モデルトレーナによってアクセス可能な確率ライブラリ文法をさらに含むことを特徴とする請求項1に記載のオーサリング構成要素。
- 前記トレーニングデータは意味的な注釈が付けられたトレーニングデータであり、前記モデルトレーナは、前記意味的な注釈が付けられたトレーニングデータに基づいて、前記確率ライブラリ文法において確率を適応するように構成されることを特徴とする請求項7に記載のオーサリング構成要素。
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US10/427,604 US7603267B2 (en) | 2003-05-01 | 2003-05-01 | Rules-based grammar for slots and statistical model for preterminals in natural language understanding system |
US10/427,604 | 2003-05-01 |
Publications (2)
Publication Number | Publication Date |
---|---|
JP2005115328A JP2005115328A (ja) | 2005-04-28 |
JP4724377B2 true JP4724377B2 (ja) | 2011-07-13 |
Family
ID=32990449
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
JP2004130332A Expired - Lifetime JP4724377B2 (ja) | 2003-05-01 | 2004-04-26 | 自然言語理解(NLU)システムにおける規則ベース文法に関するスロットおよび前終端記号(preterminal)に関する統計モデル |
Country Status (7)
Country | Link |
---|---|
US (2) | US7603267B2 (ja) |
EP (1) | EP1475778B1 (ja) |
JP (1) | JP4724377B2 (ja) |
KR (1) | KR101120858B1 (ja) |
CN (1) | CN1542736B (ja) |
AT (1) | ATE492876T1 (ja) |
DE (1) | DE602004030635D1 (ja) |
Families Citing this family (137)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP2544102A1 (en) | 2002-11-28 | 2013-01-09 | Nuance Communications Austria GmbH | Method to assign word class information |
US20060009966A1 (en) * | 2004-07-12 | 2006-01-12 | International Business Machines Corporation | Method and system for extracting information from unstructured text using symbolic machine learning |
JP4081056B2 (ja) * | 2004-08-30 | 2008-04-23 | 株式会社東芝 | 情報処理装置、情報処理方法及びプログラム |
US20060155530A1 (en) * | 2004-12-14 | 2006-07-13 | International Business Machines Corporation | Method and apparatus for generation of text documents |
WO2006084144A2 (en) * | 2005-02-03 | 2006-08-10 | Voice Signal Technologies, Inc. | Methods and apparatus for automatically extending the voice-recognizer vocabulary of mobile communications devices |
DE602005007939D1 (de) * | 2005-02-17 | 2008-08-14 | Loquendo Societa Per Azioni | Verfahren und system zum automatischen bereitstellen linguistischer formulierungen, die ausserhalb ekennungssystems liegen |
US7617093B2 (en) * | 2005-06-02 | 2009-11-10 | Microsoft Corporation | Authoring speech grammars |
US20060287846A1 (en) * | 2005-06-21 | 2006-12-21 | Microsoft Corporation | Generating grammar rules from prompt text |
US8700404B1 (en) * | 2005-08-27 | 2014-04-15 | At&T Intellectual Property Ii, L.P. | System and method for using semantic and syntactic graphs for utterance classification |
US8677377B2 (en) | 2005-09-08 | 2014-03-18 | Apple Inc. | Method and apparatus for building an intelligent automated assistant |
KR100732167B1 (ko) * | 2005-09-08 | 2007-06-27 | 엘지전자 주식회사 | 스피커 어셈블리 및 영상기기 |
US8442828B2 (en) * | 2005-12-02 | 2013-05-14 | Microsoft Corporation | Conditional model for natural language understanding |
US7957968B2 (en) * | 2005-12-22 | 2011-06-07 | Honda Motor Co., Ltd. | Automatic grammar generation using distributedly collected knowledge |
US7865357B2 (en) * | 2006-03-14 | 2011-01-04 | Microsoft Corporation | Shareable filler model for grammar authoring |
US8244545B2 (en) * | 2006-03-30 | 2012-08-14 | Microsoft Corporation | Dialog repair based on discrepancies between user model predictions and speech recognition results |
US20070239453A1 (en) * | 2006-04-06 | 2007-10-11 | Microsoft Corporation | Augmenting context-free grammars with back-off grammars for processing out-of-grammar utterances |
US7689420B2 (en) * | 2006-04-06 | 2010-03-30 | Microsoft Corporation | Personalizing a context-free grammar using a dictation language model |
US7707027B2 (en) * | 2006-04-13 | 2010-04-27 | Nuance Communications, Inc. | Identification and rejection of meaningless input during natural language classification |
US8831943B2 (en) * | 2006-05-31 | 2014-09-09 | Nec Corporation | Language model learning system, language model learning method, and language model learning program |
WO2007138875A1 (ja) * | 2006-05-31 | 2007-12-06 | Nec Corporation | 音声認識用単語辞書・言語モデル作成システム、方法、プログラムおよび音声認識システム |
US8209175B2 (en) * | 2006-06-08 | 2012-06-26 | Microsoft Corporation | Uncertainty interval content sensing within communications |
DE102006029755A1 (de) * | 2006-06-27 | 2008-01-03 | Deutsche Telekom Ag | Verfahren und Vorrichtung zur natürlichsprachlichen Erkennung einer Sprachäußerung |
US10796390B2 (en) * | 2006-07-03 | 2020-10-06 | 3M Innovative Properties Company | System and method for medical coding of vascular interventional radiology procedures |
US9318108B2 (en) | 2010-01-18 | 2016-04-19 | Apple Inc. | Intelligent automated assistant |
US20080071520A1 (en) * | 2006-09-14 | 2008-03-20 | David Lee Sanford | Method and system for improving the word-recognition rate of speech recognition software |
US8433576B2 (en) * | 2007-01-19 | 2013-04-30 | Microsoft Corporation | Automatic reading tutoring with parallel polarized language modeling |
US7856351B2 (en) | 2007-01-19 | 2010-12-21 | Microsoft Corporation | Integrated speech recognition and semantic classification |
US8332207B2 (en) * | 2007-03-26 | 2012-12-11 | Google Inc. | Large language models in machine translation |
US8306822B2 (en) * | 2007-09-11 | 2012-11-06 | Microsoft Corporation | Automatic reading tutoring using dynamically built language model |
JP4640407B2 (ja) | 2007-12-07 | 2011-03-02 | ソニー株式会社 | 信号処理装置、信号処理方法及びプログラム |
US8996376B2 (en) | 2008-04-05 | 2015-03-31 | Apple Inc. | Intelligent text-to-speech conversion |
US8046222B2 (en) * | 2008-04-16 | 2011-10-25 | Google Inc. | Segmenting words using scaled probabilities |
US9460708B2 (en) * | 2008-09-19 | 2016-10-04 | Microsoft Technology Licensing, Llc | Automated data cleanup by substitution of words of the same pronunciation and different spelling in speech recognition |
KR101149521B1 (ko) * | 2008-12-10 | 2012-05-25 | 한국전자통신연구원 | 도메인 온톨로지를 이용한 음성 인식 방법 및 그 장치 |
US8990088B2 (en) * | 2009-01-28 | 2015-03-24 | Microsoft Corporation | Tool and framework for creating consistent normalization maps and grammars |
US10241752B2 (en) | 2011-09-30 | 2019-03-26 | Apple Inc. | Interface for a virtual digital assistant |
US10241644B2 (en) | 2011-06-03 | 2019-03-26 | Apple Inc. | Actionable reminder entries |
US9431006B2 (en) | 2009-07-02 | 2016-08-30 | Apple Inc. | Methods and apparatuses for automatic speech recognition |
KR20110006004A (ko) * | 2009-07-13 | 2011-01-20 | 삼성전자주식회사 | 결합인식단위 최적화 장치 및 그 방법 |
US20110238407A1 (en) * | 2009-08-31 | 2011-09-29 | O3 Technologies, Llc | Systems and methods for speech-to-speech translation |
WO2011083528A1 (ja) * | 2010-01-06 | 2011-07-14 | 日本電気株式会社 | データ処理装置、そのコンピュータプログラムおよびデータ処理方法 |
US8682667B2 (en) | 2010-02-25 | 2014-03-25 | Apple Inc. | User profiling for selecting user specific voice input processing information |
US8655647B2 (en) * | 2010-03-11 | 2014-02-18 | Microsoft Corporation | N-gram selection for practical-sized language models |
JP5710317B2 (ja) * | 2011-03-03 | 2015-04-30 | インターナショナル・ビジネス・マシーンズ・コーポレーションInternational Business Machines Corporation | 情報処理装置、自然言語解析方法、プログラムおよび記録媒体 |
US9760566B2 (en) | 2011-03-31 | 2017-09-12 | Microsoft Technology Licensing, Llc | Augmented conversational understanding agent to identify conversation context between two humans and taking an agent action thereof |
US9298287B2 (en) | 2011-03-31 | 2016-03-29 | Microsoft Technology Licensing, Llc | Combined activation for natural user interface systems |
US9244984B2 (en) | 2011-03-31 | 2016-01-26 | Microsoft Technology Licensing, Llc | Location based conversational understanding |
US9858343B2 (en) | 2011-03-31 | 2018-01-02 | Microsoft Technology Licensing Llc | Personalization of queries, conversations, and searches |
US10642934B2 (en) | 2011-03-31 | 2020-05-05 | Microsoft Technology Licensing, Llc | Augmented conversational understanding architecture |
US9842168B2 (en) | 2011-03-31 | 2017-12-12 | Microsoft Technology Licensing, Llc | Task driven user intents |
CN102147731A (zh) * | 2011-04-20 | 2011-08-10 | 上海交通大学 | 基于扩展功能需求描述框架的功能需求自动抽取*** |
US9064006B2 (en) | 2012-08-23 | 2015-06-23 | Microsoft Technology Licensing, Llc | Translating natural language utterances to keyword search queries |
US9454962B2 (en) | 2011-05-12 | 2016-09-27 | Microsoft Technology Licensing, Llc | Sentence simplification for spoken language understanding |
US8886533B2 (en) * | 2011-10-25 | 2014-11-11 | At&T Intellectual Property I, L.P. | System and method for combining frame and segment level processing, via temporal pooling, for phonetic classification |
US9214157B2 (en) * | 2011-12-06 | 2015-12-15 | At&T Intellectual Property I, L.P. | System and method for machine-mediated human-human conversation |
US9280610B2 (en) | 2012-05-14 | 2016-03-08 | Apple Inc. | Crowd sourcing information to fulfill user requests |
US9721563B2 (en) | 2012-06-08 | 2017-08-01 | Apple Inc. | Name recognition system |
JP5819261B2 (ja) * | 2012-06-19 | 2015-11-18 | 株式会社Nttドコモ | 機能実行指示システム、機能実行指示方法及び機能実行指示プログラム |
US9547647B2 (en) | 2012-09-19 | 2017-01-17 | Apple Inc. | Voice-based media searching |
US10373615B2 (en) | 2012-10-30 | 2019-08-06 | Google Technology Holdings LLC | Voice control user interface during low power mode |
US10304465B2 (en) | 2012-10-30 | 2019-05-28 | Google Technology Holdings LLC | Voice control user interface for low power mode |
US9584642B2 (en) | 2013-03-12 | 2017-02-28 | Google Technology Holdings LLC | Apparatus with adaptive acoustic echo control for speakerphone mode |
US10381001B2 (en) | 2012-10-30 | 2019-08-13 | Google Technology Holdings LLC | Voice control user interface during low-power mode |
MX345622B (es) * | 2013-01-29 | 2017-02-08 | Fraunhofer Ges Forschung | Decodificador para generar una señal de audio mejorada en frecuencia, método de decodificación, codificador para generar una señal codificada y metodo de codificación utilizando informacion secundaria de selección compacta. |
US9330659B2 (en) * | 2013-02-25 | 2016-05-03 | Microsoft Technology Licensing, Llc | Facilitating development of a spoken natural language interface |
US10354677B2 (en) * | 2013-02-28 | 2019-07-16 | Nuance Communications, Inc. | System and method for identification of intent segment(s) in caller-agent conversations |
US9460088B1 (en) * | 2013-05-31 | 2016-10-04 | Google Inc. | Written-domain language modeling with decomposition |
WO2014197334A2 (en) | 2013-06-07 | 2014-12-11 | Apple Inc. | System and method for user-specified pronunciation of words for speech synthesis and recognition |
US10235359B2 (en) * | 2013-07-15 | 2019-03-19 | Nuance Communications, Inc. | Ontology and annotation driven grammar inference |
EP2851896A1 (en) | 2013-09-19 | 2015-03-25 | Maluuba Inc. | Speech recognition using phoneme matching |
US9449598B1 (en) * | 2013-09-26 | 2016-09-20 | Amazon Technologies, Inc. | Speech recognition with combined grammar and statistical language models |
US8768712B1 (en) | 2013-12-04 | 2014-07-01 | Google Inc. | Initiating actions based on partial hotwords |
US9601108B2 (en) * | 2014-01-17 | 2017-03-21 | Microsoft Technology Licensing, Llc | Incorporating an exogenous large-vocabulary model into rule-based speech recognition |
US10749989B2 (en) | 2014-04-01 | 2020-08-18 | Microsoft Technology Licensing Llc | Hybrid client/server architecture for parallel processing |
US9430463B2 (en) | 2014-05-30 | 2016-08-30 | Apple Inc. | Exemplar-based natural language processing |
US9785630B2 (en) * | 2014-05-30 | 2017-10-10 | Apple Inc. | Text prediction using combined word N-gram and unigram language models |
US9767093B2 (en) * | 2014-06-19 | 2017-09-19 | Nuance Communications, Inc. | Syntactic parser assisted semantic rule inference |
US20150379166A1 (en) * | 2014-06-25 | 2015-12-31 | Linkedin Corporation | Model compilation for feature selection in statistical models |
US9338493B2 (en) | 2014-06-30 | 2016-05-10 | Apple Inc. | Intelligent automated assistant for TV user interactions |
US9668121B2 (en) | 2014-09-30 | 2017-05-30 | Apple Inc. | Social reminders |
US10567477B2 (en) | 2015-03-08 | 2020-02-18 | Apple Inc. | Virtual assistant continuity |
US9578173B2 (en) | 2015-06-05 | 2017-02-21 | Apple Inc. | Virtual assistant aided communication with 3rd party service in a communication session |
US10747498B2 (en) | 2015-09-08 | 2020-08-18 | Apple Inc. | Zero latency digital assistant |
US10671428B2 (en) | 2015-09-08 | 2020-06-02 | Apple Inc. | Distributed personal assistant |
US10366158B2 (en) | 2015-09-29 | 2019-07-30 | Apple Inc. | Efficient word encoding for recurrent neural network language models |
US11010550B2 (en) | 2015-09-29 | 2021-05-18 | Apple Inc. | Unified language modeling framework for word prediction, auto-completion and auto-correction |
US11587559B2 (en) | 2015-09-30 | 2023-02-21 | Apple Inc. | Intelligent device identification |
WO2017059500A1 (en) * | 2015-10-09 | 2017-04-13 | Sayity Pty Ltd | Frameworks and methodologies configured to enable streamlined integration of natural language processing functionality with one or more user interface environments, including assisted learning process |
US10691473B2 (en) | 2015-11-06 | 2020-06-23 | Apple Inc. | Intelligent automated assistant in a messaging environment |
US10049668B2 (en) | 2015-12-02 | 2018-08-14 | Apple Inc. | Applying neural network language models to weighted finite state transducers for automatic speech recognition |
WO2017094967A1 (ko) * | 2015-12-03 | 2017-06-08 | 한국과학기술원 | 자연 언어 처리 스키마 및 그 지식 데이터베이스 구축 방법 및 시스템 |
US10223066B2 (en) | 2015-12-23 | 2019-03-05 | Apple Inc. | Proactive assistance based on dialog communication between devices |
US10446143B2 (en) | 2016-03-14 | 2019-10-15 | Apple Inc. | Identification of voice inputs providing credentials |
US9934775B2 (en) | 2016-05-26 | 2018-04-03 | Apple Inc. | Unit-selection text-to-speech synthesis based on predicted concatenation parameters |
US9972304B2 (en) | 2016-06-03 | 2018-05-15 | Apple Inc. | Privacy preserving distributed evaluation framework for embedded personalized systems |
US10249300B2 (en) | 2016-06-06 | 2019-04-02 | Apple Inc. | Intelligent list reading |
US10049663B2 (en) | 2016-06-08 | 2018-08-14 | Apple, Inc. | Intelligent automated assistant for media exploration |
DK179309B1 (en) | 2016-06-09 | 2018-04-23 | Apple Inc | Intelligent automated assistant in a home environment |
US10509862B2 (en) | 2016-06-10 | 2019-12-17 | Apple Inc. | Dynamic phrase expansion of language input |
US10192552B2 (en) | 2016-06-10 | 2019-01-29 | Apple Inc. | Digital assistant providing whispered speech |
US10067938B2 (en) | 2016-06-10 | 2018-09-04 | Apple Inc. | Multilingual word prediction |
US10490187B2 (en) | 2016-06-10 | 2019-11-26 | Apple Inc. | Digital assistant providing automated status report |
US10586535B2 (en) | 2016-06-10 | 2020-03-10 | Apple Inc. | Intelligent digital assistant in a multi-tasking environment |
DK201670540A1 (en) | 2016-06-11 | 2018-01-08 | Apple Inc | Application integration with a digital assistant |
DK179049B1 (en) | 2016-06-11 | 2017-09-18 | Apple Inc | Data driven natural language event detection and classification |
DK179415B1 (en) | 2016-06-11 | 2018-06-14 | Apple Inc | Intelligent device arbitration and control |
DK179343B1 (en) | 2016-06-11 | 2018-05-14 | Apple Inc | Intelligent task discovery |
US10043516B2 (en) | 2016-09-23 | 2018-08-07 | Apple Inc. | Intelligent automated assistant |
US10360301B2 (en) * | 2016-10-10 | 2019-07-23 | International Business Machines Corporation | Personalized approach to handling hypotheticals in text |
CN106557461B (zh) * | 2016-10-31 | 2019-03-12 | 百度在线网络技术(北京)有限公司 | 基于人工智能的语义解析处理方法和装置 |
CN110352423B (zh) * | 2016-11-04 | 2021-04-20 | 渊慧科技有限公司 | 使用噪声信道模型生成目标序列的方法、存储介质和*** |
US10593346B2 (en) | 2016-12-22 | 2020-03-17 | Apple Inc. | Rank-reduced token representation for automatic speech recognition |
DK201770439A1 (en) | 2017-05-11 | 2018-12-13 | Apple Inc. | Offline personal assistant |
DK179496B1 (en) | 2017-05-12 | 2019-01-15 | Apple Inc. | USER-SPECIFIC Acoustic Models |
DK179745B1 (en) | 2017-05-12 | 2019-05-01 | Apple Inc. | SYNCHRONIZATION AND TASK DELEGATION OF A DIGITAL ASSISTANT |
DK201770432A1 (en) | 2017-05-15 | 2018-12-21 | Apple Inc. | Hierarchical belief states for digital assistants |
DK201770431A1 (en) | 2017-05-15 | 2018-12-20 | Apple Inc. | Optimizing dialogue policy decisions for digital assistants using implicit feedback |
DK179560B1 (en) | 2017-05-16 | 2019-02-18 | Apple Inc. | FAR-FIELD EXTENSION FOR DIGITAL ASSISTANT SERVICES |
CN117112761A (zh) * | 2017-09-05 | 2023-11-24 | 声音猎手公司 | 域间通过语法槽的分类 |
US10896297B1 (en) | 2017-12-13 | 2021-01-19 | Tableau Software, Inc. | Identifying intent in visual analytical conversations |
CN108615526B (zh) | 2018-05-08 | 2020-07-07 | 腾讯科技(深圳)有限公司 | 语音信号中关键词的检测方法、装置、终端及存储介质 |
US11055489B2 (en) * | 2018-10-08 | 2021-07-06 | Tableau Software, Inc. | Determining levels of detail for data visualizations using natural language constructs |
US11537276B2 (en) | 2018-10-22 | 2022-12-27 | Tableau Software, Inc. | Generating data visualizations according to an object model of selected data sources |
US11138374B1 (en) * | 2018-11-08 | 2021-10-05 | Amazon Technologies, Inc. | Slot type authoring |
US11308281B1 (en) * | 2018-11-08 | 2022-04-19 | Amazon Technologies, Inc. | Slot type resolution process |
US11281857B1 (en) * | 2018-11-08 | 2022-03-22 | Amazon Technologies, Inc. | Composite slot type resolution |
CN111292751B (zh) * | 2018-11-21 | 2023-02-28 | 北京嘀嘀无限科技发展有限公司 | 语义解析方法及装置、语音交互方法及装置、电子设备 |
US11314817B1 (en) | 2019-04-01 | 2022-04-26 | Tableau Software, LLC | Methods and systems for inferring intent and utilizing context for natural language expressions to modify data visualizations in a data visualization interface |
JP7393438B2 (ja) * | 2019-05-01 | 2023-12-06 | ボーズ・コーポレーション | コヒーレンスを使用した信号コンポーネント推定 |
US11455339B1 (en) | 2019-09-06 | 2022-09-27 | Tableau Software, LLC | Incremental updates to natural language expressions in a data visualization user interface |
US11163954B2 (en) * | 2019-09-18 | 2021-11-02 | International Business Machines Corporation | Propagation of annotation metadata to overlapping annotations of synonymous type |
US10997217B1 (en) | 2019-11-10 | 2021-05-04 | Tableau Software, Inc. | Systems and methods for visualizing object models of database tables |
CN112466291B (zh) * | 2020-10-27 | 2023-05-05 | 北京百度网讯科技有限公司 | 语言模型的训练方法、装置和电子设备 |
CN112987940B (zh) * | 2021-04-27 | 2021-08-27 | 广州智品网络科技有限公司 | 一种基于样本概率量化的输入方法、装置和电子设备 |
US20230162055A1 (en) * | 2021-11-22 | 2023-05-25 | Tencent America LLC | Hierarchical context tagging for utterance rewriting |
US20240086637A1 (en) * | 2022-09-08 | 2024-03-14 | Tencent America LLC | Efficient hybrid text normalization |
US11934794B1 (en) * | 2022-09-30 | 2024-03-19 | Knowbl Inc. | Systems and methods for algorithmically orchestrating conversational dialogue transitions within an automated conversational system |
Family Cites Families (16)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2867695B2 (ja) * | 1990-11-28 | 1999-03-08 | 日本電気株式会社 | 連続音声認識装置 |
JP3265864B2 (ja) * | 1994-10-28 | 2002-03-18 | 三菱電機株式会社 | 音声認識装置 |
US6052483A (en) | 1994-11-04 | 2000-04-18 | Lucent Technologies Inc. | Methods and apparatus for classification of images using distribution maps |
US6292767B1 (en) * | 1995-07-18 | 2001-09-18 | Nuance Communications | Method and system for building and running natural language understanding systems |
JP3009636B2 (ja) * | 1996-05-16 | 2000-02-14 | 株式会社エイ・ティ・アール音声翻訳通信研究所 | 音声言語解析装置 |
US5797123A (en) * | 1996-10-01 | 1998-08-18 | Lucent Technologies Inc. | Method of key-phase detection and verification for flexible speech understanding |
US6317708B1 (en) * | 1999-01-07 | 2001-11-13 | Justsystem Corporation | Method for producing summaries of text document |
US7031908B1 (en) | 2000-06-01 | 2006-04-18 | Microsoft Corporation | Creating a language model for a language processing system |
US6865528B1 (en) | 2000-06-01 | 2005-03-08 | Microsoft Corporation | Use of a unified language model |
AU2001275845A1 (en) | 2000-06-26 | 2002-01-08 | Onerealm Inc. | Method and apparatus for normalizing and converting structured content |
US6230138B1 (en) * | 2000-06-28 | 2001-05-08 | Visteon Global Technologies, Inc. | Method and apparatus for controlling multiple speech engines in an in-vehicle speech recognition system |
US6952666B1 (en) | 2000-07-20 | 2005-10-04 | Microsoft Corporation | Ranking parser for a natural language processing system |
US6419431B1 (en) * | 2000-12-06 | 2002-07-16 | E-Z Trail, Inc. | Adjustable support for transport |
US7003444B2 (en) | 2001-07-12 | 2006-02-21 | Microsoft Corporation | Method and apparatus for improved grammar checking using a stochastic parser |
US7039579B2 (en) * | 2001-09-14 | 2006-05-02 | International Business Machines Corporation | Monte Carlo method for natural language understanding and speech recognition language models |
US7805302B2 (en) * | 2002-05-20 | 2010-09-28 | Microsoft Corporation | Applying a structured language model to information extraction |
-
2003
- 2003-05-01 US US10/427,604 patent/US7603267B2/en not_active Expired - Fee Related
- 2003-11-20 US US10/718,138 patent/US20040220809A1/en not_active Abandoned
-
2004
- 2004-04-16 AT AT04009124T patent/ATE492876T1/de not_active IP Right Cessation
- 2004-04-16 EP EP04009124A patent/EP1475778B1/en not_active Expired - Lifetime
- 2004-04-16 DE DE602004030635T patent/DE602004030635D1/de not_active Expired - Lifetime
- 2004-04-26 JP JP2004130332A patent/JP4724377B2/ja not_active Expired - Lifetime
- 2004-04-30 KR KR1020040030614A patent/KR101120858B1/ko active IP Right Grant
- 2004-05-08 CN CN2004100435917A patent/CN1542736B/zh not_active Expired - Fee Related
Also Published As
Publication number | Publication date |
---|---|
KR101120858B1 (ko) | 2012-06-27 |
US20040220809A1 (en) | 2004-11-04 |
DE602004030635D1 (de) | 2011-02-03 |
ATE492876T1 (de) | 2011-01-15 |
EP1475778A1 (en) | 2004-11-10 |
CN1542736B (zh) | 2011-08-03 |
US7603267B2 (en) | 2009-10-13 |
CN1542736A (zh) | 2004-11-03 |
EP1475778B1 (en) | 2010-12-22 |
JP2005115328A (ja) | 2005-04-28 |
US20040220797A1 (en) | 2004-11-04 |
KR20040094645A (ko) | 2004-11-10 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP4724377B2 (ja) | 自然言語理解(NLU)システムにおける規則ベース文法に関するスロットおよび前終端記号(preterminal)に関する統計モデル | |
US11238845B2 (en) | Multi-dialect and multilingual speech recognition | |
US7529657B2 (en) | Configurable parameters for grammar authoring for speech recognition and natural language understanding | |
US7478038B2 (en) | Language model adaptation using semantic supervision | |
US7617093B2 (en) | Authoring speech grammars | |
US7451125B2 (en) | System and method for compiling rules created by machine learning program | |
US7031908B1 (en) | Creating a language model for a language processing system | |
US7634406B2 (en) | System and method for identifying semantic intent from acoustic information | |
US20070129936A1 (en) | Conditional model for natural language understanding | |
EP1475779B1 (en) | System with composite statistical and rules-based grammar model for speech recognition and natural language understanding | |
JP2004246368A (ja) | テキストから単語誤り率を予測するための方法および装置 | |
JP2005293580A (ja) | Arpa標準フォーマットによる、削除補間nグラム言語モデルの表現 | |
JP4738753B2 (ja) | 文法オーサリングにおけるセグメント化あいまい性(segmentationambiguity)の自動的な解決 | |
KR102026967B1 (ko) | n-gram 데이터 및 언어 분석에 기반한 문법 오류 교정장치 및 방법 | |
Wang et al. | Combination of CFG and n-gram modeling in semantic grammar learning. | |
Švec et al. | Semantic entity detection from multiple ASR hypotheses within the WFST framework | |
Acero et al. | A semantically structured language model | |
MXPA97002521A (es) | Metodo y aparato para un sistema de reconocimientode lenguaje mejorado |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
A621 | Written request for application examination |
Free format text: JAPANESE INTERMEDIATE CODE: A621 Effective date: 20070330 |
|
A131 | Notification of reasons for refusal |
Free format text: JAPANESE INTERMEDIATE CODE: A131 Effective date: 20100326 |
|
A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20100628 |
|
TRDD | Decision of grant or rejection written | ||
A01 | Written decision to grant a patent or to grant a registration (utility model) |
Free format text: JAPANESE INTERMEDIATE CODE: A01 Effective date: 20110401 |
|
A01 | Written decision to grant a patent or to grant a registration (utility model) |
Free format text: JAPANESE INTERMEDIATE CODE: A01 |
|
A61 | First payment of annual fees (during grant procedure) |
Free format text: JAPANESE INTERMEDIATE CODE: A61 Effective date: 20110411 |
|
FPAY | Renewal fee payment (event date is renewal date of database) |
Free format text: PAYMENT UNTIL: 20140415 Year of fee payment: 3 |
|
R150 | Certificate of patent or registration of utility model |
Ref document number: 4724377 Country of ref document: JP Free format text: JAPANESE INTERMEDIATE CODE: R150 Free format text: JAPANESE INTERMEDIATE CODE: R150 |
|
R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
S111 | Request for change of ownership or part of ownership |
Free format text: JAPANESE INTERMEDIATE CODE: R313113 |
|
R350 | Written notification of registration of transfer |
Free format text: JAPANESE INTERMEDIATE CODE: R350 |
|
R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |