JP2004053979A5

JP2004053979A5 -

Info

Publication number: JP2004053979A5
Application number: JP2002212058A
Authority: JP
Filing date: 2002-07-22
Publication date: 2005-09-22

Claims

コンピュータシステムを用いて、人間が発声した音声を認識するために用いられる音声認識辞書を作成する音声認識辞書作成方法であって、
コンピュータシステムにおいて、前記音声認識辞書によって認識対象とするテキストを、当該テキストに含まれる所定の記号文字をスペース文字に置き換えたテキストに変換する変換ステップと、
コンピュータシステムにおいて、前記変換ステップで変換されたテキストの発音を表す発音データを生成する発音データ生成ステップと、
コンピュータシステムにおいて、前記発音データ生成ステップで生成された発音データを、前記認識対象とするテキストを認識するための発音データとして前記音声認識辞書に格納するステップとを有することを特徴とする音声認識辞書作成方法。A speech recognition dictionary creation method for creating a speech recognition dictionary used for recognizing speech uttered by a human using a computer system,
In the computer system, a conversion step of converting a text to be recognized by the speech recognition dictionary into a text in which a predetermined symbol character included in the text is replaced with a space character;
In a computer system, a pronunciation data generation step for generating pronunciation data representing the pronunciation of the text converted in the conversion step;
In the computer system, the speech recognition dictionary comprising the step of storing the pronunciation data generated in the pronunciation data generation step in the speech recognition dictionary as pronunciation data for recognizing the text to be recognized How to make.

請求項１記載の音声認識辞書作成方法であって、
コンピュータシステムを用いて、人間が発声した音声を認識するために用いられる音声認識辞書を作成する音声認識辞書作成方法であって、
コンピュータシステムにおいて、前記音声認識辞書によって認識対象とするテキストを、当該テキストに含まれる記号文字"#"の文字列"number"への置き換えと、当該テキストに含まれる記号文字"&"の文字列"and"への置き換えと、当該テキストに含まれる記号文字"@"の文字列"at"への置き換えとのうちの少なくとも一つの置き換えを行ったテキストに変換する変換ステップと、
コンピュータシステムにおいて、前記変換ステップで変換されたテキストの発音を表す発音データを生成する発音データ生成ステップと、
コンピュータシステムにおいて、前記発音データ生成ステップで生成された発音データを、前記認識対象とするテキストを認識するための発音データとして前記音声認識辞書に格納するステップとを有することを特徴とする音声認識辞書作成方法。The speech recognition dictionary creation method according to claim 1,
A speech recognition dictionary creation method for creating a speech recognition dictionary used for recognizing speech uttered by a human using a computer system,
In the computer system, the text to be recognized by the speech recognition dictionary is replaced with the character string “number” of the symbol character “#” included in the text, and the character string of the symbol character “&” included in the text a conversion step for converting the text into at least one of the replacement of "and" and the replacement of the symbol character "@" included in the text with the character string "at";
In a computer system, a pronunciation data generation step for generating pronunciation data representing the pronunciation of the text converted in the conversion step;
In the computer system, the speech recognition dictionary comprising the step of storing the pronunciation data generated in the pronunciation data generation step in the speech recognition dictionary as pronunciation data for recognizing the text to be recognized How to make.

コンピュータシステムを用いて、人間が発声した音声を認識するために用いられる音声認識辞書を作成する音声認識辞書作成方法であって、
コンピュータシステムにおいて、前記音声認識辞書によって認識対象とするテキストを、当該テキストに含まれる第１の言語に含まれ第２の言語に含まれない文字を、当該第１の言語の文字の発音に相当または近似する発音を有する前記第２の言語の文字に置き換えたテキストに変換する変換ステップと、
コンピュータシステムにおいて、前記変換ステップで変換されたテキストの前記第２の言語の発音ルールに従った発音を表す発音データを生成する発音データ生成ステップと、
コンピュータシステムにおいて、前記発音データ生成ステップで生成された発音データを、前記認識対象とするテキストを認識するための発音データとして前記音声認識辞書に格納するステップとを有することを特徴とする音声認識辞書作成方法。A speech recognition dictionary creation method for creating a speech recognition dictionary used for recognizing speech uttered by a human using a computer system,
In the computer system, the text to be recognized by the speech recognition dictionary is a character included in the first language included in the text but not included in the second language, and corresponds to the pronunciation of the character in the first language. Or a conversion step of converting to text replaced with characters of the second language having approximate pronunciation;
In the computer system, a pronunciation data generation step for generating pronunciation data representing pronunciation according to the pronunciation rules of the second language of the text converted in the conversion step;
In the computer system, the speech recognition dictionary comprising the step of storing the pronunciation data generated in the pronunciation data generation step in the speech recognition dictionary as pronunciation data for recognizing the text to be recognized How to make.

コンピュータシステムを用いて、人間が発声した音声を認識するために用いられる音声認識辞書を作成する音声認識辞書作成方法であって、
コンピュータシステムにおいて、前記音声認識辞書によって認識対象とするテキストが、第１の言語によって対象を略記したテキストであった場合に、当該テキストが表す対象を略記せずに第１の言語によって表したテキストに含まれる第１の言語に含まれ第２の言語に含まれない文字を、当該第１の言語による文字の発音に相当または近似する発音を有する第２の言語の文字に置き換えたテキストに、前記認識対象とするテキストを変換する変換ステップと、
コンピュータシステムにおいて、前記変換ステップで変換されたテキストの前記第２の言語の発音ルールに従った発音を表す発音データを生成する発音データ生成ステップと、
コンピュータシステムにおいて、前記発音データ生成ステップで生成された発音データを、前記認識対象とするテキストを認識するための発音データとして前記音声認識辞書に格納するステップとを有することを特徴とする音声認識辞書作成方法。A speech recognition dictionary creation method for creating a speech recognition dictionary used for recognizing speech uttered by a human using a computer system,
In the computer system, when the text to be recognized by the speech recognition dictionary is a text in which the object is abbreviated in the first language, the text expressed in the first language without abbreviating the object to be represented by the text A character that is included in the first language and is not included in the second language is replaced with a character in the second language having a pronunciation equivalent to or similar to the pronunciation of the character in the first language, A conversion step of converting the text to be recognized;
In the computer system, a pronunciation data generation step for generating pronunciation data representing pronunciation according to the pronunciation rules of the second language of the text converted in the conversion step;
In the computer system, the speech recognition dictionary comprising the step of storing the pronunciation data generated in the pronunciation data generation step in the speech recognition dictionary as pronunciation data for recognizing the text to be recognized How to make.

人間が発声した音声を認識するために用いられる音声認識辞書を作成する音声認識辞書作成システムであって、
テキストの変換ルールを格納した変換ルールテーブルと、
前記音声認識辞書によって認識対象とするテキストを、前記変換ルールテーブルの変換ルールに従って変換する変換手段と、
前記変換手段で変換されたテキストの発音を表す発音データを生成する発音データ生成手段と、
前記発音データ生成ステップで生成された発音データを、前記認識対象とするテキストを認識するための発音データとして前記音声認識辞書に格納する格納手段とを有し、
前記変換ルールテーブルに格納された変換ルールは、テキストを、当該テキストに含まれる所定の記号文字をスペース文字に置き換えたテキストに変換するものであることを特徴とする音声認識辞書作成システム。A speech recognition dictionary creation system that creates a speech recognition dictionary used to recognize speech uttered by a human,
A conversion rule table storing text conversion rules;
Conversion means for converting the text to be recognized by the speech recognition dictionary according to the conversion rule of the conversion rule table;
Pronunciation data generation means for generating pronunciation data representing the pronunciation of the text converted by the conversion means;
Storage means for storing the pronunciation data generated in the pronunciation data generation step in the speech recognition dictionary as pronunciation data for recognizing the text to be recognized;
The speech recognition dictionary creation system, wherein the conversion rule stored in the conversion rule table converts text into text obtained by replacing a predetermined symbol character included in the text with a space character.

ユーザからの音声入力を受け付けるナビゲーション装置であって、
テキストと当該テキストに対応する発音データとの対応が登録された音声認識辞書を記憶した記憶手段と、
マイクと、
前記マイクから入力した音声に整合する発音データに対応して前記音声認識辞書に登録されているテキストをユーザが音声入力した内容を表すテキストとして認識する音声認識手段とを有し、
前記音声認識辞書において、所定の記号文字を含むテキストについては、当該テキストに対応する発音データとして、当該テキストに含まれる所定の記号文字をスペース文字に置き換えたテキストを音声データ化して得られた発音データが登録されていることを特徴とするナビゲーション装置。A navigation device that accepts voice input from a user,
Storage means for storing a speech recognition dictionary in which correspondence between text and pronunciation data corresponding to the text is registered;
With a microphone,
Speech recognition means for recognizing text registered in the speech recognition dictionary corresponding to pronunciation data matched with speech input from the microphone as text representing the content input by the user;
In the speech recognition dictionary, for a text including a predetermined symbol character, as a pronunciation data corresponding to the text, a pronunciation obtained by converting the text obtained by replacing the predetermined symbol character included in the text with a space character into speech data A navigation apparatus characterized in that data is registered.

ユーザからの音声入力を受け付けるナビゲーション装置であって、
テキストと当該テキストに対応する発音データとの対応が登録された音声認識辞書を記憶した記憶手段と、
マイクと、
前記マイクから入力した音声に整合する発音データに対応して前記音声認識辞書に登録されているテキストをユーザが音声入力した内容を表すテキストとして認識する音声認識手段とを有し、
前記音声認識辞書において、記号文字"#"を含むテキストについては、当該テキストに対応する発音データとして、当該テキストに含まれる記号文字"#"を文字列"number"に置き換えたテキストを音声データ化して得られた発音データが登録されており、記号文字"&"を含むテキストについては、当該テキストに対応する発音データとして、当該テキストに含まれる記号文字"&"を文字列"and"に置き換えたテキストを音声データ化して得られた発音データが登録されており、記号文字"@"を含むテキストについては、当該テキストに対応する発音データとして、当該テキストに含まれる記号文字"@"を文字列"at"に置き換えたテキストを音声データ化して得られた発音データが登録されていることを特徴とするナビゲーション装置。A navigation device that accepts voice input from a user,
Storage means for storing a speech recognition dictionary in which correspondence between text and pronunciation data corresponding to the text is registered;
With a microphone,
Speech recognition means for recognizing text registered in the speech recognition dictionary corresponding to pronunciation data matched with speech input from the microphone as text representing the content input by the user;
In the speech recognition dictionary, for text that includes the symbol character “#”, as pronunciation data corresponding to the text, the text in which the symbol character “#” included in the text is replaced with the character string “number” is converted into speech data. The phonetic data obtained in this way is registered, and for text that includes the symbol character "&", the symbol character "&" contained in the text is replaced with the string "and" as the pronunciation data corresponding to the text. The pronunciation data obtained by converting the text to speech data is registered. For text containing the symbol character "@", the symbol character "@" contained in the text is used as the pronunciation data corresponding to the text. A navigation device, wherein pronunciation data obtained by converting the text replaced with the column "at" into speech data is registered.

ユーザからの音声入力を受け付けるナビゲーション装置であって、
テキストと当該テキストに対応する発音データとの対応が登録された音声認識辞書を記憶した記憶手段と、
マイクと、
前記マイクから入力した音声に整合する発音データに対応して前記音声認識辞書に登録されているテキストをユーザが音声入力した内容を表すテキストとして認識する音声認識手段とを有し、
前記音声認識辞書において、第１の言語に含まれ第２の言語に含まれない文字を含むテキストについては、当該テキストに対応する発音データとして、当該テキストに含まれる第１の言語に含まれ第２の言語に含まれない文字を、当該第１の言語の文字の発音に相当または近似する発音を有する前記第２の言語の文字に置き換えたテキストを音声データ化して得られた発音データが登録されていることを特徴とするナビゲーション装置。A navigation device that accepts voice input from a user,
Storage means for storing a speech recognition dictionary in which correspondence between text and pronunciation data corresponding to the text is registered;
With a microphone,
Speech recognition means for recognizing text registered in the speech recognition dictionary corresponding to pronunciation data matched with speech input from the microphone as text representing the content input by the user;
In the speech recognition dictionary, text that includes characters that are included in the first language and not included in the second language is included in the first language included in the text as pronunciation data corresponding to the text. Phonetic data obtained by converting text that is not included in the second language to text in the second language having a pronunciation equivalent to or similar to the pronunciation of the first language is registered as speech data The navigation apparatus characterized by being made.

ユーザからの音声入力を受け付けるナビゲーション装置であって、
テキストと当該テキストに対応する発音データとの対応が登録された音声認識辞書を記憶した記憶手段と、
マイクと、
前記マイクから入力した音声に整合する発音データに対応して前記音声認識辞書に登録されているテキストをユーザが音声入力した内容を表すテキストとして認識する音声認識手段とを有し、
前記音声認識辞書において、第１の言語によって対象を略記したテキストについては、当該テキストに対応する発音データとして、当該テキストが表す対象を略記せずに第１の言語によって表したテキストに含まれる第１の言語に含まれ第２の言語に含まれない文字を、当該第１の言語による文字の発音に相当または近似する発音を有する第２の言語の文字に置き換えたテキストを音声データ化して得られた発音データが登録されていることを特徴とするナビゲーション装置。A navigation device that accepts voice input from a user,
Storage means for storing a speech recognition dictionary in which correspondence between text and pronunciation data corresponding to the text is registered;
With a microphone,
Speech recognition means for recognizing text registered in the speech recognition dictionary corresponding to pronunciation data matched with speech input from the microphone as text representing the content input by the user;
In the speech recognition dictionary, for a text in which an object is abbreviated in a first language, the phonetic data corresponding to the text is included in the text expressed in the first language without abbreviating the object represented by the text. A text obtained by replacing a character included in one language and not included in the second language with a character in the second language having a pronunciation equivalent to or similar to the pronunciation of the character in the first language is obtained as voice data. Navigation device characterized in that recorded pronunciation data is registered.