JP2000112845A

JP2000112845A - Electronic mail system with voice information

Info

Publication number: JP2000112845A
Application number: JP10281273A
Authority: JP
Inventors: Tetsuya Takeji; 徹也武次; Keizo Nakatani; 敬三中谷
Original assignee: NEC Software Kobe Ltd
Current assignee: NEC Software Kobe Ltd
Priority date: 1998-10-02
Filing date: 1998-10-02
Publication date: 2000-04-21

Abstract

PROBLEM TO BE SOLVED: To provide an electronic(E) mail system with voice information capable of reading an E mail by a voice similar to a data transmitter's voice. SOLUTION: A mail transmission part 11 transmits a mail document 112 consisting of an inputted mail text 113 and speaking data 114 to be used for voice synthesis by a voice synthesis by rule. At the time of receiving the mail document 121, a mail receiving part 12 executes voice synthesis by using speaking data 123 included in the document 121, drives a voice output device 14 based on a voice synthesis result and displays a mail text 122 in the document 121 on a display device 13.

Description

【発明の詳細な説明】DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は電子メールシステム
に関し、特に受信側で受け取った電子メールをデータ発
信者の声で読み上げることが出来る音声通知付電子メー
ルシステムに関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an electronic mail system and, more particularly, to an electronic mail system with voice notification which can read out an electronic mail received on the receiving side with the voice of a data sender.

【０００２】[0002]

【従来の技術】従来の電子メールシステムでは、発信側
では文字情報を発信し、受信側では受け取った文字情報
をディスプレイに表示し、目で確認するのが一般的であ
る。しかし、発信者が誰であるのかが分かりにくいた
め、受け取った文字情報をディスプレイに表示するとと
もに、音声合成システムによって音声で通知する音声通
知付電子メールシステムが提案されている。2. Description of the Related Art In a conventional e-mail system, it is general that a transmitting side transmits character information, and a receiving side displays the received character information on a display and confirms it visually. However, since it is difficult to know who the caller is, an e-mail system with a sound notification that displays the received character information on a display and notifies the sound by a sound synthesis system has been proposed.

【０００３】例えば、特開平１０−１３３９８１号公報
に記載されている方法では、受信側で送信側から受け取
った発信者を特定する識別番号から、予め登録された性
別情報・年齢情報・話速情報・音声トーン情報等のパラ
メータを取り出し、これらを組み合わせて音声合成シス
テムにより受け取った文字情報の音声合成を行ってい
た。[0003] For example, in the method described in Japanese Patent Application Laid-Open No. 10-133981, the gender information / age information / speech speed information registered in advance from the identification number identifying the caller received from the transmission side at the reception side is obtained. -Parameters such as voice tone information are taken out, and these are combined to perform voice synthesis of the character information received by the voice synthesis system.

【０００４】[0004]

【発明が解決しようとする課題】上述した公報の音声通
知付電子メールシステムは、限定されたパラメータによ
って音声合成されるため、合成される音声が性別・年齢
・話速・音声トーン等のレベルで差は出せるものの、デ
ータ発信者を判別することができるほどの特徴を出せな
いという欠点があった。In the electronic mail system with voice notification described in the above-mentioned publication, voices are synthesized according to limited parameters. Therefore, the synthesized voice is at the level of gender, age, speech speed, voice tone and the like. Although a difference can be made, there is a drawback that a feature that can identify a data sender cannot be obtained.

【０００５】本発明の目的は、データ発信者の声に近い
音声で読み上げを行うことの出来る音声通知付電子メー
ルシステムを提供することにある。An object of the present invention is to provide an e-mail system with voice notification that can read aloud a voice close to the voice of a data sender.

【０００６】[0006]

【課題を解決するための手段】本願の第１の発明は、音
声通知付電子メールシステムにおいて、入力したメール
本文と該メール本文を音声合成する場合の発声データか
らなるメール文書を送信するメール送信手段と、前記メ
ール文書を受信すると該メール文書中の前記発声データ
を用いて音声合成をおこない該音声合成結果により予め
備えた音声出力装置を駆動するとともに該メール文書中
の前記本文を予め備えた表示装置に表示するメール受信
手段から構成されることを特徴とする。According to a first aspect of the present invention, in an electronic mail system with voice notification, a mail transmission for transmitting a mail document including an input mail text and utterance data when voice synthesis is performed on the mail text. Means for receiving the mail document, performing voice synthesis using the utterance data in the mail document, driving a voice output device provided in advance based on the voice synthesis result, and providing the text in the mail document in advance. It is characterized by comprising mail receiving means for displaying on a display device.

【０００７】本願の第２の発明は、第１の発明における
前記メール送信手段は前記メール本文を規則合成によっ
て音声合成する場合のパラメータを前記発声データとし
て出力する音声変換部を備え、前記メール受信手段は該
パラメータを入力して音声合成する音声合成部を備える
ことを特徴とする。According to a second aspect of the present invention, in the first aspect, the mail transmitting means includes a voice conversion unit for outputting a parameter for synthesizing the text of the mail by rule synthesis as the utterance data. The means is provided with a voice synthesizing section for inputting the parameter and performing voice synthesis.

【０００８】本願の第３の発明は、第２の発明における
前記パラメータは前記音声合成部のＬＳＰ音声合成ディ
ジタルフィルタへ入力されるＬＳＰパラメータ及び前記
音声合成部の音源生成部へ入力されるピッチパターンを
含むことを特徴とする。In a third aspect of the present invention, in the second aspect, the parameter is an LSP parameter inputted to an LSP speech synthesis digital filter of the speech synthesis section and a pitch pattern inputted to a sound source generation section of the speech synthesis section. It is characterized by including.

【０００９】本願の第４の発明は、音声通知付電子メー
ルシステムにおいて、入力したメール本文を音声合成す
る場合の発声データを送信するメール送信手段と、受信
した前記発声データを用いて音声合成をおこない該音声
合成結果により予め備えた音声出力装置を駆動するとと
もに前記音声合成結果を音声認識し文書として予め備え
た表示装置に出力するメール受信手段から構成されるこ
とを特徴とする。According to a fourth aspect of the present invention, in the electronic mail system with voice notification, a mail transmitting means for transmitting voice data for voice synthesis of an input mail text, and voice synthesis using the received voice data. The voice synthesizing unit is characterized in that the voice synthesizing unit is characterized in that it comprises a mail receiving means for driving a voice output device provided in advance based on the voice synthesis result, recognizing the voice synthesis result by voice, and outputting it as a document to a display device provided in advance.

【００１０】本願の第５の発明は、第４の発明における
前記メール送信手段は前記メール本文を規則合成によっ
て音声合成する場合のパラメータを前記発声データとし
て出力する音声変換部を備え、前記メール受信手段は該
パラメータを入力して音声合成する音声合成部を備える
ことを特徴とする。According to a fifth aspect of the present invention, in the fourth aspect, the mail transmission means includes a voice conversion unit for outputting, as the utterance data, a parameter when the voice of the mail is synthesized by rule synthesis. The means is provided with a voice synthesizing section for inputting the parameter and performing voice synthesis.

【００１１】本願の第６の発明は、第５の発明における
前記パラメータは前記音声合成部のＬＳＰ音声合成ディ
ジタルフィルタへ入力されるＬＳＰパラメータ及び前記
音声合成部の音源生成部へ入力されるピッチパターンを
含むことを特徴とする。In a sixth aspect of the present invention, in the fifth aspect, the parameter is an LSP parameter inputted to an LSP speech synthesis digital filter of the speech synthesis section and a pitch pattern inputted to a sound source generation section of the speech synthesis section. It is characterized by including.

【００１２】本願の第７の発明は、音声通知付電子メー
ルシステムにおいて、入力したメール本文と該メール本
文を音声合成する場合の発声データからなるＨＴＭＬ文
書を送信するＨＴＭＬ文書作成手段と、前記ＨＴＭＬ文
書を保存するＷＥＢサーバと、前記ＷＥＢサーバから受
信した前記ＨＴＭＬ文書中の前記発声データを用いて音
声合成をおこない該音声合成結果により予め備えた音声
出力装置を駆動するとともに該ＨＴＭＬ文書中の前記メ
ール本文を予め備えた表示装置に表示するブラウザから
構成されることを特徴とする。According to a seventh aspect of the present invention, in the electronic mail system with voice notification, an HTML document creating means for transmitting an HTML document including an input mail body and voice data for synthesizing the mail body, and the HTML A web server for storing a document, and performing voice synthesis using the utterance data in the HTML document received from the web server, driving a voice output device provided in advance based on the voice synthesis result, and driving the voice output device in the HTML document. It is characterized by comprising a browser for displaying a mail text on a display device provided in advance.

【００１３】本願の第８の発明は、第７の発明における
前記ＨＴＭＬ文書作成手段は前記メール本文を規則合成
によって音声合成する場合のパラメータを前記発声デー
タとして出力する音声変換部を備え、前記ブラウザは該
パラメータを入力して音声合成する音声合成部を備える
ことを特徴とする。According to an eighth aspect of the present invention, in the seventh aspect, the HTML document creation means includes a voice conversion unit for outputting, as the utterance data, a parameter for voice-synthesizing the mail text by rule synthesis, and the browser Is provided with a voice synthesizer for inputting the parameters and performing voice synthesis.

【００１４】本願の第９の発明は、第８の発明における
前記パラメータは前記音声合成部のＬＳＰ音声合成ディ
ジタルフィルタへ入力されるＬＳＰパラメータ及び前記
音声合成部の音源生成部へ入力されるピッチパターンを
含むことを特徴とする。In a ninth aspect of the present invention, in the eighth aspect, the parameter is an LSP parameter inputted to an LSP speech synthesis digital filter of the speech synthesis section and a pitch pattern inputted to a sound source generation section of the speech synthesis section. It is characterized by including.

【００１５】本願の第１０の発明は、音声通知付電子メ
ールシステムにおいて、入力したメール本文を音声合成
する場合の発声データをＨＴＭＬ文書として送信するＨ
ＴＭＬ文書作成手段と、前記ＨＴＭＬ文書を保存するＷ
ＥＢサーバと、前記ＷＥＢサーバから受信した前記ＨＴ
ＭＬ文書中の前記発声データを用いて音声合成をおこな
い該音声合成結果により予め備えた音声出力装置を駆動
するとともに前記音声合成結果を音声認識し文書として
予め備えた表示装置に出力するブラウザから構成される
ことを特徴とする。According to a tenth aspect of the present invention, in an electronic mail system with voice notification, an utterance data for synthesizing an input mail body as voice is transmitted as an HTML document.
TML document creation means and W for storing the HTML document
An EB server and the HT received from the WEB server
A browser that performs voice synthesis using the utterance data in the ML document, drives a voice output device provided in advance based on the voice synthesis result, recognizes the voice synthesis result, and outputs the result to a display device provided in advance as a document. It is characterized by being performed.

【００１６】本願の第１１の発明は、第１０の発明にお
ける前記ＨＴＭＬ文書作成手段は前記メール本文を規則
合成によって音声合成する場合のパラメータを前記発声
データとして出力する音声変換部を備え、前記ブラウザ
は該パラメータを入力して音声合成する音声合成部を備
えることを特徴とする。According to an eleventh aspect of the present invention, in the tenth aspect, the HTML document creation means includes a voice conversion unit that outputs a parameter when the text of the mail is voice-synthesized by rule synthesis as the voice data. Is provided with a voice synthesizer for inputting the parameters and performing voice synthesis.

【００１７】本願の第１２の発明は、第１１の発明にお
ける前記パラメータは前記音声合成部のＬＳＰ音声合成
ディジタルフィルタへ入力されるＬＳＰパラメータ及び
前記音声合成部の音源生成部へ入力されるピッチパター
ンを含むことを特徴とする。According to a twelfth aspect of the present invention, in the eleventh aspect, the parameter is an LSP parameter input to an LSP voice synthesis digital filter of the voice synthesis unit and a pitch pattern input to a sound source generation unit of the voice synthesis unit. It is characterized by including.

【００１８】［作用］データ発信者の声に近い音声で読
み上げを行うことの出来る音声通知付電子メールシステ
ムを提供するために、メール送信部では、入力したメー
ル本文と該メール本文を規則合成により音声合成する場
合の発声データからなるメール文書を送信し、メール受
信部はメール文書を受信すると、該メール文書中の発声
データを用いて音声合成をおこない該音声合成結果によ
り音声出力装置を駆動するとともに該メール文書中のメ
ール本文を表示装置に表示する。[Effect] In order to provide an electronic mail system with a voice notification capable of reading out with a voice close to the voice of the data sender, the mail transmitting section converts the input mail text and the mail text by rule synthesis. When a mail document including voice data for voice synthesis is transmitted, and the mail receiving unit receives the mail document, voice synthesis is performed using the voice data in the mail document, and a voice output device is driven based on the voice synthesis result. At the same time, the mail text in the mail document is displayed on the display device.

【００１９】[0019]

【発明の実施の形態】次に、本発明の実施の形態につい
て図面を参照して詳細に説明する。Next, embodiments of the present invention will be described in detail with reference to the drawings.

【００２０】図１は、本発明の音声通知付電子メールシ
ステムの第１の実施の形態を示すブロック図である。FIG. 1 is a block diagram showing a first embodiment of the electronic mail system with voice notification according to the present invention.

【００２１】図１を参照すると、本発明の第１の実施の
形態は、メール送信部１１と、メール受信部１２と、表
示装置１３と、音声出力装置１４とから構成される。メ
ール送信部１１には、入力したメール本文１１３とメー
ル本文１１３を音声合成する場合の発声データ１１４か
らなるメール文書１１２と、メール本文１１３を音声合
成する場合の発声データ１１４を抽出する音声変換部１
１１とが含まれる。メール受信部１２には、受信した本
文１２２と発声データ１２３からなるメール文書１２１
と、発声データ１２３を用いて音声合成をおこなう音声
合成部１２４とが含まれる。表示装置１３は、本文１２
２を表示し、音声出力装置１４は、音声合成部１２４で
合成した結果を音声出力する。Referring to FIG. 1, the first embodiment of the present invention comprises a mail transmitting unit 11, a mail receiving unit 12, a display device 13, and a voice output device 14. The mail transmitting unit 11 includes a mail document 112 including the input mail text 113 and voice data 114 for synthesizing the mail text 113 and a voice conversion unit for extracting voice data 114 for synthesizing the mail text 113. 1
11 are included. The mail receiving unit 12 includes a mail document 121 including the received text 122 and utterance data 123.
And a speech synthesis unit 124 that performs speech synthesis using the utterance data 123. The display device 13 displays the text 12
2 is displayed, and the voice output device 14 outputs the result of the synthesis by the voice synthesizer 124 as voice.

【００２２】音声変換部１１１は、テキスト文書を規則
合成によって音声合成する場合のパラメータを上述の発
声データとして出力するもので、パラメータとしては、
音声合成部１２４のＬＳＰ音声合成ディジタルフィルタ
へ入力されるＬＳＰパラメータ及び、音源生成部へ入力
されるピッチパターンがある。詳細については、“「音
声合成」；電子情報通信学会誌、４／８７,VOL.70,No.
4”を参照されたい。The voice conversion unit 111 outputs, as the above-mentioned utterance data, parameters used for voice synthesis of a text document by rule synthesis.
There are LSP parameters input to the LSP voice synthesis digital filter of the voice synthesis unit 124 and pitch patterns input to the sound source generation unit. For details, see "" Speech Synthesis "; IEICE Journal, 4/87, VOL. 70, No.
Please refer to 4 ”.

【００２３】音声合成部１２４は、音声変換部１１１で
生成されたＬＳＰパラメータ及び、ピッチパターンを用
いて音声合成し、音声出力装置１４（例えば、スピー
カ）に出力するもので、詳細については、“「音声符号
化」；電子情報通信学会誌、４／８７,VOL.70,No.4”を
参照されたい。The voice synthesizing section 124 synthesizes voice using the LSP parameter and the pitch pattern generated by the voice converting section 111 and outputs the synthesized voice to the voice output device 14 (for example, a speaker). "Speech coding"; see IEICE Journal, 4/87, VOL. 70, No. 4 ".

【００２４】規則合成時の音韻規則や韻律規則にメール
送信本人のものを使用すれば、合成音は、送信者の発声
に近いものとすることが出来る。また、ＬＳＰパラメー
タ及びピッチパターンを用いた音声合成は、最も低ビッ
トレートの音声符号化方式の１つであり、メール受信部
１２への送信データを圧縮することが可能になる。If the mail sending person himself / herself is used for the phonological rules and the prosody rules at the time of rule synthesis, the synthesized sound can be similar to the utterance of the sender. Speech synthesis using an LSP parameter and a pitch pattern is one of the lowest bit rate speech coding methods, and can compress data transmitted to the mail receiving unit 12.

【００２５】次に、図１を参照して本実施の形態の動作
について詳細に説明する。Next, the operation of this embodiment will be described in detail with reference to FIG.

【００２６】まず、メール文書１１２の本文１１３は従
来方法によって作成され、メール送信部１１に入力され
る。音声変換部１１１は、入力された本文１１３から、
音声変換する場合のパラメータを作成し、発声データ１
１４として出力する。作成された本文１１３と発声デー
タ１１４は、メール文書１１２として送信される。First, the body 113 of the mail document 112 is created by a conventional method, and is input to the mail transmission unit 11. The voice converter 111 converts the input text 113 into
Create parameters for voice conversion and create utterance data 1
14 is output. The created text 113 and utterance data 114 are transmitted as a mail document 112.

【００２７】次に、メール受信部１２では、メール文書
１１２（メール文書１２１と同じ）を受信すると、メー
ル本文１２２を表示装置１３に表示するとともに、発声
データ１２３により音声合成部１２４で音声合成をおこ
ない、合成結果を音声出力装置１４によって音声出力す
る。Next, when the mail receiving section 12 receives the mail document 112 (same as the mail document 121), the mail body 122 is displayed on the display device 13, and the voice synthesizing section 124 performs voice synthesis using the utterance data 123. Then, the synthesized result is output as sound by the sound output device 14.

【００２８】尚、上述の動作説明では、メール受信部１
２では、メール本文１２２の表示装置１３への表示と、
音声出力装置１４からの音声出力を並行して行うように
しているが、メール受信者の希望により、どちらか一方
を出力するようにすることが出来ることはいうまでもな
い。この場合、メール送信部１１から無条件でメール本
文１１３と発声データ１１４の両方を送り、メール受信
部１２で選択出力するようにしても良いし、メール送信
部１１とメール受信部１２のやりとりにより、出力を希
望する方のみ送信するようにしても良い。In the operation described above, the mail receiving unit 1
In 2, the display of the mail text 122 on the display device 13 is performed.
Although the audio output from the audio output device 14 is performed in parallel, it goes without saying that one of them can be output according to the wish of the mail receiver. In this case, both the mail text 113 and the utterance data 114 may be sent unconditionally from the mail transmitting unit 11 and selectively output by the mail receiving unit 12, or by the exchange between the mail transmitting unit 11 and the mail receiving unit 12. Alternatively, only those who desire output may be transmitted.

【００２９】次に、本発明の第２の実施の形態について
図面を参照して詳細に説明する。Next, a second embodiment of the present invention will be described in detail with reference to the drawings.

【００３０】図２を参照すると、本発明の第２の実施の
形態は、メール送信部２１と、メール受信部２２と、表
示装置２３と、音声出力装置２４とから構成される。メ
ール送信部２１には、入力したメール本文２１１とメー
ル本文２１１を音声合成する場合の発声データ２１３
と、メール本文２１１を音声合成する場合の発声データ
２１３を抽出する音声変換部２１２が含まれる。メール
受信部２２には、受信した発声データ２１４と、発声デ
ータ２１４を用いて音声合成をおこなう音声合成部２１
６と、音声合成部２１６からの音声出力を音声認識し文
書として表示装置２３に出力する音声認識部２１５が含
まれる。表示装置２３は、音声認識部２１５からの本文
を表示し、音声出力装置２４は、音声合成部２１６で合
成した結果を音声出力する。Referring to FIG. 2, the second embodiment of the present invention comprises a mail transmitting unit 21, a mail receiving unit 22, a display device 23, and a sound output device 24. The mail transmitting unit 21 includes the input mail text 211 and utterance data 213 when voice synthesis is performed on the mail text 211.
And a voice conversion unit 212 that extracts voice data 213 when voice synthesis is performed on the mail body 211. The mail receiving unit 22 includes the received voice data 214 and the voice synthesizer 21 that performs voice synthesis using the voice data 214.
6 and a voice recognition unit 215 that recognizes the voice output from the voice synthesis unit 216 and outputs it to the display device 23 as a document. The display device 23 displays the text from the voice recognition unit 215, and the voice output device 24 outputs the result synthesized by the voice synthesis unit 216 as voice.

【００３１】音声変換部２１２と音声合成部２１６は、
第１の実施の形態において説明したものと同じである。
音声認識部２１５は、音声波形から認識文章を出力する
ものであり、詳細については、例えば、“「マルコフモ
デルによる音声認識」；電子情報通信学会誌、４／８
７,VOL.70,No.4”等を参照されたい。The voice conversion unit 212 and the voice synthesis unit 216
This is the same as that described in the first embodiment.
The speech recognition unit 215 outputs a recognition sentence from a speech waveform. For details, see, for example, ““ Speech Recognition by Markov Model ”;
7, VOL. 70, No. 4 ".

【００３２】第２の実施の形態と第１の実施の形態との
相違は、第１の実施の形態では、図１に示したように、
メール送信部１１からメール本文１１３と発声データ１
１４の両方を送り、メール受信部１２では、メール本文
１２２の表示装置１３への表示と、音声出力装置１４か
らの音声出力を行うようにしていたが、第２の実施の形
態では、図２に示すように、メール送信部２１では、音
声変換部２１２から抽出されたメール本文２１１の発声
データ２１３のみ送信し、メール受信部２２では、受信
した発声データ２１４を用いて音声合成部２１６により
音声合成をおこない、音声出力装置２４から音声出力す
るとともに、音声認識部２１５により音声合成部２１６
からの音声出力を音声認識し、文書として表示装置２３
に出力する。従って、第２の実施の形態では第１の実施
の形態に比べ、転送データ量を削減することが出来る
が、メール受信部ごとに音声認識部を必要とする。The difference between the second embodiment and the first embodiment is that, in the first embodiment, as shown in FIG.
The mail body 113 and the utterance data 1 from the mail transmitting unit 11
14 is sent, and the mail receiving unit 12 displays the mail text 122 on the display device 13 and outputs the voice from the voice output device 14. However, in the second embodiment, FIG. As shown in the figure, the mail transmission unit 21 transmits only the utterance data 213 of the mail body 211 extracted from the voice conversion unit 212, and the mail reception unit 22 uses the received utterance data 214 to perform the voice synthesis by the voice synthesis unit 216. The speech is output from the voice output device 24 and the voice is synthesized by the voice recognition unit 215.
Of the voice output from the display device 23,
Output to Therefore, in the second embodiment, the amount of transfer data can be reduced as compared with the first embodiment, but a voice recognition unit is required for each mail receiving unit.

【００３３】尚、第２の実施の形態においても、メール
受信部２２における、音声認識部２１５により音声認識
された文書の表示装置２３への表示と、音声出力装置２
４からの音声出力を、メール受信者の希望によりどちら
か一方を出力するようにすることが出来ることはいうま
でもない。Also in the second embodiment, the mail receiving unit 22 displays the document recognized by the voice recognition unit 215 on the display unit 23 and the voice output unit 2
It goes without saying that either one of the voice outputs from 4 can be output according to the wish of the mail recipient.

【００３４】次に、図３は、本発明の第３の実施の形態
を示す構成図であり、第１の実施の形態におけるメール
転送網をインターネットに置き換えた場合を示す。Next, FIG. 3 is a block diagram showing a third embodiment of the present invention, in which the mail transfer network in the first embodiment is replaced by the Internet.

【００３５】図３を参照すると、本発明の第３の実施の
形態は、ＨＴＭＬ文書作成装置３１と、ＷＥＢサーバ３
２と、ブラウザ３３と、表示装置３４と、音声出力装置
３５とから構成される。ＨＴＭＬ文書作成装置３１に
は、メール本文３１３とメール本文３１３を音声合成す
る場合の発声データ３１４からなるＨＴＭＬ文書３１２
と、メール本文３１３を音声合成する場合の発声データ
３１４を抽出する音声変換部３１１とが含まれる。ＷＥ
Ｂサーバ３２には、ＨＴＭＬ文書作成装置３１から保存
された本文３２２と発声データ３２３とからなるＨＴＭ
Ｌ文書３２１が含まれる。ブラウザ３３には、ＷＥＢサ
ーバ３２から受信した本文３３２と発声データ３３３か
らなるＨＴＭＬ文書３３１と、発声データ３３３を用い
て音声合成をおこなう音声合成部３３４とが含まれる。
表示装置３４は、本文３３２を表示し、音声出力装置３
５は、音声合成部３３４で合成した結果を音声出力す
る。Referring to FIG. 3, according to a third embodiment of the present invention, an HTML document creating device 31 and a web server 3
2, a browser 33, a display device 34, and an audio output device 35. An HTML document 312 including an e-mail text 313 and utterance data 314 when the e-mail text 313 is synthesized by voice is provided in the HTML document creation device 31.
And a voice conversion unit 311 for extracting voice data 314 when the text of the mail 313 is voice-synthesized. WE
The B server 32 has an HTM composed of a text 322 and speech data 323 stored from the HTML document creation device 31.
L document 321 is included. The browser 33 includes an HTML document 331 including a text 332 and voice data 333 received from the web server 32, and a voice synthesis unit 334 that performs voice synthesis using the voice data 333.
The display device 34 displays the text 332, and outputs the audio output device 3
5 outputs the result synthesized by the voice synthesis unit 334 as voice.

【００３６】第３の実施の形態と第１の実施の形態の主
な相違は、ＨＴＭＬ文書作成装置３１におけるメール本
文３１３と発声データ３１４が、ＨＴＭＬ文書として作
成され、ＷＥＢサーバ３２を介して、ブラウザ３３に読
み込まれる点である。その他の動作については、第１の
実施の形態と同じである。The main difference between the third embodiment and the first embodiment is that the mail text 313 and the utterance data 314 in the HTML document creation device 31 are created as an HTML document, and are transmitted via the web server 32. This is a point read by the browser 33. Other operations are the same as those of the first embodiment.

【００３７】尚、本発明の第３の実施の形態と同様に、
第２の実施の形態におけるメール転送網をインターネッ
トに置き換えて構成できることはいうまでもない。Incidentally, as in the third embodiment of the present invention,
It goes without saying that the mail transfer network according to the second embodiment can be replaced with the Internet.

【００３８】[0038]

【発明の効果】以上説明したように、本発明は、メール
送信部ではメール本文をデータ発信者の音韻規則や韻律
規則に基づいて発声データを作成して送信し、メール受
信部では受信した発声データを用いて規則合成により音
声合成を行い音声出力するようにしたことにより、デー
タ発信者の声に近い音声でメール本文を読み上げること
が可能になり、データ発信者の判別が容易になるという
効果がある。As described above, according to the present invention, the mail transmitting section creates and transmits utterance data based on the phonological rules and prosody rules of the data sender, and the mail receiving section transmits the received utterances. Speech synthesis is performed by rule synthesis using data, and voice output is performed. This makes it possible to read out the text of the mail with a voice close to the voice of the data sender, making it easier to identify the data sender. There is.

【図面の簡単な説明】[Brief description of the drawings]

【図１】本発明の第１の実施の形態を示すブロック図で
ある。FIG. 1 is a block diagram showing a first embodiment of the present invention.

【図２】本発明の第２の実施の形態を示すブロック図で
ある。FIG. 2 is a block diagram showing a second embodiment of the present invention.

【図３】本発明の第３の実施の形態を示すブロック図で
ある。FIG. 3 is a block diagram showing a third embodiment of the present invention.

【符号の説明】[Explanation of symbols]

１１，２１メール送信部１２，２２メール受信部１３，２３表示装置１４，２４音声出力装置１１１，２１２音声変換部１１２メール文書１１３，１２２，２１１本文１１４，１２３発声データ１２４，２１６音声合成部２１３，２１４発声データ２１５音声認識部３１ＨＴＭＬ文書作成装置３２ＷＥＢサーバ３３ブラウザ３４表示装置３５音声出力装置３１１音声変換部３１２，３２１，３３１ＨＴＭＬ文書３１３，３２２，３３２本文３１４，３２３，３３３発声データ３３４音声合成部 11, 21 Mail transmission unit 12, 22 Mail reception unit 13, 23 Display device 14, 24 Voice output device 111, 212 Voice conversion unit 112 Mail document 113, 122, 211 Body 114, 123 Speech data 124, 216 Voice synthesis unit 213 , 214 utterance data 215 voice recognition unit 31 HTML document creation device 32 WEB server 33 browser 34 display device 35 voice output device 311 voice conversion unit 312, 321, 331 HTML document 313, 322, 332 text 314, 323, 333 utterance data 334 Voice synthesis unit

───────────────────────────────────────────────────── フロントページの続き (51)Int.Cl.⁷ 識別記号ＦＩテーマコート゛(参考）Ｈ０４Ｌ 12/58 Ｆターム(参考） 5B089 GA11 GA21 GB03 GB04 HA10 HB02 HB05 JA31 JB02 JB22 LA13 LB13 5D045 AA07 AB26 CB03 5K030 GA18 HA06 HB01 HB02 JT01 KA20 ──────────────────────────────────────────────────続き Continued on the front page (51) Int.Cl. ⁷ Identification symbol FI Theme coat ゛ (Reference) H04L 12/58 F-term (Reference) 5B089 GA11 GA21 GB03 GB04 HA10 HB02 HB05 JA31 JB02 JB22 LA13 LB13 5D045 AA07 AB26 CB03 5K030 GA18 HA06 HB01 HB02 JT01 KA20

Claims

【特許請求の範囲】[Claims]

【請求項１】音声通知付電子メールシステムにおい
て、入力したメール本文と該メール本文を音声合成する
場合の発声データからなるメール文書を送信するメール
送信手段と、前記メール文書を受信すると該メール文書
中の前記発声データを用いて音声合成をおこない該音声
合成結果により予め備えた音声出力装置を駆動するとと
もに該メール文書中の前記本文を予め備えた表示装置に
表示するメール受信手段から構成されることを特徴とす
る音声通知付電子メールシステム。1. An electronic mail system with voice notification, a mail transmitting means for transmitting a mail document comprising an input mail text and voice data for synthesizing the text of the mail, and a mail document upon receiving the mail document. A voice receiving unit that performs voice synthesis using the utterance data in the voice data, drives a voice output device provided in advance based on the voice synthesis result, and displays the text in the mail document on a display device provided in advance. An e-mail system with voice notification, characterized in that:

【請求項２】前記メール送信手段は前記メール本文を
規則合成によって音声合成する場合のパラメータを前記
発声データとして出力する音声変換部を備え、前記メー
ル受信手段は該パラメータを入力して音声合成する音声
合成部を備えることを特徴とする請求項１記載の音声通
知付電子メールシステム。2. The mail transmitting means includes a voice conversion unit for outputting a parameter when the text of the mail is voice-synthesized by rule synthesis as the utterance data, and the mail receiving means inputs the parameter and performs voice synthesis. The electronic mail system with voice notification according to claim 1, further comprising a voice synthesis unit.

【請求項３】前記パラメータは前記音声合成部のＬＳ
Ｐ音声合成ディジタルフィルタへ入力されるＬＳＰパラ
メータ及び前記音声合成部の音源生成部へ入力されるピ
ッチパターンを含むことを特徴とする請求項２記載の音
声通知付電子メールシステム。3. The parameter is an LS of the speech synthesis unit.
3. The electronic mail system with voice notification according to claim 2, further comprising an LSP parameter input to a P voice synthesis digital filter and a pitch pattern input to a sound source generation unit of the voice synthesis unit.

【請求項４】音声通知付電子メールシステムにおい
て、入力したメール本文を音声合成する場合の発声デー
タを送信するメール送信手段と、受信した前記発声デー
タを用いて音声合成をおこない該音声合成結果により予
め備えた音声出力装置を駆動するとともに前記音声合成
結果を音声認識し文書として予め備えた表示装置に出力
するメール受信手段から構成されることを特徴とする音
声通知付電子メールシステム。4. An e-mail system with voice notification, a mail transmitting means for transmitting voice data for voice synthesis of an input mail text, voice synthesis using the received voice data, and using the voice synthesis result. An e-mail system with voice notification, comprising: a mail receiving means for driving a voice output device provided in advance and voice-recognizing the speech synthesis result and outputting it as a document to a display device provided in advance.

【請求項５】前記メール送信手段は前記メール本文を
規則合成によって音声合成する場合のパラメータを前記
発声データとして出力する音声変換部を備え、前記メー
ル受信手段は該パラメータを入力して音声合成する音声
合成部を備えることを特徴とする請求項４記載の音声通
知付電子メールシステム。5. The mail transmitting unit includes a voice conversion unit that outputs a parameter when the voice of the mail is synthesized by rule synthesis as the utterance data, and the mail receiving unit inputs the parameter and performs voice synthesis. 5. The electronic mail system with voice notification according to claim 4, further comprising a voice synthesis unit.

【請求項６】前記パラメータは前記音声合成部のＬＳ
Ｐ音声合成ディジタルフィルタへ入力されるＬＳＰパラ
メータ及び前記音声合成部の音源生成部へ入力されるピ
ッチパターンを含むことを特徴とする請求項５記載の音
声通知付電子メールシステム。6. The parameter LS of the speech synthesis unit.
6. The electronic mail system with voice notification according to claim 5, further comprising an LSP parameter input to a P voice synthesis digital filter and a pitch pattern input to a sound source generation unit of the voice synthesis unit.

【請求項７】音声通知付電子メールシステムにおい
て、入力したメール本文と該メール本文を音声合成する
場合の発声データからなるＨＴＭＬ文書を送信するＨＴ
ＭＬ文書作成手段と、前記ＨＴＭＬ文書を保存するＷＥ
Ｂサーバと、前記ＷＥＢサーバから受信した前記ＨＴＭ
Ｌ文書中の前記発声データを用いて音声合成をおこない
該音声合成結果により予め備えた音声出力装置を駆動す
るとともに該ＨＴＭＬ文書中の前記メール本文を予め備
えた表示装置に表示するブラウザから構成されることを
特徴とする音声通知付電子メールシステム。7. An HT for transmitting an HTML document including an input mail text and voice data when the text of the mail is synthesized in an electronic mail system with voice notification.
ML document creation means and WE for storing the HTML document
B server and the HTM received from the WEB server
The apparatus comprises a browser that performs voice synthesis using the utterance data in the L document, drives a voice output device provided in advance based on the voice synthesis result, and displays the mail text in the HTML document on a display device provided in advance. An electronic mail system with voice notification.

【請求項８】前記ＨＴＭＬ文書作成手段は前記メール
本文を規則合成によって音声合成する場合のパラメータ
を前記発声データとして出力する音声変換部を備え、前
記ブラウザは該パラメータを入力して音声合成する音声
合成部を備えることを特徴とする請求項７記載の音声通
知付電子メールシステム。8. The apparatus according to claim 1, wherein said HTML document creation means includes a voice conversion unit for outputting a parameter for voice synthesis of said mail text by rule synthesis as said utterance data, and said browser inputs said parameter and outputs voice for voice synthesis. The electronic mail system with voice notification according to claim 7, further comprising a synthesizing unit.

【請求項９】前記パラメータは前記音声合成部のＬＳ
Ｐ音声合成ディジタルフィルタへ入力されるＬＳＰパラ
メータ及び前記音声合成部の音源生成部へ入力されるピ
ッチパターンを含むことを特徴とする請求項８記載の音
声通知付電子メールシステム。9. The parameter of the speech synthesis unit is LS.
9. The electronic mail system with voice notification according to claim 8, further comprising an LSP parameter input to a P voice synthesis digital filter and a pitch pattern input to a sound source generation unit of the voice synthesis unit.

【請求項１０】音声通知付電子メールシステムにおい
て、入力したメール本文を音声合成する場合の発声デー
タをＨＴＭＬ文書として送信するＨＴＭＬ文書作成手段
と、前記ＨＴＭＬ文書を保存するＷＥＢサーバと、前記
ＷＥＢサーバから受信した前記ＨＴＭＬ文書中の前記発
声データを用いて音声合成をおこない該音声合成結果に
より予め備えた音声出力装置を駆動するとともに前記音
声合成結果を音声認識し文書として予め備えた表示装置
に出力するブラウザから構成されることを特徴とする音
声通知付電子メールシステム。10. An electronic mail system with voice notification, an HTML document creating means for transmitting utterance data as an HTML document when voice synthesis of an input mail text is performed, a WEB server for storing the HTML document, and the WEB server. A voice synthesis is performed using the utterance data in the HTML document received from the device, a voice output device provided in advance is driven by the voice synthesis result, and the voice synthesis result is voice-recognized and output to a display device provided in advance as a document. An e-mail system with voice notification, characterized by comprising a browser that executes.

【請求項１１】前記ＨＴＭＬ文書作成手段は前記メー
ル本文を規則合成によって音声合成する場合のパラメー
タを前記発声データとして出力する音声変換部を備え、
前記ブラウザは該パラメータを入力して音声合成する音
声合成部を備えることを特徴とする請求項１０記載の音
声通知付電子メールシステム。11. The HTML document creation means includes a voice conversion unit that outputs a parameter for voice synthesis of the mail text by rule synthesis as the utterance data.
11. The electronic mail system with voice notification according to claim 10, wherein the browser includes a voice synthesizing unit that performs voice synthesis by inputting the parameter.

【請求項１２】前記パラメータは前記音声合成部のＬ
ＳＰ音声合成ディジタルフィルタへ入力されるＬＳＰパ
ラメータ及び前記音声合成部の音源生成部へ入力される
ピッチパターンを含むことを特徴とする請求項１１記載
の音声通知付電子メールシステム。12. The parameter of the speech synthesis unit is L
The electronic mail system with voice notification according to claim 11, further comprising an LSP parameter input to an SP voice synthesis digital filter and a pitch pattern input to a sound source generation unit of the voice synthesis unit.