JP4589910B2

JP4589910B2 - Conversation recording blogging device

Info

Publication number: JP4589910B2
Application number: JP2006334557A
Authority: JP
Inventors: 祐宮崎
Original assignee: Yahoo Japan Corp
Current assignee: Yahoo Japan Corp
Priority date: 2006-12-12
Filing date: 2006-12-12
Publication date: 2010-12-01
Anticipated expiration: 2026-12-12
Also published as: JP2008146461A

Description

本発明は、会話記録ブログ化装置に関する。更に詳しくは、携帯電話、携帯録音機等を用いたブログ作成のための支援装置及び方法に関する。 The present invention relates to a conversation recording blogging apparatus. More particularly, the present invention relates to a support apparatus and method for creating a blog using a mobile phone, a portable recording device, or the like.

最近、インターネット上でいわゆるブログ（ｂｌｏｇ：ｗｅｂｌｏｇの略称）と呼ばれる、簡易ホームページが急速に普及している。ブログは、単なる個人の日記的なホームページではなく、時事ニュースや専門的トピックスに関して自らの専門や立場に根ざした分析や意見を表明したり、他のサイトの著者と議論したりする形式が多く、様々なコミュニケーション・ツール、知識マネージメント・ツール、及び広告媒体としての役割が高まってきている。 Recently, a simple homepage called a so-called blog (abbreviation of webblog) is rapidly spreading on the Internet. Blogs are not just personal diary homepages, but are often used to express analysis and opinions rooted in your expertise and position, and discuss with other site authors about current news and specialized topics. The role of various communication tools, knowledge management tools, and advertising media is increasing.

ブログの特徴としては、（１）記事毎に時間記録（ｔｉｍｅｓｔａｍｐ）を持つ、（２）記事が時系列に並ぶので最新の記事が常に上段に位置する、（３）記事毎にリンクできる、（４）リンクとコメントを通して相互コミュニケーションが可能、といったことである。こういった特徴をベースに、ブログの作成者、閲覧者が急激に増加している。 The features of the blog are as follows: (1) Each article has a time record (timestamp), (2) Since articles are arranged in time series, the latest article is always located at the top, (3) Each article can be linked ( 4) Mutual communication is possible through links and comments. Based on these characteristics, the number of creators and viewers of blogs is increasing rapidly.

ブログを作成するためには、ＰＣや携帯電話等で、ブログの記事となる文章を入力し、写真等の画像と共にアップロードする、という方法が一般に用いられる。しかし、毎日毎日、ブログ作成者にとって出来事が起こるたびに文章を入力し、アップロードすることはかなり面倒なことである。このため、日々のブログをできるだけ簡単に作成・更新するためのツールが登場してきている。例えば、特許文献１には、利用者が特定の行為を行った際、自機（携帯通信装置）の位置情報を取得し、当該行為の日時及びその位置情報から利用者のイベント情報を記憶し、後で利用者が選択したイベント情報をサーバに送信することにより、利用者が容易にブログを作成する携帯通信装置が開示されている。
特開２００６−３０１８８４号公報 In order to create a blog, a method of inputting a sentence to be a blog article and uploading it together with an image such as a photo on a PC or a mobile phone is generally used. However, every day, every day, every time an event occurs, it's quite annoying for blog creators to enter and upload text. For this reason, tools for creating and updating daily blogs as easily as possible have appeared. For example, in Patent Literature 1, when a user performs a specific action, the position information of the own device (mobile communication device) is acquired, and the event information of the user is stored from the date and time of the action and the position information. A portable communication device is disclosed in which a user easily creates a blog by transmitting event information selected by the user later to a server.
JP 2006-301844 A

しかしながら、特許文献１のような方法では、利用者が特定の行為（例えば、電子マネー、電子定期券、電子チケットなどを使用したときや、カメラで撮像したとき等）を予めイベントとして指定しておかなければならず、指定した特定の行為以外のイベントは記録されない。また、これらのイベントを多数記憶、蓄積する必要があり、それらのイベントの各項目毎に、ブログにアップロードするコメントを自動生成するかどうかをＯＮ／ＯＦＦで指定する煩わしさがある。 However, in a method such as Patent Document 1, a user designates a specific action (for example, when using electronic money, an electronic commuter pass, an electronic ticket, etc., or when captured by a camera) as an event in advance. It must be set and no events other than the specified specific action will be recorded. In addition, it is necessary to store and accumulate a large number of these events, and it is troublesome to specify ON / OFF whether to automatically generate a comment to be uploaded to the blog for each item of these events.

本発明は、上記課題を鑑み、携帯電話機や携帯録音機（ＩＣレコーダなど）のように会話記録手段を備えた携帯端末装置を用い、ブログの作成を容易にするための情報をより簡単に得るための新たな装置や方法を提供することを目的とする。 In view of the above-described problems, the present invention uses a mobile terminal device having a conversation recording unit such as a mobile phone or a mobile recorder (IC recorder or the like), and more easily obtains information for making a blog. An object of the present invention is to provide a new apparatus and method.

本発明では以下のような解決手段を提供する。 The present invention provides the following solutions.

（１）音声会話記録を用いて会話記録の要点作成のための装置であって、
利用者の携帯端末に記憶された会話記録を入力として受け付ける会話記録入力部と、
前記会話記録の音声データを、前記利用者の発話部分と前記利用者と会話する他の者の発話部分とに分離する会話分離部と、
前記分離された音声データをテキスト・データに変換する音声テキスト変換部と、
前記テキスト・データから会話の特徴を表す要点語を抽出する要点語抽出部と、
前記抽出した要点語を前記利用者に表示する表示部と、
を備える装置。 (1) A device for creating the main points of conversation recording using voice conversation recording,
A conversation record input unit that accepts a conversation record stored in the user's mobile terminal as an input;
A conversation separating unit that separates the voice data of the conversation record into an utterance part of the user and an utterance part of another person talking with the user;
A voice text conversion unit for converting the separated voice data into text data;
A key word extraction unit that extracts key words representing features of conversation from the text data;
A display unit for displaying the extracted key words to the user;
A device comprising:

このような構成によれば、本装置は、利用者の携帯端末（例えば、携帯電話機やＩＣレコーダなど音声記録部を備えた携帯端末）に記憶された会話記録（通話記録や、会話録音）を入力として受け付け、会話記録の音声データの部分（発話部分）のみを分離して取り出す。この分離された音声データに対して音声テキスト変換を行い、その変換後のテキスト・データから、記録された会話の特徴を表す「要点語」（要点キーワード）を抽出する。音声テキスト変換がたとえ不完全であっても、会話の中に高い頻度（実際には重み付けられた頻度）で出現する要点語のみを抽出することは可能である。このことにより、利用者の毎日の記録を、携帯端末に音声で記憶された会話記録に基づいて、その記録中に含まれる要点語を抽出して、利用者に時系列にテキストとして表示できる。このような要点語のテキストは、後に様々な用途に利用できる。 According to such a configuration, this apparatus stores conversation records (call recording and conversation recording) stored in a user's portable terminal (for example, a portable terminal having a voice recording unit such as a cellular phone or an IC recorder). Accepted as input, only the voice data portion (speech portion) of the conversation record is separated and extracted. Speech text conversion is performed on the separated speech data, and “main point words” (main point keywords) representing the characteristics of the recorded conversation are extracted from the converted text data. Even if the speech-to-speech conversion is incomplete, it is possible to extract only the key words that appear with high frequency (actually weighted frequency) in the conversation. This makes it possible to extract the key words included in the user's daily recording based on the conversation recording stored in the portable terminal by voice and display it as text in time series to the user. Such key word text can be used for various purposes later.

（２）音声会話記録を用いてブログの作成を支援するための装置であって、
利用者の携帯端末に記憶された会話記録を入力として受け付ける会話記録入力部と、
前記会話記録の音声データを、前記利用者の発話部分と前記利用者と会話する他の者の発話部分とに分離する会話分離部と、
前記分離された音声データをテキスト・データに変換する音声テキスト変換部と、
前記テキスト・データから会話の特徴を表す要点語を抽出する要点語抽出部と、
前記抽出した要点語を前記利用者に表示する表示部と、
前記要点語に基づいて前記利用者に、ブログに掲載するコメントを編集させるブログデータ編集部と、
前記コメントを含むブログデータをサーバに送信するブログデータ送信部と、
を備える装置。 (2) A device for supporting the creation of a blog using voice conversation recording,
A conversation record input unit that accepts a conversation record stored in the user's mobile terminal as an input;
A conversation separating unit that separates the voice data of the conversation record into an utterance part of the user and an utterance part of another person talking with the user;
A voice text conversion unit for converting the separated voice data into text data;
A key word extraction unit that extracts key words representing features of conversation from the text data;
A display unit for displaying the extracted key words to the user;
A blog data editing unit that allows the user to edit a comment to be posted on a blog based on the gist word;
A blog data transmission unit that transmits blog data including the comment to a server;
A device comprising:

このような構成によれば、本装置は、利用者の携帯端末（例えば、携帯電話機やＩＣレコーダなど音声記録部を備えた携帯端末）に記憶された会話記録（通話記録や、会話録音）を入力として受け付け、会話記録の音声データの部分（発話部分）のみを分離して取り出す。この分離された音声データに対して音声テキスト変換を行い、その変換後のテキスト・データから、記録された会話の特徴を表す「要点語」（要点キーワード）を抽出する。この抽出した要点語を利用者に提示し、ブログに掲載するためのコメント作成の元データとする。そして、この元のデータを利用者に編集させる手段を備えて、ブログ記事（ブログコメント）として、ブログサイトのＷｅｂサーバにアップロードする。 According to such a configuration, this apparatus stores conversation records (call recording and conversation recording) stored in a user's portable terminal (for example, a portable terminal having a voice recording unit such as a cellular phone or an IC recorder). Accepted as input, only the voice data portion (speech portion) of the conversation record is separated and extracted. Speech text conversion is performed on the separated speech data, and “main point words” (main point keywords) representing the characteristics of the recorded conversation are extracted from the converted text data. The extracted key words are presented to the user and used as original data for creating a comment for posting on the blog. Then, a means for allowing the user to edit the original data is uploaded to the web server of the blog site as a blog article (blog comment).

このことにより、利用者の毎日の記録を、携帯端末に記憶された会話記録に基づいて、その記録中に含まれる要点語を抽出して利用者に表示できる。更に、場合によっては、その会話記録を聞きながら、その日の出来事を思い出してブログ記事に反映することもできる。これらの会話記録は、時系列に記憶されているので、ブログの特徴である記事毎のｔｉｍｅｓｔａｍｐ情報も容易に取り出せる。また、利用者個人の思考や感想だけでなく、会話の相手方（通話の相手や面談記録の相手）からの情報も同時に抽出できるという特徴もある。 Thus, the daily record of the user can be displayed to the user by extracting the key words included in the record based on the conversation record stored in the portable terminal. In addition, in some cases, while listening to the conversation record, you can recall the events of the day and reflect them in the blog post. Since these conversation records are stored in time series, timestamp information for each article, which is a feature of the blog, can be easily extracted. In addition to the personal thoughts and impressions of the user, information from the conversation partner (call partner or interview record partner) can also be extracted at the same time.

（３）前記会話記録は、携帯電話機を用いた通話記録である、（１）〜（２）に記載の装置。 (3) The apparatus according to (1) to (2), wherein the conversation record is a call record using a mobile phone.

このような構成によれば、会話記録として、今日広く普及している携帯電話機に通話記録を備えれば、これを会話記録の情報源として利用し、上記の機能を実現できる。 According to such a configuration, if a mobile phone widely used today as a conversation record is provided with a call record, it can be used as an information source for the conversation record, and the above function can be realized.

（４）前記会話記録は、前記携帯端末に記録された会話録音データである、（１）〜（２）に記載の装置。 (4) The apparatus according to (1) to (2), wherein the conversation recording is conversation recording data recorded in the portable terminal.

このような構成によれば、携帯電話の通話記録でなく、利用者が意図的に録音した記録、例えば、特定の出来事に対するインタビューや面談の録音記録に基づいて、（１）〜（２の機能を実現できる。もちろん、携帯電話機に通話記録以外に録音機能が備えられていれば、通話記録と録音記録を一つの携帯電話機で実現することができる。 According to such a configuration, the functions (1) to (2) are based on a record intentionally recorded by a user, for example, a record of an interview or interview for a specific event, instead of a call record of a mobile phone. Of course, if the mobile phone has a recording function in addition to call recording, call recording and recording can be realized with a single mobile phone.

（５）前記要点語の抽出は、前記テキスト・データからＴＦＩＤＦ値を計算することによって求める、（１）〜（４）に記載の装置。 (5) The apparatus according to any one of (1) to (4), wherein the extraction of the key word is obtained by calculating a TFIDF value from the text data.

このような構成によれば、音声テキスト変換された通話記録又は録音記録からＴＦＩＤＦ法を用いて、その会話の特徴を表す要点語を抽出することができる。なお、ＴＦＩＤＦについては後述する。 According to such a configuration, it is possible to extract a key word representing the characteristics of the conversation using the TFIDF method from a call record or a recording record that has been converted into a voice text. TFIDF will be described later.

（６）前記ＴＦＩＤＦ値は、前記利用者の発話部分のテキスト・データと、会話の相手側の発話部分のテキスト・データのそれぞれ又は両データ全体に対して計算する、（５）に記載の装置。 (6) The apparatus according to (5), wherein the TFIDF value is calculated for each of the text data of the utterance part of the user and the text data of the utterance part of the other party of the conversation, or the whole of both data. .

このような構成によれば、会話記録に利用者自身の発話内容と相手側（通話相手又は面談相手）の発話内容の双方が記録されていることを利用して、異なる２つの側面から要点語を抽出することが可能となる。例えば、利用者自身の発話部分のみを利用して要点語を抽出したり、相手側のみの発話記録のみを利用したり、あるいは、両者を共に利用したりといった利用する情報の選択の幅が広がることになる。 According to such a configuration, the fact that both the user's own utterance content and the utterance content of the other party (calling partner or interviewing partner) are recorded in the conversation record can be used as a key word from two different aspects. Can be extracted. For example, the range of selection of information to be used, such as extracting key words using only the user's own utterance part, using only the other party's utterance record, or using both together, is widened. It will be.

（７）前記ブログデータ編集部は、前記ブログの表示形式を編集する機能を備え、前記利用者からの指示によって、前記サーバに送信する、（２）に記載の装置。 (7) The apparatus according to (2), wherein the blog data editing unit has a function of editing a display format of the blog, and transmits the blog data to the server according to an instruction from the user.

このような構成によれば、本装置は、ブログ内容のみならずブログの表示形式（例えばＨＴＭＬ形式やＸＭＬ形式）そのものを編集する機能を有するので、本装置側でアップロードする内容をプレビューしたり、ブログに掲載する形式でアップロードすることが可能となる。 According to such a configuration, the device has a function of editing not only the blog content but also the blog display format (for example, HTML format or XML format) itself, so that the content uploaded on the device side can be previewed, It becomes possible to upload in the form posted on the blog.

（８）前記携帯端末は、対象物を撮像する手段を備える、（１）〜（７）に記載の装置。 (8) The said portable terminal is an apparatus as described in (1)-(7) provided with the means to image a target object.

このような構成によれば、例えば、携帯端末にデジタル・カメラを備えることによって、ブログ記事と共にアップロードする画像を容易に取得することができる。 According to such a configuration, for example, by providing the mobile terminal with a digital camera, it is possible to easily acquire an image to be uploaded together with the blog article.

（９）前記携帯端末は、自らの位置情報を取得する手段を備え、前記会話記録を作成した場所の位置情報を前記会話記録と関連付けて記録する、（１）〜（８）に記載の装置。 (9) The device according to any one of (1) to (8), wherein the portable terminal includes means for acquiring own position information, and records position information of a place where the conversation record is created in association with the conversation record. .

このような構成によれば、携帯端末にＧＰＳ等の位置情報取得手段を備え、写真を撮影した日時と共に撮影場所情報等を容易にブログに利用することができる。 According to such a configuration, the mobile terminal is provided with position information acquisition means such as GPS, and the shooting location information and the like can be easily used for the blog together with the date and time when the photo was taken.

（１０）音声発話記録を用いてブログの作成を支援するための装置であって、
利用者の携帯端末に記憶された発話記録を入力として受け付ける発話記録入力部と、
前記発話記録の音声データを、前記利用者の発話部分とそれ以外の部分とに分離する発話分離部と、
前記分離された音声データをテキスト・データに変換する音声テキスト変換部と、
前記テキスト・データから発話の特徴を表す要点語を抽出する要点語抽出部と、
前記抽出した要点語を前記利用者に表示する表示部と、
前記要点語に基づいて前記利用者に、ブログに掲載するコメントを編集させるブログデータ編集部と、
前記コメントを含むブログデータをサーバに送信するブログデータ送信部と、
を備える装置。 (10) A device for supporting the creation of a blog using a voice utterance record,
An utterance record input unit that accepts an utterance record stored in the user's mobile terminal as an input;
An utterance separation unit that separates the voice data of the utterance record into the utterance part of the user and the other part;
A voice text conversion unit for converting the separated voice data into text data;
A key word extraction unit that extracts key words representing features of the utterance from the text data;
A display unit for displaying the extracted key words to the user;
A blog data editing unit that allows the user to edit a comment to be posted on a blog based on the gist word;
A blog data transmission unit that transmits blog data including the comment to a server;
A device comprising:

通話相手や面談相手がいない場合であっても、利用者自身の単独の口述記録(自己録音)を用いて（２）と同様な機能を実現することができる。これは、ブログを作成するための音声メモとも考えられる。 Even when there is no call partner or interview partner, the function similar to (2) can be realized by using the user's own dictation record (self-recording). This can be considered as a voice memo for creating a blog.

（１１）音声会話記録を用いて会話記録の要点作成のための方法であって、
コンピュータ・システムにおいて、
利用者の携帯端末に記憶された会話記録を入力として受け付けるステップと、
前記会話記録の音声データを、前記利用者の発話部分と前記利用者と会話する他の者の発話部分とに分離するステップと、
前記分離された音声データをテキスト・データに変換するステップと、
前記テキスト・データから会話の特徴を表す要点語を抽出するステップと、
前記抽出した要点語を前記利用者に表示するステップと、
を含む方法。 (11) A method for creating a point of conversation recording using voice conversation recording,
In computer systems,
Receiving a conversation record stored in the user's mobile device as input,
Separating the voice data of the conversation record into an utterance part of the user and an utterance part of another person talking with the user;
Converting the separated audio data into text data;
Extracting key words representing conversation characteristics from the text data;
Displaying the extracted key words to the user;
Including methods.

このような構成によれば、（１）の装置の機能をコンピュータ・システムにおける方法の発明としてとらえ、（１）と同様の効果を得ることができる。 According to such a configuration, the function of the device of (1) can be regarded as a method invention in a computer system, and the same effect as that of (1) can be obtained.

このように、本発明によれば、携帯電話機や携帯録音装置など、会話記録部を備えた携帯端末装置を用いてブログの作成を容易にするための要点語などを容易に得ることができる。 Thus, according to the present invention, it is possible to easily obtain key words for facilitating the creation of a blog using a mobile terminal device having a conversation recording unit such as a mobile phone or a mobile recording device.

以下、本発明の実施形態について、図を参照しながら説明する。 Hereinafter, embodiments of the present invention will be described with reference to the drawings.

図１は、本発明の一つの実施形態であるブログ作成に係る会話記録装置１０の機能ブロックを示した図である。なお、説明の便宜上、会話記録装置１０のみならず、これと接続される携帯電話機３０やＩＣレコーダ、Ｗｅｂサーバ２０も同時に図示している。会話記録装置１０は、主に、携帯電話機３０、又はＩＣレコーダ３１等の録音機に記録された会話記録を入力として受け付ける会話記録入力部１１、入力した会話記録を自分の会話と相手の会話と無音部分に分離する会話分離部１２、分離した会話音声部をテキストに変換する音声テキスト変換部１３、変換されたテキストからその特徴を表す要点語を抽出する要点語抽出部１４、要点語を元に利用者にブログデータを編集させるブログデータ編集部１５、及び編集したブログデータをブログサイトであるＷＥＢサーバ２０に送信するブログデータ送信部１６を備える。また、会話記録装置１０は、一般的なオーディオ・スピーカ１７、利用者のための表示部１８（液晶表示装置等）、操作部１９（キーボード、マウス等）、及び必要ならば音声入力のためのマイク（図示せず）を備える。 FIG. 1 is a functional block diagram of a conversation recording apparatus 10 relating to blog creation according to an embodiment of the present invention. For convenience of explanation, not only the conversation recording device 10 but also a mobile phone 30, an IC recorder, and a Web server 20 connected thereto are shown at the same time. The conversation recording apparatus 10 mainly includes a conversation recording input unit 11 that accepts as input a conversation record recorded in a recording device such as a mobile phone 30 or an IC recorder 31. Conversation separation unit 12 that separates into silent parts, speech text conversion unit 13 that converts the separated conversational speech part into text, main point word extraction unit 14 that extracts a main word representing its characteristics from the converted text, A blog data editing unit 15 that allows the user to edit the blog data, and a blog data transmission unit 16 that transmits the edited blog data to the WEB server 20 that is a blog site. The conversation recording device 10 includes a general audio speaker 17, a display unit 18 (liquid crystal display device or the like) for a user, an operation unit 19 (keyboard, mouse, etc.), and a voice input if necessary. A microphone (not shown) is provided.

携帯電話機３０やＩＣレコーダ３１との接続は、ＵＳＢなどの汎用インターフェースを用いてもよいし、ＳＤカードやＳＭＡＲＴカードなどの携帯型メモリを着脱して、データをやり取りするようにしてもよい。 Connection with the mobile phone 30 or the IC recorder 31 may be performed using a general-purpose interface such as USB, or a portable memory such as an SD card or a SMART card may be attached and detached to exchange data.

会話記録装置１０は、ネットワーク４０（典型的には、インターネットであるが、ＬＡＮ（ＬｏｃａｌＡｒｅａＮｅｔｗｏｒｋ）やＷＡＮ（ＷｉｄｅＡｒｅａＮｅｔｗｏｒｋ）であってもよい）を介して、Ｗｅｂサーバ２０と接続される。Ｗｅｂサーバ２０は、ブログの内容を格納するサーバであり、会話記録装置１０からのデータを受信するブログデータ受信部２１、Ｗｅｂページ形式に編集するブログ記録部２２、及びデータを格納するＷｅｂページＤＢ２３を備える。 The conversation recording apparatus 10 is connected to the Web server 20 via a network 40 (typically the Internet, but may be a local area network (LAN) or a wide area network (WAN)). The web server 20 is a server that stores the contents of a blog, a blog data receiving unit 21 that receives data from the conversation recording device 10, a blog recording unit 22 that edits data into a web page format, and a web page DB 23 that stores data. Is provided.

なお、これらの構成は、ほんの一例を示したに過ぎず、機能的に同等であれば他の構成を含んでもよいし、元の構成と置換してもよい。 Note that these configurations are merely examples, and other configurations may be included as long as they are functionally equivalent, or may be replaced with the original configurations.

図２は、本発明の一つの実施形態に係る会話記録装置１０の処理フローを示した図である。ここでは、入力として携帯電話機３０の場合で説明するが、ＩＣレコーダ３１等の他の記録装置の場合であっても基本的な流れは同様である。 FIG. 2 is a diagram showing a processing flow of the conversation recording apparatus 10 according to one embodiment of the present invention. Here, the case of the cellular phone 30 will be described as an input, but the basic flow is the same even in the case of other recording devices such as the IC recorder 31.

まず、ステップＳ１において、携帯電話機３０より通話記録を取得する。このデータの取得は、前述したように、会話記録装置１０と携帯電話機３０をＵＳＢや専用ケーブルで接続して行ってもよいし、ＳＤメモリカード等を利用してもよい。 First, in step S <b> 1, a call record is acquired from the mobile phone 30. As described above, this data acquisition may be performed by connecting the conversation recording device 10 and the mobile phone 30 with a USB or a dedicated cable, or may use an SD memory card or the like.

次に、ステップＳ２において、利用者（ブログ作成者）自身の発話部分と通話の相手方の発話部分、及び無音部分を分離する。この分離技術自体は、任意の公知技術を用いてよい。会話全体を自分と相手の部分に分離するのは、発音の違う話者が混在することで、音声認識（次のステップＳ３）の誤変換率を少なくするためだけでなく、自分と相手方の思考パターンの違いによる用語の使用の違いを分析しやすくするためでもある。無音部分を除去するのは、後で録音内容を再生するときの時間を少なくするためである。 Next, in step S2, the utterance part of the user (blog creator) itself, the utterance part of the other party of the call, and the silent part are separated. As the separation technique itself, any known technique may be used. The reason why the entire conversation is separated into the part of the person and the other party is not only to reduce the misconversion rate of the speech recognition (next step S3) but also to the thinking of the person and the other party. This is also to make it easier to analyze differences in terminology due to differences in patterns. The reason for removing the silent portion is to reduce the time when the recorded content is reproduced later.

次にステップＳ３において、分離した音声データをテキストに変換する。この変換、すなわち音声認識は公知の技術（例えば、特開平―２３０２９５など参照）を用いてよいが、複数の話者が存在することを考慮すると、音声パターンや辞書を複数切り替えることができるものが好ましい。 Next, in step S3, the separated voice data is converted into text. For this conversion, that is, voice recognition, a known technique (for example, see Japanese Patent Laid-Open No. 23030295) may be used. However, in consideration of the presence of a plurality of speakers, a voice pattern and a dictionary can be switched among a plurality. preferable.

次に、ステップＳ４において、変換されたテキストを形態素分割する。形態素（ｍｏｒｐｈｅｍｅ）とは、意味を持つ最小の言語単位のことで、自然言語で書かれた文章を分割する際に利用される言語単位である。例えば、「今日はいい天気です」は、「今日／は／いい／天気／です」に分割される。 Next, in step S4, the converted text is divided into morphemes. A morpheme is the smallest linguistic unit having meaning, and is a linguistic unit used when a sentence written in a natural language is divided. For example, “Today is good weather” is divided into “Today / Has / Good / Weather / Is it”.

更に、ステップＳ５において、形態素分割されたテキストは、そのテキスト全体(通話全体)を文書集合ととらえ、更に、適切な通話単位(例えば一定時間)を文書ととらえて、その文書で特徴となる要点語（キーワード）を抽出する。この要点語は、単なるテキスト全体における出現頻度でなく、特定の文書に偏って出現する語を抽出する。具体的には、例えば、ＴＦＩＤＦ値を計算し、その値が所定の値よりも高いものを示す語を要点語として抽出する。ＴＦＩＤＦ法は、語の出現頻度に基づいてキーワードを重みづける方法である。ＴＦＩＤＦはＴＦとＩＤＦに分かれる。ＴＦはＴｅｒｍＦｒｅｑｕｅｎｃｙ（語彙頻度）であり、ＩＤＦはＩｎｖｅｒｓｅＤｏｃｕｍｅｎｔＦｒｅｑｕｅｎｃｙ（文書頻度の逆数）である。すなわち、ＴＦＩＤＦとは、ＴＦとＩＤＦの積をあらわす。以下の実施例では、ＴＦＩＤＦを主体に説明しているが、要点語の抽出はＴＦＩＤＦ以外にもテキスト・マイニングでは多数知られており、（例えば文書クラスタリング）、それらの公知技術を用いることを排除するものではない。 Further, in step S5, the morpheme-divided text treats the entire text (the entire call) as a document set, and further treats an appropriate call unit (for example, a fixed time) as a document, and is a feature that characterizes the document. Extract words (keywords). The key word is not a simple appearance frequency in the whole text, but a word that appears biased in a specific document is extracted. Specifically, for example, a TFIDF value is calculated, and a word indicating that the value is higher than a predetermined value is extracted as a key word. The TFIDF method is a method of weighting keywords based on the appearance frequency of words. TFIDF is divided into TF and IDF. TF is Term Frequency (vocabulary frequency), and IDF is Inverse Document Frequency (reciprocal of document frequency). That is, TFIDF represents the product of TF and IDF. In the following embodiments, TFIDF is mainly described, but extraction of key words is well known in text mining other than TFIDF (for example, document clustering), and the use of those known techniques is excluded. Not what you want.

そして、ステップＳ６において、記録された全ての通話記録についてステップＳ２〜Ｓ５の処理を繰り返す。こうすることで、記録された通話記録がテキストに変換できる。なお、音声テキストの変換精度が低い場合は、ステップＳ４の処理を行う前に、認識されたテキストのチェックを利用者に行わせ、必要ならば修正を行うようにさせてもよい。 In step S6, the processing in steps S2 to S5 is repeated for all recorded call records. By doing so, the recorded call record can be converted into text. If the conversion accuracy of the voice text is low, the user may check the recognized text and perform correction if necessary before performing the process of step S4.

次に、ステップＳ７において、通話記録毎に抽出された要点語を利用者に表示する。このとき、要点語をブログ記事のタイトルの候補として表示してもよいし、通話記録のｔｉｍｅｓｔａｍｐから記事の発生時刻を表示してもよい。また、カメラで写した写真や、ＧＰＳから取り込んだ位置情報を表示させるようにしてもよい。これらの各種の情報に基づいて利用者がブログ記事を作成するブログ編集画面を表示する。テキスト変換された通話記録を同時に表示し、ブログ記事の原型としてもよい。このようにしてブログ記事を作成する各種の情報が本発明の会話記録装置１０によって得られる。 Next, in step S7, the key words extracted for each call record are displayed to the user. At this time, the key word may be displayed as a candidate for the title of the blog article, or the generation time of the article may be displayed from the time stamp of the call record. Moreover, you may make it display the photograph image | photographed with the camera and the positional information taken in from GPS. A blog editing screen for a user to create a blog article based on these various types of information is displayed. It is also possible to display text-converted call records at the same time and use it as the original blog article. In this way, various kinds of information for creating a blog article can be obtained by the conversation recording apparatus 10 of the present invention.

最後に、ステップＳ８において、編集済みのブログデータを利用者の指示に基づいてＷｅｂサーバに送信（アップロード）する。アップロードされたブログデータは、ブログサイトで閲覧しやすい形式（ＨＴＭＬ変換等）に加工されてＷｅｂページＤＢ２３に格納される。 Finally, in step S8, the edited blog data is transmitted (uploaded) to the Web server based on the user's instruction. The uploaded blog data is processed into a format (HTML conversion or the like) that is easy to browse on the blog site and stored in the Web page DB 23.

図３は、会話記録として、携帯電話機３０における通話記録の一例を示した図である。図の通話記録は、利用者（ブログ作成者）Ａさんが、友人のＢさんと交わした携帯電話の通話内容である。ここでは、Ａさんが、偶然立ち寄ったケーキ屋の話題が盛り上がっているが、このような印象に残った出来事をブログ記事として、後でアップロードすることはよく行われることである。このとき、通話記録には、ｔｉｍｅｓｔａｍｐが含まれるので、これを時系列に編集することはブログ作成にとって非常に便利である。また、今日の携帯電話機には、カメラやＧＰＳ（ＧｌｏｂａｌＰｏｓｉｔｉｏｎｉｎｇＳｙｓｔｅｍ）を備えたものも多く、カメラで撮像したその場所での画像や、その時刻、位置なども、ブログにアップロードするのには有用な情報である。例えば、そのとき、たまたま食べたケーキなどの感想やその写真、お店の位置情報（地図）をブログに載せると、それを見た人が多くそのお店を訪れるようになることも十分にあり得る。 FIG. 3 is a diagram showing an example of a call record in the mobile phone 30 as a conversation record. The call record shown in the figure is the content of a mobile phone call that user (blog creator) A exchanged with friend B. Here, the topic of the cake shop where Mr. A stopped by accident is exciting, but it is often done to upload such a memorable event as a blog post later. At this time, since the call record includes a timestamp, it is very convenient for creating a blog to edit this in time series. Also, many of today's mobile phones are equipped with a camera and GPS (Global Positioning System), and it is useful for uploading to the blog the images taken by the camera, the time and position, etc. Information. For example, if you post an impression of a cake you happened to eat at that time, its photo, or the location information (map) of the store on a blog, it is quite possible that many people will see it. obtain.

図４は、図３で示した通話記録を元に、通話記録の要点データ（要点語の集まり）を抽出したテーブルを示した図である。この表には、前述の通話記録から抽出した語（キーワード）が、出現頻度、ＴＦＩＤＦの値と共に格納され、ＴＦＩＤＦ値の高い語ほど特徴を表す要点語として上位に並べられる。なお、ここではＴＦＩＤＦ値は、通話全体を文書集合として計算した例を示すが、自分（Ａさん）と友人（Ｂさん）の発話部分に分け、それぞれに対してＴＦＩＤＦ値を示してもよい。例えば、同じ意味でもＡさんとＢさんは、用いる用語が異なると、Ａさんの発話部から得た要点語と、Ｂさんの発話部から得た要点語では当然結果が異なる。このことを分析することによって、ブログ記事を作成するときに、会話の相手方の意見を取り込んだり、逆に取り込まなかったりするといったような選択の余地を広げることができる。それによって、Ａさん、Ｂさんの使う用語のばらつきを分析することができる。また、通話全体を一つの文書にするのではなく、適当な区切り（例えば無音時間が長く続くとき）を検出し、その区切り毎に別の文書として扱ってもよい。あるいは、一定時間、例えば３分毎などに通話を区切って、それぞれを一つの文書と扱うようにしてもよい。このように同じ通話記録からでも様々な文書単位を定義できるので、ＴＦＩＤＦ値をそれごとに計算し、要点語の分析に役立てることができる。 FIG. 4 is a diagram showing a table in which key data (collection of key words) of the call record is extracted based on the call record shown in FIG. In this table, the words (keywords) extracted from the above-mentioned call record are stored together with the appearance frequency and the value of TFIDF, and the words with higher TFIDF values are arranged at the top as the key words representing the features. Here, the TFIDF value is an example in which the entire call is calculated as a document set. However, the TFIDF value may be shown for each of the utterance portions of the user (Mr. A) and the friend (Mr. B). For example, if the terms used by Mr. A and Mr. B are different in the same meaning, the results of the key words obtained from Mr. A's speech part and the key words obtained from Mr. B's speech part are naturally different. By analyzing this, when creating a blog article, it is possible to expand the scope of selection such as taking in the opinions of the other party of the conversation or not taking them in reverse. As a result, it is possible to analyze variations in terms used by Mr. A and Mr. B. Further, instead of making the entire call into one document, an appropriate break (for example, when the silent time lasts long) may be detected and treated as a separate document for each break. Alternatively, a call may be divided at a certain time, for example, every 3 minutes, and each may be handled as one document. Thus, since various document units can be defined even from the same call record, the TFIDF value can be calculated for each and used to analyze the key words.

図５は、会話記録として、ＩＣレコーダ３１のような録音機における録音記録（インタビュー）を示した図である。この例では、図３の携帯電話機３０での通話の後、ＡさんとＢさんが実際のお店を訪れて、そのお店の店長にインタビューを行ったときの会話が記録されている。携帯電話機３０での通話と違って、インタビューは、ブログに掲載することを前提に会話が行われると考えられるので、ブログの記事になりやすいような記録が得られる可能性が高い。ここで得られた会話記録は、携帯電話機３０の通話記録と同様に会話記録装置１０に入力され、音声テキスト変換されてブログ記事の元データとなる。なお、このようなインタビューの録音は、ＩＣレコーダ３１のような録音専用機でなくとも、録音機能を備えていれば、携帯電話機３０でも代用できる。また、相手方がいなくとも、利用者が記事を口述し、その口述記録を会話記録装置１０の入力として用いることもできる。 FIG. 5 is a diagram showing recording recording (interview) in a recording machine such as the IC recorder 31 as conversation recording. In this example, conversations are recorded when Mr. A and Mr. B visit an actual store and interview the store manager after a call on the mobile phone 30 of FIG. Unlike a call on the mobile phone 30, the interview is considered to be conducted on the assumption that the interview is posted on the blog, so there is a high possibility that a record that is likely to become a blog article is obtained. The conversation record obtained here is input to the conversation recording device 10 in the same manner as the call record of the mobile phone 30, and is converted into voice text to be the original data of the blog article. It should be noted that such interview recording can be substituted by the mobile phone 30 as long as it has a recording function, even if it is not a dedicated recording machine such as the IC recorder 31. Even if there is no other party, the user can dictate the article, and the dictation record can be used as the input of the conversation recording apparatus 10.

図６は、図５の録音記録を元に、録音記録の要点データ（要点語の集まり）を抽出したテーブルを示した図である。この表には、前述の図４のテーブルと同様に、録音記録から抽出した語（キーワード）が、出現頻度、ＴＦＩＤＦの値と共に格納され、ＴＦＩＤＦ値の高い語ほど特徴を表す要点語として上位に並べられる。ＴＦＩＤＦ値の計算方法も、同様である。ただし、通話の内容と、録音内容がこの例のように関連している場合は、通話記録を文書１、録音記録を文書２として、両文書全体からＴＦＩＤＦ値を計算し、その値に基づいて要点語を抽出することができる。もちろん、文書１、文書２に対してそれぞれ別々にＴＦＩＤＦ値を求め、それらの値に基づいて要点語を抽出してもよい。このようにすることで、異なる角度からの要点語抽出を可能にする。例えば、会話の相手方が有用な情報を発言せず、ただ相槌を打っているような場合には、相手方の発話部分を抽出の対象から外すことができる。 FIG. 6 is a diagram showing a table in which the key data (collection of key words) of the recording record is extracted based on the recording record of FIG. In this table, as in the table of FIG. 4 described above, words (keywords) extracted from the recording are stored together with the appearance frequency and the value of TFIDF, and the higher the TFIDF value, the higher the key words representing the features. Are lined up. The calculation method of the TFIDF value is also the same. However, if the contents of the call and the recorded contents are related as shown in this example, the call record is document 1 and the recorded record is document 2, and the TFIDF value is calculated from the entire documents, and based on that value. Key words can be extracted. Of course, the TFIDF values may be obtained separately for the document 1 and the document 2, and the key words may be extracted based on these values. In this way, it is possible to extract key words from different angles. For example, when the conversation partner does not speak useful information and is just talking, the utterance part of the other party can be excluded from the extraction target.

図７は、本発明の一つの実施形態に係る会話記録装置１０におけるブログ記事編集画面の一例を示した図である。ここでは、携帯電話の通話記録の場合について説明する。符号５０で示される通話記録選択バーから、対象となる通話記録等を選択する。通話記録は複数選択してもよい。領域５１には、選択された通話に関連する情報として、通話開始時間、通話相手、通話時間等が表示される。更に、領域５２には、前述の手法に基づいて抽出された要点語（要点キーワード）が表示される。ここではＴＦＩＤＦ値の高い順に表示するようにしてもよい。また、領域５３には音声テキスト変換部１３によって、テキストに変換された通話記録の音声認識結果が表示される。このテキスト中、下線部の部分は誤認識、又は認識できなかった部分を示している。利用者は、この正しく認識されなかった語を特に修正しなくともよい。領域５４（ブログ記事見出し部）には、例えば、領域５１の通話関連情報と、領域５２の要点キーワードから作成されたブログ記事のタイトル部分が表示される。もちろん、利用者がこのタイトル部分を編集するようにしてもよい。領域５５は、抽出された要点語を、ブログ記事の内容（コメント）の要点として表示する領域である。もちろん、この部分は、利用者がこれらの要点語を元に編集してもよいし、場合によっては、領域５３の通話記録を直接編集して作成することもできる。 FIG. 7 is a diagram showing an example of a blog article editing screen in the conversation recording apparatus 10 according to one embodiment of the present invention. Here, a case of call recording of a mobile phone will be described. A target call record or the like is selected from a call record selection bar indicated by reference numeral 50. A plurality of call records may be selected. The area 51 displays a call start time, a call partner, a call time, and the like as information related to the selected call. Further, in the area 52, the key words (key points keywords) extracted based on the above-described method are displayed. Here, the TFIDF values may be displayed in descending order. In the area 53, the voice recognition result of the call record converted into text by the voice text conversion unit 13 is displayed. In this text, the underlined portion indicates a misrecognized or unrecognized portion. The user does not need to particularly correct the words that are not correctly recognized. In the area 54 (blog article heading section), for example, the call-related information in the area 51 and the title part of the blog article created from the key point keywords in the area 52 are displayed. Of course, the user may edit the title portion. The area 55 is an area for displaying the extracted key word as a key point of the content (comment) of the blog article. Of course, this portion may be edited by the user based on these key words, or in some cases, the call record in the area 53 can be directly edited.

領域５６〜５９は、ブログに同時にアップロードする写真等の画像を編集するためのものである。すなわち、領域５６の写真選択バーから写真のファイルを選択し、領域５７（写真付随情報）には、撮影時刻や場所を表示する。ＧＰＳを備えた携帯電話機を使用している場合は、場所の位置情報を取得するようにしてもよい。領域５８には、写真を編集するための各種ツールボタン等が配置され、アップロードされるべき写真が領域５９に表示されるようにする。図示していないが、編集前の写真と編集後の写真を並べて表示するようにしてもよい。 Regions 56 to 59 are for editing images such as photos that are simultaneously uploaded to the blog. That is, a photo file is selected from the photo selection bar in the area 56, and the shooting time and location are displayed in the area 57 (photo attached information). When a mobile phone equipped with GPS is used, location information on a place may be acquired. In the area 58, various tool buttons and the like for editing a photograph are arranged so that the photograph to be uploaded is displayed in the area 59. Although not shown, the photo before editing and the photo after editing may be displayed side by side.

このようにして作成したブログ記事の文章と写真等の画像は、利用者が、領域６０（ブログ選択メニュー）においてアップロード先のブログを選択し、アップロード・ボタン６２を押下することによって所定のブログサイトに掲載される。また、ブログの表示形式（例えば、ＨＴＭＬやＸＭＬ）を編集する機能を持たせることによって、アップロードする前に、閲覧イメージを、プレビューボタン６１を押下することによって表示させるようにしてもよい。 The images of the blog article created in this way and images such as photos are selected by the user by selecting an upload destination blog in the area 60 (blog selection menu) and pressing the upload button 62. It is published in. Further, by providing a function for editing a blog display format (for example, HTML or XML), a browsing image may be displayed by pressing the preview button 61 before uploading.

［会話記録装置のハードウェア構成］
図８は、本発明の一つの実施形態に係る会話記録装置１０のハードウェア構成の一例を示す図である。 [Hardware configuration of conversation recording device]
FIG. 8 is a diagram illustrating an example of a hardware configuration of the conversation recording apparatus 10 according to an embodiment of the present invention.

会話記録装置１０は、制御部１０１を構成するＣＰＵ（ＣｅｎｔｒａｌＰｒｏｃｅｓｓｉｎｇＵｎｉｔ）１１０（マルチプロセッサ構成ではＣＰＵ１２０等複数のＣＰＵが追加されてもよい）、バスライン１０５、通信Ｉ／Ｆ１４０、メインメモリ１５０、ＢＩＯＳ（ＢａｓｉｃＩｎｐｕｔＯｕｔｐｕｔＳｙｓｔｅｍ）１６０、ＵＳＢポート１９０、Ｉ／Ｏコントローラ１７０、並びにキーボード及びマウス１８０等の入力手段や表示装置１２２を備える。また、必要に応じてマイク・スピーカ１７３を備えていてもよい。ＵＳＢポート１９０には、携帯電話機３０やＩＣレコーダ３１などが接続されようにして、通話記録データを会話記録装置１０に取り込むようにしてもよい。 The conversation recording apparatus 10 includes a central processing unit (CPU) 110 (a plurality of CPUs such as the CPU 120 may be added in a multiprocessor configuration) 110, a bus line 105, a communication I / F 140, a main memory 150, A BIOS (Basic Input Output System) 160, a USB port 190, an I / O controller 170, and input means such as a keyboard and mouse 180 and a display device 122 are provided. Further, a microphone / speaker 173 may be provided as necessary. The mobile phone 30 and the IC recorder 31 may be connected to the USB port 190 so that the call recording data can be taken into the conversation recording device 10.

Ｉ／Ｏコントローラ１７０には、テープドライブ１７２、ハードディスク１７４、光ディスクドライブ１７６、半導体メモリ１７８、等の記憶手段を接続することができる。 Storage means such as a tape drive 172, a hard disk 174, an optical disk drive 176, and a semiconductor memory 178 can be connected to the I / O controller 170.

ＢＩＯＳ１６０は、会話記録装置１０の起動時にＣＰＵ１１０が実行するブートプログラムや、会話記録装置１０のハードウェアに依存するプログラム等を格納する。 The BIOS 160 stores a boot program executed by the CPU 110 when the conversation recording apparatus 10 is activated, a program depending on the hardware of the conversation recording apparatus 10, and the like.

記憶部１０２を構成するハードディスク１７４は、会話記録装置１０が装置として機能するための各種プログラム及び本発明の機能を実行するプログラムを記憶しており、更に必要に応じて各種データベースを構成可能である。 The hard disk 174 constituting the storage unit 102 stores various programs for the conversation recording device 10 to function as a device and programs for executing the functions of the present invention, and can further configure various databases as necessary. .

光ディスクドライブ１７６としては、例えば、ＤＶＤ−ＲＯＭドライブ、ＣＤ−ＲＯＭドライブ、ＤＶＤ−ＲＡＭドライブ、ＣＤ−ＲＡＭドライブを使用することができる。この場合は各ドライブに対応した光ディスク１７７を使用する。光ディスク１７７から光ディスクドライブ１７６によりプログラム又はデータを読み取り、Ｉ／Ｏコントローラ１０７０を介してメインメモリ１５０又はハードディスク１７４に提供することもできる。また、同様にテープドライブ１７２に対応したテープメディア１７１を主としてバックアップのために使用することもできる。 As the optical disk drive 176, for example, a DVD-ROM drive, a CD-ROM drive, a DVD-RAM drive, or a CD-RAM drive can be used. In this case, the optical disk 177 corresponding to each drive is used. A program or data can be read from the optical disk 177 by the optical disk drive 176 and provided to the main memory 150 or the hard disk 174 via the I / O controller 1070. Similarly, the tape medium 171 corresponding to the tape drive 172 can be used mainly for backup.

会話記録装置１０に提供されるプログラムは、ハードディスク１７４、光ディスク１７７、又はメモリカード等の記録媒体に格納されて提供される。このプログラムは、Ｉ／Ｏコントローラ１７０を介して、記録媒体から読み出され、又は通信Ｉ／Ｆ１４０を介してダウンロードされることによって、会話記録装置１０にインストールされ実行されてもよい。 The program provided to the conversation recording device 10 is provided by being stored in a recording medium such as the hard disk 174, the optical disk 177, or a memory card. The program may be installed in the conversation recording apparatus 10 and executed by being read from the recording medium via the I / O controller 170 or downloaded via the communication I / F 140.

前述のプログラムは、内部又は外部の記憶媒体に格納されてもよい。ここで、記憶部１０２を構成する記憶媒体としては、ハードディスク１７４、光ディスク１７７、又はメモリカードの他に、ＭＤ等の光磁気記録媒体、テープ媒体を用いることができる。また、専用通信回線やインターネットに接続されたサーバシステムに設けたハードディスク１７４又は光ディスクライブラリ等の記憶装置を記録媒体として使用し、通信回線を介してプログラムを会話記録装置１０に提供してもよい。 The aforementioned program may be stored in an internal or external storage medium. Here, in addition to the hard disk 174, the optical disk 177, or the memory card, a magneto-optical recording medium such as an MD, or a tape medium can be used as the storage medium constituting the storage unit 102. Further, a storage device such as a hard disk 174 or an optical disk library provided in a server system connected to a dedicated communication line or the Internet may be used as a recording medium, and the program may be provided to the conversation recording device 10 via the communication line.

ここで、表示装置１２２は、ユーザにデータの入力を受け付ける画面を表示したり、会話記録装置１０による演算処理結果の画面を表示したりするものであり、ブラウン管表示装置（ＣＲＴ）、液晶表示装置（ＬＣＤ）等のディスプレイ装置を含む。 Here, the display device 122 displays a screen for accepting data input to the user or displays a screen of a calculation processing result by the conversation recording device 10, and is a cathode ray tube display device (CRT) or a liquid crystal display device. (LCD) and other display devices.

ここで、入力手段は、ユーザによる入力の受け付けを行うものであり、キーボード及びマウス１８０等により構成してよい。 Here, the input means accepts input by the user, and may be configured by a keyboard, a mouse 180, and the like.

また、通信Ｉ／Ｆ１４０は、会話記録装置１０を専用ネットワーク又は公共ネットワークを介して端末と接続できるようにするためのネットワーク・アダプタである。通信Ｉ／Ｆ１４０は、モデム、ケーブル・モデム及びイーサネット（登録商標）・アダプタを含んでよい。 The communication I / F 140 is a network adapter for enabling the conversation recording apparatus 10 to be connected to a terminal via a dedicated network or a public network. The communication I / F 140 may include a modem, a cable modem, and an Ethernet (registered trademark) adapter.

以上の例は、会話記録装置１０について主に説明したが、コンピュータに、プログラムをインストールして、そのコンピュータをサーバ装置として動作させることにより上記で説明した機能を実現することもできる。したがって、本発明において一実施形態として説明したサーバにより実現される機能は、上述の方法を当該コンピュータにより実行することにより、あるいは、上述のプログラムを当該コンピュータに導入して実行することによっても実現可能である。 In the above example, the conversation recording apparatus 10 has been mainly described. However, the functions described above can be realized by installing a program in a computer and operating the computer as a server apparatus. Therefore, the functions realized by the server described as an embodiment in the present invention can be realized by executing the above-described method by the computer, or by introducing the above-mentioned program into the computer and executing it. It is.

以上、本発明の実施形態について説明したが、本発明は上述した実施形態に限るものではない。また、本発明の実施形態に記載された効果は、本発明から生じる最も好適な効果を列挙したに過ぎず、本発明による効果は、本発明の実施例に記載されたものに限定されるものではない。 As mentioned above, although embodiment of this invention was described, this invention is not restricted to embodiment mentioned above. The effects described in the embodiments of the present invention are only the most preferable effects resulting from the present invention, and the effects of the present invention are limited to those described in the embodiments of the present invention. is not.

本発明の一つの実施形態に係る会話記録装置１０の機能ブロックを示した図である。It is the figure which showed the functional block of the conversation recording apparatus 10 which concerns on one Embodiment of this invention. 本発明の一つの実施形態に係る会話記録装置１０の処理フローを示した図である。It is the figure which showed the processing flow of the conversation recording apparatus 10 which concerns on one Embodiment of this invention. 会話記録として、携帯電話機における通話記録の一例を示した図である。It is the figure which showed an example of the call recording in a mobile telephone as conversation recording. 図３の通話記録を元に、通話記録の要点データ（要点語の集まり）を抽出したテーブルを示した図である。It is the figure which showed the table which extracted the key data (collection of key words) of the call record based on the call record of FIG. 会話記録として、ＩＣレコーダのような録音機における録音記録（インタビュー）を示した図である。It is the figure which showed the recording recording (interview) in recording machines like an IC recorder as conversation recording. 図５の録音記録を元に、録音記録の要点データ（要点語の集まり）を抽出したテーブルを示した図である。It is the figure which showed the table which extracted the key data (collection of key words) of the sound recording based on the sound recording of FIG. 本発明の一つの実施形態に係る会話記録装置１０におけるブログ記事編集画面の一例を示した図である。It is the figure which showed an example of the blog article edit screen in the conversation recording apparatus 10 which concerns on one Embodiment of this invention. 本発明の一つの実施形態に係る会話記録装置１０のハードウェア構成の一例を示す図である。It is a figure which shows an example of the hardware constitutions of the conversation recording apparatus 10 which concerns on one Embodiment of this invention.

符号の説明Explanation of symbols

１０会話記録装置
１１会話記録入力部
１２会話部分離部
１３音声テキスト変換部
１４要点語抽出部
１５ブログデータ編集部
１６ブログデータ送信部
１７スピーカ
１８表示部
１９操作部
２０Ｗｅｂサーバ
２１ブログデータ受信部
２２ブログ記録部
２３ＷｅｂページＤＢ
３０携帯電話機
３１ＩＣレコーダ
４０ネットワーク
５０通話記録選択バー
５１通話関連情報
５２抽出要点語
５３通話記録（テキスト変換後）
５４ブログ記事見出し部
５５ブログ記事内容
５６写真選択バー
５７写真付随情報
５８写真編集用ツールボタン
５９写真
６０ブログ選択メニュー
６１ブレビュー・ボタン
６２アップロード・ボタン DESCRIPTION OF SYMBOLS 10 Conversation recording apparatus 11 Conversation record input part 12 Conversation part isolation | separation part 13 Speech text conversion part 14 Gist word extraction part 15 Blog data edit part 16 Blog data transmission part 17 Speaker 18 Display part 19 Operation part 20 Web server 21 Blog data reception part 22 Blog recording section 23 Web page DB
30 Cellular Phone 31 IC Recorder 40 Network 50 Call Record Selection Bar 51 Call Related Information 52 Extraction Key Words 53 Call Record (After Text Conversion)
54 Blog Article Heading 55 Blog Article Content 56 Photo Selection Bar 57 Photo Accompanying Information 58 Photo Editing Tool Buttons 59 Photos 60 Blog Selection Menu 61 Review Button 62 Upload Button

Claims

音声会話記録を用いてブログの作成を支援するための装置であって、
携帯端末に記憶された音声会話記録データを音声認識によりテキスト・データに変換する音声テキスト変換部と、
前記テキスト・データを形態素分割して、所定範囲の会話全体を文書集合とし、所定の会話単位を文書としてとらえて、ＴＦＩＤＦ値を計算し、当該計算したＴＦＩＤＦ値の高い語ほど当該会話の特徴を表すキーワードとして抽出するキーワード抽出部と、
前記抽出したキーワードを、利用者の端末に、ブログに掲載するコメントを編集させるブログデータ編集画面とともに表示する表示部と、
前記コメントを含むブログデータをサーバに送信するブログデータ送信部と、
を備える装置。 A device for supporting the creation of a blog using voice conversation recording,
A voice text conversion unit that converts voice conversation recording data stored in the portable terminal into text data by voice recognition ;
The text data is divided into morphemes, the entire conversation in a predetermined range is made into a document set, a predetermined conversation unit is taken as a document, a TFIDF value is calculated, and the characteristic of the conversation is determined for a word having a higher calculated TFIDF value. A keyword extractor for extracting as a keyword to represent,
The keywords that the extracted, the user of the terminal, and a display unit that displays along with the blog data editing screen to edit the comments posted on the blog,
A blog data transmission unit that transmits blog data including the comment to a server;
A device comprising:

前記音声会話記録は、携帯電話機を用いた通話記録である、請求項１に記載の装置。 The apparatus according to claim 1 , wherein the voice conversation record is a call record using a mobile phone.

前記音声会話記録の音声データを、前記利用者の発話部分と前記利用者と会話する他の者の発話部分とに、発音の違いにより分離する会話分離部をさらに備え、
前記キーワード抽出部は、前記ＴＦＩＤＦ値を、前記利用者の発話部分のテキスト・データと、会話の相手側の発話部分のテキスト・データのそれぞれの所定範囲の会話全体を文書集合としてとらえて計算する、請求項２に記載の装置。 The voice data of the voice conversation record is further provided with a conversation separating unit that separates the utterance part of the user and the utterance part of another person talking with the user by a difference in pronunciation,
The keyword extraction unit calculates the TFIDF value by regarding the entire conversation within a predetermined range of the text data of the user's utterance part and the text data of the utterance part of the conversation partner as a document set. The apparatus according to claim 2 .

前記ブログデータ編集画面は、前記ブログの表示形式を編集する機能を備え、前記利用者からの指示によって、前記サーバに送信する、請求項１に記載の装置。 The apparatus according to claim 1 , wherein the blog data editing screen has a function of editing a display format of the blog, and transmits the blog data editing screen to the server according to an instruction from the user.

前記携帯端末は、対象物を撮像する手段を備える、請求項１乃至４に記載の装置。 The mobile terminal comprises means for capturing an object, according to claims 1 to 4.

前記携帯端末は、自らの位置情報を取得する手段を備え、前記音声会話記録を作成した場所の位置情報を前記音声会話記録と関連付けて記録する、請求項１乃至５に記載の装置。 The apparatus according to any one of claims 1 to 5 , wherein the portable terminal includes means for acquiring own position information, and records position information of a place where the voice conversation record is created in association with the voice conversation record.

音声発話記録を用いてブログの作成を支援するための装置であって、
携帯端末に記憶された音声発話記録データを音声認識によりテキスト・データに変換する音声テキスト変換部と、
前記テキスト・データを形態素分割して、所定範囲の発話全体を文書集合とし、所定の発話単位を文書としてとらえて、ＴＦＩＤＦ値を計算し、当該計算したＴＦＩＤＦ値の高い語ほど当該発話の特徴を表すキーワードとして抽出するキーワード抽出部と、
前記抽出したキーワードを、利用者の端末に、ブログに掲載するコメントを編集させるブログデータ編集画面とともに表示する表示部と、
前記コメントを含むブログデータをサーバに送信するブログデータ送信部と、
を備える装置。 A device for supporting the creation of a blog using voice utterance records,
A voice text conversion unit that converts voice utterance recording data stored in a portable terminal into text data by voice recognition ;
The text data is divided into morphemes, a whole set of utterances in a predetermined range is taken as a document set, a predetermined utterance unit is taken as a document, a TFIDF value is calculated, and the higher the calculated TFIDF value, the characteristics of the utterance A keyword extractor for extracting as a keyword to represent,
The keywords that the extracted, the user of the terminal, and a display unit that displays along with the blog data editing screen to edit the comments posted on the blog,
A blog data transmission unit that transmits blog data including the comment to a server;
A device comprising:

音声会話記録を用いてブログの作成を支援するための方法であって、
コンピュータ・システムにおいて、
携帯端末に記憶された音声会話記録データを音声認識によりテキスト・データに変換するステップと、
前記テキスト・データを形態素分割して、所定範囲の会話全体を文書集合とし、所定の会話単位を文書としてとらえて、ＴＦＩＤＦ値を計算し、当該計算したＴＦＩＤＦ値の高い語ほど当該会話の特徴を表すキーワードとして抽出するステップと、
前記抽出したキーワードを、利用者の端末に、ブログに掲載するコメントを編集させるブログデータ編集画面とともに表示する表示ステップと、
前記コメントを含むブログデータをサーバに送信するブログデータ送信ステップと、
を含む方法。 A method for supporting the creation of a blog using voice conversation recording,
In computer systems,
Converting voice conversation recording data stored in the mobile terminal into text data by voice recognition ;
The text data is divided into morphemes, the entire conversation in a predetermined range is made into a document set, a predetermined conversation unit is taken as a document, a TFIDF value is calculated, and the characteristic of the conversation is determined for a word having a higher calculated TFIDF value. Extracting as keywords to represent,
The keywords that the extracted, the user of the terminal, and a display step of displaying along with the blog data editing screen to edit the comments posted on the blog,
A blog data transmission step of transmitting blog data including the comment to a server;
Including methods.