JP2014048888A

JP2014048888A - Word importance degree calculation device, word importance degree calculation method, and computer program

Info

Publication number: JP2014048888A
Application number: JP2012191170A
Authority: JP
Inventors: Yoshimi Moriwaki; 香美森脇; Satoshi Izushi; 聡史出石; Yoichi Kawabuchi; 洋一河渕
Original assignee: Konica Minolta Inc
Current assignee: Konica Minolta Inc
Priority date: 2012-08-31
Filing date: 2012-08-31
Publication date: 2014-03-17
Anticipated expiration: 2032-08-31
Also published as: JP5949340B2

Abstract

PROBLEM TO BE SOLVED: To calculate a degree of importance for a user for respective words included in a document more accurately than before.SOLUTION: A profile management server 1 is provided with means including: an interest degree data acquisition part 102 for acquiring profile data 6PD showing interest degrees that are degrees of user's interests in a document at respective times during a view of the document; a change pattern determination part 106 for detecting a pattern of change in the interest degrees; and an importance degree adjustment part 107 for calculating importance degrees that are degrees of interests for the user for respective words included in the document on the basis of the detected pattern.

Description

本発明は、ユーザにとっての単語の重要性の度合いを算出する装置および方法などに関する。 The present invention relates to an apparatus and a method for calculating a degree of importance of a word for a user.

従来、ユーザが関心または興味を持っている事項を表わす単語を、ユーザのプロファイルとして取り扱うことがある。さらに、単語ごとに、ユーザにとっての重要性の度合いを表わす重要度を算出しデータベースなどに登録しておくことがある。 Conventionally, a word representing a matter that the user is interested in or interested in may be handled as a user profile. Furthermore, for each word, an importance level indicating the degree of importance for the user may be calculated and registered in a database or the like.

単語を１つずつ入力することは、面倒である。そこで、ユーザが閲覧した文書の中から単語を抽出することが、考えられる。 Entering words one by one is cumbersome. Therefore, it is conceivable to extract words from the document viewed by the user.

また、ユーザが情報を閲覧しているときの生体情報を検出し、ユーザにとっての、この情報の有用性を、生体情報に基づいて判別する技術が、提案されている（特許文献１）。 Further, a technique has been proposed in which biometric information when a user is browsing information is detected and the usefulness of this information for the user is determined based on the biometric information (Patent Document 1).

特開２００９−１９９４６９号公報JP 2009-199469 A

特許文献１に記載される技術は、心理情報に基づいてユーザにとっての情報の有用性を判別する概念に過ぎない。精度の高いプロファイルを作成するためには、文書に含まれる単語ごとの重要性の度合いをできるだけ正確に求める必要がある。 The technique described in Patent Document 1 is merely a concept for determining the usefulness of information for a user based on psychological information. In order to create a highly accurate profile, it is necessary to obtain the degree of importance of each word included in the document as accurately as possible.

本発明は、このような問題点に鑑み、文書に含まれる単語ごとの、ユーザにとっての重要性の度合いを、従来よりも精度よく求めることを、目的とする。 The present invention has been made in view of such a problem, and an object of the present invention is to obtain the degree of importance for a user for each word included in a document with higher accuracy than before.

本発明の一形態に係る単語重要度算出装置は、ユーザの、文書の閲覧中の各時刻における前記文書に対する興味の度合いである興味度を入力する、興味度入力手段と、入力された前記興味度の変化のパターンを検知する、変化パターン検知手段と、検知された前記パターンに基づいて、前記文書に含まれる単語ごとの、前記ユーザにとっての重要性の度合いである重要度を算出する、重要度算出手段と、を有する。 An apparatus for calculating word importance according to an aspect of the present invention includes an interest level input unit that inputs an interest level that is a degree of interest of a user at each time during browsing of the document, and the input interest level. A change pattern detection unit that detects a pattern of change in degree, and calculates an importance that is a degree of importance for the user for each word included in the document based on the detected pattern. Degree calculation means.

好ましくは、前記変化パターン検知手段は、前記パターンとして、第一の長さの時間における前記興味度の変化率が第一の率以上である第一のパターンおよび前記第一の長さよりも長い第二の長さの時間における前記興味度の変化率が前記第二の率よりも小さい第二の率以上である第二のパターンのうちの少なくともいずれかを検知し、前記重要度算出手段は、前記第一のパターンが検知された場合は、前記重要度として、前記第二のパターンのみ検知された場合よりも、高い度数を算出する。 Preferably, the change pattern detecting means includes, as the pattern, a first pattern in which a change rate of the degree of interest in a first length of time is equal to or greater than a first rate and a first pattern longer than the first length. Detecting at least one of the second patterns in which the change rate of the degree of interest in a second length of time is equal to or greater than a second rate smaller than the second rate, and the importance degree calculation unit includes: When the first pattern is detected, a higher frequency is calculated as the importance than when only the second pattern is detected.

または、前記重要度算出手段は、前記単語それぞれの重要度を、当該単語が前記文書に表れる頻度に応じて算出する。 Alternatively, the importance calculation means calculates the importance of each word according to the frequency with which the word appears in the document.

または、前記重要度算出手段は、前記各時刻における前記興味度の中間値が所定の値以上である場合に、前記単語それぞれの前記重要度を算出し、そうでない場合は、算出しない。 Alternatively, the importance level calculation means calculates the importance level of each of the words when the intermediate value of the interest level at each time is greater than or equal to a predetermined value, and does not calculate otherwise.

または、前記重要度算出手段は、前記単語のうちの前記文書の中の前記ユーザが注目した部分に含まれる単語の前記重要度として、他の単語の前記重要度よりも高い度数を算出する。 Alternatively, the importance calculation means calculates a frequency higher than the importance of other words as the importance of the word included in the portion of the word that is noticed by the user in the document.

または、前記重要度算出手段は、前記単語のうちの所定の書式で表わされる単語の前記重要度として、他の単語の前記重要度よりも高い度数を算出する。または、前記単語のうちの前記文書の中の所定の欄に表われる単語の前記重要度として、他の単語の前記重要度よりも高い度数を算出する。 Alternatively, the importance calculation means calculates a frequency higher than the importance of other words as the importance of words expressed in a predetermined format among the words. Alternatively, a frequency higher than the importance of other words is calculated as the importance of the word appearing in a predetermined column in the document.

または、検知された前記パターンに基づいて、前記文書に含まれる複数の単語同士からなるペアごとの、前記ユーザにとっての重要性の度合いであるペア重要度を算出する、ペア重要度算出手段、を有する。 Or, based on the detected pattern, pair importance calculation means for calculating pair importance, which is a degree of importance for the user, for each pair consisting of a plurality of words included in the document, Have.

または、前記ペア重要度算出手段は、前記第一のパターンが検知された場合は、前記ペア重要度として、前記第二のパターンのみ検知された場合よりも、高い度数を算出する。 Alternatively, the pair importance calculation means calculates a higher frequency as the pair importance than when only the second pattern is detected when the first pattern is detected.

または、前記ペア重要度算出手段は、前記ペアのうちの前記文書の中の前記ユーザが注目した部分に含まれるペアの前記ペア重要度として、他のペアの前記ペア重要度よりも高い度数を算出する。 Alternatively, the pair importance calculation means may calculate a higher degree than the pair importance of the other pair as the pair importance of the pair included in the portion of the document that is noticed by the user in the document. calculate.

または、前記単語ごとの前記重要度および前記ペアごとの前記ペア重要度を示すプロファイルデータを記憶するプロファイルデータ記憶手段と、前記ユーザが指定した検索クエリーを前記プロファイルデータに基づいて補正し、当該補正した検索クエリーに基づいて情報を検索する処理を行う検索処理手段と、を有する。 Alternatively, profile data storage means for storing profile data indicating the importance for each word and the pair importance for each pair, and a search query specified by the user is corrected based on the profile data, and the correction Retrieval processing means for performing processing for retrieving information based on the retrieved query.

本発明によると、文書に含まれる単語ごとの、ユーザにとっての重要性の度合いを、従来よりも精度よく求めることができる。 According to the present invention, the degree of importance for a user for each word included in a document can be obtained with higher accuracy than before.

文書管理システムの全体的な構成の例を示す図である。It is a figure which shows the example of the whole structure of a document management system. プロファイル管理サーバのハードウェア構成の例を示す図である。It is a figure which shows the example of the hardware constitutions of a profile management server. プロファイル管理サーバの機能的構成の例を示す図である。It is a figure which shows the example of a functional structure of a profile management server. 端末装置のハードウェア構成の例を示す図である。It is a figure which shows the example of the hardware constitutions of a terminal device. 端末装置の機能的構成の例を示す図である。It is a figure which shows the example of a functional structure of a terminal device. 各時刻の興味度データの例を示す図である。It is a figure which shows the example of the interest level data of each time. 必要部分抽出処理の流れの例を説明するフローチャートである。It is a flowchart explaining the example of the flow of a required part extraction process. 変化パターンと重要性特性との関係の例を示す図である。It is a figure which shows the example of the relationship between a change pattern and an importance characteristic. 変化パターンごとの重み付けの方法の例を示す図である。It is a figure which shows the example of the method of weighting for every change pattern. プロファイルデータの例を示す図である。It is a figure which shows the example of profile data. 端末装置の全体的な処理の流れの例を説明するフローチャートである。It is a flowchart explaining the example of the flow of the whole process of a terminal device. プロファイル管理サーバの全体的な処理の流れの例を説明するフローチャートである。It is a flowchart explaining the example of the flow of the whole process of a profile management server. 重要度調整処理の流れの例を説明するフローチャートである。It is a flowchart explaining the example of the flow of an importance adjustment process. 端末装置の機能的構成の変形例を示す図である。It is a figure which shows the modification of the functional structure of a terminal device. 重要度調整処理の流れの変形例を説明するフローチャートである。It is a flowchart explaining the modification of the flow of importance adjustment processing. 必要部分抽出処理の流れの変形例を説明するフローチャートである。It is a flowchart explaining the modification of the flow of a required part extraction process. 変化率と所定の値との関係を定義するテーブルの例を示す図である。It is a figure which shows the example of the table which defines the relationship between a change rate and a predetermined value.

図１は、文書管理システム１ＳＹの全体的な構成の例を示す図である。図２は、プロファイル管理サーバ１のハードウェア構成の例を示す図である。図３は、プロファイル管理サーバ１の機能的構成の例を示す図である。図４は、端末装置２のハードウェア構成の例を示す図である。図５は、端末装置２の機能的構成の例を示す図である。 FIG. 1 is a diagram illustrating an example of the overall configuration of the document management system 1SY. FIG. 2 is a diagram illustrating an example of a hardware configuration of the profile management server 1. FIG. 3 is a diagram illustrating an example of a functional configuration of the profile management server 1. FIG. 4 is a diagram illustrating an example of a hardware configuration of the terminal device 2. FIG. 5 is a diagram illustrating an example of a functional configuration of the terminal device 2.

文書管理システム１ＳＹは、図１に示すように、プロファイル管理サーバ１、端末装置２、ドキュメントサーバ３、画像形成装置４、および通信回線５などによって構成される。 As shown in FIG. 1, the document management system 1SY includes a profile management server 1, a terminal device 2, a document server 3, an image forming device 4, a communication line 5, and the like.

文書管理システム１ＳＹは、種々の情報、特に文書を管理しユーザに提供するためのシステムである。文書管理システム１ＳＹは、企業、役所、または学校などの組織に設置され、組織のメンバーによって用いられる。以下、ある企業において文書管理システム１ＳＹが用いられる場合を例に説明する。したがって、この企業の従業員が文書管理システム１ＳＹのユーザである。各ユーザには、ユニークなＩＤ（identification）であるユーザコードが１つずつ与えられている。 The document management system 1SY is a system for managing various information, particularly documents, and providing them to users. The document management system 1SY is installed in an organization such as a company, a government office, or a school, and is used by members of the organization. Hereinafter, a case where the document management system 1SY is used in a certain company will be described as an example. Therefore, employees of this company are users of the document management system 1SY. Each user is given one user code, which is a unique ID (identification).

プロファイル管理サーバ１、端末装置２、ドキュメントサーバ３、および画像形成装置４は、通信回線５を介して通信を行うことができる。通信回線５として、固定電話回線、携帯電話網、ＰＨＳ（Personal Handy-phone System）回線、専用線、インターネット、またはいわゆるＬＡＮ（Local Area Network）回線などが用いられる。ＬＡＮ回線の全部または一部として、無線ＬＡＮの回線を用いることができる。 The profile management server 1, the terminal device 2, the document server 3, and the image forming device 4 can communicate via the communication line 5. As the communication line 5, a fixed telephone line, a mobile phone network, a PHS (Personal Handy-phone System) line, a dedicated line, the Internet, a so-called LAN (Local Area Network) line, or the like is used. A wireless LAN line can be used as all or part of the LAN line.

プロファイル管理サーバ１は、ユーザごとのプロファイルを管理する。プロファイルとして、特に、ユーザに関連する単語を管理する。また、ユーザに関連する単語をキーワードとして用いて文書の検索を支援する。これらについては、後に詳細に説明する。 The profile management server 1 manages a profile for each user. As a profile, in particular, a word related to the user is managed. In addition, a document search is supported by using a word related to the user as a keyword. These will be described in detail later.

プロファイル管理サーバ１は、図２に示すように、ＣＰＵ（Central Processing Unit）１０ａ、ＲＡＭ（Random Access Memory）１０ｂ、ＲＯＭ（Read Only Memory）１０ｃ、ハードディスク１０ｄ、およびＮＩＣ（Network Interface Card）１０ｅなどによって構成される。 As shown in FIG. 2, the profile management server 1 includes a CPU (Central Processing Unit) 10a, a RAM (Random Access Memory) 10b, a ROM (Read Only Memory) 10c, a hard disk 10d, a NIC (Network Interface Card) 10e, and the like. Composed.

ＮＩＣ１０ｅは、端末装置２、ドキュメントサーバ３、および画像形成装置４のほか、インターネット上のウェブサーバなどと通信を行うための装置である。 The NIC 10 e is a device for communicating with the terminal device 2, the document server 3, the image forming device 4, and a web server on the Internet.

ＲＯＭ１０ｃまたはハードディスク１０ｄには、図３に示す文書データ収集部１０１、興味度データ取得部１０２、必要部分抽出部１０３、単語重要度算出部１０４、ペア重要度算出部１０５、変化パターン判定部１０６、重要度調整部１０７、プロファイルデータ生成部１０８、検索処理部１０９、文書データ記憶部１２１、興味度データ記憶部１２２、およびプロファイルデータ記憶部１２３などの機能を実現するためのプログラムが記憶されている。このプログラムは、プロファイルの管理および情報の検索のためのものである。このプログラムは、必要に応じてＲＡＭ１０ｂにロードされ、ＣＰＵ１０ａによって実行される。 The ROM 10c or the hard disk 10d includes a document data collection unit 101, an interest level data acquisition unit 102, a necessary part extraction unit 103, a word importance level calculation unit 104, a pair importance level calculation unit 105, a change pattern determination unit 106, Programs for realizing functions such as the importance adjustment unit 107, profile data generation unit 108, search processing unit 109, document data storage unit 121, interest level data storage unit 122, and profile data storage unit 123 are stored. . This program is for profile management and information retrieval. This program is loaded into the RAM 10b as necessary and executed by the CPU 10a.

図１に戻って、端末装置２は、文書（ドキュメント）の作成および編集を行ったり、プロファイル管理サーバ１、ドキュメントサーバ３、および画像形成装置４などが提供する種々のサービスを受けたりするための装置である。端末装置２として、パーソナルコンピュータ、タブレットコンピュータ（タブレットＰＣ）、携帯電話端末、またはスマートフォンなどが用いられる。以下、端末装置２としてタブレットコンピュータが用いられる場合を例に説明する。 Returning to FIG. 1, the terminal device 2 creates and edits a document (document) and receives various services provided by the profile management server 1, the document server 3, the image forming device 4, and the like. Device. As the terminal device 2, a personal computer, a tablet computer (tablet PC), a mobile phone terminal, a smartphone, or the like is used. Hereinafter, a case where a tablet computer is used as the terminal device 2 will be described as an example.

端末装置２は、図４に示すように、ＣＰＵ２０ａ、ＲＡＭ２０ｂ、フラッシュメモリ２０ｃ、タッチパネルディスプレイ２０ｄ、操作ボタン群２０ｅ、無線ＬＡＮ通信装置２０ｆ、近距離無線通信装置２０ｇ、携帯電話通信装置２０ｈ、マイクロフォン２０ｉ、スピーカ２０ｊ、デジタルカメラ２０ｋ、およびジャイロセンサ２０ｍなどによって構成される。 As shown in FIG. 4, the terminal device 2 includes a CPU 20a, a RAM 20b, a flash memory 20c, a touch panel display 20d, an operation button group 20e, a wireless LAN communication device 20f, a short-range wireless communication device 20g, a mobile phone communication device 20h, and a microphone 20i. , A speaker 20j, a digital camera 20k, a gyro sensor 20m, and the like.

タッチパネルディスプレイ２０ｄは、ユーザに対してメッセージを与えるための画面、処理の結果を示す画面、またはユーザが指示を入力するための画面などを表示する。また、タッチパネルディスプレイ２０ｄは、タッチされた位置を検知し、ＣＰＵ２０ａにその位置を通知する。 The touch panel display 20d displays a screen for giving a message to the user, a screen showing a result of processing, a screen for the user to input an instruction, and the like. The touch panel display 20d detects the touched position and notifies the CPU 20a of the position.

操作ボタン群２０ｅは、いわゆるホーム画面に戻るためのボタン、音量を調整するためのボタン、および電源のオン／オフを切り換えるためのボタンなどによって構成される。 The operation button group 20e includes a button for returning to a so-called home screen, a button for adjusting the volume, a button for switching power on / off, and the like.

無線ＬＡＮ通信装置２０ｆ、近距離無線通信装置２０ｇ、および携帯電話通信装置２０ｈは、いずれも、無線によって他の装置と通信を行うための装置である。ただし、それぞれ、通信の方式などが異なる。 The wireless LAN communication device 20f, the short-range wireless communication device 20g, and the mobile phone communication device 20h are all devices for communicating with other devices wirelessly. However, each communication method is different.

無線ＬＡＮ通信装置２０ｆは、無線ＬＡＮの規格、つまり、ＩＥＥＥ（Institute of Electrical and Electronics Engineers）８０２．１１の規格に基づいて通信を行う。 The wireless LAN communication device 20f performs communication based on a wireless LAN standard, that is, an IEEE (Institute of Electrical and Electronics Engineers) 802.11 standard.

近距離無線通信装置２０ｇは、近距離無線通信の規格に基づいて通信を行う。例えば、Ｂｌｕｅｔｏｏｔｈ（登録商標）またはＺｉｇＢｅｅなどの規格に基づいて通信を行う。また、ワイヤレスキーボード、ワイヤレスマウス、エアペン（Air Pen）、視線追従型ヘッドマウントディスプレイ、赤外線カメラ、およびジェスチャコントローラなどを、近距離無線通信装置２０ｇを介して端末装置２に接続することもできる。なお、ジェスチャコントローラは、深度センサまたはジャイロセンサなどを有し、ユーザがジェスチャによって本体（本例では、端末装置２）をコントロールするための入力装置である。ジェスチャコントローラとして、マイクロソフト社のＫｉｎｅｃｔまたはリープモーション社のＬＥＡＰなどが用いられる。 The short-range wireless communication device 20g performs communication based on a short-range wireless communication standard. For example, communication is performed based on a standard such as Bluetooth (registered trademark) or ZigBee. Also, a wireless keyboard, a wireless mouse, an air pen, a line-of-sight tracking head mounted display, an infrared camera, a gesture controller, and the like can be connected to the terminal device 2 via the short-range wireless communication device 20g. Note that the gesture controller is an input device that includes a depth sensor, a gyro sensor, or the like, and is used by the user to control the main body (the terminal device 2 in this example) using the gesture. As a gesture controller, Kinect of Microsoft Corporation or LEAP of Leap Motion Corporation is used.

携帯電話通信装置２０ｈは、ＣＤＭＡ（Code Division Multiple Access）２０００またはＷ−ＣＤＭＡ（Wideband-CDMA）などの規格に基づいて通信を行う。 The cellular phone communication device 20h performs communication based on a standard such as CDMA (Code Division Multiple Access) 2000 or W-CDMA (Wideband-CDMA).

マイクロフォン２０ｉは、集音し音声データを生成する。スピーカ２０ｊは、音声データに基づいて音声を出力する。デジタルカメラ２０ｋは、画像を撮影し画像データを生成する。動画像を撮影し動画像の画像データを生成することも、できる。ジャイロセンサ２０ｍは、端末装置２の姿勢を検知する。 The microphone 20i collects sound and generates voice data. The speaker 20j outputs sound based on the sound data. The digital camera 20k captures an image and generates image data. It is also possible to shoot a moving image and generate image data of the moving image. The gyro sensor 20 m detects the attitude of the terminal device 2.

フラッシュメモリ２０ｃには、文書（ドキュメント）の作成用のアプリケーション（例えば、ワープロソフト）がインストールされている。そのほか、メーラ、ウェブブラウザ、スケジューラ、メッセンジャ、ＩＰ電話アプリケーション、および声紋判別アプリケーションなどのアプリケーションがインストールされている。 An application (for example, word processing software) for creating a document (document) is installed in the flash memory 20c. In addition, applications such as a mailer, a web browser, a scheduler, a messenger, an IP phone application, and a voiceprint discrimination application are installed.

さらに、フラッシュメモリ２０ｃには、図５に示す閲覧開始操作検知部２０１、文書データ提供部２０２、興味度検出部２０３、興味度変化データ記憶部２０４、閲覧終了操作検知部２０５、全体興味度算出部２０６、興味度データ提供部２０７、および検索クライアント２０８などの機能を実現するためのプログラムが記憶されている。 Furthermore, the flash memory 20c includes a browsing start operation detection unit 201, a document data providing unit 202, an interest level detection unit 203, an interest level change data storage unit 204, a browsing end operation detection unit 205, and an overall interest level calculation illustrated in FIG. A program for realizing the functions of the unit 206, the interest data providing unit 207, the search client 208, and the like is stored.

これらのプログラムは、必要に応じてＲＡＭ２０ｂにロードされ、ＣＰＵ２０ａによって実行される。 These programs are loaded into the RAM 20b as necessary and executed by the CPU 20a.

図１に戻って、ドキュメントサーバ３は、文書のデータを管理する。ドキュメントサーバ３として、複数台のサーバが設けられている。特に、ドキュメントサーバ３１は、ファイルサーバであって、ワープロソフトなどのアプリケーションによって作成された文書のデータをファイル単位で管理する。ドキュメントサーバ３２は、ウェブサーバであって、文書のデータをウェブページの単位で管理する。 Returning to FIG. 1, the document server 3 manages document data. A plurality of servers are provided as the document server 3. In particular, the document server 31 is a file server, and manages document data created by an application such as word processing software in units of files. The document server 32 is a web server and manages document data in units of web pages.

画像形成装置４は、コピー、ＰＣプリント、ファックス、およびスキャナなどの機能を集約した装置である。一般に、「複合機」または「ＭＦＰ（Multi Function Peripherals）」などと呼ばれることがある。 The image forming apparatus 4 is an apparatus that integrates functions such as copying, PC printing, faxing, and scanning. Generally, it is sometimes called “multifunction machine” or “MFP (Multi Function Peripherals)”.

「ＰＣプリント」は、端末装置２から受信した画像データに基づいて画像を用紙に印刷する機能である。「ネットワークプリンティング」または「ネットワークプリント」などと呼ばれることもある。 “PC print” is a function for printing an image on paper based on image data received from the terminal device 2. It may also be called “network printing” or “network printing”.

次に、図３に示すプロファイル管理サーバ１の各部および図５に示す端末装置２の各部について、プロファイルの生成のためのデータの収集の処理、プロファイルの生成の処理、および情報の検索の処理に大別して説明する。 Next, for each part of the profile management server 1 shown in FIG. 3 and each part of the terminal device 2 shown in FIG. 5, data collection processing for profile generation, profile generation processing, and information search processing are performed. This will be explained roughly.

〔プロファイルの生成のためのデータの収集の処理〕
図６は、各時刻の興味度データ６ＫＡの例を示す図である。 [Process of collecting data for profile generation]
FIG. 6 is a diagram illustrating an example of the interest level data 6KA at each time.

ユーザが文書を端末装置２によって使用し始めるタイミングが、プロファイルの生成のために必要なデータを収集する処理の開始のタイミングである。 The timing at which the user starts using the document by the terminal device 2 is the timing at which processing for collecting data necessary for generating the profile is started.

端末装置２において、閲覧開始操作検知部２０１は、ユーザが文書の閲覧を開始する操作を行ったことを検知する。 In the terminal device 2, the browsing start operation detection unit 201 detects that the user has performed an operation to start browsing the document.

閲覧を開始する操作の検知は、キーロガーによって実現することができる。なお、キーロガーは、キーボード、マウス、またはタッチパネルディスプレイによる入力を随時、監視するエージェントである。すなわち、閲覧開始操作検知部２０１は、キーロガーによってユーザが行った操作の手順を検知し、検知した操作の手順と所定の操作の手順とをマッチングする。そして、両者が一致すれば、閲覧を開始する操作が行われたと、検知する。所定の操作の手順は、例えば、所定のアプリケーションを起動し、ファイルのオープンのコマンドを入力し、文書のデータを選択する、という操作の手順である。この例によると、閲覧開始操作検知部２０１は、ユーザが選択した文書のデータをも検知することができる。 Detection of an operation for starting browsing can be realized by a key logger. Note that the key logger is an agent that monitors input from the keyboard, mouse, or touch panel display as needed. In other words, the browsing start operation detection unit 201 detects a procedure of an operation performed by the user using a key logger, and matches the detected operation procedure with a predetermined operation procedure. And if both correspond, it will detect that operation which starts browsing was performed. The predetermined operation procedure is, for example, an operation procedure for starting a predetermined application, inputting a file open command, and selecting document data. According to this example, the browsing start operation detection unit 201 can also detect data of a document selected by the user.

また、このような操作によって端末装置２において、従来通り、アプリケーションが起動し、ユーザによって選択された文書のデータがドキュメントサーバ３からダウンロードされまたは端末装置２のデータベースから読み出され、そして、文書が表示される。これにより、ユーザは、文書を閲覧することができる。 Further, by such an operation, the application is started in the terminal device 2 as usual, the data of the document selected by the user is downloaded from the document server 3 or read from the database of the terminal device 2, and the document is Is displayed. Thereby, the user can browse the document.

文書データ提供部２０２は、閲覧開始操作検知部２０１によって検知された文書のデータを、このユーザのユーザコードと対応付けてプロファイル管理サーバ１へ送信する。以下、ユーザが選択した文書を「カレント文書」と記載し、カレント文書のデータを「文書データ６ＤＣ」と記載する。 The document data providing unit 202 transmits the document data detected by the browsing start operation detecting unit 201 to the profile management server 1 in association with the user code of the user. Hereinafter, the document selected by the user is described as “current document”, and the data of the current document is described as “document data 6DC”.

プロファイル管理サーバ１の文書データ収集部１０１（図３参照）は、端末装置２から送信されてきた文書データ６ＤＣを取得し、文書データ記憶部１２１に記憶させる。 The document data collection unit 101 (see FIG. 3) of the profile management server 1 acquires the document data 6DC transmitted from the terminal device 2 and stores it in the document data storage unit 121.

端末装置２の興味度検出部２０３は、カレント文書に対するユーザの興味の強さの度合いを表わす興味度ＲＩを検出する。興味度ＲＩは、例えば次の方法によって検出することができる。 The interest level detection unit 203 of the terminal device 2 detects an interest level RI that represents the degree of user interest in the current document. The degree of interest RI can be detected by the following method, for example.

１つ目の方法は、ユーザの視線に基づいて検出する方法である。具体的には、デジタルカメラ２０ｋまたは視線追従型ヘッドマウントディスプレイによってユーザの顔を撮影し視線およびその動きを検出する。また、予め、視線がカレント文書の画面に定まっている時間（凝視している時間）が長いほど高い興味度を導く関数またはテーブルを用意しておく。そして、ユーザの視線に基づいてカレント文書の画面に定まっている時間を算出し、関数またはテーブルに基づいて興味度ＲＩを検出する。視線に基づく興味度の検出の公知の技術として、特開平１１−８５７６２号公報に記載される技術がある。 The first method is a detection method based on the user's line of sight. Specifically, the user's face is photographed by the digital camera 20k or a line-of-sight tracking head mounted display, and the line of sight and its movement are detected. In addition, a function or table is prepared in advance so that the longer the time during which the line of sight is fixed on the screen of the current document (the time when the user is staring), the higher the degree of interest. Then, the time determined on the screen of the current document is calculated based on the user's line of sight, and the degree of interest RI is detected based on the function or the table. As a known technique for detecting the degree of interest based on the line of sight, there is a technique described in JP-A-11-85762.

２つ目の方法は、ユーザの姿勢に基づいて検出する方法である。具体的には、デジタルカメラ２０ｋによってユーザと対象の物体（本例では、カレント文書の画面）との距離を計測する。また、予め、距離（ユーザと対象の物体との距離）と興味度との関係を表わす関数またはテーブルを用意しておく。そして、計測した距離および関数またはテーブルに基づいて興味度ＲＩを検出する。または、距離の変化量に基づいて興味度を検出する方法を用いてもよい。 The second method is a detection method based on the user's posture. Specifically, the distance between the user and the target object (in this example, the screen of the current document) is measured by the digital camera 20k. In addition, a function or table representing the relationship between the distance (distance between the user and the target object) and the degree of interest is prepared in advance. Then, the degree of interest RI is detected based on the measured distance and function or table. Alternatively, a method of detecting the degree of interest based on the amount of change in distance may be used.

３つ目の方法は、生体情報に基づいて検出する方法である。具体的には、デジタルカメラ２０ｋ、視線追従型ヘッドマウントディスプレイ、または赤外線カメラによってユーザの眼球の運動および瞬きのほか、瞳孔の大きさを検出する。また、眼球の運動、瞬きの回数もしくは頻度、または瞳孔の大きさもしくはその変化と興味度との関係を表わす関数またはテーブルを予め用意しておく。そして、検出したこれらの情報および関数またはテーブルに基づいて興味度ＲＩを検出する。人間の目に関する情報に基づいて興味度を検出する公知の技術の例として、ＷＯ２００８／０７２７３９号公報に記載される技術がある。 The third method is a detection method based on biological information. Specifically, the size of the pupil is detected in addition to the movement and blinking of the user's eyeball by the digital camera 20k, the line-of-sight tracking head mounted display, or the infrared camera. In addition, a function or table representing the relationship between the eyeball movement, the number or frequency of blinks, or the size of the pupil or its change and the degree of interest is prepared in advance. Then, the degree of interest RI is detected based on the detected information and function or table. As an example of a known technique for detecting the degree of interest based on information about the human eye, there is a technique described in WO2008 / 072739.

これらの方法によると、ユーザがカレント文書を閲覧している様子を撮影するだけでよいので、ユーザの作業（ワーク）を阻害することなく、興味度ＲＩをリアルタイムに検出することができる。 According to these methods, since it is only necessary to photograph the state in which the user is browsing the current document, the degree of interest RI can be detected in real time without obstructing the user's work (work).

なお、上記３つ以外の方法によって興味度ＲＩを検出してもよい。または、精度を高めるために、複数の方法を組み合わせても用いてもよい。 Note that the degree of interest RI may be detected by a method other than the above three methods. Alternatively, a plurality of methods may be used in combination to increase accuracy.

興味度検出部２０３は、ユーザがカレント文書の閲覧を開始してから終了するまでの間、所定の時間（例えば、１〜１０秒）ごとに興味度ＲＩを検出する。 The interest level detection unit 203 detects the interest level RI every predetermined time (for example, 1 to 10 seconds) from when the user starts browsing the current document to when it ends.

検出された興味度ＲＩは、その時点の時刻と対応付けられて、興味度データ６ＫＡとして興味度変化データ記憶部２０４に記憶される。つまり、興味度変化データ記憶部２０４には、図６のような、時刻ごとの興味度ＲＩ、つまり、興味度ＲＩの変化のデータが、記憶される。 The detected degree of interest RI is stored in the interest degree change data storage unit 204 as interest degree data 6KA in association with the time at that time. That is, the interest degree change data storage unit 204 stores interest degree RI for each time as shown in FIG. 6, that is, data on changes in the interest degree RI.

閲覧終了操作検知部２０５は、ユーザがカレント文書の閲覧を終了する操作を行ったことを検知する。 The browsing end operation detection unit 205 detects that the user has performed an operation to end browsing of the current document.

閲覧を終了する操作の検知も、閲覧を開始する操作の検知と同様に、キーロガーによって実現することができる。すなわち、閲覧終了操作検知部２０５は、キーロガーによってユーザが行った操作を検知し、検知した操作と所定の操作とをマッチングする。そして、両者が一致すれば、閲覧を終了する操作が行われたと、検知する。所定の操作は、例えば、所定のアプリケーションの終了のコマンドの入力、文書のファイルを閉じるコマンドの入力、または他の文書のファイルをオープンするコマンドの入力、または他のアプリケーションへの切換えなどの操作などである。 The detection of the operation for ending the browsing can also be realized by a key logger similarly to the detection of the operation for starting the browsing. That is, the browsing end operation detection unit 205 detects an operation performed by the user with a key logger, and matches the detected operation with a predetermined operation. And if both correspond, it will detect that operation which complete | finished browsing was performed. The predetermined operation includes, for example, an input of a command for ending a predetermined application, an input of a command for closing a document file, an input of a command for opening a file of another document, or an operation such as switching to another application. It is.

全体興味度算出部２０６は、カレント文書の閲覧の終了の操作が閲覧終了操作検知部２０５によって検知されると、興味度変化データ記憶部２０４に記憶されている、各時刻の興味度データ６ＫＡに基づいて、閲覧中の全体的な興味の度合いを表わす全体興味度ＲＥを算出する。例えば、各興味度データ６ＫＡに示される興味度ＲＩの平均値を全体興味度ＲＥとして算出する。または、これらの興味度ＲＩのうちの最大値と最小値との中間値（例えば、ちょうど真ん中の値）を全体興味度ＲＥとして算出してもよい。 When the operation for ending the browsing of the current document is detected by the browsing end operation detecting unit 205, the entire interest level calculating unit 206 stores the interest level data 6KA stored in the interest level change data storage unit 204 at each time. Based on this, an overall interest level RE representing the overall level of interest being browsed is calculated. For example, the average value of the degree of interest RI indicated in each degree of interest data 6KA is calculated as the overall degree of interest RE. Alternatively, an intermediate value (for example, a value just in the middle) between the maximum value and the minimum value of these interest levels RI may be calculated as the overall interest level RE.

興味度データ提供部２０７は、興味度データ６Ｋを生成し、プロファイル管理サーバ１へ送信する。興味度データ６Ｋには、閲覧が終了したことを示す信号、全体興味度算出部２０６によって算出された全体興味度ＲＥ、興味度検出部２０３によって検出された各時刻の興味度ＲＩ、および閲覧者（ユーザ）のユーザコードが示されている。 The interest degree data providing unit 207 generates the interest degree data 6K and transmits it to the profile management server 1. The interest level data 6K includes a signal indicating that browsing has been completed, the overall interest level RE calculated by the overall interest level calculation unit 206, the interest level RI detected at each time by the interest level detection unit 203, and the viewer The (user) user code is shown.

プロファイル管理サーバ１において、興味度データ取得部１０２は、端末装置２から送信されてきた興味度データ６Ｋを取得し、興味度データ記憶部１２２に記憶させる。 In the profile management server 1, the interest level data acquisition unit 102 acquires the interest level data 6 </ b> K transmitted from the terminal device 2 and stores it in the interest level data storage unit 122.

〔プロファイルの生成の処理〕
図７は、必要部分抽出処理の流れの例を説明するフローチャートである。図８は、変化パターンと重要性特性との関係の例を示す図である。図９は、変化パターンごとの重み付けの方法の例を示す図である。図１０は、プロファイルデータ６ＰＤの例を示す図である。 [Profile generation processing]
FIG. 7 is a flowchart illustrating an example of the flow of necessary part extraction processing. FIG. 8 is a diagram illustrating an example of the relationship between the change pattern and the importance characteristic. FIG. 9 is a diagram illustrating an example of a weighting method for each change pattern. FIG. 10 is a diagram illustrating an example of the profile data 6PD.

必要部分抽出部１０３は、興味度データ取得部１０２によって興味度データ６Ｋが取得されると、カレント文書の中からプロファイルの生成に必要な部分を抽出する処理を、例えば図７に示す手順で実行する。 When the interest data 6K is acquired by the interest data acquisition unit 102, the necessary part extraction unit 103 executes processing for extracting a part necessary for generating a profile from the current document, for example, according to the procedure shown in FIG. To do.

必要部分抽出部１０３は、興味度データ６Ｋに示される全体興味度ＲＥと閾値αとを比較する（＃７０１）。全体興味度ＲＥが閾値α以上である場合は（＃７０２でＹｅｓ）、文書データ記憶部１２１から文書データ６ＤＣを読み出す（＃７０３）。そして、文書データ６ＤＣに基づいて、プロファイルの生成のために必要でない部分をカレント文書から削除する（＃７０４）。 The necessary part extraction unit 103 compares the overall interest level RE indicated in the interest level data 6K with the threshold value α (# 701). If the overall interest level RE is equal to or greater than the threshold value α (Yes in # 702), the document data 6DC is read from the document data storage unit 121 (# 703). Then, based on the document data 6DC, a part that is not necessary for generating the profile is deleted from the current document (# 704).

例えば、必要部分抽出部１０３は、カレント文書が電子メールである場合は、電子メールのヘッダおよびレターヘッドを削除する。カレント文書がウェブページである場合は、ウェブページのバナー広告を削除する。または、ウェブページの必要な部分と不要な部分とを、領域判別に関する公知の技術によって区別し、不要な部分を削除してもよい。公知の技術として、エバーノート社のＷｅｂクリッパまたはＣｌｅａｌｙで採用されている技術が用いられる。 For example, if the current document is an e-mail, the necessary part extraction unit 103 deletes the e-mail header and letterhead. If the current document is a web page, the banner advertisement of the web page is deleted. Alternatively, a necessary part and an unnecessary part of the web page may be distinguished by a known technique related to area discrimination, and the unnecessary part may be deleted. As a known technique, a technique adopted by Evernote Web Clipper or Cleary is used.

このようにして、カレント文書から抽出された必要な部分のデータを、以下、「必要部分データ６ＨＢ」と記載する。 The necessary portion data extracted from the current document in this way is hereinafter referred to as “required portion data 6HB”.

そして、必要部分抽出部１０３は、必要部分データ６ＨＢを文書データ記憶部１２１に保存する（＃７０５）。 Then, the necessary part extraction unit 103 stores the necessary part data 6HB in the document data storage unit 121 (# 705).

一方、全体興味度ＲＥが閾値α未満である場合は（＃７０２でＮｏ）、必要部分抽出部１０３は、ステップ＃７０３以降の処理を中止する。次に説明する単語重要度算出部１０４などによる処理も中止され、必要部分抽出部１０３は、次の興味度データ６Ｋが受信されるのを待つ。 On the other hand, when the overall interest level RE is less than the threshold value α (No in # 702), the necessary part extraction unit 103 stops the processes after step # 703. Processing by the word importance degree calculation unit 104 and the like described below is also stopped, and the necessary part extraction unit 103 waits for reception of the next interest level data 6K.

単語重要度算出部１０４は、必要部分データ６ＨＢに示される文書（必要な部分）から単語を抽出し、単語ごとの重要性の度合い（以下、「重要度Ｗ」と記載する。）を算出する。 The word importance calculation unit 104 extracts words from the document (necessary part) indicated in the necessary part data 6HB, and calculates the degree of importance for each word (hereinafter referred to as “importance W”). .

抽出の処理は、例えば、次の方法で行うことができる。必要部分データ６ＨＢに示される文書が英語またはフランス語などで記述されている場合は、単語抽出部１０５は、スペースまたは改行のコードに基づいて単語同士を切り分け抽出すればよい。日本語で記述されている場合は、形態素解析の技術を用いて単語を抽出する。この技術を実装しているソフトウェアとして、奈良先端科学技術大学院大学の松本裕治研究室の茶筅（ＣｈａＳｅｎ）および工藤拓氏のＭｅｃａｂが挙げられる。これらのソフトウェアによって形態素解析を行うことによって、単語を抽出すればよい。 The extraction process can be performed, for example, by the following method. When the document indicated by the necessary partial data 6HB is described in English or French, the word extracting unit 105 may extract and extract words from each other based on a space or a line feed code. When written in Japanese, words are extracted using morphological analysis technology. Software that implements this technology includes ChaSen from Yuji Matsumoto Laboratory at Nara Institute of Science and Technology and Mecab from Taku Kudo. Words may be extracted by performing morphological analysis with these software.

一方、重要度Ｗの算出は、例えば、次の方法で行うことができる。１つ目の方法は、パターンマッチングによる方法である。この方法は、「○○は調査すべきである」のような、よく使う重要な言い回しのテンプレートを予め登録しておく。そして、「○○」に該当する該当する単語に、該当する回数に応じて所定の点数を与えることによって、重要度Ｗを算出する。 On the other hand, the importance W can be calculated by the following method, for example. The first method is a pattern matching method. In this method, a frequently used important wording template such as “XX should be investigated” is registered in advance. Then, the degree of importance W is calculated by giving a predetermined score to the corresponding word corresponding to “XX” according to the number of times corresponding to the word.

２つ目の方法は、文書に単語が出現する回数に応じて算出する方法である。例えば、
ＴＦ−ＩＤＦ（Term Frequency Inverse Document Frequency）法による方法である。この方法によると、次に示す式に基づいて重要度を算出する。 The second method is a method of calculating according to the number of times a word appears in a document. For example,
This is a method based on the TF-IDF (Term Frequency Inverse Document Frequency) method. According to this method, the importance is calculated based on the following equation.

「ｎ_i,j」は、単語ｉの、文書ｊにおける出現回数である。｜Ｄ｜は、文書の総数である。｜｛ｄ：ｄ∋ｔ_i｝は、単語ｉを含む文書の総数である。上述の式の「ｉｄｆ」は、一般語フィルタとして働く。そして、多くの文書に出現する単語（一般語）の重要度を下げる役割を果たす。 “N _{i, j} ” is the number of appearances of the word i in the document j. | D | is the total number of documents. | {D: d∋t _i } is the total number of documents including the word i. “Idf” in the above formula acts as a general word filter. And it plays the role which lowers the importance of the word (general word) which appears in many documents.

ペア重要度算出部１０５は、必要部分データ６ＨＢに示される文書（必要な部分）から複数の単語からなるペアを抽出し、ペアごとの重要性の度合い（以下、「ペア重要度Ｓ」と記載する。）を算出する。 The pair importance calculation unit 105 extracts a pair of a plurality of words from the document (necessary part) indicated in the necessary part data 6HB, and describes the degree of importance for each pair (hereinafter referred to as “pair importance S”). ) Is calculated.

ペアは、単語重要度算出部１０４によって抽出された単語同士を任意に組み合わせることによって、抽出することができる。ただし、この方法によると、非常に多くのペアが抽出され得るが、単語同士の関連性が低いペアが含まれることがある。そこで、１つの塊、例えば、１つのセンテンスまたは１つの段落などに含まれる単語同士をペアとして抽出してもよい。以下、２つの単語が１つのペアをなす場合を例に説明する。 A pair can be extracted by arbitrarily combining words extracted by the word importance calculation unit 104. However, according to this method, a very large number of pairs can be extracted, but a pair having a low relevance between words may be included. Therefore, words included in one block, for example, one sentence or one paragraph may be extracted as a pair. Hereinafter, a case where two words form one pair will be described as an example.

ペア重要度Ｓの算出は、例えば、次の方法で行うことができる。単語重要度算出部１０４は、抽出したペアを構成する２つの単語同士の、文書内における距離を求める。そして、所定の関数またはテーブルに基づいて、その距離に応じた重要度を算出する。所定の関数またはテーブルには、距離が短いほど重要度が高くなるように、距離と重要度との関係が定義されている。 The pair importance S can be calculated by, for example, the following method. The word importance calculation unit 104 obtains a distance in the document between two words constituting the extracted pair. Then, based on a predetermined function or table, the importance corresponding to the distance is calculated. In the predetermined function or table, the relationship between the distance and the importance is defined so that the importance becomes higher as the distance becomes shorter.

例えば、「南部地方｜で｜は、｜気候｜を｜活かし｜た｜特産品｜で｜村おこし｜の｜協力者｜を｜募集し｜て｜いる。」というセンテンスの中から「南部地方」および「気候」のペアが抽出されたとする。なお、「｜」は、単語と単語との区切りの記号である。句読点などの記号は、無視する。このペアを構成する２つの単語の距離（いわゆるワード距離）は、自らを含め、４ワードである。よって、ペア重要度算出部１０５は、このペアのペア重要度Ｓを、４ワードに対応する重要度を所定の関数またはテーブルから導くことによって、算出する。同様に、「南部地方」と「特産品」との距離は、８ワードである。よって、ペア重要度算出部１０５は、両単語のペアのペア重要度Ｓを、８ワードに対応する重要度を所定の関数またはテーブルから導くことによって、算出する。 For example, from the sentence “Southern Region | In |” | “Climate | ”And“ climate ”pairs are extracted. Note that “|” is a delimiter between words. Ignore symbols such as punctuation marks. The distance between two words constituting the pair (so-called word distance) is 4 words including itself. Therefore, the pair importance calculation unit 105 calculates the pair importance S of the pair by deriving the importance corresponding to 4 words from a predetermined function or table. Similarly, the distance between the “Southern region” and the “special product” is 8 words. Therefore, the pair importance calculation unit 105 calculates the pair importance S of the pair of both words by deriving the importance corresponding to 8 words from a predetermined function or table.

または、ペア重要度算出部１０５は、シソーラス（分類辞典）を参照し、両単語の持つ意味的な繋がりに基づいて重要度を求めても算出してもよい。 Alternatively, the pair importance calculation unit 105 may refer to a thesaurus (classification dictionary) and calculate or calculate the importance based on the semantic connection of both words.

これらの方法は、ペアを構成する両単語の関係性の度合いを、ペアの重要度として算出していると、言える。 It can be said that these methods calculate the degree of relationship between the two words constituting the pair as the importance of the pair.

同一の単語が複数抽出された場合は、同一のペアが複数回、出現する。例えば、ＷＤ１という単語が２つ抽出され、ＷＤ２という単語が３つ抽出された場合は、２×３、つまり、６回、両単語のペアが出現する。このような場合は、２つの単語同士の距離が最も近いときの重要度をペア重要度Ｓとして採用すればよい。または、それぞれの重要度の平均値を取ってもよい。または、出現回数の多さに応じた重要度をペア重要度Ｓとして採用してもよい。つまり、出現回数が多いほどペア重要度Ｓを大きくしてもよい。 When a plurality of the same words are extracted, the same pair appears a plurality of times. For example, when two words WD1 are extracted and three words WD2 are extracted, 2 × 3, that is, pairs of both words appear six times. In such a case, the importance when the distance between the two words is the shortest may be adopted as the pair importance S. Or you may take the average value of each importance. Alternatively, importance corresponding to the number of appearances may be adopted as the pair importance S. That is, the pair importance S may be increased as the number of appearances increases.

または、ペア重要度算出部１０５は、上記の演算を行わず、すべてのペアのペア重要度Ｓを一律に所定の値（ただし、「０」以外の値。例えば、「１」）に決めてもよい。 Alternatively, the pair importance calculation unit 105 does not perform the above calculation, and uniformly determines the pair importance S for all the pairs to a predetermined value (however, a value other than “0”, for example, “1”). Also good.

変化パターン判定部１０６は、ユーザの興味度ＲＩの変化に表れるパターン（以下、「変化パターン」と記載する。）を、次のように判定する。 The change pattern determination unit 106 determines a pattern (hereinafter referred to as “change pattern”) that appears in the change in the user's degree of interest RI as follows.

変化パターン判定部１０６は、例えば１〜１０秒程度の短い時間Ｌａに急激に興味度ＲＩが上昇するパターン（以下、「急上昇パターン」と記載する。）が表れているか否かを、次のように判別する。カレント文書の閲覧中の各時刻から時間Ｌａの経過後までの興味度ＲＩの変化率を算出する。つまり、例えば、ある時刻Ｔ１における変化率として、時刻Ｔ１の時間Ｌａ前における興味度ＲＩから時刻Ｔ１における興味度ＲＩへの増加量を時間Ｌａで割った値を、算出する。そして、変化率が閾値βａ以上である時刻があれば、変化パターンとして急上昇パターンが表れていると、判定する。 For example, the change pattern determination unit 106 determines whether or not a pattern in which the degree of interest RI suddenly increases in a short time La of about 1 to 10 seconds (hereinafter referred to as a “rapid increase pattern”) appears as follows. To determine. The change rate of the degree of interest RI from each time during browsing of the current document to after the passage of time La is calculated. That is, for example, as the rate of change at a certain time T1, a value obtained by dividing the amount of increase from the degree of interest RI before the time La at the time T1 to the degree of interest RI at the time T1 by the time La is calculated. If there is a time at which the rate of change is equal to or greater than the threshold value βa, it is determined that a sudden increase pattern appears as the change pattern.

さらに、変化パターン判定部１０６は、例えば１０秒〜１分程度の長い時間Ｌｂに興味度ＲＩが上昇するパターン（以下、「緩上昇パターン」と記載する。）が表れているか否かを、次のように判別する。カレント文書の閲覧中の各時刻から時間Ｌｂの経過後までの興味度ＲＩの変化率を算出する。つまり、例えば、ある時刻Ｔ１における変化率として、時刻Ｔ１の時間Ｌｂ前における興味度ＲＩから時刻Ｔ１における興味度ＲＩへの増加量を時間Ｌｂで割った値を、算出する。そして、変化率が閾値βｂ以上である時刻があれば、変化パターンとして緩上昇パターンが表れていると、判定する。ただし、閾値βａ＞閾値βｂ、である。 Further, the change pattern determination unit 106 determines whether or not a pattern in which the degree of interest RI increases (hereinafter referred to as “slow increase pattern”) appears for a long time Lb of about 10 seconds to 1 minute, for example. It is determined as follows. A change rate of the degree of interest RI from each time during browsing of the current document to after the lapse of time Lb is calculated. That is, for example, as the rate of change at a certain time T1, a value obtained by dividing the amount of increase from the degree of interest RI before the time Lb at the time T1 to the degree of interest RI at the time T1 by the time Lb is calculated. If there is a time when the rate of change is equal to or greater than the threshold value βb, it is determined that a slowly rising pattern appears as the change pattern. However, threshold value βa> threshold value βb.

また、急上昇パターンおよび緩上昇パターンのいずれも表れていない場合は、変化パターン判定部１０６は、変化パターンとして「上昇小パターン」が表れていると、判定する。 Further, when neither the sudden rise pattern nor the slow rise pattern appears, the change pattern determination unit 106 determines that “a small increase pattern” appears as the change pattern.

なお、時間ＬａおよびＬｂの長さは、ユーザの読書のスピードなどに応じて適宜、変更することができる。ただし、時間Ｌａ＜時間Ｌｂ、である。閾値βａおよびβｂも、適宜、変更することができる。 The lengths of the times La and Lb can be changed as appropriate according to the reading speed of the user. However, time La <time Lb. The threshold values βa and βb can also be changed as appropriate.

重要度調整部１０７は、単語重要度算出部１０４によって算出された各単語の重要度Ｗおよびペア重要度算出部１０５によって算出された各ペアのペア重要度Ｓを、変化パターン判定部１０６によって判定された変化パターンに基づいて調整する。 The importance level adjustment unit 107 determines the importance level W of each word calculated by the word importance level calculation unit 104 and the pair importance level S of each pair calculated by the pair importance level calculation unit 105 by the change pattern determination unit 106. Adjust based on the changed pattern.

ところで、一般に、文書の中のユーザにとっての重要な部分は、変化パターンに応じて、重要性について図８に示すような特性を有することが多い。 By the way, in general, an important part for a user in a document often has a characteristic as shown in FIG. 8 according to the change pattern.

つまり、変化パターンとして急上昇パターンが表れる場合は、ユーザの目に留まった単語自体が重要であることが多い。一方、緩上昇パターンが表れる場合は、単語自体よりも全体的な内容が重要であることが多い。また、どちらの場合も、単語同士のペアが重要であることが多い。 That is, when a sudden rise pattern appears as a change pattern, the word itself that is noticeable to the user is often important. On the other hand, when a slowly rising pattern appears, the overall content is often more important than the word itself. In both cases, word-to-word pairs are often important.

そこで、重要度調整部１０７は、このような傾向に鑑み、図９に示すように、各単語の重要度Ｗおよび各ペアのペア重要度Ｓを調整する。 Therefore, in view of such a tendency, the importance level adjustment unit 107 adjusts the importance level W of each word and the pair importance level S of each pair as shown in FIG.

つまり、変化パターンが急上昇パターンであると変化パターン判定部１０６によって判定された場合は、重要度調整部１０７は、各単語の重要度Ｗに所定の値γａ（γａ＞０）を加える。さらに、各ペアのペア重要度Ｓに所定の値γｂ（γｂ＞０）を加える。または、各ペアのペア重要度Ｓに所定の値γｃ（γｃ＞１）を掛けてもよい。 That is, when the change pattern determination unit 106 determines that the change pattern is a sudden increase pattern, the importance level adjustment unit 107 adds a predetermined value γa (γa> 0) to the importance level W of each word. Further, a predetermined value γb (γb> 0) is added to the pair importance S of each pair. Alternatively, the pair importance S of each pair may be multiplied by a predetermined value γc (γc> 1).

または、変化パターンが急上昇パターンではなく緩上昇パターンであると変化パターン判定部１０６によって判定された場合は、重要度調整部１０７は、各ペアのペア重要度Ｓに所定の値γｄ（γｄ＞０）を加える。または、各ペアのペア重要度Ｓに所定の値γｅ（γｅ＞１）を掛けてもよい。 Alternatively, when the change pattern determination unit 106 determines that the change pattern is not a sudden increase pattern but a slow increase pattern, the importance level adjustment unit 107 sets the pair importance level S of each pair to a predetermined value γd (γd> 0). ). Alternatively, the pair importance S of each pair may be multiplied by a predetermined value γe (γe> 1).

なお、所定の値γｂ＞所定の値γｄ、である。また、所定の値γｃ＞所定の値γｅ、である。つまり、急上昇パターンが出現するときのほうが、急上昇パターンが出現せず緩上昇パターンが出現するときよりも、増加量を多くする。 Note that the predetermined value γb> the predetermined value γd. Further, the predetermined value γc> the predetermined value γe. That is, the amount of increase is greater when the sudden rise pattern appears than when the slow rise pattern appears without the sudden rise pattern.

また、ほぼ同時に目に留まった単語同士は、全体的な内容を表わす単語同士よりも、直接的な結び付きが大きい。そこで、ほぼ同時に目に留まった単語同士のペアに対する増加量をより大きくしてもよい。 In addition, words that are noticed almost simultaneously have a greater direct connection than words that represent the overall content. Therefore, the amount of increase with respect to pairs of words that are noticed almost simultaneously may be increased.

変化パターンが上昇小パターンであると変化パターン判定部１０６によって判定された場合は、重要度調整部１０７は、重要度Ｗの調整もペア重要度Ｓの調整も、行わない。 When the change pattern determination unit 106 determines that the change pattern is a small increase pattern, the importance level adjustment unit 107 does not adjust the importance level W nor the pair importance level S.

プロファイルデータ生成部１０８は、ユーザのプロファイルデータ６ＰＤを生成する。プロファイルデータ６ＰＤには、図１０のように、単語ごとの重要度Ｗおよび識別子（単語コード）と、ペアごとのペア重要度Ｓおよび識別子（ペアコード）などが示される。これらの単語およびペアは、そのユーザの特徴を表わす重要なキーワードであると、言える。なお、ペアを構成する単語を「南部地方−特産品」のように文字で表わす代わりに「ＷＤ０００１−ＷＤ０００２」のように単語コードで表わしてもよい。 The profile data generation unit 108 generates user profile data 6PD. As shown in FIG. 10, the profile data 6PD shows the importance W and identifier (word code) for each word, pair importance S and identifier (pair code) for each pair, and the like. It can be said that these words and pairs are important keywords representing the characteristics of the user. The words constituting the pair may be represented by word codes such as “WD0001-WD0002” instead of characters such as “Southern region—specialty product”.

生成されたプロファイルデータ６ＰＤは、そのユーザのユーザコードと対応付けられてプロファイルデータ記憶部１２３に記憶される。 The generated profile data 6PD is stored in the profile data storage unit 123 in association with the user code of the user.

なお、そのユーザのプロファイルデータ６ＰＤが既にプロファイルデータ記憶部１２３に記憶されている場合は、プロファイルデータ生成部１０８は、そのプロファイルデータ６ＰＤを更新してもよい。更新の処理は、例えば次のように行うことができる。 If the profile data 6PD of the user is already stored in the profile data storage unit 123, the profile data generation unit 108 may update the profile data 6PD. The update process can be performed as follows, for example.

既にプロファイルデータ６ＰＤに示される単語については、プロファイルデータ生成部１０８は、その単語の重要度Ｗを例えば次の式に基づいて算出し直す。
Ｗｉ＝（１−Ｐ）・Ｗ０ｉ＋Ｐ・Ｗ１ｉ
ただし、０≦Ｐ≦１、である。Ｗ０ｉは、既存のプロファイルデータ６ＰＤに示される単語ｉの重要度である。Ｗ１ｉは、今回算出された単語ｉの重要度である。 For the word already indicated in the profile data 6PD, the profile data generation unit 108 recalculates the importance W of the word based on the following formula, for example.
Wi = (1−P) · W0i + P · W1i
However, 0 ≦ P ≦ 1. W0i is the importance of the word i indicated in the existing profile data 6PD. W1i is the importance of the word i calculated this time.

Ｐの値は、任意に設定することができる。過去の閲覧の際の結果を重要視するのであれば、Ｐの値を小さくすればよいし、今回の閲覧の際の結果を重要視するのであれば、Ｐの値を大きくすればよい。過去の閲覧の際の結果を全く考慮しないのであれば、Ｐを「１」にすればよい。 The value of P can be set arbitrarily. If the result of past browsing is regarded as important, the value of P may be reduced. If the result of current browsing is regarded as important, the value of P may be increased. If the results of past browsing are not considered at all, P may be set to “1”.

既存のペアのペア重要度Ｓも同様の方法で算出し直せばよい。つまり、次の式に基づいて算出し直せばよい。
Ｓｋ＝（１−Ｑ）・Ｓ０ｋ＋Ｑ・Ｓ１ｋ
ただし、０≦Ｑ≦１、である。Ｓｋ０は、既存のプロファイルデータ６ＰＤに示されるペアｋｎのペア重要度Ｓである。Ｓｋ１は、今回算出されたペア重要度Ｓである。 The pair importance S of an existing pair may be recalculated by the same method. That is, it is sufficient to recalculate based on the following equation.
Sk = (1-Q) · S0k + Q · S1k
However, 0 ≦ Q ≦ 1. Sk0 is the pair importance S of the pair kn shown in the existing profile data 6PD. Sk1 is the pair importance S calculated this time.

そして、プロファイルデータ生成部１０８は、単語ｉの重要度Ｗを、算出し直したＷｉに書き換え、ペアｋのペア重要度Ｓを、算出し直したペア重要度Ｓｋに書き換える。 Then, the profile data generating unit 108 rewrites the importance W of the word i with the recalculated Wi, and rewrites the pair importance S of the pair k with the recalculated pair importance Sk.

上の各式によると、重要度Ｗおよびペア重要度Ｓは、次のように更新される。「南部地方」のこれまでの重要度Ｗが「６」であり、今回算出された重要度Ｗが「５」であり、Ｐが「０．３」である場合は、これらの値に基づいて重要度Ｗを算出し直すと、
Ｗ＝（１−０．３）・６＋０．３・５＝５．７
になる。 According to the above equations, the importance W and the pair importance S are updated as follows. When the importance W so far of the “Southern region” is “6”, the importance W calculated this time is “5”, and P is “0.3”, based on these values When the importance W is recalculated,
W = (1-0.3) .6 + 0.3.5 = 5.7
become.

また、「南部地方」と「特産品」とのペアのペア重要度Ｓが「９」であり、今回算出されたペア重要度Ｓが「１０」であり、Ｑが「０．３」である場合は、これらの値に基づいてペア重要度Ｓを算出し直すと、
Ｓ＝（１−０．３）・９＋０．３・１０＝９．３
になる。 Further, the pair importance S of the pair of “Southern region” and “special product” is “9”, the pair importance S calculated this time is “10”, and Q is “0.3”. If the pair importance S is recalculated based on these values,
S = (1-0.3) · 9 + 0.3 · 10 = 9.3
become.

なお、初めて抽出された単語については、プロファイルデータ生成部１０８は、今回算出されたペア重要度Ｓおよび新しい単語コードとともにプロファイルデータ６ＰＤに追記する。また、初めて抽出されたペアのペア重要度Ｓについても同様に、今回算出されたペア重要度Ｓおよび新しいペアコードとともにプロファイルデータ６ＰＤに追記する。 In addition, about the word extracted for the first time, the profile data generation part 108 adds to the profile data 6PD with the pair importance S calculated this time and the new word code. Similarly, the pair importance S of the pair extracted for the first time is added to the profile data 6PD together with the pair importance S calculated this time and the new pair code.

〔情報の検索の処理〕
検索処理部１０９は、情報の検索の処理を、プロファイルデータ６ＰＤを用いて例えば次のように実行する。 [Information search process]
The search processing unit 109 executes information search processing using the profile data 6PD as follows, for example.

ユーザが端末装置２に所定のコマンドを入力すると、検索クライアント２０８（図５参照）が起動する。なお、検索クライアント２０８は、検索処理部１０９による検索のサービスを受けるクライアントのモジュールである。 When the user inputs a predetermined command to the terminal device 2, the search client 208 (see FIG. 5) is activated. The search client 208 is a client module that receives a search service by the search processing unit 109.

検索クライアント２０８は、検索のキーワード（以下、「検索キー」と記載する。）を入力するための入力画面をタッチパネルディスプレイ２０ｄに表示させる。そして、ユーザが入力した検索キーおよびこのユーザのユーザコードを示す検索要求データ６ＱＤをプロファイル管理サーバ１へ送信する。 The search client 208 causes the touch panel display 20d to display an input screen for inputting a search keyword (hereinafter referred to as “search key”). Then, the search request data 6QD indicating the search key input by the user and the user code of the user is transmitted to the profile management server 1.

プロファイル管理サーバ１において、検索処理部１０９は、検索要求データ６ＱＤを取得すると、検索要求データ６ＱＤに示される検索キーを次のように補正する。検索要求データ６ＱＤに示されるユーザコードのプロファイルデータ６ＰＤをプロファイルデータ記憶部１２３から読み出す。そして、検索要求データ６ＱＤに示される検索キー（単語）がプロファイルデータ６ＰＤに示される場合は、この検索キーとのペア重要度Ｓが最も高い単語を選出する。 In the profile management server 1, when the search processing unit 109 acquires the search request data 6QD, it corrects the search key indicated in the search request data 6QD as follows. The profile data 6PD of the user code indicated in the search request data 6QD is read from the profile data storage unit 123. When the search key (word) indicated in the search request data 6QD is indicated in the profile data 6PD, the word having the highest pair importance S with this search key is selected.

例えば、検索キーが「南部地方」であり、「南部地方」と各単語との関連度が図１０に示す通りである場合は、ペア重要度Ｓの最も高い「村おこし」を選出する。 For example, if the search key is “south region” and the degree of association between “south region” and each word is as shown in FIG. 10, “village revitalization” having the highest pair importance S is selected.

検索処理部１０９は、この検索キーと選出した単語との組合せ（例えば、「南部地方」および「村おこし」）を検索クエリとして用いて検索を行う。検索は、プロファイル管理サーバ１に備わっている検索エンジンを用いてもよいし、他の装置（例えば、ドキュメントサーバ３２）の検索エンジンを用いてもよい。また、この検索キーと選出した単語とをＡＮＤで繋いでもよいし、ＯＲで繋いでもよい。つまり、いわゆるＡＮＤ検索を行ってもよいし、いわゆるＯＲ検索を行ってもよい。 The search processing unit 109 performs a search using a combination of the search key and the selected word (for example, “Nanbu region” and “Murakoshi”) as a search query. For the search, a search engine provided in the profile management server 1 may be used, or a search engine of another device (for example, the document server 32) may be used. Further, the search key and the selected word may be connected by AND or OR. That is, a so-called AND search may be performed, or a so-called OR search may be performed.

なお、検索キーがプロファイルデータ６ＰＤに示されない場合は、検索処理部１０９は、従来通り、端末装置２からの検索キーのみを検索クエリとして用いて検索を行う。 When the search key is not indicated in the profile data 6PD, the search processing unit 109 performs a search using only the search key from the terminal device 2 as a search query, as is conventional.

そして、検索処理部１０９は、検索結果を示す検索結果データ６ＡＤを、要求元である端末装置２へ送信する。 Then, the search processing unit 109 transmits search result data 6AD indicating the search result to the terminal device 2 that is the request source.

端末装置２において、検索クライアント２０８は、検索結果データ６ＡＤを受信すると、検索結果をタッチパネルディスプレイ２０ｄに表示させる。 In the terminal device 2, when the search client 208 receives the search result data 6AD, the search client 208 displays the search result on the touch panel display 20d.

または、検索処理部１０９は、検索要求データ６ＱＤに示される検索キー（単語）がプロファイルデータ６ＰＤに示される場合に、この検索キーとのペア重要度Ｓが所定の値以上であり、かつ、重要度Ｗが最も高い単語を選出し補足してもよい。 Alternatively, when the search key (word) indicated in the search request data 6QD is indicated in the profile data 6PD, the search processing unit 109 has a pair importance S with the search key equal to or higher than a predetermined value and is important The word having the highest degree W may be selected and supplemented.

または、検索処理部１０９は、上述のいずれかの方法で選出した単語を、検索要求データ６ＱＤに示される単語と置き換えて、検索してもよい。 Alternatively, the search processing unit 109 may search by replacing the word selected by any one of the above methods with the word indicated in the search request data 6QD.

図１１は、端末装置２の全体的な処理の流れの例を説明するフローチャートである。図１２は、プロファイル管理サーバ１の全体的な処理の流れの例を説明するフローチャートである。図１３は、重要度調整処理の流れの例を説明するフローチャートである。 FIG. 11 is a flowchart illustrating an example of the overall processing flow of the terminal device 2. FIG. 12 is a flowchart for explaining an example of the overall processing flow of the profile management server 1. FIG. 13 is a flowchart for explaining an example of the flow of importance adjustment processing.

次に、プロファイル管理サーバ１および端末装置２の全体的な処理の流れを、フローチャートを参照しながら説明する。 Next, the overall processing flow of the profile management server 1 and the terminal device 2 will be described with reference to flowcharts.

端末装置２は、カレント文書の閲覧の開始の操作を検知すると（図１１の＃８５１でＹｅｓ）、文書データ６ＤＣをプロファイル管理サーバ１へ送信する（＃８５２）。さらに、所定の時間ごとに興味度ＲＩを検出し記録する（＃８５３）。 When the terminal device 2 detects an operation to start browsing the current document (Yes in # 851 in FIG. 11), the terminal device 2 transmits the document data 6DC to the profile management server 1 (# 852). Further, the degree of interest RI is detected and recorded every predetermined time (# 853).

そして、端末装置２は、カレント文書の閲覧の終了の操作を検知すると（＃８５４でＹｅｓ）、全体興味度ＲＥを算出し（＃８５５）、全体興味度ＲＥおよび各時刻の興味度ＲＩなどを示すデータ、つまり、興味度データ６Ｋを生成し、プロファイル管理サーバ１へ送信する（＃８５６）。 When the terminal device 2 detects an operation for ending the viewing of the current document (Yes in # 854), the terminal device 2 calculates the overall interest level RE (# 855), and displays the overall interest level RE and the interest level RI at each time. Data shown, that is, interest degree data 6K is generated and transmitted to the profile management server 1 (# 856).

または、端末装置２は、検索キーを受け付けると（＃８５７でＹｅｓ）、この検索キーなどを示すデータつまり検索要求データ６ＱＤを生成し、プロファイル管理サーバ１へ送信する（＃８５８）。その後、プロファイル管理サーバ１から検索結果データ６ＡＤを受信すると、検索の結果を表示する（＃８５９）。 Alternatively, when receiving the search key (Yes in # 857), the terminal device 2 generates data indicating the search key or the like, that is, search request data 6QD, and transmits it to the profile management server 1 (# 858). Thereafter, when the search result data 6AD is received from the profile management server 1, the search result is displayed (# 859).

なお、図１１に記載する全体的な処理は、最初から最後まで通しで行われるとは、限らない。端末装置２に対して行われた操作または入力されたデータなどに応じて、一部の処理が適宜、実行される。例えば、コマンド判別手段によって操作またはデータに基づいてコマンドを判別し、コマンドに対応するモジュールに対してディスパッチすればよい。図１２についても、同様である。 Note that the overall processing illustrated in FIG. 11 is not always performed from the beginning to the end. Depending on the operation performed on the terminal device 2 or input data, a part of the processing is appropriately executed. For example, a command determination unit may determine a command based on an operation or data and dispatch it to a module corresponding to the command. The same applies to FIG.

一方、プロファイル管理サーバ１は、端末装置２からデータを受信すると（図１２の＃８０１）、そのデータの種類に応じて次の処理を実行する。 On the other hand, when receiving data from the terminal device 2 (# 801 in FIG. 12), the profile management server 1 executes the following processing according to the type of the data.

文書データ６ＤＣを受信した場合は（＃８０２でＹｅｓ）、プロファイル管理サーバ１は、それを記憶しておく（＃８０３）。 When the document data 6DC is received (Yes in # 802), the profile management server 1 stores it (# 803).

または、興味度データ６Ｋを受信した場合は（＃８０４でＹｅｓ）、プロファイル管理サーバ１は、興味度データ６Ｋを記憶し（＃８０５）、プロファイルの生成に必要な部分をカレント文書の中から抽出する処理を実行する（＃８０６）。この処理の手順は、前に図７で説明した通りである。なお、この処理において、全体興味度ＲＥと閾値αとを比較した結果、全体興味度ＲＥが閾値α未満であることが分かった場合は（＃８０７でＮｏ）、プロファイルの生成のための以降の処理（＃８０８〜＃８１２）を中止する。 Alternatively, when the interest level data 6K is received (Yes in # 804), the profile management server 1 stores the interest level data 6K (# 805), and extracts a part necessary for generating the profile from the current document. (# 806). The procedure of this process is as described above with reference to FIG. In this process, if the overall interest level RE is compared with the threshold value α and it is found that the overall interest level RE is less than the threshold value α (No in # 807), the subsequent steps for generating the profile are performed. The processing (# 808 to # 812) is stopped.

プロファイル管理サーバ１は、プロファイルの生成に必要な部分の中から単語を抽出し、それぞれの単語の重み付けを行う（＃８０８）。つまり、それぞれの単語の重要度Ｗを算出する。 The profile management server 1 extracts words from the parts necessary for generating the profile, and weights each word (# 808). That is, the importance W of each word is calculated.

また、プロファイル管理サーバ１は、プロファイルの生成に必要な部分の中から複数の単語のペアを抽出し、それぞれのペアの重み付けを行う（＃８０９）。つまり、それぞれのペアのペア重要度Ｓを算出する。 Further, the profile management server 1 extracts a plurality of word pairs from the portions necessary for generating the profile, and weights each pair (# 809). That is, the pair importance S of each pair is calculated.

重要度Ｗおよびペア重要度Ｓを算出する処理と前後してまたは並行して、プロファイル管理サーバ１は、変化パターンを判定する（＃８１０）。 Before or after or in parallel with the process of calculating the importance W and the pair importance S, the profile management server 1 determines the change pattern (# 810).

判定した変化パターンに応じて、プロファイル管理サーバ１は、ステップ＃８０８および＃８０９でそれぞれ算出した重要度Ｗおよびペア重要度Ｓを調整する処理を行う（＃８１１）。この処理は、図１３に示す手順で行われる。 In accordance with the determined change pattern, the profile management server 1 performs a process of adjusting the importance W and the pair importance S calculated in steps # 808 and # 809, respectively (# 811). This process is performed according to the procedure shown in FIG.

変化パターンが急上昇パターンである場合は（＃７２１でＹｅｓ）、プロファイル管理サーバ１は、各単語の重要度Ｗを、所定の値γａを加えることによって増加させる（＃７２２）。さらに、各ペアのペア重要度Ｓを、所定の値γｂを加えまたは所定の値γｃを掛けることによって増加させる（＃７２３）。 If the change pattern is a sudden increase pattern (Yes in # 721), the profile management server 1 increases the importance W of each word by adding a predetermined value γa (# 722). Further, the pair importance S of each pair is increased by adding a predetermined value γb or multiplying by a predetermined value γc (# 723).

変化パターンが緩上昇パターンである場合は（＃７２４でＹｅｓ）、プロファイル管理サーバ１は、各ペアのペア重要度Ｓを、所定の値γｄを加えまたは所定の値γｅを掛けることによって増加させる（＃７２５）。 When the change pattern is a gradual increase pattern (Yes in # 724), the profile management server 1 increases the pair importance S of each pair by adding a predetermined value γd or multiplying by a predetermined value γe ( # 725).

そして、プロファイル管理サーバ１は、適宜調整した各単語の重要度Ｗおよび各ペアのペア重要度Ｓを用いてプロファイルデータ６ＰＤを生成し記憶する（図１２の＃８１２）。ただし、既にこのユーザのプロファイルデータ６ＰＤがある場合は、登録済の単語の重要度Ｗおよび登録済のペアのペア重要度Ｓを更新する。 Then, the profile management server 1 generates and stores the profile data 6PD using the importance W of each word and the pair importance S of each pair adjusted as appropriate (# 812 in FIG. 12). However, if this user's profile data 6PD already exists, the importance W of the registered word and the pair importance S of the registered pair are updated.

または、検索要求データ６ＱＤを受信した場合は（＃８１３でＹｅｓ）、ユーザが指定した検索キーを、このユーザのプロファイルデータ６ＰＤに基づいて補正し（＃８１４）、補正した検索キーで情報を検索する（＃８１５）。そして、検索結果を示すデータつまり検索結果データ６ＡＤを端末装置２へ送信する（＃８１６）。 Alternatively, when the search request data 6QD is received (Yes in # 813), the search key specified by the user is corrected based on the user profile data 6PD (# 814), and information is searched using the corrected search key. (# 815). Then, data indicating the search result, that is, search result data 6AD is transmitted to the terminal device 2 (# 816).

プロファイル管理サーバ１は、以上の処理を、端末装置２からデータを受信するごとに、適宜、実行する。 The profile management server 1 appropriately executes the above processing every time data is received from the terminal device 2.

本実施形態によると、文書に含まれる単語ごとの、ユーザにとっての重要性の度合いを、従来よりも精度よく求めることができる。 According to the present embodiment, the degree of importance for the user for each word included in the document can be obtained with higher accuracy than in the past.

以下、本実施形態の幾つかの変形例を、説明する。本実施形態と重複する点については、説明を省略する。 Hereinafter, some modified examples of the present embodiment will be described. The description overlapping with this embodiment is omitted.

〔第一の変形例〕
図１４は、端末装置２の機能的構成の変形例を示す図である。 [First modification]
FIG. 14 is a diagram illustrating a modification of the functional configuration of the terminal device 2.

上述の実施形態によると、プロファイル管理サーバ１は、ステップ＃８１１において、各単語の重要度Ｗを一律に決定し、各ペアのペア重要度Ｓを一律に決定した。しかし、ユーザの興味の度合いが部分ごとに異なることがある。そこで、重要度Ｗおよびペア重要度Ｓをより精度よく調整するために、プロファイル管理サーバ１および端末装置２を次のように構成してもよい。 According to the embodiment described above, the profile management server 1 uniformly determines the importance W of each word and uniformly determines the pair importance S of each pair in Step # 811. However, the degree of interest of the user may vary from part to part. Therefore, in order to adjust the importance W and the pair importance S more accurately, the profile management server 1 and the terminal device 2 may be configured as follows.

図１４に示すように、端末装置２に、注目部分検知部２０９を設ける。注目部分検知部２０９は、ユーザがカレント文書の閲覧を開始してから終了するまでの間、所定の時間（例えば、１〜１０秒）ごとに、カレント文書の中でユーザが注目している部分（以下、「注目部分」と記載する。）検知する。なお、興味度検出部２０３が興味度ＲＩを検知する時刻に検知するのが望ましい。 As shown in FIG. 14, the target portion detection unit 209 is provided in the terminal device 2. The attention part detection unit 209 is a part of the current document in which the user pays attention every predetermined time (for example, 1 to 10 seconds) from when the user starts browsing the current document to when it ends. (Hereinafter referred to as “attention portion”). In addition, it is desirable to detect at the time when the interest level detection unit 203 detects the interest level RI.

検知は、例えば、次の方法によって行うことができる。注目部分検知部２０９は、デジタルカメラ２０ｋまたは視線追従型ヘッドマウントディスプレイが検知する、ユーザの瞳（視線）の方向に基づいて、注目部分を検知する。公知の技術によると、注目部分は、単語の単位で検知することができる。 The detection can be performed, for example, by the following method. The attention portion detection unit 209 detects the attention portion based on the direction of the user's pupil (line of sight) detected by the digital camera 20k or the line-of-sight tracking head mounted display. According to a known technique, the target portion can be detected in units of words.

＜参考＞
http://www.toshiba.co.jp/tech/review/2005/11/60_11pdf/rd01.pdf
http://www.murata.co.jp/zaidan/annual/pdf/k03/2011/a04106.pdf
または、注目部分検知部２０９は、キーロガーによって、ユーザが下線を引きまたは手書きのコメントを加えている箇所を、注目部分として検知する。エアペンが指す場所を検知することによって、注目部分を検知してもよい。 <Reference>
http://www.toshiba.co.jp/tech/review/2005/11/60_11pdf/rd01.pdf
http://www.murata.co.jp/zaidan/annual/pdf/k03/2011/a04106.pdf
Or the attention part detection part 209 detects the location where the user underlined or added the handwritten comment with a keylogger as an attention part. The portion of interest may be detected by detecting the location pointed to by the air pen.

または、カレント文書が複数のページによって構成されている場合は、注目部分検知部２０９は、現在表示されているページを注目箇所として検出してもよい。 Alternatively, when the current document is composed of a plurality of pages, the attention part detection unit 209 may detect the currently displayed page as the attention part.

興味度データ提供部２０７は、興味度データ６Ｋの代わりに、興味度データ６Ｌを生成し、プロファイル管理サーバ１へ送信する。 The interest level data providing unit 207 generates the interest level data 6L instead of the interest level data 6K, and transmits it to the profile management server 1.

興味度データ６Ｌには、興味度データ６Ｋに示される内容に加え、注目部分検知部２０９によって検知された、時刻ごとの注目部分が示されている。 In the interest level data 6L, in addition to the contents shown in the interest level data 6K, the attention part for each time detected by the attention part detection unit 209 is shown.

プロファイル管理サーバ１の必要部分抽出部１０３（図３参照）は、図７で説明した通り、ステップ＃７０４において、プロファイルの生成のために必要でない部分をカレント文書から削除する。上述の実施形態では、不要な部分（削除する部分）の例として、電子メールのヘッダおよびレターヘッドを挙げたが、第一の変形例では、さらに、興味度データ６Ｌに示される注目箇所以外の部分をも、削除する。これにより、重要度Ｗおよびペア重要度Ｓをより高い精度で算出することができる。 The necessary part extraction unit 103 (see FIG. 3) of the profile management server 1 deletes a part that is not necessary for generating a profile from the current document in step # 704 as described in FIG. In the above-described embodiment, the header and letterhead of the e-mail are given as examples of unnecessary parts (parts to be deleted). However, in the first modified example, a part other than the noticed part indicated in the interest degree data 6L is further provided. Also delete the part. Thereby, the importance W and the pair importance S can be calculated with higher accuracy.

また、変化パターン判定部１０６は、カレント文書の全体についての変化パターンを判定したが、興味度データ６Ｌに基づいて、注目箇所ごとの変化パターンを判定してもよい。そして、重要度調整部１０７は、注目箇所ごとに、それの変化パターンに応じて重要度Ｗおよびペア重要度Ｓを調整してもよい。 The change pattern determination unit 106 determines the change pattern for the entire current document. However, the change pattern determination unit 106 may determine the change pattern for each target location based on the interest level data 6L. Then, the importance level adjustment unit 107 may adjust the importance level W and the pair importance level S according to the change pattern of each point of interest.

第一の変形例によると、ユーザの目に留まった単語をより正確に特定し、その単語に対して高い重み付けを行うことができる。例えば、新しいお菓子の企画をしている人であれば、情報を探しているときに、「スナック菓子のトレンド」の中の「スナック菓子」および「トレンド」の単語に目が留まりやすい。第一の変形例によると、これらの単語に対して高い重み付けを行うことができる。 According to the first modification, it is possible to more accurately identify a word that has been noticed by the user and to perform high weighting on the word. For example, if a person is planning a new candy, when looking for information, the words “snack candy” and “trend” in the “snack candy trend” tend to attract attention. According to the first modification, high weighting can be performed on these words.

また、ユーザがカレント文書の中のある程度の塊（センテンス、段落、項、節など）の内容に興味が沸いた場合であっても、これらの塊を注目部分として抽出するので、注目部分の中の複数の単語同士の関連性つまりペア重要度Ｓをより正確に算出することができる。例えば、「南部地方では、気候を生かした特産品で村おこしの協力者を募集している。特産品は、海に近い地域性を生かした塩や寒暖差の大きさを生かした紅茶である。」という注目部分に、一瞬で興味を掻き立てる単語がなくても、ペアとして重要な要素を抽出し、ペア重要度Ｓをより正確に算出することができる。 Even if the user is interested in the contents of a certain amount of chunks (sentences, paragraphs, paragraphs, sections, etc.) in the current document, these chunks are extracted as the attention part. It is possible to calculate the relevance of a plurality of words, that is, the pair importance S more accurately. For example, “In the southern region, we are looking for collaborators of village revitalization with special products that make use of the climate. Even if there is no word that is intriguing in a moment, the important element as a pair can be extracted and the pair importance S can be calculated more accurately.

〔第二の変形例〕
図１５は、重要度調整処理の流れの変形例を説明するフローチャートである。 [Second modification]
FIG. 15 is a flowchart illustrating a modified example of the flow of the importance level adjustment process.

上述の実施形態および第一の変形例では、プロファイル管理サーバ１の重要度調整部１０７は、必要部分抽出部１０３によって抽出された部分の各単語の重要度Ｗを一律に調整し、各ペアのペア重要度Ｓを一律に調整した。 In the above-described embodiment and the first modification, the importance level adjustment unit 107 of the profile management server 1 uniformly adjusts the importance level W of each word of the part extracted by the necessary part extraction unit 103, and Pair importance S was adjusted uniformly.

しかし、調整する単語およびペアを図１５に示す方法によって調整してもよい。変化パターン判定部１０６は、時刻ごとの変化パターンを判定する。 However, the words and pairs to be adjusted may be adjusted by the method shown in FIG. The change pattern determination unit 106 determines a change pattern for each time.

重要度調整部１０７は、変化パターンが急上昇パターンである時刻における注目部分を選出し（図１５の＃７３１でＹｅｓ、＃７３２）、この注目部分に属する各単語の重要度Ｗを、所定の値γａを加えることによって増加させる（＃７３３）。さらに、この注目部分に属する各ペアのペア重要度Ｓを、所定の値γｂを加えまたは所定の値γｃを掛けることによって増加させる（＃７３４）。 The importance level adjustment unit 107 selects a target portion at the time when the change pattern is a sudden increase pattern (Yes in # 731 in FIG. 15, # 732), and sets the importance level W of each word belonging to this target portion to a predetermined value. Increase by adding γa (# 733). Further, the pair importance S of each pair belonging to this attention portion is increased by adding a predetermined value γb or multiplying by a predetermined value γc (# 734).

さらに、重要度調整部１０７は、変化パターンが急上昇パターンではなく緩上昇パターンである時刻における注目部分を選出し（＃７３５でＹｅｓ、＃７３６）、この注目部分に属する各ペアのペア重要度Ｓを、所定の値γｄを加えまたは所定の値γｅを掛けることによって増加させる（＃７３７）。 Further, the importance level adjustment unit 107 selects a target portion at a time when the change pattern is not a sudden increase pattern but a slow increase pattern (Yes in # 735, # 736), and the pair importance S of each pair belonging to this target portion. Is increased by adding a predetermined value γd or multiplying by a predetermined value γe (# 737).

〔第三の変形例〕
図１６は、必要部分抽出処理の流れの変形例を説明するフローチャートである。 [Third Modification]
FIG. 16 is a flowchart for explaining a modified example of the flow of the necessary part extraction process.

上述の実施形態および各変形例では、プロファイル管理サーバ１は、全体興味度ＲＥが閾値α以上である場合にのみ、重要度Ｗおよびペア重要度Ｓを算出し、プロファイルデータ６ＰＤを生成しまたは更新した（図７および図１２参照）。しかし、全体興味度ＲＥが閾値α未満であっても、変化パターンとして急上昇パターンまたは緩上昇パターンが表れていれば、これらの処理を行ってもよい。 In the above embodiment and each modification, the profile management server 1 calculates the importance W and the pair importance S only when the overall interest level RE is equal to or higher than the threshold value α, and generates or updates the profile data 6PD. (See FIGS. 7 and 12). However, even if the overall interest level RE is less than the threshold value α, these processes may be performed as long as a sudden rise pattern or a slow rise pattern appears as the change pattern.

すなわち、必要部分抽出部１０３は、図１６に示す手順で処理を行う。興味度データ６Ｋに示される全体興味度ＲＥと閾値αとを比較し（図１６の＃７４１）、全体興味度ＲＥが閾値α以上である場合は（＃７４２でＹｅｓ）、カレント文書から必要な部分を抽出する処理を行う（＃７４４〜＃７４６）。全体興味度ＲＥが閾値α未満である場合であっても（＃７４２でＮｏ）、変化パターンが急上昇パターンまたは緩上昇パターンであると変化パターン判定部１０６によって判定された場合は（＃７４３でＹｅｓ）、カレント文書から必要な部分を抽出する処理を行う（＃７４４〜＃７４６）。変化パターンが上昇無パターンであると判定された場合は（＃７４３でＮｏ）、それ以降の処理を中止し、次の興味度データ６Ｋが受信されるのを待つ。なお、ステップ＃７４４〜＃７４６の処理の内容は、図７のステップ＃７０３〜＃７０５の処理と同様である。 That is, the necessary part extraction unit 103 performs processing according to the procedure shown in FIG. The total interest level RE shown in the interest level data 6K is compared with the threshold value α (# 741 in FIG. 16). If the total interest level RE is equal to or higher than the threshold value α (Yes in # 742), it is necessary from the current document. Processing for extracting a part is performed (# 744 to # 746). Even when the overall interest level RE is less than the threshold value α (No in # 742), when the change pattern determination unit 106 determines that the change pattern is a sudden rise pattern or a slow rise pattern (Yes in # 743). ) A process of extracting a necessary part from the current document is performed (# 744 to # 746). If it is determined that the change pattern is a non-rising pattern (No in # 743), the subsequent processing is stopped and the next interest level data 6K is waited for to be received. Note that the contents of the processes of steps # 744 to # 746 are the same as the processes of steps # 703 to # 705 of FIG.

〔第四の変形例〕
図１７は、変化率と所定の値との関係を定義するテーブルの例を示す図である。 [Fourth modification]
FIG. 17 is a diagram illustrating an example of a table that defines the relationship between the rate of change and a predetermined value.

上述の実施形態および各変形例では、変化パターンが急上昇パターンである場合に、変化率を問わず一律に重要度Ｗを増加させ、一律にペア重要度Ｓを増加させた。しかし、変化率に応じて増加させる量を変えてもよい。例えば、図１７（Ａ）のような、変化率が多いほど所定の値γａが大きくなる関係を表わすテーブルを予め用意しておく。そして、このテーブルに基づいて変化率に応じた所定の値γａを求め、重要度Ｗを算出すればよい。テーブルの代わりに関数を用いてもよい。ペア重要度Ｓについても、同様である（図１７（Ｂ）および（Ｃ）参照）。 In the above-described embodiment and each modification, when the change pattern is a sudden increase pattern, the importance W is increased uniformly regardless of the change rate, and the pair importance S is increased uniformly. However, the amount to be increased may be changed according to the rate of change. For example, as shown in FIG. 17A, a table representing a relationship in which the predetermined value γa increases as the change rate increases is prepared in advance. Then, a predetermined value γa corresponding to the rate of change is obtained based on this table, and the importance W may be calculated. Functions may be used instead of tables. The same applies to the pair importance S (see FIGS. 17B and 17C).

変化パターンが緩上昇パターンである場合も同様に、図１７（Ｄ）または（Ｅ）に示すテーブルを用いて、重要度Ｗの増加量を変化率に応じて変えてもよい。 Similarly, when the change pattern is a gradual increase pattern, the amount of increase in importance W may be changed according to the change rate using the table shown in FIG.

〔その他の変形例〕
特定の書式が設定され目立つ単語に対しては、他の単語よりも高い重要度を与えてもよい。例えば、太字の単語、フォントサイズが所定のサイズ以上の単語、文書のタイトルとして用いられている単語、および章のタイトルまたは小見出しとして用いられている単語に対して、他の単語よりも高い重要度を与えてもよい。または、要約の欄に用いられている単語に対して、他の単語よりも高い重要度を与えてもよい。ペアの一部または全部の単語に特定の書式が設定されている場合についても同様に、他のペアよりも高い重要度（ペア重要度）を与えてもよい。 [Other variations]
For words that have a particular format and are conspicuous, higher importance may be given than other words. For example, bold words, words whose font size is larger than a certain size, words used as document titles, and words used as chapter titles or subheadings are higher in importance than other words May be given. Alternatively, the word used in the summary column may be given higher importance than other words. Similarly, when a specific format is set for some or all words of a pair, higher importance (pair importance) than other pairs may be given.

または、変化パターンを、興味度ＲＩの変化率だけでなく、ユーザのアクションまたは集中の度合い（集中度）をも参照して、判定してもよい。例えば、ある時刻Ｔ１における変化パターンを、時刻Ｔ１における興味度ＲＩの変化率が閾値βａ以上であることだけでなく、時刻Ｔ１においてユーザが所定のアクションを行ったことまたは集中度が所定の値以上であることを、急上昇パターンであると判定する条件としてもよい。緩上昇パターンの判定についても、同様である。 Alternatively, the change pattern may be determined with reference to not only the change rate of the degree of interest RI but also the user's action or the degree of concentration (concentration level). For example, the change pattern at a certain time T1 is not only that the rate of change of the degree of interest RI at the time T1 is equal to or higher than the threshold value βa, but also that the user has performed a predetermined action at the time T1 or the degree of concentration is higher than a predetermined value It is good also as conditions which determine with it being a sudden rise pattern. The same applies to the determination of the gradual rise pattern.

これにより、誰かに話し掛けられることなど、他の原因で興味度が急上昇した場合を除外することができ、変化パターンの判定の精度を高めることができる。なお、アクションおよび集中度は、公知の技術によって検知することができる。 As a result, it is possible to exclude a case where the interest level suddenly increases due to other causes such as talking to someone, and the accuracy of determination of the change pattern can be improved. The action and the degree of concentration can be detected by a known technique.

プロファイル管理サーバ１および端末装置２それぞれの役割を、適宜、変更することができる。 The roles of the profile management server 1 and the terminal device 2 can be changed as appropriate.

例えば、変化パターンの判定を、プロファイル管理サーバ１の代わりに端末装置２が行ってもよい。または、上述の実施形態および各変形例では、検索クエリの補正だけでなく検索もプロファイル管理サーバ１が行ったが、補正の結果をプロファイル管理サーバ１から端末装置２へ送信し、検索は端末装置２が行ってもよい。 For example, the terminal device 2 may perform the change pattern determination instead of the profile management server 1. Alternatively, in the above-described embodiment and each modification, the profile management server 1 performs not only the search query correction but also the search. However, the correction result is transmitted from the profile management server 1 to the terminal device 2, and the search is performed by the terminal device. 2 may do.

または、ペア重要度Ｓおよび重要度Ｗの算出およびプロファイルデータ６ＰＤの生成などの処理をすべて、端末装置２が行ってもよい。 Alternatively, the terminal device 2 may perform all processing such as calculation of the pair importance S and importance W and generation of the profile data 6PD.

または、カレント文書を画像形成装置４に印刷させた場合に、より高い重要度Ｗおよびペア重要度Ｓを与えてもよい。 Alternatively, when the current document is printed on the image forming apparatus 4, higher importance W and pair importance S may be given.

上述の実施形態および各変形例では、所定の値を加算しまたは掛けるアルゴリズムを用いて重要度を調整したが、他のアルゴリズムを用いてもよい。 In the above-described embodiment and each modified example, the importance is adjusted using an algorithm for adding or multiplying a predetermined value, but other algorithms may be used.

その他、文書管理システム１ＳＹ、プロファイル管理サーバ１、および端末装置２の全体または各部の構成、処理内容、処理順序、データの構成などは、本発明の趣旨に沿って適宜変更することができる。 In addition, the configuration of the whole or each part of the document management system 1SY, the profile management server 1, and the terminal device 2, the processing content, the processing order, the data configuration, and the like can be appropriately changed in accordance with the spirit of the present invention.

１プロファイル管理サーバ（単語重要度算出装置）
１０２興味度データ取得部（興味度入力手段）
１０４単語重要度算出部（重要度算出手段）
１０５ペア重要度算出部（ペア重要度算出手段）
１０６変化パターン判定部（変化パターン検知手段）
１０７重要度調整部（重要度算出手段、ペア重要度算出手段）
１０９検索処理部（検索処理手段）
１２３プロファイルデータ記憶部（プロファイルデータ記憶手段）
６ＰＤプロファイルデータ
Ｓペア重要度
Ｗ重要度 1 Profile management server (word importance calculation device)
102 Interest level data acquisition unit (interest level input means)
104 word importance calculation unit (importance calculation means)
105 Pair importance calculation unit (pair importance calculation means)
106 Change pattern determination unit (change pattern detection means)
107 Importance adjustment unit (importance calculation means, pair importance calculation means)
109 Search processing unit (search processing means)
123 profile data storage unit (profile data storage means)
6PD profile data S Pair importance W Importance

Claims

ユーザの、文書の閲覧中の各時刻における前記文書に対する興味の度合いである興味度を入力する、興味度入力手段と、
入力された前記興味度の変化のパターンを検知する、変化パターン検知手段と、
検知された前記パターンに基づいて、前記文書に含まれる単語ごとの、前記ユーザにとっての重要性の度合いである重要度を算出する、重要度算出手段と、
を有することを特徴とする単語重要度算出装置。 An interest level input means for inputting an interest level, which is a degree of interest in the document at each time when the user is browsing the document;
A change pattern detection means for detecting a pattern of change in the degree of interest input;
Importance calculating means for calculating the importance, which is the degree of importance for the user, for each word included in the document based on the detected pattern;
A word importance calculation device characterized by comprising:

前記変化パターン検知手段は、前記パターンとして、第一の長さの時間における前記興味度の変化率が第一の率以上である第一のパターンおよび前記第一の長さよりも長い第二の長さの時間における前記興味度の変化率が前記第二の率よりも小さい第二の率以上である第二のパターンのうちの少なくともいずれかを検知し、
前記重要度算出手段は、前記第一のパターンが検知された場合は、前記重要度として、前記第二のパターンのみ検知された場合よりも、高い度数を算出する、
請求項１に記載の単語重要度算出装置。 The change pattern detection means includes, as the pattern, a first pattern in which a change rate of the degree of interest in a first length of time is equal to or higher than a first rate and a second length longer than the first length. Detecting at least one of the second patterns in which the change rate of the degree of interest in the time is equal to or higher than a second rate smaller than the second rate;
When the first pattern is detected, the importance calculation unit calculates a higher frequency as the importance than when only the second pattern is detected.
The word importance calculation device according to claim 1.

前記重要度算出手段は、前記単語それぞれの重要度を、当該単語が前記文書に表れる頻度に応じて算出する、
請求項１または請求項２に記載の単語重要度算出装置。 The importance calculation means calculates the importance of each word according to the frequency with which the word appears in the document.
The word importance calculation device according to claim 1 or 2.

前記重要度算出手段は、前記各時刻における前記興味度の中間値が所定の値以上である場合に、前記単語それぞれの前記重要度を算出し、そうでない場合は、算出しない、
請求項１ないし請求項３のいずれかに記載の単語重要度算出装置。 The importance calculation means calculates the importance of each of the words when the intermediate value of the interest at each time is equal to or greater than a predetermined value, and does not calculate otherwise.
The word importance calculation device according to any one of claims 1 to 3.

前記重要度算出手段は、前記単語のうちの前記文書の中の前記ユーザが注目した部分に含まれる単語の前記重要度として、他の単語の前記重要度よりも高い度数を算出する、
請求項１ないし請求項４のいずれかに記載の単語重要度算出装置。 The importance calculation means calculates a frequency higher than the importance of the other words as the importance of the word included in the portion of the word that the user noticed in the document.
The word importance calculation apparatus according to any one of claims 1 to 4.

前記重要度算出手段は、前記単語のうちの所定の書式で表わされる単語の前記重要度として、他の単語の前記重要度よりも高い度数を算出する、
請求項１ないし請求項５のいずれかに記載の単語重要度算出装置。 The importance calculating means calculates a frequency higher than the importance of other words as the importance of the word represented in a predetermined format among the words.
The word importance calculation device according to claim 1.

前記重要度算出手段は、前記単語のうちの前記文書の中の所定の欄に表われる単語の前記重要度として、他の単語の前記重要度よりも高い度数を算出する、
請求項１ないし請求項５のいずれかに記載の単語重要度算出装置。 The importance calculation means calculates a higher frequency than the importance of other words as the importance of the word appearing in a predetermined column in the document of the words.
The word importance calculation device according to claim 1.

検知された前記パターンに基づいて、前記文書に含まれる複数の単語同士からなるペアごとの、前記ユーザにとっての重要性の度合いであるペア重要度を算出する、ペア重要度算出手段、を有する、
請求項１ないし請求項７のいずれかに記載の単語重要度算出装置。 Based on the detected pattern, a pair importance degree calculating unit that calculates a pair importance degree that is a degree of importance for the user for each pair including a plurality of words included in the document,
The word importance calculation apparatus according to any one of claims 1 to 7.

前記ペア重要度算出手段は、前記第一のパターンが検知された場合は、前記ペア重要度として、前記第二のパターンのみ検知された場合よりも、高い度数を算出する、
請求項８に記載の単語重要度算出装置。 The pair importance calculation means, when the first pattern is detected, calculates a higher frequency as the pair importance than when only the second pattern is detected,
The word importance calculation device according to claim 8.

前記ペア重要度算出手段は、前記ペアのうちの前記文書の中の前記ユーザが注目した部分に含まれるペアの前記ペア重要度として、他のペアの前記ペア重要度よりも高い度数を算出する、
請求項８または請求項９に記載の単語重要度算出装置。 The pair importance calculation means calculates a frequency higher than the pair importance of another pair as the pair importance of a pair included in a portion of the document that is noticed by the user in the document. ,
The word importance calculation apparatus according to claim 8 or 9.

前記単語ごとの前記重要度および前記ペアごとの前記ペア重要度を示すプロファイルデータを記憶するプロファイルデータ記憶手段と、
前記ユーザが指定した検索クエリーを前記プロファイルデータに基づいて補正し、当該補正した検索クエリーに基づいて情報を検索する処理を行う検索処理手段と、を有する、
請求項８ないし請求項１０のいずれかに記載の単語重要度算出装置。 Profile data storage means for storing profile data indicating the importance for each word and the pair importance for each pair;
Search processing means for correcting the search query specified by the user based on the profile data and performing a process of searching for information based on the corrected search query,
The word importance calculation device according to claim 8.

コンピュータが、
ユーザの、文書の閲覧中の各時刻における前記文書に対する興味の度合いである興味度の入力を受け付ける処理と、
入力された前記興味度の変化のパターンを検知する処理と、
検知された前記パターンに基づいて、前記文書に含まれる単語ごとの、前記ユーザにとっての重要性の度合いである重要度を算出する処理と、
を実行することを特徴とする単語重要度算出方法。 Computer
A process of receiving an input of an interest level, which is a degree of interest in the document at each time when the user is browsing the document;
A process of detecting the input pattern of change in interest;
Based on the detected pattern, for each word included in the document, a process of calculating importance that is a degree of importance for the user;
A word importance calculation method characterized by executing:

コンピュータに、
ユーザの、文書の閲覧中の各時刻における前記文書に対する興味の度合いである興味度の入力を受け付ける処理と、
入力された前記興味度の変化のパターンを検知する処理と、
検知された前記パターンに基づいて、前記文書に含まれる単語ごとの、前記ユーザにとっての重要性の度合いである重要度を算出する処理と、
を実行させることを特徴とするコンピュータプログラム。 On the computer,
A process of receiving an input of an interest level, which is a degree of interest in the document at each time when the user is browsing the document;
A process of detecting the input pattern of change in interest;
Based on the detected pattern, for each word included in the document, a process of calculating importance that is a degree of importance for the user;
A computer program for executing