JP2019164515A

JP2019164515A - Device for extracting portions to be improved in document, method for extracting portions to be improved in document and program

Info

Publication number: JP2019164515A
Application number: JP2018051512A
Authority: JP
Inventors: 宏和秋葉; Hirokazu Akiba
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 2018-03-19
Filing date: 2018-03-19
Publication date: 2019-09-26

Abstract

To provide a device for extracting portions to be improved in a document, a method for extracting portions to be improved in a document and a program capable of improving precision in extracting portions to be improved in a document.SOLUTION: A device 100 for extracting portion to be improved in document includes: an information obtaining unit 101 that obtains visual-line record information for identifying a trajectory of a visual line when a user views a document; a standard reading speed calculating unit 102 that calculates a standard reading speed when the user views the document on the basis of the obtained visual-line record information; and an extracting unit 103 that extracts, on the basis of the obtained visual-line record information, portions to be improved in the document which are portions where the moving speed of the visual line of the user decreases relative to the standard reading speed in the document.SELECTED DRAWING: Figure 1

Description

本発明は、文書の読みにくい箇所、理解し難い箇所などを抽出するための、文書改善箇所抽出装置、及び文書改善箇所抽出方法に関し、更には、これらを実現するためのプログラムに関する。 The present invention relates to a document improvement part extraction device and a document improvement part extraction method for extracting difficult-to-read parts and difficult-to-understand parts of a document, and also relates to a program for realizing them.

従来から、商品を購入したユーザは、その商品の使い方を理解するために取扱説明書及び利用規約を読むことがある。また、サービスを利用するユーザも、サービスの利用の仕方を理解するために、そのサービスの説明書及び利用規約を読むことがある。なお、以下においては、これらのユーザをまとめて「ユーザ」又は「文書を読むユーザ」と表記する。 Conventionally, a user who has purchased a product may read an instruction manual and terms of use in order to understand how to use the product. In addition, a user who uses a service sometimes reads the service manual and terms of service in order to understand how to use the service. In the following, these users are collectively referred to as “user” or “user who reads a document”.

但し、説明書及び利用規約といった文書は、必ずしも利用者にとって読みやすい文章で記載されているとは限らず、ユーザは、文書の内容を十分に理解しないまま、サービス又は商品を利用してしまうことがある。このような場合、ユーザは、そのサービス又は商品の価値を十分に享受できなかったり、誤った使い方をしたために損害を被ったりすることがある。 However, documents such as manuals and terms of use are not always written in easy-to-read text for users, and users may use services or products without fully understanding the contents of the documents. There is. In such a case, the user may not be able to fully enjoy the value of the service or product, or may suffer damage due to misuse.

また、ユーザは、多くの場合、サービスの提供者又は商品の提供者（以下、両者を合わせて「提供者」と表記する）に対して、取扱説明書及び利用規約の記載内容のうち、読みにくい箇所、及び理解しにくい箇所について、確認する手間を負担に思っている。このため、ユーザは、提供者に対して、これらの箇所を確認するという行動を実行しないことが多く、提供者においても、取扱説明書及び利用規約の読みにくい箇所、及び理解しにくい箇所を知ることは困難である。 Also, in many cases, users read service manuals or product providers (hereinafter referred to as “providers” together) from the contents of the instruction manuals and terms of use. The burden of checking the difficult parts and difficult parts to understand is burdened. For this reason, the user often does not perform the action of confirming these parts to the provider, and the provider also knows the parts that are difficult to read and understand the instruction manual and the terms of use. It is difficult.

このような問題に対応するため、例えば、特許文献１及び２は、文書の改善箇所を提示する技術を開示提案している。具体的には、特許文献１は、文書を読むユーザの視線を特定する視線情報を用いて、文書が情報伝達効率の良い構成になっているかどうかを評価するシステムを開示している。 In order to deal with such a problem, for example, Patent Documents 1 and 2 disclose and propose a technique for presenting an improved portion of a document. Specifically, Patent Document 1 discloses a system that evaluates whether or not a document has a configuration with good information transmission efficiency using gaze information that identifies the gaze of a user who reads the document.

また、特許文献２は、ユーザの行動履歴に基づいて、文書の改善箇所を提示するシステムを開示している。具体的には、ユーザの行動履歴には、ユーザが文書を読み進めていき、理解しにくい箇所に遭遇したタイミングで、その箇所について検索を行うと、そのことが履歴として記録される。このため、特許文献２に開示されたシステムは、行動履歴に、ユーザの検索行為が記録されていると、この記録に基づいて、ユーザが理解しにくい箇所を特定し、この箇所を文書の改善箇所として提示する。 Patent Document 2 discloses a system that presents document improvements based on a user's behavior history. Specifically, when a user searches for a part at a timing when the user reads a document and encounters a part that is difficult to understand, the user's action history is recorded as a history. For this reason, when the user's search action is recorded in the action history, the system disclosed in Patent Literature 2 identifies a portion that is difficult for the user to understand based on this record, and this portion is improved in the document. Present as a location.

特開２００２−１４９６３３号公報JP 2002-149633 A 特開２０１７−２１５８４５号公報JP 2017-215845 A

しかしながら、特許文献１に開示されたシステムでは、ユーザ毎に、視線の動くスピード、つまり文書を読む速度が異なることが考慮されていない。このため、「読むスピードが早い人から取得した視線情報を用いた場合の文書の評価結果」と「読むスピードが遅い
人から取得した視線情報を用いた場合の文書の評価結果」とで、齟齬が生じてしまう。即ち、特許文献１に開示されたシステムには、文書の改善箇所の特定精度を高めるのが難しいという問題がある However, in the system disclosed in Patent Document 1, it is not considered that the speed at which the line of sight moves, that is, the speed at which a document is read differs for each user. For this reason, “document evaluation results when using gaze information obtained from a person who reads fast” and “document evaluation results using gaze information obtained from a person who reads slowly” Will occur. In other words, the system disclosed in Patent Document 1 has a problem that it is difficult to increase the accuracy of identifying the location where the document is improved.

また、特許文献２に開示されたシステムでは、ユーザが理解しにくい箇所に遭遇したタイミングで検索を行うことが前提となっている。このため、ユーザが、理解しにくい箇所に遭遇したタイミングで、検索を行う代わりに、問い合わせメールを送信した場合、ユーザ自身で参考となる文献を確認した場合等において、当該システムが、ユーザが理解しにくい箇所を特定することは困難である。つまり、特許文献２に開示されたシステムにも、文書の改善箇所の特定精度を高めるのが難しいという問題がある。 In addition, the system disclosed in Patent Document 2 is premised on performing a search at a timing when a user encounters a location that is difficult to understand. For this reason, when the user encounters a location that is difficult to understand, when the user sends an inquiry mail instead of performing a search, or when the user himself / herself confirms a reference document, the system understands the system. It is difficult to specify a difficult part. In other words, the system disclosed in Patent Document 2 also has a problem that it is difficult to increase the accuracy of specifying the improved portion of the document.

本発明の目的の一例は、上記問題を解消し、文書中の改善箇所の抽出精度を向上し得る、文書改善箇所抽出装置、文書改善箇所抽出方法、及びプログラムを提供することにある。 An object of the present invention is to provide a document improvement part extraction apparatus, a document improvement part extraction method, and a program that can solve the above-described problems and improve the extraction accuracy of improvement parts in a document.

上記目的を達成するため、本発明の一側面における文書改善箇所出力装置は、
ユーザが文書を閲読した際の視線の軌跡を特定する視線履歴情報を取得する、情報取得部と、
取得された前記視線履歴情報に基づいて、前記ユーザが前記文書を閲読した際の標準閲読速度を計算する、標準閲読速度計算部と、
取得された前記視線履歴情報に基づいて、前記文書において、前記ユーザの視線の移動速度が前記標準閲読速度に比べて低下している箇所を、文書の改善箇所として抽出する、改善箇所抽出部と、
を備えている、ことを特徴とする。 In order to achieve the above object, a document improvement portion output device according to one aspect of the present invention is provided.
An information acquisition unit that acquires line-of-sight history information that identifies a locus of line-of-sight when a user reads a document;
A standard reading speed calculation unit that calculates a standard reading speed when the user reads the document based on the acquired line-of-sight history information;
Based on the acquired line-of-sight history information, an improved part extraction unit that extracts a part of the document where the movement speed of the user's line of sight is lower than the standard reading speed as an improved part of the document; ,
It is characterized by having.

また、上記目的を達成するため、本発明の一側面における文書改善箇所出力方法は、
（ａ）ユーザが文書を閲読した際の視線の軌跡を特定する視線履歴情報を取得する、ステップと、
（ｂ）取得された前記視線履歴情報に基づいて、前記ユーザが前記文書を閲読した際の標準閲読速度を計算する、ステップと、
（ｃ）取得された前記視線履歴情報に基づいて、前記文書において、前記ユーザの視線の移動速度が前記標準閲読速度に比べて低下している箇所を、文書の改善箇所として抽出する、ステップと、
を有する、ことを特徴とする。 In order to achieve the above object, the document improvement portion output method in one aspect of the present invention is:
(A) obtaining line-of-sight history information for identifying a locus of line-of-sight when a user reads a document;
(B) calculating a standard reading speed when the user reads the document based on the acquired line-of-sight history information;
(C) Based on the acquired line-of-sight history information, extracting a part of the document where the movement speed of the user's line of sight is lower than the standard reading speed as an improved part of the document; ,
It is characterized by having.

更に、上記目的を達成するため、本発明の一側面におけるプログラムは、
コンピュータに、
（ａ）ユーザが文書を閲読した際の視線の軌跡を特定する視線履歴情報を取得する、ステップと、
（ｂ）取得された前記視線履歴情報に基づいて、前記ユーザが前記文書を閲読した際の標準閲読速度を計算する、ステップと、
（ｃ）取得された前記視線履歴情報に基づいて、前記文書において、前記ユーザの視線の移動速度が前記標準閲読速度に比べて低下している箇所を、文書の改善箇所として抽出する、ステップと、
を実行させる、ことを特徴とする。 Furthermore, in order to achieve the above object, a program according to one aspect of the present invention is provided.
On the computer,
(A) obtaining line-of-sight history information for identifying a locus of line-of-sight when a user reads a document;
(B) calculating a standard reading speed when the user reads the document based on the acquired line-of-sight history information;
(C) Based on the acquired line-of-sight history information, extracting a part of the document where the movement speed of the user's line of sight is lower than the standard reading speed as an improved part of the document; ,
Is executed.

以上のように、本発明によれば、文書中の改善箇所の抽出精度を向上することができる。 As described above, according to the present invention, it is possible to improve the extraction accuracy of an improved portion in a document.

図１は、本発明の実施の形態における文書改善箇所抽出装置の概略構成を示すブロック図である。FIG. 1 is a block diagram showing a schematic configuration of a document improvement portion extraction apparatus according to an embodiment of the present invention. 図２は、本発明の実施の形態における文書改善箇所お抽出装置の具体的構成を示すブロック図である。FIG. 2 is a block diagram showing a specific configuration of the document improvement portion extracting apparatus according to the embodiment of the present invention. 図３は、本実施の形態において視線履歴データベースに格納されている視線履歴情報の一例を示す図である。FIG. 3 is a diagram illustrating an example of line-of-sight history information stored in the line-of-sight history database in the present embodiment. 図４は、本発明の実施の形態においてユーザが閲読した文書の一例を示す図である。FIG. 4 is a diagram showing an example of a document read by the user in the embodiment of the present invention. 図５は、図４に示した文書に設定されたエリアの一例を示す図である。FIG. 5 is a diagram showing an example of areas set in the document shown in FIG. 図６は、図４に示した文書上で検出された視線の位置の一例を示す図である。FIG. 6 is a diagram showing an example of the position of the line of sight detected on the document shown in FIG. 図７は、本発明の実施の形態において閲読速度データベースに格納されている標準閲読速度情報の一例を示す図である。FIG. 7 is a diagram showing an example of standard reading speed information stored in the reading speed database in the embodiment of the present invention. 図８は、本発明の実施の形態において改善箇所候補データベースに格納されている閲読速度低下発生箇所情報の一例を示す図である。FIG. 8 is a diagram showing an example of reading speed reduction occurrence location information stored in the improvement location candidate database in the embodiment of the present invention. 図９は、本発明の実施の形態において改善箇所候補データベースに格納されている読み戻り発生箇所情報の一例を示す図である。FIG. 9 is a diagram showing an example of the read-back occurrence location information stored in the improvement location candidate database in the embodiment of the present invention. 図１０は、本発明の実施の形態における文書改善箇所抽出装置の改善箇所抽出処理での動作を示すフロー図である。FIG. 10 is a flowchart showing an operation in the improved portion extraction process of the document improved portion extracting apparatus according to the embodiment of the present invention. 図１１は、本発明の実施の形態における文書改善箇所抽出装置の改善箇所の提示処理での動作を示すフロー図である。FIG. 11 is a flowchart showing an operation in the improvement point presentation process of the document improvement point extraction device according to the embodiment of the present invention. 図１２は、本発明の実施の形態における文書改善箇所抽出装置を実現するコンピュータの一例を示すブロック図である。FIG. 12 is a block diagram illustrating an example of a computer that implements the document improvement portion extraction apparatus according to the embodiment of the present invention.

（実施の形態）
以下、本発明の実施の形態における、文書改善箇所抽出装置、文書改善箇所抽出方法、及びプログラムについて、図１〜図１２を参照しながら説明する。 (Embodiment)
Hereinafter, a document improvement portion extraction apparatus, a document improvement portion extraction method, and a program according to an embodiment of the present invention will be described with reference to FIGS.

［装置構成］
最初に、図１を用いて、本実施の形態における文書改善箇所抽出装置の概略構成について説明する。図１は、本発明の実施の形態における文書改善箇所抽出装置の概略構成を示すブロック図である。 [Device configuration]
First, a schematic configuration of the document improvement portion extraction apparatus according to the present embodiment will be described with reference to FIG. FIG. 1 is a block diagram showing a schematic configuration of a document improvement portion extraction apparatus according to an embodiment of the present invention.

図１に示す、本実施の形態における文書改善箇所抽出装置１００は、文書を読む利用者の視線の動きに関する情報を入力として、ユーザにとって文書の読みにくい箇所又は理解しにくい箇所を文書の改善箇所として抽出する装置である。 The document improvement part extraction apparatus 100 according to the present embodiment shown in FIG. 1 receives information on the movement of the line of sight of a user who reads a document as an input, and determines a part that is difficult to read or difficult to understand for the user. It is a device to extract as.

図１に示すように、文書改善箇所抽出装置１００は、情報取得部１０１と、標準閲読速度計算部１０２と、改善箇所抽出部１０３とを備えている。このうち、情報取得部１０１は、ユーザが文書を閲読した際の視線の軌跡を特定する視線履歴情報を取得する。 As illustrated in FIG. 1, the document improvement portion extraction apparatus 100 includes an information acquisition unit 101, a standard reading speed calculation unit 102, and an improvement portion extraction unit 103. Among these, the information acquisition unit 101 acquires line-of-sight history information that specifies the locus of the line of sight when the user reads the document.

標準閲読速度計算部１０２は、取得された視線履歴情報に基づいて、ユーザが文書を閲読した際の標準閲読速度を計算する。改善箇所抽出部１０３は、取得された視線履歴情報に基づいて、文書において、ユーザの視線の移動速度が前記標準閲読速度に比べて低下している箇所を、文書の改善箇所として抽出する。 The standard reading speed calculation unit 102 calculates a standard reading speed when the user reads the document based on the acquired line-of-sight history information. Based on the acquired line-of-sight history information, the improved part extraction unit 103 extracts a part of the document where the movement speed of the user's line of sight is lower than the standard reading speed as an improved part of the document.

このように、本実施の形態では、ユーザにとって読みにくい箇所及び理解しにくい箇所は、そのユーザ特有の標準閲読速度に基づいて抽出される。このため、本実施の形態によ
れば、ユーザがとった行動に左右されることなく、又ユーザ毎の閲読速度の違いに対応して、上述した箇所の抽出が可能となる。この結果、文書中の改善箇所の抽出精度の向上が図られる。 As described above, in the present embodiment, portions that are difficult to read and portions that are difficult to understand are extracted based on the standard reading speed unique to the user. For this reason, according to this Embodiment, the location mentioned above can be extracted without being influenced by the action taken by the user and corresponding to the difference in reading speed for each user. As a result, it is possible to improve the extraction accuracy of the improved portion in the document.

続いて、図２〜図９を用いて、本実施の形態における文書改善箇所抽出装置の構成についてより具体的に説明する。図２は、本発明の実施の形態における文書改善箇所お抽出装置の具体的構成を示すブロック図である。 Next, the configuration of the document improvement portion extraction apparatus according to the present embodiment will be described more specifically with reference to FIGS. FIG. 2 is a block diagram showing a specific configuration of the document improvement portion extracting apparatus according to the embodiment of the present invention.

図２に示すように、本実施の形態では、文書改善箇所抽出装置１００は、上述した情報取得部１０１、標準閲読速度計算部１０２及び改善箇所抽出部１０３に加えて、改善箇所提示部１０７を備えている。 As shown in FIG. 2, in the present embodiment, the document improvement part extraction device 100 includes an improvement part presentation unit 107 in addition to the information acquisition unit 101, the standard reading speed calculation unit 102, and the improvement part extraction unit 103 described above. I have.

また、文書改善箇所抽出装置１００において、改善箇所抽出部１０３は、閲読速度低下箇所抽出部１０４と、読み戻り箇所抽出部１０５と、提示箇所抽出部１０６とを備えている。 Further, in the document improvement part extraction apparatus 100, the improvement part extraction unit 103 includes a reading speed reduction part extraction part 104, a read back part extraction part 105, and a presentation part extraction part 106.

更に、図２に示すように、文書改善箇所抽出装置１００は、視線履歴データベース１１１と、閲読速度データベース１１２と、改善箇所候補データベース１１３とに、ネットワーク等を介して接続されている。なお、これらのデータベースのうちの一部又は全部は、文書改善箇所抽出装置１００の内部に構築されていても良い。なお、以降の説明において、データベースは、「ＤＢ」と表記することとする。 Further, as shown in FIG. 2, the document improvement portion extraction apparatus 100 is connected to a line-of-sight history database 111, a reading speed database 112, and an improvement portion candidate database 113 via a network or the like. Some or all of these databases may be built inside the document improvement portion extraction apparatus 100. In the following description, the database is expressed as “DB”.

文書改善箇所抽出装置１００が接続されているデータベースのうち、視線履歴ＤＢ１１１には、視線履歴情報収集装置２００によって収集された視線履歴情報が格納されている。 Of the databases to which the document improvement portion extraction apparatus 100 is connected, the line-of-sight history DB 111 stores the line-of-sight history information collected by the line-of-sight history information collection apparatus 200.

視線履歴情報収集装置２００は、ユーザが文書を閲覧すると、その視線の軌跡を抽出し、抽出した軌跡を特定する視線履歴情報を生成する装置である。視線履歴情報収集装置２００の具体例としては、特許文献１に開示された視線検出装置が挙げられる。視線履歴情報収集装置２００は、ユーザの目に赤外線を照射し、反射した赤外線の反射方向を特定することによって、視線の位置を検出する。 The line-of-sight history information collection apparatus 200 is an apparatus that, when a user browses a document, extracts a line-of-sight locus and generates line-of-sight history information that identifies the extracted locus. A specific example of the line-of-sight history information collection apparatus 200 is the line-of-sight detection apparatus disclosed in Patent Document 1. The line-of-sight history information collection device 200 detects the position of the line of sight by irradiating the user's eyes with infrared rays and specifying the reflection direction of the reflected infrared rays.

また、視線履歴情報収集装置２００は、一定時間毎に、ユーザの視線の位置を検出し、検出した各位置を用いて、視線履歴情報１２１を生成し、生成した視線履歴情報１２１を、視線履歴ＤＢ１１１に格納する。 Further, the line-of-sight history information collection device 200 detects the position of the user's line of sight at regular intervals, generates line-of-sight history information 121 using each detected position, and uses the line-of-sight history information 121 thus generated as the line-of-sight history. Store in the DB 111.

図３は、本実施の形態において視線履歴データベースに格納されている視線履歴情報の一例を示す図である。図３に示すように、視線履歴情報は、視線の取得元となったユーザの利用者ＩＤと、ユーザが閲読した文書の文書ＩＤと、視線履歴とで構成されている。 FIG. 3 is a diagram illustrating an example of line-of-sight history information stored in the line-of-sight history database in the present embodiment. As illustrated in FIG. 3, the line-of-sight history information includes a user ID of a user who has acquired the line of sight, a document ID of a document read by the user, and a line-of-sight history.

図３において、視線履歴として登録されている数字の羅列は、一定時間毎の視線の位置を示している。この点について、図４〜図６を用いて説明する。図４は、本発明の実施の形態においてユーザが閲読した文書の一例を示す図である。図５は、図４に示した文書に設定されたエリアの一例を示す図である。図６は、図４に示した文書上で検出された視線の位置の一例を示す図である。 In FIG. 3, the enumeration of numbers registered as the line-of-sight history indicates the position of the line of sight at regular intervals. This point will be described with reference to FIGS. FIG. 4 is a diagram showing an example of a document read by the user in the embodiment of the present invention. FIG. 5 is a diagram showing an example of areas set in the document shown in FIG. FIG. 6 is a diagram showing an example of the position of the line of sight detected on the document shown in FIG.

ユーザが、例えば、図４に示す文書を閲読したとする。この場合、この図４に示す文書に対して、文書上の領域を複数個に分割するエリアが設定され、各エリアには、番号が付与される。また、各エリアに含まれる文字数は一定である。なお、図４においては、説明のため、一部のエリアに付与された番号については図示が省略されている。 For example, assume that the user has read the document shown in FIG. In this case, areas for dividing the document area into a plurality of areas are set for the document shown in FIG. 4, and a number is assigned to each area. Further, the number of characters included in each area is constant. In FIG. 4, for the sake of explanation, the numbers assigned to some areas are not shown.

そして、ユーザが図４に示す文書を閲読すると、視線履歴情報収集装置２００は、図６に示すように、一定時間毎の視線の位置を検出し、検出した位置に該当するエリアの番号を、視線履歴として記録する。 Then, when the user reads the document shown in FIG. 4, the line-of-sight history information collection device 200 detects the line-of-sight position for every predetermined time as shown in FIG. 6, and the area number corresponding to the detected position is Record as gaze history.

情報取得部１０１は、本実施の形態では、視線履歴ＤＢ１１１から、視線履歴情報１２１を取得し、取得した視線履歴情報１２１を、標準閲読速度計算部１０２と、改善箇所抽出部１０３とに入力する。 In this embodiment, the information acquisition unit 101 acquires the line-of-sight history information 121 from the line-of-sight history DB 111, and inputs the acquired line-of-sight history information 121 to the standard reading speed calculation unit 102 and the improved part extraction unit 103. .

標準閲読速度計算部１０２は、情報取得部１０１で取得された視線履歴情報１２１に基づいて、ユーザの標準閲読速度を算出する。具体的には、標準閲読速度計算部１０２は、例えば、ユーザ閲読した文書中の文字数を、この文書の閲読の開始から終了までの時間で除算し、得られた値を標準閲読速度とする。 The standard reading speed calculation unit 102 calculates the standard reading speed of the user based on the line-of-sight history information 121 acquired by the information acquisition unit 101. Specifically, for example, the standard reading speed calculation unit 102 divides the number of characters in the document read by the user by the time from the start to the end of the reading of the document, and sets the obtained value as the standard reading speed.

また、標準閲読速度計算部１０２は、算出した標準閲読速度を新たなレコードとして、閲読速度ＤＢ１１２に渡す。閲読速度ＤＢ１１２は、追加されたレコードを、標準閲読速度情報１２２として格納する。 Further, the standard reading speed calculation unit 102 passes the calculated standard reading speed as a new record to the reading speed DB 112. The reading speed DB 112 stores the added record as standard reading speed information 122.

図７は、本発明の実施の形態において閲読速度データベースに格納されている標準閲読速度情報の一例を示す図である。図７に示すように、閲読速度ＤＢ１１２には、標準閲読速度情報１２２として、ユーザ毎の標準閲読速度が登録されている。 FIG. 7 is a diagram showing an example of standard reading speed information stored in the reading speed database in the embodiment of the present invention. As shown in FIG. 7, the standard reading speed for each user is registered in the reading speed DB 112 as the standard reading speed information 122.

閲読速度低下箇所抽出部１０４は、まず、閲読速度ＤＢ１１２から標準閲読速度情報１２２を取得する。そして、閲読速度低下箇所抽出部１０４は、情報取得部１０１で取得された視線履歴情報１２１と、標準閲読速度情報１２２とに基づいて、文書において、ユーザの視線の移動速度がその標準閲読速度に比べて大きく低下している箇所を、文書の改善箇所として抽出する。 The reading speed decrease portion extraction unit 104 first acquires standard reading speed information 122 from the reading speed DB 112. Then, the reading speed decrease portion extraction unit 104 determines that the movement speed of the user's line of sight becomes the standard reading speed based on the line-of-sight history information 121 acquired by the information acquisition unit 101 and the standard reading speed information 122. A portion that is greatly reduced as compared with the document is extracted as an improved portion of the document.

具体的には、閲読速度低下箇所抽出部１０４は、まず、標準閲読速度情報１２２で特定されるユーザの標準閲読速度を閾値で除算して基準速度を算出する。そして、閲読速度低下箇所抽出部１０４は、基準速度よりも閲読速度が低い箇所（エリア）を特定し、特定した箇所（エリア）を文書の改善箇所とする。 Specifically, the reading speed decrease portion extraction unit 104 first calculates the reference speed by dividing the user's standard reading speed specified by the standard reading speed information 122 by a threshold value. Then, the reading speed reduction part extraction unit 104 specifies a part (area) whose reading speed is lower than the reference speed, and sets the specified part (area) as an improved part of the document.

また、この場合、各エリアにおける閲読速度は、視線履歴情報１２１の視線履歴に記録されているエリアは、一定時間毎に検出された視線の位置を表し、各エリアに含まれる文字数は一定であるから（図３参照）、以下の数１から求められる。 Further, in this case, the reading speed in each area indicates the position of the line of sight detected every fixed time in the area recorded in the line-of-sight history of the line-of-sight history information 121, and the number of characters included in each area is constant. (See FIG. 3), the following equation 1 is obtained.

（数１）
閲読速度＝（エリアの番号−次登録エリアの番号）×１エリアの文字数／視線検出間隔 (Equation 1)
Reading speed = (area number-next registered area number) x number of characters in one area / gaze detection interval

また、閲読速度低下箇所抽出部１０４は、特定したエリアを新たなレコードとして、改善箇所候補ＤＢ１１３に渡す。改善箇所候補ＤＢ１１３は、追加されたレコードを、閲読速度低下発生箇所情報１２３として格納する。 Moreover, the reading speed reduction part extraction part 104 passes the specified area to the improvement part candidate DB 113 as a new record. The improvement location candidate DB 113 stores the added record as reading speed reduction occurrence location information 123.

図８は、本発明の実施の形態において改善箇所候補データベースに格納されている閲読速度低下発生箇所情報の一例を示す図である。図８に示すように、改善箇所候補ＤＢ１１３には、閲読速度低下発生箇所情報１２３として、文書毎の閲読速度が大きく低下しているエリアが登録されている。 FIG. 8 is a diagram showing an example of reading speed reduction occurrence location information stored in the improvement location candidate database in the embodiment of the present invention. As shown in FIG. 8, an area where the reading speed for each document is greatly reduced is registered as the reading speed reduction occurrence place information 123 in the improvement place candidate DB 113.

読み戻り箇所抽出部１０５は、情報取得部１０１で取得された視線履歴情報１２１に基
づいて、文書において、ユーザが文書を読み戻している箇所を、文書の改善箇所として抽出し、更に、抽出した箇所において、ユーザが読み戻している回数を特定する。 Based on the line-of-sight history information 121 acquired by the information acquisition unit 101, the read-back location extraction unit 105 extracts a location where the user has read back the document as an improved location of the document, and further extracts it. The number of times the user has read back is specified at the location.

具体的には、読み戻り箇所抽出部１０５は、視線履歴情報１２１の視線履歴から同一文書において２回以上登録されているエリアを特定し、抽出されたエリアのうち最も番号が小さいエリアを読み戻しの始点とし、最も番号が大きいエリアを読み戻しの終点とする。そして、読み戻り箇所抽出部１０５は、始点となるエリアと終点となるエリアとの間を文書の改善箇所として抽出する。 Specifically, the read-back location extraction unit 105 identifies an area registered twice or more in the same document from the line-of-sight history of the line-of-sight history information 121, and reads back the area with the smallest number among the extracted areas. And the area with the highest number is the end point of read-back. Then, the read-back location extraction unit 105 extracts the area between the start point and the end point as a document improvement location.

また、読み戻り箇所抽出部１０５は、視線履歴情報１２１の視線履歴から、始点となるエリアと終点となるエリアとの間において、視線が往復している回数を計測し、計測した回数をユーザが読み戻した回数として特定する。 Further, the read-back location extracting unit 105 measures the number of times that the line of sight is reciprocating between the area as the start point and the area as the end point from the line-of-sight history of the line-of-sight history information 121, Identified as the number of times read back.

また、読み戻り箇所抽出部１０５には、大幅に後ろの文章に視線が戻った場合は、読み戻りとカウントしないようにするために、予め、始点となるエリアと終点となるエリアとの間のエリア数に対して閾値が設定されていても良い。この場合、始点となるエリアと終点となるエリアとの間のエリア数が閾値を超えると、読み戻り箇所抽出部１０５は、これらのエリアを抽出対象から除外する。 In addition, when the line of sight returns to the text that is significantly behind, the read-back location extraction unit 105 is previously set between the area that becomes the start point and the area that becomes the end point in order not to count it as read-back. A threshold may be set for the number of areas. In this case, when the number of areas between the area as the start point and the area as the end point exceeds the threshold value, the read-back location extraction unit 105 excludes these areas from the extraction target.

また、読み戻り箇所抽出部１０５は、文書毎に特定した、始点となるエリアと終点となるエリアとを、新たなレコードとして、改善箇所候補ＤＢ１１３に渡す。改善箇所候補ＤＢ１１３は、追加されたレコードを、読み戻り発生箇所情報１２４として格納する。 In addition, the read-back location extraction unit 105 passes the start point area and end point area specified for each document to the improvement location candidate DB 113 as new records. The improvement location candidate DB 113 stores the added record as read-back occurrence location information 124.

図９は、本発明の実施の形態において改善箇所候補データベースに格納されている読み戻り発生箇所情報の一例を示す図である。図９に示すように、改善箇所候補ＤＢ１１３には、読み戻り発生箇所情報１２４として、文書毎の読み戻り発生箇所と読み戻り回数とが登録されている。 FIG. 9 is a diagram showing an example of the read-back occurrence location information stored in the improvement location candidate database in the embodiment of the present invention. As shown in FIG. 9, in the improvement location candidate DB 113, the read-back occurrence location and the number of read-back times for each document are registered as the read-back occurrence location information 124.

提示箇所抽出部１０６は、提供者によって改善箇所を知りたい文書が指定されて、改善箇所の提示が要求されると、この要求を受け付ける。そして、提示箇所抽出部１０６は、改善箇所候補ＤＢ１１３に格納されている、閲読速度低下発生箇所情報１２３と読み戻り発生箇所情報１２４とに基づいて、指定された文書について抽出されている改善箇所を特定する。具体的には、提示箇所抽出部１０６は、指定された文書における、閲読速度低下と読み戻りとが頻繁に発生している箇所を抽出する。 The presentation location extraction unit 106 accepts this request when a document for which an improvement location is desired is specified by the provider and presentation of the improvement location is requested. And the presentation location extraction part 106 extracts the improvement location extracted about the designated document based on the reading speed fall location information 123 and the read-back occurrence location information 124 which are stored in the improvement location candidate DB113. Identify. Specifically, the presentation location extraction unit 106 extracts locations where a decrease in reading speed and reading back frequently occur in a designated document.

また、提示箇所抽出部１０６は、指定された文書について抽出されている全ての改善箇所を抽出する必要はなく、閲読速度低下の発生率、読み戻り発生率、及び読み戻り回数について予め定められている閾値を超えている改善箇所のみを抽出しても良い。 Further, the presentation location extraction unit 106 does not need to extract all the improvement locations extracted for the specified document, and the occurrence rate of reading speed reduction, the occurrence rate of readback, and the number of readbacks are determined in advance. Only improvement points that exceed a certain threshold may be extracted.

また、閲読速度低下の発生率は、閲読速度低下発生箇所情報１２３において、該当する箇所が閲読速度低下発生箇所として登録されているレコードの数を、その箇所を含む文書全体の閲読速度低下発生箇所のレコードの数で除算することによって算出できる。 Further, the occurrence rate of the reading speed decrease is the number of records in which the corresponding portion is registered as the reading speed reduction occurrence location in the reading speed reduction occurrence location information 123, and the reading speed reduction occurrence location of the entire document including the location is calculated. It can be calculated by dividing by the number of records.

更に、読み戻り発生率は、読み戻り発生箇所情報１２４において、該当する箇所が読み戻り発生箇所として登録されているレコードの数を、その箇所を含む文書全体の読み戻り発生箇所のレコードの数で除算することによって算出できる。 Furthermore, the read-back occurrence rate is the number of records in which the corresponding part is registered as the read-out occurrence part in the read-out occurrence part information 124, and is the number of records of the read-out occurrence part of the entire document including the part. It can be calculated by dividing.

また、提示箇所抽出部１０６は、指定された文書について抽出した改善箇所を、改善箇所提示部１０７に渡し、改善箇所提示部１０７に、改善箇所の提示を指示する。これにより、改善箇所提示部１０７は、提示箇所抽出部１０６によって指示された改善箇所を特定
する情報を作成し、作成した情報を、文書を指定した提供者の端末装置に送信する。これにより、提供者の端末装置の画面には、文書の改善箇所が提示されることになる。 Also, the presentation location extraction unit 106 passes the improvement location extracted for the designated document to the improvement location presentation portion 107 and instructs the improvement location presentation portion 107 to present the improvement location. Thereby, the improvement part presentation part 107 produces the information which specifies the improvement part instruct | indicated by the presentation part extraction part 106, and transmits the produced information to the terminal device of the provider who designated the document. Thereby, the improvement part of a document will be shown on the screen of a provider's terminal device.

［装置動作］
次に、本実施の形態における文書改善箇所抽出装置１００の動作について図１０及び図１１を用いて説明する。また、本実施の形態では、文書改善箇所抽出装置１００を動作させることによって、文書改善箇所抽出方法が実施される。よって、本実施の形態における文書改善箇所抽出方法の説明は、以下の文書改善箇所抽出装置の動作説明に代える。 [Device operation]
Next, the operation of the document improvement portion extraction apparatus 100 according to the present embodiment will be described with reference to FIGS. In the present embodiment, the document improvement portion extraction method is performed by operating the document improvement portion extraction apparatus 100. Therefore, the description of the document improvement portion extraction method in the present embodiment is replaced with the following description of the operation of the document improvement portion extraction apparatus.

最初に、図１０を用いて、ユーザによって閲読された文書から改善箇所を抽出する処理について説明する。図１０は、本発明の実施の形態における文書改善箇所抽出装置の改善箇所抽出処理での動作を示すフロー図である。 First, a process for extracting an improved portion from a document read by a user will be described with reference to FIG. FIG. 10 is a flowchart showing an operation in the improved portion extraction process of the document improved portion extracting apparatus according to the embodiment of the present invention.

まず、前提として、ユーザが文書を閲読すると、視線履歴情報収集装置２００が、ユーザの視線の軌跡を抽出し、抽出した軌跡を特定する情報を生成し、生成した情報を、視線履歴情報１２１として、視線履歴ＤＢ１１１に登録する。また、視線履歴情報収集装置２００による視線履歴情報１２１の生成は、複数のユーザ、複数の文書に対して行われていても良い。 First, as a premise, when a user reads a document, the line-of-sight history information collection device 200 extracts a line of sight of the user's line of sight, generates information for specifying the extracted locus, and uses the generated information as line-of-sight history information 121. And is registered in the line-of-sight history DB 111. The generation of the line-of-sight history information 121 by the line-of-sight history information collection apparatus 200 may be performed for a plurality of users and a plurality of documents.

最初に、図１０に示すように、情報取得部１０１は、視線履歴ＤＢ１１１から、視線履歴情報１２１を取得する（ステップＡ１）。また、情報取得部１０１は、ステップＡ１の実行後、取得した視線履歴情報１２１を、標準閲読速度計算部１０２及び改善箇所抽出部１０３に入力する。 First, as shown in FIG. 10, the information acquisition unit 101 acquires the line-of-sight history information 121 from the line-of-sight history DB 111 (step A1). In addition, the information acquisition unit 101 inputs the acquired line-of-sight history information 121 to the standard reading speed calculation unit 102 and the improved portion extraction unit 103 after executing Step A1.

また、ステップＡ１は、視線履歴ＤＢ１１１に新たな視線履歴情報１２１が登録されたことをトリガーにして実行されても良いし、登録後、ある程度の時間が経過してから実行されても良い。 Further, step A1 may be executed when a new line-of-sight history information 121 is registered in the line-of-sight history DB 111, or may be executed after a certain amount of time has elapsed after registration.

次に、標準閲読速度計算部１０２は、ステップＡ１で取得された視線履歴情報１２１に基づいて、ユーザが文書を閲読した際の標準閲読速度を計算する（ステップＡ２）。 Next, the standard reading speed calculation unit 102 calculates the standard reading speed when the user reads the document based on the line-of-sight history information 121 acquired in step A1 (step A2).

次に、ステップＡ２の実行後、標準閲読速度計算部１０２は、視線履歴情報１２１に含まれる利用者ＩＤと計算した標準閲読速度とを関連付けて、新しいレコードを作成し、閲読速度ＤＢ１１２に、このレコードを標準閲読速度情報１２２として登録させる（ステップＡ３）。 Next, after executing step A2, the standard reading speed calculation unit 102 creates a new record by associating the user ID included in the line-of-sight history information 121 with the calculated standard reading speed, and stores this in the reading speed DB 112. The record is registered as standard reading speed information 122 (step A3).

次に、ステップＡ３の実行後、閲読速度低下箇所抽出部１０４は、ステップＡ１で取得された視線履歴情報１２１から利用者情報（利用者ＩＤ）を抽出し、その利用者情報を基に、閲読速度ＤＢ１１２から該当利用者の標準閲読速度情報１２２を取得する（ステップＡ４）。 Next, after the execution of step A3, the reading speed decrease portion extraction unit 104 extracts user information (user ID) from the line-of-sight history information 121 acquired in step A1, and reads based on the user information. The standard reading speed information 122 of the corresponding user is acquired from the speed DB 112 (step A4).

次に、閲読速度低下箇所抽出部１０４は、視線履歴情報１２１と、標準閲読速度情報１２２とに基づいて、ユーザの視線の移動速度がその標準閲読速度に比べて大きく低下している箇所（閲読速度低下箇所）を抽出し、これを改善箇所とする（ステップＡ５）。 Next, the reading speed reduction part extraction unit 104, based on the line-of-sight history information 121 and the standard reading speed information 122, places where the movement speed of the user's line of sight is greatly reduced compared to the standard reading speed (reading (Speed reduction part) is extracted and used as an improvement part (step A5).

次に、閲読速度低下箇所抽出部１０４は、抽出した閲読速度低下箇所と、視線履歴情報１２１に含まれる文書ＩＤとを関連付けて、新しいレコードを作成し、改善箇所候補ＤＢ１１３に、このレコードを、閲読速度低下発生箇所情報１２３として登録する（ステップＡ６）。 Next, the reading speed reduction part extraction unit 104 creates a new record by associating the extracted reading speed reduction part and the document ID included in the line-of-sight history information 121, and stores this record in the improvement part candidate DB 113. It registers as the reading speed drop occurrence location information 123 (step A6).

次に、読み戻り箇所抽出部１０５は、ステップＡ１で取得された視線履歴情報１２１に基づいて、ユーザが読み戻している箇所（読み戻りが発生している箇所）と、その箇所における読み戻りの回数とを抽出する（ステップＡ７）。 Next, based on the line-of-sight history information 121 acquired in step A1, the read-back location extraction unit 105 determines the location where the user is reading back (the location where the read-back has occurred) and the read-back at that location. The number of times is extracted (step A7).

次に、読み戻り箇所抽出部１０５は、読み戻り発生箇所と読み戻りの回数とを関連付けて、新しいレコードを作成し、改善箇所候補ＤＢ１１３に、このレコードを、読み戻り発生箇所情報１２４として登録する（ステップＡ８）。 Next, the read back location extraction unit 105 creates a new record by associating the read back occurrence location with the number of read backs, and registers this record in the improvement location candidate DB 113 as the read back occurrence location information 124. (Step A8).

また、ステップＡ１〜Ａ８は、上述したように、視線履歴ＤＢ１１１に新たな視線履歴情報１２１が登録されると、再度実行され、新たに、閲読速度低下発生箇所情報１２３及び読み戻り発生箇所情報１２４が登録される。 Further, as described above, steps A1 to A8 are executed again when new line-of-sight history information 121 is registered in the line-of-sight history DB 111, and new reading speed drop occurrence point information 123 and read-back occurrence point information 124 are newly obtained. Is registered.

続いて、図１１を用いて、文書が指定されて改善箇所の提示が求められた場合の改善箇所の提示処理について説明する。図１１は、本発明の実施の形態における文書改善箇所抽出装置の改善箇所の提示処理での動作を示すフロー図である。 Next, with reference to FIG. 11, an improvement point presentation process when a document is specified and an improvement point presentation is requested will be described. FIG. 11 is a flowchart showing an operation in the improvement point presentation process of the document improvement point extraction device according to the embodiment of the present invention.

最初に、提示箇所抽出部１０６は、提供者が、端末装置等を介して、改善箇所の提示を求める文書を指定して、改善箇所の提示を要求すると、要求を受け付ける（ステップＢ１）。具体的には、提供者が、例えば図２に示した文書ＩＤを指定すると、指定された文書ＩＤを受け付ける。 First, the presentation location extraction unit 106 receives a request when a provider specifies a document for requesting an improvement location via a terminal device or the like and requests the improvement location presentation (step B1). Specifically, for example, when the provider designates the document ID shown in FIG. 2, the designated document ID is accepted.

次に、提示箇所抽出部１０６は、ステップＢ１で受け付けた文書の文書ＩＤを基に、改善箇所候補ＤＢ１１３から、該当文書の閲読速度低下発生箇所情報１２３と、読み戻り発生箇所情報１２４とを取得する（ステップＢ２）。 Next, based on the document ID of the document received in step B1, the presentation location extraction unit 106 acquires the reading speed reduction occurrence location information 123 and the read occurrence location information 124 of the corresponding document from the improvement location candidate DB 113. (Step B2).

次に、提示箇所抽出部１０６は、ステップＢ２で取得した閲読速度低下発生箇所情報１２３及び読み戻り発生箇所情報１２４から、閲読速度低下の発生率、読み戻り発生率、及び読み戻り回数が閾値を超えている改善箇所のみを抽出する（ステップＢ３）。 Next, the presentation location extraction unit 106 uses the reading speed reduction occurrence location information 123 and the read back occurrence location information 124 acquired in step B2 as threshold values for the occurrence rate of the reading speed reduction, the occurrence rate of the read back, and the number of read back times. Only the improvement part which has exceeded is extracted (step B3).

次に、提示箇所抽出部１０６は、ステップＢ３抽出した改善箇所を、改善箇所提示部１０７に渡し、改善箇所提示部１０７に、改善箇所の提示を指示する（ステップＢ４）。 Next, the presentation location extraction unit 106 passes the improvement location extracted in step B3 to the improvement location presentation portion 107, and instructs the improvement location presentation portion 107 to present the improvement location (step B4).

その後、改善箇所提示部１０７は、ステップＢ４で指示された改善箇所を特定する情報を作成し、作成した情報を、文書を指定した提供者の端末装置に送信する（ステップＢ５）。これにより、提供者の端末装置の画面には、文書の改善箇所が提示されることになる。 Thereafter, the improvement point presentation unit 107 creates information for specifying the improvement point instructed in step B4, and transmits the created information to the terminal device of the provider who specified the document (step B5). Thereby, the improvement part of a document will be shown on the screen of a provider's terminal device.

［実施の形態の効果］
以上のように、本実施の形態によれば、以下の４つの効果を得ることができる。 [Effect of the embodiment]
As described above, according to the present embodiment, the following four effects can be obtained.

第一の効果は、文書の読みにくい箇所を、文書を作成した人（提供者）が把握することができる点である。 The first effect is that a person (provider) who created a document can grasp a difficult-to-read part of the document.

第二の効果は、取扱説明書及び利用規約の読みにくい箇所が減ることで、ユーザがサービス及び商品の価値を十分に享受できるようになり、かつ、誤った使い方を避けることができる点である。 The second effect is that the number of difficult-to-read parts of the instruction manual and terms of service is reduced, so that the user can fully enjoy the value of the service and the product, and the wrong usage can be avoided. .

第三の効果は、文書の読みにくい箇所の判断に、ユーザ毎の通常時の文書を読むスピードが考慮されるため、読むスピードの個人差を考慮した正確な評価結果を得ることができる点である。 The third effect is that it is possible to obtain an accurate evaluation result that takes into account individual differences in reading speed because the speed of reading a normal document for each user is taken into account in determining the difficult part of the document to read. is there.

第四の効果は、「文書を読む」というユーザが必ず行動する事象を入力情報としているため文書の改善すべき箇所を見逃す可能性はほとんど無く、正確な評価結果が得られる点である。 The fourth effect is that since an event that the user always reads “read a document” is used as input information, there is almost no possibility of overlooking a portion to be improved in the document, and an accurate evaluation result is obtained.

［プログラム］
本実施の形態におけるプログラムは、コンピュータに、図１０に示すステップＡ１〜Ａ８、図１１に示すステップＢ１〜Ｂ５を実行させるプログラムであれば良い。このプログラムをコンピュータにインストールし、実行することによって、本実施の形態における文書改善箇所抽出装置１００と文書改善箇所抽出方法とを実現することができる。この場合、コンピュータのプロセッサは、情報取得部１０１、標準閲読速度計算部１０２、改善箇所抽出部１０３、及び改善箇所提示部１０７として機能し、処理を行なう。 [program]
The program in the present embodiment may be a program that causes a computer to execute steps A1 to A8 shown in FIG. 10 and steps B1 to B5 shown in FIG. By installing and executing this program on a computer, the document improvement portion extraction apparatus 100 and the document improvement portion extraction method in the present embodiment can be realized. In this case, the processor of the computer functions as an information acquisition unit 101, a standard reading speed calculation unit 102, an improved part extraction unit 103, and an improved part presentation unit 107, and performs processing.

また、本実施の形態におけるプログラムは、複数のコンピュータによって構築されたコンピュータシステムによって実行されても良い。この場合は、例えば、各コンピュータが、それぞれ、情報取得部１０１、標準閲読速度計算部１０２、改善箇所抽出部１０３、及び改善箇所提示部１０７のいずれかとして機能しても良い。 The program in the present embodiment may be executed by a computer system constructed by a plurality of computers. In this case, for example, each computer may function as any of the information acquisition unit 101, the standard reading speed calculation unit 102, the improved part extraction unit 103, and the improved part presentation unit 107.

［物理構成］
ここで、本実施の形態におけるプログラムを実行することによって、文書改善箇所抽出装置１００を実現するコンピュータについて図１２を用いて説明する。図１２は、本発明の実施の形態における文書改善箇所抽出装置を実現するコンピュータの一例を示すブロック図である。 [Physical configuration]
Here, a computer that realizes the document improvement portion extraction apparatus 100 by executing the program according to the present embodiment will be described with reference to FIG. FIG. 12 is a block diagram illustrating an example of a computer that implements the document improvement portion extraction apparatus according to the embodiment of the present invention.

図１２に示すように、コンピュータ１０は、ＣＰＵ（Central Processing Unit）１１と、メインメモリ１２と、記憶装置１３と、入力インターフェイス１４と、表示コントローラ１５と、データリーダ／ライタ１６と、通信インターフェイス１７とを備える。これらの各部は、バス２１を介して、互いにデータ通信可能に接続される。なお、コンピュータ１０は、ＣＰＵ１１に加えて、又はＣＰＵ１１に代えて、ＧＰＵ（Graphics Processing Unit）、又はＦＰＧＡ（Field-Programmable Gate Array）を備えていても良い。 As shown in FIG. 12, the computer 10 includes a CPU (Central Processing Unit) 11, a main memory 12, a storage device 13, an input interface 14, a display controller 15, a data reader / writer 16, and a communication interface 17. With. These units are connected to each other via a bus 21 so that data communication is possible. The computer 10 may include a graphics processing unit (GPU) or a field-programmable gate array (FPGA) in addition to or in place of the CPU 11.

ＣＰＵ１１は、記憶装置１３に格納された、本実施の形態におけるプログラム（コード）をメインメモリ１２に展開し、これらを所定順序で実行することにより、各種の演算を実施する。メインメモリ１２は、典型的には、ＤＲＡＭ（Dynamic Random Access Memory）等の揮発性の記憶装置である。また、本実施の形態におけるプログラムは、コンピュータ読み取り可能な記録媒体２０に格納された状態で提供される。なお、本実施の形態におけるプログラムは、通信インターフェイス１７を介して接続されたインターネット上で流通するものであっても良い。 The CPU 11 performs various operations by expanding the program (code) in the present embodiment stored in the storage device 13 in the main memory 12 and executing them in a predetermined order. The main memory 12 is typically a volatile storage device such as a DRAM (Dynamic Random Access Memory). Further, the program in the present embodiment is provided in a state stored in a computer-readable recording medium 20. Note that the program in the present embodiment may be distributed on the Internet connected via the communication interface 17.

また、記憶装置１３の具体例としては、ハードディスクドライブの他、フラッシュメモリ等の半導体記憶装置が挙げられる。入力インターフェイス１４は、ＣＰＵ１１と、キーボード及びマウスといった入力機器１８との間のデータ伝送を仲介する。表示コントローラ１１５は、ディスプレイ装置１９と接続され、ディスプレイ装置１９での表示を制御する。 Specific examples of the storage device 13 include a hard disk drive and a semiconductor storage device such as a flash memory. The input interface 14 mediates data transmission between the CPU 11 and an input device 18 such as a keyboard and a mouse. The display controller 115 is connected to the display device 19 and controls display on the display device 19.

データリーダ／ライタ１６は、ＣＰＵ１１と記録媒体２０との間のデータ伝送を仲介し、記録媒体２０からのプログラムの読み出し、及びコンピュータ１０における処理結果の記録媒体２０への書き込みを実行する。通信インターフェイス１７は、ＣＰＵ１１と、他のコンピュータとの間のデータ伝送を仲介する。 The data reader / writer 16 mediates data transmission between the CPU 11 and the recording medium 20, and reads a program from the recording medium 20 and writes a processing result in the computer 10 to the recording medium 20. The communication interface 17 mediates data transmission between the CPU 11 and another computer.

また、記録媒体２０の具体例としては、ＣＦ（Compact Flash（登録商標））及びＳＤ（Secure Digital）等の汎用的な半導体記憶デバイス、フレキシブルディスク（Flexible
Disk）等の磁気記録媒体、又はＣＤ−ＲＯＭ（Compact Disk Read Only Memory）などの光学記録媒体が挙げられる。 Specific examples of the recording medium 20 include general-purpose semiconductor storage devices such as CF (Compact Flash (registered trademark)) and SD (Secure Digital), and flexible disks (Flexible Disks).
And a magnetic recording medium such as a CD-ROM (Compact Disk Read Only Memory).

なお、本実施の形態における文書改善箇所抽出装置１００は、プログラムがインストールされたコンピュータではなく、各部に対応したハードウェアを用いることによっても実現可能である。更に、文書改善箇所抽出装置１００は、一部がプログラムで実現され、残りの部分がハードウェアで実現されていてもよい。 Note that the document improvement portion extraction apparatus 100 according to the present embodiment can be realized not by using a computer in which a program is installed but also by using hardware corresponding to each unit. Further, part of the document improvement portion extraction apparatus 100 may be realized by a program, and the remaining part may be realized by hardware.

上述した実施の形態の一部又は全部は、以下に記載する（付記１）〜（付記１５）によって表現することができるが、以下の記載に限定されるものではない。 Part or all of the above-described embodiment can be expressed by (Appendix 1) to (Appendix 15) described below, but is not limited to the following description.

（付記１）
ユーザが文書を閲読した際の視線の軌跡を特定する視線履歴情報を取得する、情報取得部と、
取得された前記視線履歴情報に基づいて、前記ユーザが前記文書を閲読した際の標準閲読速度を計算する、標準閲読速度計算部と、
取得された前記視線履歴情報に基づいて、前記文書において、前記ユーザの視線の移動速度が前記標準閲読速度に比べて低下している箇所を、文書の改善箇所として抽出する、改善箇所抽出部と、
を備えている、ことを特徴とする文書改善箇所抽出装置。 (Appendix 1)
An information acquisition unit that acquires line-of-sight history information that identifies a locus of line-of-sight when a user reads a document;
A standard reading speed calculation unit that calculates a standard reading speed when the user reads the document based on the acquired line-of-sight history information;
Based on the acquired line-of-sight history information, an improved part extraction unit that extracts a part of the document where the movement speed of the user's line of sight is lower than the standard reading speed as an improved part of the document; ,
A document improvement point extracting device characterized by comprising:

（付記２）
付記１に記載の文書改善箇所抽出装置であって、
前記改善箇所抽出部が、更に、取得された前記視線履歴情報に基づいて、前記文書において、前記ユーザが前記文書を読み戻している箇所も、前記文書の改善箇所として抽出する、
ことを特徴とする、文書改善箇所抽出装置。 (Appendix 2)
A document improvement point extraction device according to appendix 1,
The improvement location extraction unit further extracts a location where the user has read back the document as an improvement location of the document based on the acquired line-of-sight history information.
The document improvement location extraction apparatus characterized by the above-mentioned.

（付記３）
付記１または２に記載の文書改善箇所抽出装置であって、
前記標準閲読速度計算部が、前記文書の文字数と、前記文書の閲読の開始から終了までの時間とから、前記標準閲読速度を計算する、
ことを特徴とする、文書改善箇所抽出装置。 (Appendix 3)
The document improvement point extraction device according to appendix 1 or 2,
The standard reading speed calculation unit calculates the standard reading speed from the number of characters of the document and the time from the start to the end of reading the document.
The document improvement location extraction apparatus characterized by the above-mentioned.

（付記４）
付記１〜３のいずれかに記載の文書改善箇所抽出装置であって、
前記改善箇所抽出部が、前記標準閲読速度に応じて、基準速度を算出し、前記ユーザの視線の移動速度が、前記基準速度より低い箇所を、前記文書の改善箇所として抽出する、ことを特徴とする、文書改善箇所抽出装置。 (Appendix 4)
The document improvement point extraction device according to any one of appendices 1 to 3,
The improvement part extraction unit calculates a reference speed according to the standard reading speed, and extracts a part where the movement speed of the line of sight of the user is lower than the reference speed as an improvement part of the document. Document improvement location extraction device.

（付記５）
付記１〜４のいずれかに記載の文書改善箇所抽出装置であって、
前記改善箇所を提示するための改善箇所提示部を更に備え、
前記改善箇所抽出部が、複数の文書について前記改善箇所を抽出しており、外部から前記複数の文書のうちのいずれかが指定されると、指定された前記文書について抽出された前記改善箇所を特定し、特定した前記改善箇所を、前記改善箇所提示部に提示させる、
ことを特徴とする、文書改善箇所抽出装置。 (Appendix 5)
The document improvement point extraction device according to any one of appendices 1 to 4,
Further comprising an improved point presentation unit for presenting the improved point,
The improvement part extraction unit extracts the improvement part for a plurality of documents, and when any one of the plurality of documents is designated from the outside, the improvement part extracted for the designated document is displayed. Identify and identify the identified improvement location in the improvement location presentation unit,
The document improvement location extraction apparatus characterized by the above-mentioned.

（付記６）
（ａ）ユーザが文書を閲読した際の視線の軌跡を特定する視線履歴情報を取得する、ステップと、
（ｂ）取得された前記視線履歴情報に基づいて、前記ユーザが前記文書を閲読した際の標準閲読速度を計算する、ステップと、
（ｃ）取得された前記視線履歴情報に基づいて、前記文書において、前記ユーザの視線の移動速度が前記標準閲読速度に比べて低下している箇所を、文書の改善箇所として抽出する、ステップと、
を有する、ことを特徴とする文書改善箇所抽出方法。 (Appendix 6)
(A) obtaining line-of-sight history information for identifying a locus of line-of-sight when a user reads a document;
(B) calculating a standard reading speed when the user reads the document based on the acquired line-of-sight history information;
(C) Based on the acquired line-of-sight history information, extracting a part of the document where the movement speed of the user's line of sight is lower than the standard reading speed as an improved part of the document; ,
A document improvement portion extraction method characterized by comprising:

（付記７）
付記６に記載の文書改善箇所抽出方法であって、
前記（ｃ）のステップにおいて、取得された前記視線履歴情報に基づいて、前記文書において、前記ユーザが前記文書を読み戻している箇所も、更に前記文書の改善箇所として抽出する、
ことを特徴とする、文書改善箇所抽出方法。 (Appendix 7)
The document improvement part extraction method according to appendix 6,
In the step (c), based on the acquired line-of-sight history information, a part of the document where the user has read back the document is further extracted as an improved part of the document.
The document improvement location extraction method characterized by the above-mentioned.

（付記８）
付記６または７に記載の文書改善箇所抽出方法であって、
前記（ｂ）のステップにおいて、前記文書の文字数と、前記文書の閲読の開始から終了までの時間とから、前記標準閲読速度を計算する、
ことを特徴とする、文書改善箇所抽出方法。 (Appendix 8)
The document improvement point extraction method according to appendix 6 or 7,
In the step (b), the standard reading speed is calculated from the number of characters of the document and the time from the start to the end of reading the document.
The document improvement location extraction method characterized by the above-mentioned.

（付記９）
付記６〜８のいずれかに記載の文書改善箇所抽出方法であって、
前記（ｃ）のステップにおいて、前記標準閲読速度に応じて、基準速度を算出し、前記ユーザの視線の移動速度が、前記基準速度より低い箇所を、前記文書の改善箇所として抽出する、
ことを特徴とする、文書改善箇所抽出方法。 (Appendix 9)
A document improvement point extraction method according to any one of appendices 6 to 8,
In the step (c), a reference speed is calculated according to the standard reading speed, and a portion where the movement speed of the user's line of sight is lower than the reference speed is extracted as an improved portion of the document.
The document improvement location extraction method characterized by the above-mentioned.

（付記１０）
付記６〜９のいずれかに記載の文書改善箇所抽出方法であって、
（ｄ）前記（ｃ）のステップにおいて、複数の文書について前記改善箇所を抽出している場合に、外部から前記複数の文書のうちのいずれかが指定されると、指定された前記文書について抽出された前記改善箇所を特定し、特定した前記改善箇所を、外部に提示させる、ステップを更に有する、
ことを特徴とする、文書改善箇所抽出方法。 (Appendix 10)
A document improvement portion extraction method according to any one of appendices 6 to 9,
(D) In the step (c), when the improvement points are extracted for a plurality of documents, if any of the plurality of documents is specified from the outside, the specified document is extracted. Further comprising the step of identifying the improved portion that has been made and presenting the identified improved portion to the outside.
The document improvement location extraction method characterized by the above-mentioned.

（付記１１）
コンピュータに、
（ａ）ユーザが文書を閲読した際の視線の軌跡を特定する視線履歴情報を取得する、ステップと、
（ｂ）取得された前記視線履歴情報に基づいて、前記ユーザが前記文書を閲読した際の標準閲読速度を計算する、ステップと、
（ｃ）取得された前記視線履歴情報に基づいて、前記文書において、前記ユーザの視線の移動速度が前記標準閲読速度に比べて低下している箇所を、文書の改善箇所として抽出する、ステップと、
を実行させる、ことを特徴とするプログラム。 (Appendix 11)
On the computer,
(A) obtaining line-of-sight history information for identifying a locus of line-of-sight when a user reads a document;
(B) calculating a standard reading speed when the user reads the document based on the acquired line-of-sight history information;
(C) Based on the acquired line-of-sight history information, extracting a part of the document where the movement speed of the user's line of sight is lower than the standard reading speed as an improved part of the document; ,
A program characterized by having executed.

（付記１２）
付記１１に記載のプログラムであって、
前記（ｃ）のステップにおいて、取得された前記視線履歴情報に基づいて、前記文書において、前記ユーザが前記文書を読み戻している箇所も、更に前記文書の改善箇所として抽出する、
ことを特徴とする、プログラム。 (Appendix 12)
The program according to attachment 11, wherein
In the step (c), based on the acquired line-of-sight history information, a part of the document where the user has read back the document is further extracted as an improved part of the document.
A program characterized by that.

（付記１３）
付記１１または１２に記載のプログラムであって、
前記（ｂ）のステップにおいて、前記文書の文字数と、前記文書の閲読の開始から終了までの時間とから、前記標準閲読速度を計算する、
ことを特徴とする、プログラム。 (Appendix 13)
The program according to appendix 11 or 12,
In the step (b), the standard reading speed is calculated from the number of characters of the document and the time from the start to the end of reading the document.
A program characterized by that.

（付記１４）
付記１１〜１３のいずれかに記載のプログラムであって、
前記（ｃ）のステップにおいて、前記標準閲読速度に応じて、基準速度を算出し、前記ユーザの視線の移動速度が、前記基準速度より低い箇所を、前記文書の改善箇所として抽出する、
ことを特徴とする、プログラム。 (Appendix 14)
The program according to any one of appendices 11 to 13,
In the step (c), a reference speed is calculated according to the standard reading speed, and a portion where the movement speed of the user's line of sight is lower than the reference speed is extracted as an improved portion of the document.
A program characterized by that.

（付記１５）
付記１１〜１４のいずれかに記載のプログラムであって、
前記コンピュータに、
（ｄ）前記（ｃ）のステップにおいて、複数の文書について前記改善箇所を抽出している場合に、外部から前記複数の文書のうちのいずれかが指定されると、指定された前記文書について抽出された前記改善箇所を特定し、特定した前記改善箇所を、外部に提示させる、ステップを更に実行させる、
ことを特徴とする、プログラム。 (Appendix 15)
The program according to any one of appendices 11 to 14,
In the computer,
(D) In the step (c), when the improvement points are extracted for a plurality of documents, if any of the plurality of documents is specified from the outside, the specified document is extracted. Identifying the improved portion that has been performed, and causing the identified improved portion to be presented to the outside, further executing a step,
A program characterized by that.

以上のように、本発明によれば、文書中の改善箇所の抽出精度を向上することができる。本発明は、文章校正分野、プログラムの可読性測定分野に利用できる。更に、本発明は、文書及びプログラムの改善箇所を自動的に出力する装置等への適用が想定される As described above, according to the present invention, it is possible to improve the extraction accuracy of an improved portion in a document. The present invention can be used in the field of sentence proofreading and the field of measuring program readability. Furthermore, the present invention is assumed to be applied to a device or the like that automatically outputs improved portions of documents and programs.

１０コンピュータ
１１ＣＰＵ
１２メインメモリ
１３記憶装置
１４入力インターフェイス
１５表示コントローラ
１６データリーダ／ライタ
１７通信インターフェイス
１８入力機器
１９ディスプレイ装置
２０記録媒体
２１バス
１００文書改善箇所抽出装置
１０１情報取得部
１０２標準閲読速度計算部
１０３改善箇所抽出部
１０４閲読速度低下箇所抽出部
１０５読み戻り箇所抽出部
１０６提示箇所抽出部
１０７改善箇所提示部
１１１視線履歴データベース
１１２閲読速度データベース
１１３改善箇所候補データベース
１２２標準閲読速度情報
１２３閲読速度低下発生箇所情報
１２４読み戻り発生箇所情報
２００視線履歴情報収集装置 10 Computer 11 CPU
DESCRIPTION OF SYMBOLS 12 Main memory 13 Memory | storage device 14 Input interface 15 Display controller 16 Data reader / writer 17 Communication interface 18 Input device 19 Display apparatus 20 Recording medium 21 Bus 100 Document improvement location extraction device 101 Information acquisition part 102 Standard reading speed calculation part 103 Improvement location Extraction unit 104 Reading speed reduction part extraction part 105 Reading back part extraction part 106 Presentation part extraction part 107 Improvement part presentation part 111 Gaze history database 112 Reading speed database 113 Improvement part candidate database 122 Standard reading speed information 123 Standard part of reading speed occurrence part information 124 Read-out occurrence location information 200 Gaze history information collection device

Claims

ユーザが文書を閲読した際の視線の軌跡を特定する視線履歴情報を取得する、情報取得部と、
取得された前記視線履歴情報に基づいて、前記ユーザが前記文書を閲読した際の標準閲読速度を計算する、標準閲読速度計算部と、
取得された前記視線履歴情報に基づいて、前記文書において、前記ユーザの視線の移動速度が前記標準閲読速度に比べて低下している箇所を、文書の改善箇所として抽出する、改善箇所抽出部と、
を備えている、ことを特徴とする文書改善箇所抽出装置。 An information acquisition unit that acquires line-of-sight history information that identifies a locus of line-of-sight when a user reads a document;
A standard reading speed calculation unit that calculates a standard reading speed when the user reads the document based on the acquired line-of-sight history information;
Based on the acquired line-of-sight history information, an improved part extraction unit that extracts a part of the document where the movement speed of the user's line of sight is lower than the standard reading speed as an improved part of the document; ,
A document improvement point extracting device characterized by comprising:

請求項１に記載の文書改善箇所抽出装置であって、
前記改善箇所抽出部が、更に、取得された前記視線履歴情報に基づいて、前記文書において、前記ユーザが前記文書を読み戻している箇所も、前記文書の改善箇所として抽出する、
ことを特徴とする、文書改善箇所抽出装置。 The document improvement point extraction device according to claim 1,
The improvement location extraction unit further extracts a location where the user has read back the document as an improvement location of the document based on the acquired line-of-sight history information.
The document improvement location extraction apparatus characterized by the above-mentioned.

請求項１または２に記載の文書改善箇所抽出装置であって、
前記標準閲読速度計算部が、前記文書の文字数と、前記文書の閲読の開始から終了までの時間とから、前記標準閲読速度を計算する、
ことを特徴とする、文書改善箇所抽出装置。 The document improvement point extracting device according to claim 1 or 2,
The standard reading speed calculation unit calculates the standard reading speed from the number of characters of the document and the time from the start to the end of reading the document.
The document improvement location extraction apparatus characterized by the above-mentioned.

請求項１〜３のいずれかに記載の文書改善箇所抽出装置であって、
前記改善箇所抽出部が、前記標準閲読速度に応じて、基準速度を算出し、前記ユーザの視線の移動速度が、前記基準速度より低い箇所を、前記文書の改善箇所として抽出する、ことを特徴とする、文書改善箇所抽出装置。 It is a document improvement location extraction device in any one of Claims 1-3,
The improvement part extraction unit calculates a reference speed according to the standard reading speed, and extracts a part where the movement speed of the line of sight of the user is lower than the reference speed as an improvement part of the document. Document improvement location extraction device.

請求項１〜４のいずれかに記載の文書改善箇所抽出装置であって、
前記改善箇所を提示するための改善箇所提示部を更に備え、
前記改善箇所抽出部が、複数の文書について前記改善箇所を抽出しており、外部から前記複数の文書のうちのいずれかが指定されると、指定された前記文書について抽出された前記改善箇所を特定し、特定した前記改善箇所を、前記改善箇所提示部に提示させる、
ことを特徴とする、文書改善箇所抽出装置。 It is a document improvement location extraction device in any one of Claims 1-4,
Further comprising an improved point presentation unit for presenting the improved point,
The improvement part extraction unit extracts the improvement part for a plurality of documents, and when any one of the plurality of documents is designated from the outside, the improvement part extracted for the designated document is displayed. Identify and identify the identified improvement location in the improvement location presentation unit,
The document improvement location extraction apparatus characterized by the above-mentioned.

（ａ）ユーザが文書を閲読した際の視線の軌跡を特定する視線履歴情報を取得する、ステップと、
（ｂ）取得された前記視線履歴情報に基づいて、前記ユーザが前記文書を閲読した際の標準閲読速度を計算する、ステップと、
（ｃ）取得された前記視線履歴情報に基づいて、前記文書において、前記ユーザの視線の移動速度が前記標準閲読速度に比べて低下している箇所を、文書の改善箇所として抽出する、ステップと、
を有する、ことを特徴とする文書改善箇所抽出方法。 (A) obtaining line-of-sight history information for identifying a locus of line-of-sight when a user reads a document;
(B) calculating a standard reading speed when the user reads the document based on the acquired line-of-sight history information;
(C) Based on the acquired line-of-sight history information, extracting a part of the document where the movement speed of the user's line of sight is lower than the standard reading speed as an improved part of the document; ,
A document improvement portion extraction method characterized by comprising:

コンピュータに、
（ａ）ユーザが文書を閲読した際の視線の軌跡を特定する視線履歴情報を取得する、ステップと、
（ｂ）取得された前記視線履歴情報に基づいて、前記ユーザが前記文書を閲読した際の標準閲読速度を計算する、ステップと、
（ｃ）取得された前記視線履歴情報に基づいて、前記文書において、前記ユーザの視線の移動速度が前記標準閲読速度に比べて低下している箇所を、文書の改善箇所として抽出す
る、ステップと、
を実行させる、ことを特徴とするプログラム。 On the computer,
(A) obtaining line-of-sight history information for identifying a locus of line-of-sight when a user reads a document;
(B) calculating a standard reading speed when the user reads the document based on the acquired line-of-sight history information;
(C) Based on the acquired line-of-sight history information, extracting a part of the document where the movement speed of the user's line of sight is lower than the standard reading speed as an improved part of the document; ,
A program characterized by having executed.