JP7187816B2

JP7187816B2 - Search method, search program and search device

Info

Publication number: JP7187816B2
Application number: JP2018091663A
Authority: JP
Inventors: 潤高橋; 淳哉斎藤; 哲典田口; 拓也上村; 健太郎村瀬; 清司大倉; 慎司菊池; 章中川; 俊之福岡
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 2017-10-13
Filing date: 2018-05-10
Publication date: 2022-12-13
Anticipated expiration: 2038-05-10
Also published as: JP2019114224A

Description

本発明は、検索方法、検索プログラム及び検索装置に関する。 The present invention relates to a search method, search program, and search device.

従来から、ユーザにより入力されたクエリに基づく検索結果を、レーダチャートにより表示させるシステムが知られている。また、従来のシステムでは、例えば、レーダチャートを表示させる際に、ユーザにより入力されたクエリに基づく検索結果の集合から、軸となる項目（単語）を抽出することが知られている。 Conventionally, there has been known a system that displays search results based on a query input by a user using a radar chart. Further, in a conventional system, for example, when displaying a radar chart, it is known to extract a key item (word) from a set of search results based on a query input by a user.

特開２００８－００３８６９号公報Japanese Patent Application Laid-Open No. 2008-003869

従来の技術では、検索結果の集合からレーダチャートの軸となる単語を抽出しているため、ユーザにより入力されたクエリによっては、レーダチャートが適切な軸で表示されているにも関わらず、クエリと軸との対応関係が明確でない状況が生じうる。このような状況では、ユーザは、検索結果が自身の意図に適合したものであるか否かを速やかに評価することが困難である。 In the conventional technology, words that serve as the axis of the radar chart are extracted from a set of search results. Therefore, depending on the query input by the user, the query A situation may arise in which the corresponding relationship between and the axis is not clear. In such a situation, it is difficult for the user to quickly evaluate whether or not the search result matches his/her intention.

１つの側面では、本発明は、検索結果の評価を適切に行うことを目的としている。 In one aspect, an object of the present invention is to appropriately evaluate search results.

一つの態様では、コンピュータによる検索方法であって、該コンピュータが、クエリに基づき、検索語群を特定し、検索対象データより、前記検索語群を用いて検索結果集合データを取得し、単語と印象語との関連の強さを示す値と、前記関連の強さを示す値のばらつきを示す値とが対応付けられた印象語マップを用いて作成された変換モデルに対し、前記検索語群に含まれる単語を分散表現したベクトルを入力し、前記検索語群に含まれる単語毎に、前記印象語マップに含まれる各印象語との関連の強さを示す値と、前記ばらつきを示す値とを取得し、前記関連の強さを示す値に応じて、第１特徴語群となる印象語を抽出し、前記検索結果集合データを、前記第１特徴語群を項目として用いて表示させ、前記第１特徴語群に、前記ばらつきを示す値が閾値より大きい印象語が含まれる場合に、前記ばらつきを示す値が閾値より大きい印象語を特定し、特定された印象語の前記ばらつきを示す値を用いて、前記印象語マップに含まれる印象語のうち、前記第１特徴語群に含まれない印象語の中から、前記第１特徴語群の代わりに前記項目として用いることが可能な印象語である第３特徴語群を特定し、前記第３特徴語群を、前記第１特徴語群と入れ替えることが可能であることを示す通知を表示させる、検索方法である。
In one aspect, a search method by a computer, wherein the computer identifies a search term group based on a query, acquires search result set data using the search term group from search target data, and searches for words and A conversion model created using an impression word map in which a value indicating strength of relationship with an impression word and a value indicating variation in the value indicating strength of relationship are associated with each other; and a value indicating the strength of association with each impression word included in the impression word map and a value indicating the dispersion for each word included in the search word group. and extracting impression words that form the first characteristic word group according to the value indicating the strength of the relationship, and displaying the search result set data using the first characteristic word group as items. and if the first feature word group includes an impression word whose value indicating the variation is greater than the threshold, the impression word whose value indicating the variation is greater than the threshold is specified, and the variation of the identified impression word is determined. out of the impression words included in the impression word map, the impression words not included in the first feature word group may be used as the item instead of the first feature word group. In the retrieval method, a third characteristic word group, which is a possible impression word, is specified, and a notification indicating that the third characteristic word group can be replaced with the first characteristic word group is displayed .

上記各手順は、上記各手順を実現する機能部、各処理としても良く、各処理をコンピュータに実行させるプログラムを記憶したコンピュータ読み取り可能な記憶媒体とすることもできる。 Each of the above procedures may be a functional unit or each process for realizing each of the above procedures, or may be a computer-readable storage medium storing a program for causing a computer to execute each process.

検索結果の評価を適切に行うことができる。 Appropriate evaluation of search results.

第一の実施形態の検索システムのシステム構成の一例を示す図である。It is a figure showing an example of the system configuration of the search system of the first embodiment. 第一の実施形態の検索装置のハードウェア構成の一例を示す図である。It is a figure which shows an example of the hardware constitutions of the search device of 1st embodiment. 第一の実施形態の検索装置の機能を説明する図である。It is a figure explaining the function of the search device of 1st embodiment. インデックス生成部による処理を説明するフローチャートである。9 is a flowchart for explaining processing by an index generation unit; インデックス生成部により生成されたインデックス情報の一例を示す図である。It is a figure which shows an example of the index information produced|generated by the index production|generation part. 第一の実施形態の印象語マップの一例を示す図である。FIG. 4 is a diagram showing an example of an impression word map according to the first embodiment; FIG. 印象空間学習部による処理を説明するフローチャートである。4 is a flowchart for explaining processing by an impression space learning unit; 第一の実施形態の印象語スコアリストの一例を示す図である。FIG. 4 is a diagram showing an example of an impression word score list according to the first embodiment; FIG. 第一の実施形態の検索処理部の処理を説明するフローチャートである。9 is a flowchart for explaining processing of a search processing unit according to the first embodiment; 第一の実施形態の印象語スコア出力部及び軸決定部の処理を説明するフローチャートである。4 is a flow chart for explaining processing of an impression word score output unit and an axis determination unit according to the first embodiment; 印象語スコア出力部と軸決定部の処理を説明する図である。FIG. 4 is a diagram for explaining processing of an impression word score output unit and an axis determination unit; 第一の実施形態のレーダチャートの表示例を示す第一の図である。FIG. 4 is a first diagram showing a display example of a radar chart according to the first embodiment; 比較例となるレーダチャートの一例を示す図である。It is a figure which shows an example of the radar chart used as a comparative example. 第一の実施形態のレーダチャートの表示例を示す第二の図である。FIG. 4 is a second diagram showing a display example of a radar chart according to the first embodiment; 第二の実施形態の検索装置の機能を説明する図である。It is a figure explaining the function of the search device of 2nd embodiment. 第二の実施形態の検索処理部の処理を説明するフローチャートである。It is a flow chart explaining processing of a search processing part of a second embodiment. 第二の実施形態の軸決定部の処理を説明するフローチャートである。It is a flow chart explaining processing of an axis deciding part of a second embodiment. 第三の実施形態の印象語マップにおけるスコアのばらつきについて説明する図である。FIG. 11 is a diagram for explaining score variations in the impression word map of the third embodiment; 第三の実施形態の検索装置の機能を説明する図である。It is a figure explaining the function of the search device of 3rd embodiment. 第三の実施形態の印象語マップの例を示す図である。FIG. 13 is a diagram showing an example of an impression word map according to the third embodiment; FIG. 第三の実施形態の印象語スコアリストの一例を示す図である。FIG. 13 is a diagram showing an example of an impression word score list according to the third embodiment; FIG. 第三の実施形態の検索処理部の処理を説明するフローチャートである。It is a flow chart explaining processing of a search processing part of a third embodiment. 第三の実施形態の印象語スコア出力部、軸決定部の処理を説明するフローチャートである。FIG. 11 is a flow chart for explaining processing of an impression word score output unit and an axis determination unit according to the third embodiment; FIG. 第三の実施形態の検索処理部による軸の候補の抽出を説明する図である。It is a figure explaining extraction of the candidate of an axis by the search processing part of a third embodiment. 第三の実施形態の表示例を示す第一の図である。FIG. 11 is a first diagram showing a display example of the third embodiment; 第三の実施形態の表示例を示す第二の図である。FIG. 11 is a second diagram showing a display example of the third embodiment; 第四の実施形態の検索装置の機能を説明する図である。It is a figure explaining the function of the search device of 4th embodiment. 第四の実施形態における軸の変更の処理を説明するフローチャートである。FIG. 14 is a flowchart for explaining axis change processing in the fourth embodiment. FIG. 軸の変更について説明する図である。It is a figure explaining the change of an axis. 第五の実施形態の検索装置の機能を説明する図である。It is a figure explaining the function of the search device of 5th embodiment. 第五の実施形態の補正リスト生成部の処理を説明するフローチャートである。FIG. 14 is a flowchart for explaining processing of a correction list generation unit according to the fifth embodiment; FIG. 第五の実施形態の補正リストの一例を示す図である。It is a figure which shows an example of the correction list|wrist of 5th embodiment. 第五の実施形態の印象語スコア出力部、軸決定部、スコア補正部の処理を説明するフローチャートである。FIG. 14 is a flow chart for explaining processing of an impression word score output unit, an axis determination unit, and a score correction unit according to the fifth embodiment; FIG. 第六の実施形態の概要を説明する図である。FIG. 11 is a diagram explaining an outline of a sixth embodiment; 印象語のスコアの変更について説明する図である。FIG. 11 is a diagram for explaining how to change scores of impression words; 第六の実施形態の検索装置の機能を説明する図である。It is a figure explaining the function of the search device of 6th embodiment. 第六の実施形態の検索処理部の処理を説明する第一のフローチャートである。FIG. 14 is a first flowchart for explaining processing of a search processing unit according to the sixth embodiment; FIG. 第六の実施形態の検索処理部の処理を説明する第二のフローチャートである。FIG. 16 is a second flowchart for explaining processing of the search processing unit of the sixth embodiment; FIG. 第六の実施形態の検索処理部の処理を説明する第三のフローチャートである。FIG. 16 is a third flowchart for explaining processing of the search processing unit of the sixth embodiment; FIG. 第六の実施形態のスコアとスライダ値との対応付けを説明する図である。FIG. 20 is a diagram illustrating association between scores and slider values according to the sixth embodiment; 第六の実施形態の対応テーブルの一例を示す図である。It is a figure which shows an example of the correspondence table of 6th embodiment. 第六の実施形態の表示の一例を示す図である。FIG. 21 is a diagram showing an example of display according to the sixth embodiment; FIG. 第六の実施形態の表示の他の例を示す図である。FIG. 21 is a diagram showing another example of display in the sixth embodiment; FIG. 第七の実施形態のスコアとスライダ値との対応付けを説明する図である。FIG. 20 is a diagram illustrating association between scores and slider values according to the seventh embodiment; 第七の実施形態の検索処理部の処理を説明するフローチャートである。FIG. 16 is a flowchart for explaining processing of a search processing unit according to the seventh embodiment; FIG. 第七の実施形態の対応テーブルの一例を示す図である。It is a figure which shows an example of the correspondence table of 7th embodiment.

（第一の実施形態）
以下に図面を参照して、第一の実施形態について説明する。図１は、第一の実施形態の検索システムのシステム構成の一例を示す図である。 (First embodiment)
A first embodiment will be described below with reference to the drawings. FIG. 1 is a diagram showing an example of the system configuration of the search system of the first embodiment.

本実施形態の検索システム１００は、検索装置２００と、端末装置３００とを有する。検索装置２００と端末装置３００とは、ネットワークを介して接続される。また、本実施形態の検索装置２００は、検索装置２００による検索処理の対象となる情報が格納された検索対象文書データベース４００と通信を行う。本実施形態の検索対象文書データベース４００に格納された情報とは、例えば、文書データ等のテキストデータである。 A search system 100 of this embodiment has a search device 200 and a terminal device 300 . The search device 200 and the terminal device 300 are connected via a network. The search device 200 of this embodiment also communicates with a search target document database 400 that stores information to be searched by the search device 200 . The information stored in the search target document database 400 of this embodiment is, for example, text data such as document data.

本実施形態の検索対象文書データベース４００は、例えば、複数のデータベースを含んでも良い。また、本実施形態の検索対象文書データベース４００は、検索システム１００に含まれても良い。 The search target document database 400 of this embodiment may include, for example, a plurality of databases. Also, the search target document database 400 of this embodiment may be included in the search system 100 .

本実施形態の検索装置２００は、端末装置３００において、ユーザによってクエリが入力されると、入力されたクエリに基づき、検索対象文書データベース４００を検索する。そして、検索装置２００は、検索処理を行った結果として取得された検索結果を、レーダチャートによって、端末装置３００に表示させる。尚、クエリとは、例えば、端末装置３００や検索装置２００のユーザが、検索要求を行う際に入力する単語・フレーズ（複合語）のことを示す。 When a user inputs a query in the terminal device 300, the search device 200 of this embodiment searches the search target document database 400 based on the input query. Then, the search device 200 causes the terminal device 300 to display the search results acquired as a result of the search processing using a radar chart. A query is, for example, a word/phrase (compound word) input by a user of the terminal device 300 or the search device 200 when making a search request.

このとき、本実施形態の検索装置２００は、端末装置３００から入力されたクエリに基づき抽出された単語を軸としたレーダチャートを表示させる。 At this time, the search device 200 of the present embodiment displays a radar chart centered on words extracted based on the query input from the terminal device 300 .

本実施形態では、このように、ユーザにより入力されたクエリに基づき抽出された単語をレーダチャートの軸に用いることで、クエリとレーダチャートの軸との対応関係を明確にすることができる。したがって、本実施形態では、検索結果が自身の意図に適合したものであるか否かをユーザに容易に評価させることができる。このように、本実施形態によれば、検索結果の評価を適切に行うことができる。 In this embodiment, by using the words extracted based on the query input by the user as the axis of the radar chart, it is possible to clarify the correspondence between the query and the axis of the radar chart. Therefore, in this embodiment, it is possible for the user to easily evaluate whether or not the search result matches his/her intention. Thus, according to this embodiment, it is possible to appropriately evaluate search results.

本実施形態の検索装置２００は、検索対象インデックス記憶部２１０と、印象語マップ記憶部２２０と、検索対象スコア記憶部２３０、検索処理部２４０と、を有する。 The search device 200 of this embodiment has a search target index storage unit 210 , an impression word map storage unit 220 , a search target score storage unit 230 and a search processing unit 240 .

検索対象インデックス記憶部２１０は、検索対象文書データベース４００に格納された各文書データと、各文書データに含まれる単語とを対応付けたインデックス情報を格納する。印象語マップ記憶部２２０は、単語と印象語との関連を示す印象語マップを格納する。検索対象スコア記憶部２３０は、検索対象文書データベース４００に含まれる文書データについて、各文書データに含まれる単語と印象語との関連を示す印象語スコアリストを格納する。各記憶部の詳細は後述する。 The search target index storage unit 210 stores index information in which each document data stored in the search target document database 400 and words included in each document data are associated with each other. The impression word map storage unit 220 stores an impression word map indicating the relationship between words and impression words. The search target score storage unit 230 stores an impression word score list indicating the relationship between words and impression words included in each document data included in the search target document database 400 . Details of each storage unit will be described later.

検索処理部２４０は、端末装置３００からクエリの入力を受け付けると、検索対象インデックス記憶部２１０を参照して検索対象文書データベース４００に対する検索を行い、検索結果としての文書データを取得する。以下の説明では、検索処理部２４０による検索処理により取得された検索結果を検索結果集合データと呼ぶ。 When the search processing unit 240 receives a query input from the terminal device 300, the search processing unit 240 refers to the search target index storage unit 210, searches the search target document database 400, and obtains document data as a search result. In the following description, search results obtained by search processing by the search processing unit 240 will be referred to as search result set data.

また、検索処理部２４０は、印象語マップ記憶部２２０を参照して、クエリの内容と関連が強い印象語をレーダチャートの軸に選択し、検索結果集合データ毎のレーダチャートを端末装置３００に表示させる。 In addition, the search processing unit 240 refers to the impression word map storage unit 220, selects impression words that are strongly related to the content of the query as axes of the radar chart, and displays the radar chart for each set of search result data in the terminal device 300. display.

尚、図１の例では、クエリは端末装置３００から検索装置２００に入力され、レーダチャートは端末装置３００に表示されるものとしたが、これに限定されない。クエリは、検索装置２００の有する入力装置等によって入力されても良いし、レーダチャートは、検索装置２００の有する出力装置（ディスプレイ）等に表示されても良い。 In the example of FIG. 1, the query is input from the terminal device 300 to the search device 200, and the radar chart is displayed on the terminal device 300, but the present invention is not limited to this. The query may be input by an input device or the like of the search device 200 , and the radar chart may be displayed on an output device (display) or the like of the search device 200 .

以下に、本実施形態の検索装置２００について説明する。図２は、第一の実施形態の検索装置のハードウェア構成の一例を示す図である。 The search device 200 of this embodiment will be described below. FIG. 2 is a diagram showing an example of the hardware configuration of the search device of the first embodiment.

本実施形態の検索装置２００は、それぞれバスＢで相互に接続されている入力装置２１、出力装置２２、ドライブ装置２３、補助記憶装置２４、メモリ装置２５、演算処理装置２６及びインターフェース装置２７を含む情報処理装置である。 The search device 200 of this embodiment includes an input device 21, an output device 22, a drive device 23, an auxiliary storage device 24, a memory device 25, an arithmetic processing device 26, and an interface device 27, which are connected to each other via a bus B. It is an information processing device.

入力装置２１は、各種の情報の入力を行うための装置であり、例えばキーボードやポインティングデバイス等により実現される。出力装置２２は、各種の情報の出力を行うためものであり、例えばディスプレイ等により実現される。インターフェース装置２７は、ＬＡＮカード等を含み、ネットワークに接続する為に用いられる。 The input device 21 is a device for inputting various kinds of information, and is realized by, for example, a keyboard or pointing device. The output device 22 is for outputting various kinds of information, and is realized by, for example, a display. The interface device 27 includes a LAN card and the like, and is used for connecting to a network.

検索プログラムは、検索装置２００を制御する各種プログラムの少なくとも一部である。検索プログラムは例えば記憶媒体２８の配布やネットワークからのダウンロード等によって提供される。検索プログラムを記録した記憶媒体２８は、ＣＤ－ＲＯＭ、フレキシブルディスク、光磁気ディスク等の様に情報を光学的、電気的或いは磁気的に記録する記憶媒体、ＲＯＭ、フラッシュメモリ等の様に情報を電気的に記録する半導体メモリ等、様々なタイプの記憶媒体を用いることができる。 The search program is at least part of various programs that control the search device 200 . The search program is provided, for example, by distribution of the storage medium 28 or download from a network. The storage medium 28 in which the search program is recorded is a storage medium for optically, electrically or magnetically recording information such as a CD-ROM, a flexible disk, a magneto-optical disk, etc., or a storage medium for storing information such as a ROM, a flash memory, etc. Various types of storage media can be used, such as semiconductor memories that record electrically.

また、検索プログラムは、検索プログラムを記録した記憶媒体２８がドライブ装置２３にセットされると、記憶媒体２８からドライブ装置２３を介して補助記憶装置２４にインストールされる。ネットワークからダウンロードされた検索プログラムは、インターフェース装置２７を介して補助記憶装置２４にインストールされる。 Further, the search program is installed in the auxiliary storage device 24 from the storage medium 28 via the drive device 23 when the storage medium 28 recording the search program is set in the drive device 23 . A search program downloaded from the network is installed in the auxiliary storage device 24 via the interface device 27 .

補助記憶装置２４は、インストールされた検索プログラムを格納すると共に、必要なファイル、データ等を格納する。メモリ装置２５は、検索装置２００の起動時に補助記憶装置２４から検索プログラムを読み出して格納する。そして、演算処理装置２６はメモリ装置２５に格納された検索プログラムに従って、後述するような各種処理を実現している。 The auxiliary storage device 24 stores the installed search program, as well as necessary files and data. The memory device 25 reads and stores the search program from the auxiliary storage device 24 when the search device 200 is activated. The arithmetic processing unit 26 implements various processes described later according to the search program stored in the memory unit 25 .

本実施形態の端末装置３００のハードウェア構成は、検索装置２００と同様であるから、説明を省略する。尚、端末装置３００は、例えば、タブレット型の端末装置や、スマートフォン等であっても良く、入力装置２１と出力装置２２との代わりに、タッチパネル等により実現される表示操作装置を有していても良い。 Since the hardware configuration of the terminal device 300 of this embodiment is the same as that of the search device 200, description thereof will be omitted. Note that the terminal device 300 may be, for example, a tablet terminal device, a smartphone, or the like, and instead of the input device 21 and the output device 22, it has a display operation device realized by a touch panel or the like. Also good.

次に、図３を参照して、本実施形態の検索装置２００の機能について説明する。図３は、第一の実施形態の検索装置の機能を説明する図である。 Next, with reference to FIG. 3, functions of the search device 200 of this embodiment will be described. FIG. 3 is a diagram for explaining the functions of the search device of the first embodiment.

本実施形態の検索装置２００は、検索対象インデックス記憶部２１０、印象語マップ記憶部２２０、検索対象スコア記憶部２３０、検索処理部２４０を有する。 The search device 200 of this embodiment has a search target index storage unit 210 , an impression word map storage unit 220 , a search target score storage unit 230 and a search processing unit 240 .

検索対象インデックス記憶部２１０には、インデックス情報２１１が格納されている。本実施形態のインデックス情報２１１は、後述するインデックス生成部２４７による事前処理によって作成される。 Index information 211 is stored in the search target index storage unit 210 . The index information 211 of this embodiment is created by pre-processing by the index generation unit 247, which will be described later.

印象語マップ記憶部２２０には、印象語マップ２２１が格納されている。本実施形態の印象語マップ２２１は、検索装置２００の管理者等によって、予め与えられたものである。 An impression word map 221 is stored in the impression word map storage unit 220 . The impression word map 221 of this embodiment is provided in advance by the administrator of the search device 200 or the like.

検索対象スコア記憶部２３０には、印象語スコアリスト２３１が格納されている。印象語スコアリスト２３１は、後述する検索処理部２４０の処理によって作成される。 An impression word score list 231 is stored in the search target score storage unit 230 . The impression word score list 231 is created by the processing of the search processing unit 240, which will be described later.

本実施形態の検索処理部２４０は、入力受付部２４１、解析部２４２、検索部２４３、印象語スコア出力部２４４、軸決定部２４５、出力部２４６、インデックス生成部２４７、印象空間学習部２４８を有する。検索処理部２４０の有する各部は、検索装置２００の有する演算処理装置２６が、メモリ装置２５等に格納された検索プログラムを読み出して実行することで実現される。 The search processing unit 240 of this embodiment includes an input reception unit 241, an analysis unit 242, a search unit 243, an impression word score output unit 244, an axis determination unit 245, an output unit 246, an index generation unit 247, and an impression space learning unit 248. have. Each unit of the search processing unit 240 is implemented by the arithmetic processing unit 26 of the search device 200 reading out and executing a search program stored in the memory device 25 or the like.

本実施形態の入力受付部２４１は、検索装置２００に対する各種の入力を受け付ける。具体的には、入力受付部２４１は、検索装置２００に対するクエリの入力を受け付ける。 The input reception unit 241 of this embodiment receives various inputs to the search device 200 . Specifically, the input reception unit 241 receives input of a query to the search device 200 .

本実施形態の解析部２４２は、キーワードや自然文に対し、単語の分割、係り受けの解析、意味の解析等を行う。具体的には、解析部２４２は、クエリに基づき検索語群を特定したり、検索対象文書データベース４００から取得した検索結果の文書データから単語を特定したりする。つまり、本実施形態の解析部２４２は、クエリから検索語群を特定する特定部の機能を果たす。 The analysis unit 242 of this embodiment performs word segmentation, dependency analysis, semantic analysis, etc. on keywords and natural sentences. Specifically, the analysis unit 242 identifies a search term group based on the query, or identifies words from the document data of the search results obtained from the search target document database 400 . In other words, the analysis unit 242 of this embodiment functions as an identification unit that identifies a search term group from a query.

本実施形態の検索部２４３は、検索対象インデックス記憶部２１０を参照して、解析部２４２により得られた検索語群を含む文書データを特定し、検索対象文書データベース４００から、検索結果集合データを取得する。 The search unit 243 of this embodiment refers to the search target index storage unit 210 to specify document data containing the search term group obtained by the analysis unit 242, and retrieves search result set data from the search target document database 400. get.

印象語スコア出力部２４４は、クエリから得られた検索語群に含まれる各単語を、分散表現によってベクトルに変換する。そして、印象語スコア出力部２４４は、ベクトルに変換された各単語と、後述する変換モデル２５０とに基づき、検索語群と、印象語マップ２２１に含まれる印象語との関連の強さを示すスコアを出力する。 The impression word score output unit 244 converts each word included in the search word group obtained from the query into a vector using distributed representation. Then, the impression word score output unit 244 indicates the strength of the relationship between the search word group and the impression words included in the impression word map 221 based on each word converted into a vector and the conversion model 250 described later. Output the score.

言い換えれば、印象語スコア出力部２４４は、ベクトルで表現された単語を、変換モデル２５０に対する入力として、この単語と、印象語マップ２２１に含まれる各印象語との関連の強さを示すスコアを出力する。 In other words, the impression word score output unit 244 uses the word represented by the vector as an input to the conversion model 250 and generates a score indicating the strength of the relationship between this word and each impression word included in the impression word map 221. Output.

本実施形態の単語の分散表現による方法とは、別途与えられる大量の学習データを用いて各単語間の意味的な類似関係（類似度）を推定し、推定した単語間類似度と矛盾しないかたちで、あらかじめ定めた次元数のベクトル空間に各単語を配置することで、単語の意味をベクトルで表現する方法である。以下の説明では、単語の分散表現によって変換された、単語の意味を表現するベクトルを意味ベクトルと呼ぶ。 The method based on distributed representation of words in this embodiment estimates the semantic similarity (similarity) between words using a large amount of training data that is separately given, and In this method, each word is arranged in a vector space having a predetermined number of dimensions, and the meaning of the word is represented by a vector. In the following description, a vector that expresses the meaning of a word that has been converted by distributed representation of the word will be referred to as a semantic vector.

また、本実施形態の印象語とは、形容詞や形容動詞、副詞などの検索結果の印象を想起させる単語である。 Impression words in the present embodiment are words such as adjectives, adjective verbs, and adverbs that evoke impressions of search results.

軸決定部２４５は、印象語スコア出力部２４４により出力された単語毎のスコアに応じて、検索結果を表示するためのレーダチャートの軸となる印象語を決定する。レーダチャートの軸とは、言い換えれば、検索結果集合データを表示する際の項目の一つである。 The axis determination unit 245 determines impression words that serve as the axes of the radar chart for displaying the search results, according to the score for each word output by the impression word score output unit 244 . The axis of the radar chart is, in other words, one of the items when displaying the search result set data.

本実施形態の出力部２４６は、軸決定部２４５により決定された印象語を軸とした、検索結果集合データのレーダチャートを表示させる。 The output unit 246 of the present embodiment displays a radar chart of the search result set data with the impression word determined by the axis determining unit 245 as the axis.

つまり、本実施形態の出力部２４６は、クエリに基づく検索語群による検索処理で取得された検索結果集合データを表示する際に、検索語群と関連の強さに応じて抽出された印象語を項目として表示させる。 In other words, the output unit 246 of the present embodiment, when displaying the search result set data acquired by the search processing using the search term group based on the query, displays the impression words extracted according to the strength of the relationship with the search term group. is displayed as an item.

本実施形態のインデックス生成部２４７は、検索対象文書データベース４００から、検索対象インデックス記憶部２１０に格納されたインデックス情報２１１を作成する。 The index generation unit 247 of this embodiment creates the index information 211 stored in the search target index storage unit 210 from the search target document database 400 .

印象空間学習部２４８は、検索対象文書データベース４００に格納された検索対象の文書データと、印象語マップ記憶部２２０に格納された印象語マップ２２１とに基づき、変換モデル２５０を出力する。 The impression space learning section 248 outputs a conversion model 250 based on the document data to be searched stored in the document database 400 to be searched and the impression word map 221 stored in the impression word map storage section 220 .

本実施形態のインデックス生成部２４７と印象空間学習部２４８の処理は、クエリが入力されてから行われる検索処理の事前処理として、予め行われていることが好ましい。 The processing of the index generation unit 247 and the impression space learning unit 248 of the present embodiment is preferably performed in advance as pre-processing of search processing that is performed after a query is input.

以下に、本実施形態の事前処理について説明する。はじめに、図４及び図５を参照して、インデックス生成部２４７によるインデックス情報２１１の生成について説明する。 The pre-processing of this embodiment will be described below. First, generation of the index information 211 by the index generator 247 will be described with reference to FIGS. 4 and 5. FIG.

図４は、インデックス生成部による処理を説明するフローチャートである。 FIG. 4 is a flowchart for explaining processing by an index generator.

本実施形態のインデックス生成部２４７は、検索対象文書データベース４００を参照し、検索対象文書データベース４００に格納された文書データ毎に、ある単語が文書データに含まれるか否かを判定する（ステップＳ４０１）。 The index generation unit 247 of this embodiment refers to the search target document database 400, and determines whether or not a certain word is included in the document data for each document data stored in the search target document database 400 (step S401). ).

尚、本実施形態では、検索対象文書データベース４００に格納された各文書データに、文書データを特定するための識別情報として、文書ＩＤが含まれるものとした。 In this embodiment, each document data stored in the search target document database 400 includes a document ID as identification information for specifying the document data.

続いて、インデックス生成部２４７は、単語毎に、単語が含まれる文書データの集合を取得し、単語と、取得した文書データの数と、文書データを特定する識別情報である文書ＩＤとを対応付けたインデックス情報２１１を生成し（ステップＳ４０２）、処理を終了する。 Subsequently, the index generating unit 247 acquires a set of document data containing the word for each word, and associates the word with the number of acquired document data and the document ID, which is identification information for specifying the document data. The attached index information 211 is generated (step S402), and the process ends.

図５は、インデックス生成部により生成されたインデックス情報の一例を示す図である。図５（ａ）は、ステップＳ４０１の処理の結果として得られる中間データを示す図である。図５（ｂ）は、インデックス情報２１１の一例を示す図である。 FIG. 5 is a diagram illustrating an example of index information generated by an index generation unit; FIG. 5A is a diagram showing intermediate data obtained as a result of the process of step S401. FIG. 5B is a diagram showing an example of the index information 211. As shown in FIG.

本実施形態のインデックス生成部２４７は、例えば、文書データに、「これ」、「ペン」、「リンゴ」が含まれるとする。 The index generation unit 247 of this embodiment assumes that document data includes, for example, "this", "pen", and "apple".

この場合、インデックス生成部２４７は、検索対象文書データベース４００に格納された文書データ毎に、単語「これ」、「ペン」、「リンゴ」のそれぞれが、文書データに含まれるか否かを判定し、判定結果を示す中間データ２０５を保持する。 In this case, the index generator 247 determines whether or not the words "this", "pen", and "apple" are included in each piece of document data stored in the search target document database 400. , holds intermediate data 205 indicating the determination result.

図５（ａ）に示す中間データ２０５では、単語「これ」は、文書ＩＤ「１」の文書データと、文書ＩＤ「２」の文書データと、文書ＩＤ「Ｎ」の文書データに含まれることがわかる。また、単語「ペン」は、文書ＩＤ「１」の文書データに含まれることがわかる。また、単語「リンゴ」は、文書ＩＤ「２」の文書データに含まれることがわかる。 In the intermediate data 205 shown in FIG. 5A, the word "this" is included in the document data with the document ID "1", the document data with the document ID "2", and the document data with the document ID "N". I understand. Also, it can be seen that the word "pen" is included in the document data with the document ID "1". Also, it can be seen that the word "apple" is included in the document data with the document ID "2".

次に、インデックス生成部２４７は、各単語に、単語が含まれる文書データの数と、文書ＩＤとを対応付けたインデックス情報２１１を生成する。 Next, the index generating unit 247 generates index information 211 in which each word is associated with the number of document data containing the word and the document ID.

図５（ｂ）に示すインデックス情報２１１は、情報の項目として、キー、マッチ文書数、文書の配列を有する。項目「キー」の値は、文書データに含まれると推定される単語を示す。項目「マッチ文書数」の値は、キーとなる単語が含まれる文書データの数を示す。項目「文書の配列」の値は、キーとなる単語が含まれる文書データの文書ＩＤを示す。尚、項目「文書の配列」の値に含まれる文書ＩＤの数は、項目「マッチ文書数」の値と同数となる。 The index information 211 shown in FIG. 5B has, as information items, a key, the number of matching documents, and an arrangement of documents. The value of the item "key" indicates a word presumed to be included in the document data. The value of the item "number of matching documents" indicates the number of document data containing the key word. The value of the item "arrangement of documents" indicates the document ID of the document data containing the key word. The number of document IDs included in the value of the item "arrangement of documents" is the same as the value of the item "number of matching documents".

図５（ｂ）のインデックス情報２１１では、単語「これ」を含む文書データの数は、２３９個であり、単語「これ」を含む文書データの文書ＩＤは、文書ＩＤ「１」、「２」、・・・、「Ｎ」となる。 In the index information 211 of FIG. 5B, the number of document data containing the word "this" is 239, and the document IDs of the document data containing the word "this" are document IDs "1" and "2". , . . . become “N”.

本実施形態のインデックス生成部２４７は、以上のようにして、インデックス情報２１１を生成し、検索対象インデックス記憶部２１０に格納しておく。 The index generation unit 247 of this embodiment generates the index information 211 as described above and stores it in the search target index storage unit 210 .

次に、印象空間学習部２４８による処理の説明に先立ち、本実施形態の印象語マップ記憶部２２０に格納された印象語マップ２２１について説明する。 Next, before explaining the processing by the impression space learning section 248, the impression word map 221 stored in the impression word map storage section 220 of this embodiment will be explained.

図６は、第一の実施形態の印象語マップの一例を示す図である。本実施形態の印象語マップ２２１は、例えば、検索システム１００の管理者等により作成され、予め検索装置２００に与えられる。より具体的には、印象語マップ２２１は、例えば、単語と印象語との関連の強さを回答させるアンケート等を多数の人に対して実施し、その結果に基づき作成されたものが、検索装置２００に与えられていても良い。 FIG. 6 is a diagram showing an example of an impression word map according to the first embodiment. The impression word map 221 of the present embodiment is created, for example, by an administrator or the like of the search system 100 and given to the search device 200 in advance. More specifically, the impression word map 221 is created based on the results of, for example, conducting a questionnaire or the like for a large number of people to answer the strength of the relationship between words and impression words. It may be provided in device 200 .

本実施形態の印象語マップ２２１では、単語毎に、予め用意された印象語との関連の強さを示すスコア（値）が対応付けられている。尚、印象語マップ２２１に含まれる単語は、検索対象文書データベース４００に格納された文書データに含まれると推定される任意の単語であって良い。 In the impression word map 221 of the present embodiment, each word is associated with a score (value) indicating the strength of the relationship with an impression word prepared in advance. The words included in the impression word map 221 may be arbitrary words presumed to be included in the document data stored in the search target document database 400 .

また、本実施形態のスコアは、例えば、０から１０までの値であり、スコアが「０」の場合は、単語と印象語との関連がなく、スコアが大きくなるほど、単語と印象語との関連が強いことを示す。 In addition, the score of the present embodiment is, for example, a value from 0 to 10. When the score is "0", there is no relationship between the word and the impression word. Indicates that the association is strong.

図６に示す印象語マップ２２１では、印象語として、「プレミア」、「わいわい」、「隠れ家」、「ひっそり」、「ヘルシー」等が挙げられている。 In the impression word map 221 shown in FIG. 6, impression words include "premium", "waiwai", "retreat", "quietly", and "healthy".

例えば、印象語マップ２２１では、単語「デート」は、印象語「プレミア」との関連の強さを示すスコアは「９．２１」であり、印象語「わいわい」との関連の強さを示すスコアは「３．２３」である。 For example, in the impression word map 221, the word "date" has a score of "9.21" indicating the strength of the relationship with the impression word "premier", indicating the strength of the relationship with the impression word "waiwai". The score is "3.23".

したがって、単語「デート」は、印象語「わいわい」よりも、印象語「プレミア」との関連の方が強いことがわかる。 Therefore, it can be seen that the word "date" has a stronger relationship with the impression word "premier" than with the impression word "waiwai".

次に、図７を参照して、本実施形態の印象空間学習部２４８の処理について説明する。図７は、印象空間学習部による処理を説明するフローチャートである。 Next, the processing of the impression space learning unit 248 of this embodiment will be described with reference to FIG. FIG. 7 is a flowchart for explaining processing by the impression space learning unit.

本実施形態の印象空間学習部２４８は、検索対象文書データベース４００を参照し、検索対象文書データベース４００に含まれる各文書データを入力として、各文書データに含まれる単語の分散表現を、教師なし学習する（ステップＳ７０１）。言い換えれば、印象空間学習部２４８は、各文書データに含まれる単語の意味ベクトルを内部的に保持する。 The impression space learning unit 248 of this embodiment refers to the search target document database 400, receives each document data included in the search target document database 400 as an input, and performs unsupervised learning of the distributed representation of the words included in each document data. (step S701). In other words, the impression space learning unit 248 internally retains semantic vectors of words included in each document data.

続いて、印象空間学習部２４８は、印象語マップ２２１を参照し、各単語の分散表現に対する、印象語マップ２２１に含まれる印象語に対応するスコアを教師あり学習する（ステップＳ７０２）。言い換えれば、印象空間学習部２４８は、印象語マップ２２１を参照して、各単語の意味ベクトルと、この単語と対応付けられた各印象語との関連の強さを示すスコアとの関係を機械学習しておく。 Subsequently, the impression space learning unit 248 refers to the impression word map 221 and performs supervised learning of scores corresponding to the impression words included in the impression word map 221 for the distributed representation of each word (step S702). In other words, the impression space learning unit 248 refers to the impression word map 221 to mechanically calculate the relationship between the semantic vector of each word and the score indicating the strength of the relationship between the word and each impression word associated with the word. keep learning.

続いて、印象空間学習部２４８は、学習した結果を変換モデル２５０として出力し、保持する（ステップＳ７０３）。つまり、本実施形態の変換モデル２５０は、分散表現によって得られた単語の意味ベクトルを入力とし、この単語と対応付けられた印象語毎のスコアを出力とするモデルである。 Subsequently, the impression space learning unit 248 outputs and holds the learning result as the conversion model 250 (step S703). In other words, the conversion model 250 of this embodiment is a model that receives as input the semantic vector of a word obtained by distributed representation and outputs the score for each impression word associated with this word.

以上が、インデックス生成部２４７よる事前処理と、印象空間学習部２４８による事前処理である。 The preliminary processing by the index generating unit 247 and the preliminary processing by the impression space learning unit 248 have been described above.

次に、図８を参照して、本実施形態の検索対象スコア記憶部２３０に格納された印象語スコアリスト２３１について説明する。 Next, the impression word score list 231 stored in the search target score storage unit 230 of the present embodiment will be described with reference to FIG.

図８は、印象語スコアリストの一例を示す図である。本実施形態の印象語スコアリスト２３１は、例えば、検索処理部２４０の解析部２４２と印象語スコア出力部２４４の処理によって得られるリストである。本実施形態の印象語スコアリスト２３１は、検索部２４３による検索結果を出力部２４６によりレーダチャートとして表示させる際に、参照される。 FIG. 8 is a diagram showing an example of an impression word score list. The impression word score list 231 of the present embodiment is, for example, a list obtained by the processing of the analysis unit 242 and the impression word score output unit 244 of the search processing unit 240 . The impression word score list 231 of the present embodiment is referred to when the search result by the search unit 243 is displayed as a radar chart by the output unit 246 .

印象語スコアリスト２３１は、検索対象文書データベース４００に含まれる文書データの文書ＩＤと、印象語スコアリスト２３１に含まれる各印象語との関連の強さを示すスコアを示している。尚、印象語スコアリスト２３１に含まれる印象語は、印象語マップ２２１に含まれる印象語と同様である。 The impression word score list 231 indicates scores indicating the strength of the relationship between the document IDs of the document data included in the search target document database 400 and each impression word included in the impression word score list 231 . The impression words included in the impression word score list 231 are the same as the impression words included in the impression word map 221 .

本実施形態の検索処理部２４０は、例えば、解析部２４２により、検索対象文書データベース４００に格納された文書データから単語を抽出する。そして、検索処理部２４０は、抽出された単語を、印象語スコア出力部２４４によって意味ベクトルとした後に変換モデル２５０へ入力し、単語毎に、各印象語との関連の強さを示すスコアを取得する。 The search processing unit 240 of this embodiment, for example, extracts words from the document data stored in the search target document database 400 by the analysis unit 242 . Then, the search processing unit 240 converts the extracted words into semantic vectors by the impression word score output unit 244, inputs them to the conversion model 250, and calculates a score indicating the strength of the relationship with each impression word for each word. get.

本実施形態の印象語スコアリスト２３１では、このスコアを文書ＩＤと対応付けられた各印象語のスコアとしても良い。 In the impression word score list 231 of this embodiment, this score may be the score of each impression word associated with the document ID.

このようにすれば、例えば、検索対象文書データベース４００に格納された文書データから１つの単語が抽出された場合には、この単語と、各印象語との関連の強さを示すスコアが、この文書データと各印象語との関連の強さを示すスコアとなる。 In this way, for example, when one word is extracted from the document data stored in the search target document database 400, the score indicating the strength of the relationship between this word and each impression word is A score that indicates the strength of the relationship between the document data and each impression word.

また、例えば、文書データから複数の単語が抽出された場合、複数の単語について、各印象語との関連の強さを示すスコアが出力される。したがって、本実施形態では、各印象語について、複数の単語のスコアの最大値や平均値を求め、各印象語のスコアの最大値や平均値を、この文書データと各印象語との関連の強さを示すスコアとしても良い。 Also, for example, when a plurality of words are extracted from the document data, a score indicating the strength of the relationship between the plurality of words and each impression word is output. Therefore, in this embodiment, for each impression word, the maximum value and average value of the scores of a plurality of words are obtained, and the maximum value and average value of the scores of each impression word are used as the relationship between this document data and each impression word. It may be a score that indicates strength.

具体的には、例えば、文書データから「デート」と「おいしい」という２つの単語が抽出されたとする。この場合、検索処理部２４０は、単語「デート」と、印象語マップ２２１に含まれる各印象語との関連の強さを示すスコアと、単語「おいしい」と、印象語マップ２２１に含まれる各印象語との関連の強さを示すスコアと、を取得する。 Specifically, for example, it is assumed that two words "date" and "delicious" are extracted from the document data. In this case, the search processing unit 240 obtains a score indicating the strength of the relationship between the word “date” and each impression word included in the impression word map 221, the word “delicious”, Acquire a score indicating the strength of the relationship with the impression word.

そして、検索処理部２４０は、印象語毎に、単語「デート」と対応するスコアと、単語「おいしい」と対応するスコアとの最大値を求める。例えば、単語「デート」と印象語「プレミア」との関連の強さを示すスコアが９．２１であり、単語「おいしい」と印象語「プレミア」との関連の強さを示すスコアが９．３８であったとする。この場合、検索処理部２４０は、９．２１と９．３８における最大値９．３８を、この文書データと、印象語「プレミア」との関連の強さを示すスコアとすれば良い。 Then, the search processing unit 240 obtains the maximum value of the score corresponding to the word "date" and the score corresponding to the word "delicious" for each impression word. For example, the score indicating the strength of association between the word "date" and the impression word "premier" is 9.21, and the score indicating the strength of association between the word "delicious" and the impression word "premier" is 9.21. Suppose it was 38. In this case, the search processing unit 240 may set the maximum value of 9.38 between 9.21 and 9.38 as the score indicating the strength of the relationship between this document data and the impression word "premium".

次に、図９を参照して、本実施形態の検索処理部２４０の処理について説明する。図９は、第一の実施形態の検索処理部の処理を説明するフローチャートである。 Next, referring to FIG. 9, processing of the search processing unit 240 of this embodiment will be described. FIG. 9 is a flowchart for explaining the processing of the search processing section of the first embodiment.

本実施形態の検索処理部２４０は、入力受付部２４１がクエリの入力を受け付けると、解析部２４２により、クエリから形態素解析等により検索語群を取得し、検索部２４３により、インデックス情報２１１を検索語群で検索し、検索語群と対応する文書ＩＤを特定する（ステップＳ９０１）。尚、本実施形態では、ステップＳ９０１で特定された文書ＩＤが示す文書データが、検索結果集合データとなる。 When the input receiving unit 241 receives the input of the query, the search processing unit 240 of the present embodiment acquires a search word group from the query by morphological analysis or the like using the analyzing unit 242, and searches the index information 211 using the searching unit 243. A word group is searched, and a document ID corresponding to the search word group is specified (step S901). Note that, in this embodiment, the document data indicated by the document ID specified in step S901 becomes the search result set data.

続いて、検索処理部２４０は、印象語スコア出力部２４４により、入力されたクエリに基づき、クエリとの関連が明確な印象語を軸としてｎ個抽出し、軸決定部２４５によりα軸に設定する（ステップＳ９０２）。ステップＳ９０２における軸とは、レーダチャートの軸であり、検索結果集合データの属性とされる項目の一つである。ステップＳ９０２の処理の詳細は後述する。また、本実施形態のα軸は、クエリとの関連の強さを示すスコアの値が大きく、クエリとの関連が強く、対応関係が明確である、とされる印象語が設定される軸である。 Subsequently, the search processing unit 240 extracts n impression words that are clearly related to the query based on the input query by the impression word score output unit 244, and sets them to the α axis by the axis determination unit 245. (step S902). The axis in step S902 is the axis of the radar chart, and is one of the items that are attributes of the search result set data. Details of the processing in step S902 will be described later. In addition, the α-axis of the present embodiment is an axis on which impression words are set that have a large score indicating the strength of the relationship with the query, have a strong relationship with the query, and have a clear correspondence relationship. be.

続いて、検索処理部２４０は、軸決定部２４５により、予め設定された単語をｍ個抽出し、β軸に設定する（ステップＳ９０３）。ここで、予め設定される単語は、例えば、印象語マップ２２１に含まれる印象語の中から選択されて設定されていても良い。また、予め設定される単語は、インデックス情報２１１に含まれる単語から選択されて設定されても良い。言い換えれば、予め設定された単語とは、検索結果集合データに含まれる単語であっても良い。 Subsequently, the search processing unit 240 extracts m preset words by the axis determination unit 245 and sets them on the β axis (step S903). Here, the preset words may be selected and set from impression words included in the impression word map 221, for example. Also, the preset words may be selected from the words included in the index information 211 and set. In other words, the preset words may be words included in the search result set data.

また、本実施形態のβ軸とは、レーダチャートにおける全ての軸のうち、α軸以外の軸を示す。つまり、β軸は、α軸と比較して、クエリとの関連が強くない印象語が設定される軸と言える。 Further, the β-axis in this embodiment indicates an axis other than the α-axis among all the axes in the radar chart. In other words, the β axis can be said to be an axis on which impression words that are not strongly related to the query are set compared to the α axis.

続いて、出力部２４６は、α軸とβ軸とを出力軸とし（ステップＳ９０４）、出力部２４６により、ステップＳ９０１で検索部２４３が取得した文書ＩＤが示す検索結果集合データをレーダチャートに描画し（ステップＳ９０５）、処理を終了する。 Subsequently, the output unit 246 uses the α axis and the β axis as output axes (step S904), and draws the search result set data indicated by the document ID acquired by the search unit 243 in step S901 on a radar chart. (step S905), and the process ends.

次に、図１０を参照して、図９のステップＳ９０２の処理について説明する。図１０は、第一の実施形態の印象語スコア出力部及び軸決定部の処理を説明するフローチャートである。図１０の処理は、印象語スコア出力部２４４と軸決定部２４５の処理を示している。 Next, the processing of step S902 in FIG. 9 will be described with reference to FIG. FIG. 10 is a flowchart for explaining the processing of the impression word score output unit and the axis determination unit of the first embodiment. The processing of FIG. 10 shows the processing of the impression word score output section 244 and the axis determination section 245 .

本実施形態の印象語スコア出力部２４４は、ステップＳ９０１において取得した検索語群に含まれる各単語を、分散表現に変換する（ステップＳ１００１）。言い換えれば、印象語スコア出力部２４４は、検索語群に含まれる各単語について、分散表現により意味ベクトルに変換する。 The impression word score output unit 244 of the present embodiment converts each word included in the search word group acquired in step S901 into distributed representation (step S1001). In other words, the impression word score output unit 244 converts each word included in the search word group into a semantic vector using distributed representation.

続いて、印象語スコア出力部２４４は、変換モデル２５０を参照し、単語毎に、各印象語との関連の強さを示すスコアを取得する（ステップＳ１００２）。言い換えれば、印象語スコア出力部２４４は、変換モデル２５０に対して、ステップＳ１００１で求めた各単語の意味ベクトルを入力し、変換モデル２５０から出力された、各単語についての印象語毎のスコアを取得する。 Subsequently, the impression word score output unit 244 refers to the conversion model 250 and obtains a score indicating the strength of the relationship between each word and each impression word (step S1002). In other words, impression word score output unit 244 inputs the semantic vector of each word obtained in step S1001 to conversion model 250, and calculates the score for each impression word for each word output from conversion model 250. get.

続いて、軸決定部２４５は、各単語と対応する印象語毎のスコアの最大値を取得する（ステップＳ１００３）。 Subsequently, the axis determining unit 245 acquires the maximum score for each impression word corresponding to each word (step S1003).

続いて、軸決定部２４５は、印象語毎のスコアの最大値の上位ｎ個と対応する印象語を、クエリと関連するα軸として抽出する（ステップＳ１００４）。言い換えれば、軸決定部２４５は、印象語毎のスコアの最大値が大きい順に、ｎ個の印象語を抽出し、α軸とする。 Subsequently, the axis determination unit 245 extracts the impression words corresponding to the top n highest score values for each impression word as the α axis related to the query (step S1004). In other words, the axis determination unit 245 extracts n impression words in descending order of the maximum score for each impression word, and uses them as the α-axis.

以下に、図１１を参照して、本実施形態の印象語スコア出力部２４４と軸決定部２４５の処理を具体的に説明する。図１１は、印象語スコア出力部と軸決定部の処理を説明する図である。 The processing of the impression word score output unit 244 and the axis determination unit 245 of this embodiment will be specifically described below with reference to FIG. FIG. 11 is a diagram for explaining the processing of the impression word score output section and the axis determination section.

図１１では、ユーザにより入力されたクエリを「ひっそりとしたプレミアのレストラン」として説明する。 FIG. 11 describes the query entered by the user as "secret premier restaurants".

本実施形態の検索処理部２４０は、このクエリの入力を受け付けると、解析部２４２により、検索語群１１１を取得する。検索語群には、単語「ひっそり」、「プレミア」、「レストラン」が含まれる。 The search processing unit 240 of the present embodiment acquires the search term group 111 by the analysis unit 242 when receiving the input of this query. The search term group includes the words "quiet", "premium", and "restaurant".

次に検索処理部２４０は、印象語スコア出力部２４４により、検索語群１１１に含まれる各単語の意味ベクトル１１２を得る。 Next, the search processing unit 240 obtains the semantic vector 112 of each word included in the search word group 111 by the impression word score output unit 244 .

図１１に示す意味ベクトル１１２では、単語「ひっそり」、「プレミア」、「レストラン」のそれぞれが、ベクトルとして示される。 In the semantic vector 112 shown in FIG. 11, each of the words "quiet", "premium", and "restaurant" is shown as a vector.

印象語スコア出力部２４４は、意味ベクトル１１２が得られると、意味ベクトル１１２を、変換モデル２５０の入力として、各単語について、印象語マップ２２１に含まれる印象語毎のスコアを得る。図１１では、検索語群に含まれる各単語と、印象語毎のスコアとを対応付けた情報を、単語毎印象語スコア１１３と呼ぶ。 When the semantic vector 112 is obtained, the impression word score output unit 244 uses the semantic vector 112 as an input to the conversion model 250 to obtain a score for each impression word included in the impression word map 221 for each word. In FIG. 11, the information in which each word included in the search word group is associated with the score for each impression word is called an impression word score for each word 113 .

次に、軸決定部２４５は、単語毎印象語スコア１１３から、印象語毎のスコアの最大値を取得する。 Next, the axis determination unit 245 acquires the maximum score for each impression word from the impression word score for each word 113 .

例えば、単語毎印象語スコア１１３では、印象語「プレミア」に対して、単語「ひっそり」との関連の強さを示すスコアは３．４９であり、単語「プレミア」との関連の強さを示すスコアは１０．００（満点）であり、単語「レストラン」との関連の強さを示すスコアは７．９５である。 For example, in the impression word score 113 for each word, the score indicating the strength of the relationship between the impression word "premier" and the word "quietly" is 3.49. The indicated score is 10.00 (perfect score), and the score indicating the strength of association with the word "restaurant" is 7.95.

したがって、軸決定部２４５は、印象語「プレミア」のスコアの最大値として「１０．００」を取得する。 Therefore, the axis determining unit 245 acquires "10.00" as the maximum score of the impression word "premium".

また、単語毎印象語スコア１１３では、印象語「ヘルシー」に対して、単語「ひっそり」との関連の強さを示すスコアは３．４８であり、単語「プレミア」との関連の強さを示すスコアは８．３３であり、単語「レストラン」との関連の強さを示すスコアは６．７２である。 Further, in the impression word score 113 for each word, the score indicating the strength of the relationship between the impression word "healthy" and the word "quietly" is 3.48. The indicated score is 8.33 and the score indicating the strength of association with the word "restaurant" is 6.72.

したがって、軸決定部２４５は、印象語「ヘルシー」のスコアの最大値として「８．３３」を取得する。 Therefore, the axis determination unit 245 acquires "8.33" as the maximum score of the impression word "healthy".

また、単語毎印象語スコア１１３では、印象語「隠れ家」に対して、単語「ひっそり」との関連の強さを示すスコアは９．３８であり、単語「プレミア」との関連の強さを示すスコアは６．２３であり、単語「レストラン」との関連の強さを示すスコアは３．４６である。 Further, in the impression word score 113 for each word, the score indicating the strength of the relationship between the impression word "kakure" and the word "quietly" is 9.38, indicating the strength of the relationship with the word "premier". The indicated score is 6.23 and the score indicating the strength of association with the word "restaurant" is 3.46.

したがって、軸決定部２４５は、印象語「隠れ家」のスコアの最大値として「９．３８」を取得する。 Therefore, the axis determination unit 245 acquires "9.38" as the maximum score of the impression word "hideaway".

本実施形態の軸決定部２４５は、このようにして取得した印象語毎のスコアの最大値を、クエリと対応した印象語スコアリスト１１４として保持しても良い。この印象語スコアリスト１１４は、例えば、後述するレーダチャートの表示の際に、参照されても良い。 The axis determination unit 245 of the present embodiment may hold the maximum value of scores for each impression word thus obtained as the impression word score list 114 corresponding to the query. This impression word score list 114 may be referred to, for example, when displaying a radar chart, which will be described later.

続いて、軸決定部２４５は、印象語スコアリスト１１４において、スコアの最大値が大きい印象語から順に、ｎ個の印象語を抽出して、α軸とする。図１１の例では、ｎ＝２としている。 Subsequently, the axis determination unit 245 extracts n impression words in descending order of the maximum score from the impression word score list 114, and defines them as the α-axis. In the example of FIG. 11, n=2.

印象語スコアリスト１１４において、スコアの最大値が最も大きい印象語は「プレミア」であり、次にスコアの最大値が大きい印象語は「隠れ家」である。 In the impression word score list 114, the impression word with the highest maximum score is "premier", and the impression word with the next highest maximum score is "hideaway".

したがって、本実施形態の軸決定部２４５は、印象語「プレミア」と印象語「隠れ家」をレーダチャートのα軸に決定する。言い換えれば、軸決定部２４５は、印象語「プレミア」と印象語「隠れ家」とを、検索結果集合データを表示する際の項目とする。 Therefore, the axis determining unit 245 of the present embodiment determines the impression word "premier" and the impression word "hideaway" to be the α-axis of the radar chart. In other words, the axis determination unit 245 sets the impression word "premier" and the impression word "hideaway" as items when displaying the search result set data.

このように、本実施形態では、入力されたクエリから得られた検索語群に基づき、クエリと関連が強いされる印象語を抽出する。つまり、ここで抽出される印象語は、クエリから推定される内容の特徴を示す特徴語群（第１特徴語群）と言える。したがって、本実施形態の軸決定部２４５は、クエリから第１特徴語群を抽出する抽出部の機能を果たす。 As described above, in this embodiment, impression words strongly related to the query are extracted based on the search word group obtained from the input query. In other words, the impression words extracted here can be said to be a feature word group (first feature word group) indicating the feature of the content estimated from the query. Therefore, the axis determining unit 245 of this embodiment functions as an extracting unit that extracts the first feature word group from the query.

また、本実施形態の軸決定部２４５は、単語毎印象語スコア１１３が取得されると、印象語毎のスコアの最大値に応じて、α軸とする印象語を決定するものとしたが、これに限定されない。軸決定部２４５は、例えば、単語毎印象語スコア１１３が取得されると、各印象語について、スコアを正規化した値が大きい順にｎ個の印象語を抽出しても良い。また、軸決定部２４５は、単語毎印象語スコア１１３が取得されると、各印象語について、スコアの平均値が大きい順にｎ個の印象語を抽出しても良い。 Further, when the per-word impression word score 113 is obtained, the axis determining unit 245 of the present embodiment determines the impression word to be used as the α-axis according to the maximum value of the score per impression word. It is not limited to this. For example, when the per-word impression word score 113 is obtained, the axis determining unit 245 may extract n impression words in descending order of normalized scores for each impression word. Further, when the per-word impression word score 113 is obtained, the axis determination unit 245 may extract n impression words in descending order of the average score for each impression word.

言い換えれば、本実施形態の印象語スコアリスト１１４において、各印象語と対応付けられる値は、単語毎印象語スコア１１３における印象語毎のスコアの最大値に限定されない。印象語スコアリスト１１４において、各印象語と対応付けられる値は、単語毎印象語スコア１１３における印象語毎のスコアを正規化した値であっても良いし、単語毎印象語スコア１１３における印象語毎のスコアの平均値であっても良い。 In other words, in the impression word score list 114 of this embodiment, the value associated with each impression word is not limited to the maximum score for each impression word in the per-word impression word score 113 . In the impression word score list 114, the value associated with each impression word may be a value obtained by normalizing the score for each impression word in the impression word score for each word 113, or the score for each impression word in the impression word score for each word 113. It may be the average value of the scores for each.

また、本実施形態では、スコアと対応した印象語をα軸として抽出するものとしたが、これに限定されない。 Also, in the present embodiment, impression words corresponding to scores are extracted as the α axis, but the present invention is not limited to this.

軸決定部２４５は、例えば、印象語との関連の強さを示すスコアが最も大きい単語をα軸として抽出しても良い。 The axis determining unit 245 may extract, for example, the word with the highest score indicating the strength of association with the impression word as the α axis.

例えば、軸決定部２４５は、印象語毎のスコアが大きい順に、印象語「プレミア」、「隠れ家」を抽出する。このとき、軸決定部２４５は、単語毎印象語スコア１１３において、印象語「隠れ家」との関連の強さを示すスコアが最も大きい単語「ひっそり」をα軸として抽出しても良い。 For example, the axis determination unit 245 extracts the impression words "premier" and "hideout" in descending order of the score for each impression word. At this time, the axis determination unit 245 may extract the word "quietly" having the highest score indicating the strength of the relationship with the impression word "hide" in the word-by-word impression word score 113 as the α-axis.

このように、本実施形態では、ユーザが入力したクエリに含まれる検索語群から、単語の意味ベクトルを取得し、この意味ベクトルを変換モデル２５０の入力とする。そして、本実施形態では、変換モデル２５０の出力として、検索語群に含まれる単語毎に、複数の印象語との関連の強さを示すスコアを取得する。 As described above, in this embodiment, the semantic vector of a word is obtained from the search term group included in the query input by the user, and the semantic vector is used as the input of the conversion model 250 . Then, in this embodiment, as an output of the conversion model 250, a score indicating the strength of the relationship with a plurality of impression words is obtained for each word included in the search word group.

したがって、本実施形態では、予め用意された単語以外の新たな単語がクエリに含まれていても、新たな単語と、複数の印象語との関連の強さを示すスコアを取得することができる。このため、本実施形態によれば、ユーザが入力するあらゆるクエリに対して、クエリとの関連が強く、対応関係が明確である、とされる印象語（項目）を用いて、検索結果を表示させることができる。 Therefore, in this embodiment, even if a new word other than words prepared in advance is included in the query, it is possible to obtain a score indicating the strength of the relationship between the new word and a plurality of impression words. . Therefore, according to the present embodiment, the search results are displayed using impression words (items) that are strongly related to the query and have a clear correspondence relationship with respect to every query input by the user. can be made

次に、図１２乃至図１４を参照して、本実施形態の検索処理部２４０によるレーダチャートの表示について説明する。 Next, display of a radar chart by the search processing unit 240 of the present embodiment will be described with reference to FIGS. 12 to 14. FIG.

図１２は、第一の実施形態のレーダチャートの表示例を示す第一の図である。図１２に示す画面１２１は、入力されたクエリが「ひっそりとしたプレミアのレストラン」である場合の検索結果集合データとして得られた「○○レストラン」のレーダチャートを示している。 FIG. 12 is a first diagram showing a display example of a radar chart according to the first embodiment. A screen 121 shown in FIG. 12 shows a radar chart of "○○ restaurant" obtained as search result set data when the input query is "quiet premier restaurant".

尚、図１２の例では、検索結果集合データの例として、「○○レストラン」のみを示しているが、これに限定されない。検索結果集合データとして、複数の検索結果が取得された場合には、画面１２１には、検索結果毎のレーダチャートが表示されても良い。 In addition, in the example of FIG. 12, only "○○ restaurant" is shown as an example of the set data of search results, but it is not limited to this. When a plurality of search results are obtained as search result set data, the screen 121 may display a radar chart for each search result.

また、図１２の例では、検索結果集合データは、「○○レストラン」の説明文を示す文書データ等であっても良く、検索結果表示欄１２３に、「○○レストラン」の説明文が表示されても良い。 Further, in the example of FIG. 12, the search result set data may be document data or the like indicating the description of "○○ restaurant", and the description of "○○ restaurant" is displayed in the search result display field 123. May be.

また、本実施形態では、検索結果集合データに含まれる検索結果毎に、ランキングが付与されても良い。具体的には、本実施形態では、検索結果集合データ毎に、レーダチャートの軸毎の得点の合計点や、グラフで示される多角形の面積を、クエリと検索結果集合データとの関連の強さを示す値として算出し、この値が大きい順に、検索結果集合データにランキングを付与しても良い。 Also, in this embodiment, a ranking may be given to each search result included in the search result set data. Specifically, in this embodiment, for each set of search result data, the total score for each axis of the radar chart and the area of the polygon shown in the graph are used to determine the strength of the relationship between the query and the search result set data. It may be calculated as a value indicating the degree of difficulty, and the search result set data may be ranked in descending order of this value.

つまり、本実施形態では、検索結果集合データは、クエリとの関連の強い順にランキングが付与される。本実施形態では、このようにランキングを付与することで、ユーザに対し、クエリと関連が強い検索結果から順に提示することができる。 That is, in the present embodiment, search result set data is ranked in descending order of relevance to the query. In this embodiment, by assigning rankings in this way, it is possible to present search results to the user in descending order of relation to the query.

本実施形態の画面１２１には、クエリ表示欄１２２、検索結果表示欄１２３、検索結果表示欄１２４が含まれる。 The screen 121 of this embodiment includes a query display field 122 , a search result display field 123 , and a search result display field 124 .

クエリ表示欄１２２には、ユーザによって入力されたクエリが表示される。検索結果表示欄１２３には、検索結果集合データに含まれる検索結果と対応したレーダチャート１２５が表示される。レーダチャート１２５では、検索結果集合データと、レーダチャート１２５の軸とされた項目との関係性を示すグラフ１２６が表示されている。検索結果表示欄１２４には、レーダチャート１２５の軸毎の値を示す棒グラフ１２７が表示される。 A query input by the user is displayed in the query display field 122 . The search result display field 123 displays a radar chart 125 corresponding to the search results included in the search result set data. The radar chart 125 displays a graph 126 that shows the relationship between the search result set data and the items that are the axes of the radar chart 125 . A bar graph 127 showing values for each axis of the radar chart 125 is displayed in the search result display field 124 .

本実施形態のレーダチャート１２５は、α１軸、α２軸と、β１軸、β２軸、β３軸の５本の軸を有している。言い換えれば、レーダチャート１２５は、検索結果の「○○レストラン」を、クエリに基づき抽出された２つの印象語と、クエリにも基づき選択された印象語以外の３つの印象語と、を５つの項目として用いて表示している。 The radar chart 125 of this embodiment has five axes: α1 axis, α2 axis, β1 axis, β2 axis, and β3 axis. In other words, the radar chart 125 divides the search result “○○ restaurant” into five impression words, two impression words extracted based on the query, and three impression words other than the impression word selected also based on the query. It is used as an item and displayed.

α１軸に対応する項目は「隠れ家」であり、α２軸に対応する項目は「プレミア」である。また、β１軸に対応する項目は「落ち着き」であり、β２軸に対応する項目は「ガッツリ」であり、β３軸に対応する項目は「ヘルシー」である。 The item corresponding to the α1 axis is "hideaway", and the item corresponding to the α2 axis is "premium". The item corresponding to the β1 axis is "calm", the item corresponding to the β2 axis is "strong", and the item corresponding to the β3 axis is "healthy".

ここで、α１軸とα２軸に対応する項目は、図９のステップＳ９０２で抽出された印象語である。つまり、α１軸とα２軸に対応する項目は、クエリとの関連が明確な印象語である。 Here, the items corresponding to the α1 axis and the α2 axis are the impression words extracted in step S902 of FIG. In other words, the items corresponding to the α1 axis and the α2 axis are impression words that are clearly related to the query.

また、β１軸、β２軸、β３軸に対応する項目は、図９のステップＳ９０３で設定された単語である。つまり、β１軸、β２軸、β３軸に対応する項目は、予め設定された単語である。 Items corresponding to the β1 axis, β2 axis, and β3 axis are the words set in step S903 of FIG. That is, the items corresponding to the β1-axis, β2-axis, and β3-axis are preset words.

尚、図１２の例では、β１軸、β２軸、β３軸に対応する項目は、印象語マップ２２１から選択された印象語としているが、これに限定されない。 In the example of FIG. 12, the items corresponding to the β1-axis, β2-axis, and β3-axis are impression words selected from the impression word map 221, but the present invention is not limited to this.

β１軸、β２軸、β３軸に対応する項目は、例えば、インデックス情報２１１から選択された単語であっても良い。つまり、β１軸、β２軸、β３軸に対応する項目は、検索結果集合データから抽出された単語であっても良い。ここで、検索結果集合データから抽出された単語とは、検索結果集合データの特徴を示す特徴語群（第２特徴語群）と言える。 Items corresponding to the β1-axis, β2-axis, and β3-axis may be words selected from the index information 211, for example. That is, the items corresponding to the β1-axis, β2-axis, and β3-axis may be words extracted from the search result set data. Here, the words extracted from the search result set data can be said to be a characteristic word group (second characteristic word group) that indicates the characteristics of the search result set data.

したがって、本実施形態のレーダチャート１２５では、クエリから抽出された特徴語群と、検索結果集合データから抽出された特徴語群と、を項目として用いて検索結果集合データを表示させている。 Therefore, in the radar chart 125 of this embodiment, the search result set data is displayed using the feature word group extracted from the query and the feature word group extracted from the search result set data as items.

ここで、本実施形態の検索処理部２４０において、レーダチャート１２５の各軸に対応する項目が決定された後の、出力部２４６によるグラフ１２６の表示について説明する。 Here, the display of the graph 126 by the output unit 246 after the items corresponding to the respective axes of the radar chart 125 are determined in the search processing unit 240 of this embodiment will be described.

本実施形態の出力部２４６は、レーダチャート１２５の各軸に対応する項目が決まると、印象語スコアリスト２３１を参照する。 The output unit 246 of this embodiment refers to the impression word score list 231 when the items corresponding to the respective axes of the radar chart 125 are determined.

そして、出力部２４６は、印象語スコアリスト２３１において、検索結果集合データとして取得された文書データの文書ＩＤと対応する印象語毎のスコアのうち、項目とされた印象語と対応するスコアを取得し、レーダチャートにおける項目の値とする。 Then, the output unit 246 acquires the score corresponding to the impression word selected from among the scores for each impression word corresponding to the document ID of the document data acquired as the set data of the search results in the impression word score list 231. be the value of the item on the radar chart.

例えば、検索結果集合データとして取得された文書データが、文書ＩＤ「１」の文書データであったとする。 For example, assume that the document data acquired as the search result set data is the document data with the document ID "1".

この場合、出力部２４６は、印象語スコアリスト２３１を参照し、文書ＩＤ「１」と対応する印象語毎のスコアのうち、α１軸と対応する項目とされた「隠れ家」と対応するスコア「９．２」を取得し、レーダチャート１２５におけるα１軸の値とする。また、出力部２４６は、文書ＩＤ「１」と対応する印象語毎のスコアのうち、α２軸と対応する項目とされた「プレミア」と対応するスコア「９．０」を取得し、レーダチャート１２５におけるα２軸の値とする。 In this case, the output unit 246 refers to the impression word score list 231, and among the scores for each impression word corresponding to the document ID "1", the score corresponding to the item "hideout" corresponding to the α1 axis. 9.2” and set it as the value of the α1 axis in the radar chart 125 . In addition, the output unit 246 acquires the score “9.0” corresponding to “premium”, which is the item corresponding to the α2 axis, among the scores for each impression word corresponding to the document ID “1”, and obtains the score “9.0”. 125 is the value of the α2 axis.

本実施形態の出力部２４６は、同様にして、印象語スコアリスト２３１から、β１軸～β３の値を取得する。そして、出力部２４６は、各軸の値に基づき、グラフ１２６を表示させる。 Similarly, the output unit 246 of this embodiment acquires the values of the β1 axis to β3 from the impression word score list 231 . Then, the output unit 246 displays the graph 126 based on the values of each axis.

尚、本実施形態の印象語スコアリスト２３１は、事前処理により生成されていても良い。また、印象語スコアリスト２３１は、検索部２４３による検索によって検索結果集合データが取得されたときに、検索結果集合データから、印象語毎のスコアを求めて印象語スコアリスト２３１に格納しても良い。 Note that the impression word score list 231 of the present embodiment may be generated by preprocessing. Further, the impression word score list 231 may obtain a score for each impression word from the search result set data and store it in the impression word score list 231 when the search result set data is acquired by the search unit 243 . good.

検索結果集合データから印象語毎のスコアを求める場合、検索処理部２４０は、解析部２４２により、検索結果集合データから単語群を取得した後に、図１０のステップＳ１００１からステップＳ１００３までの処理を実行すれば良い。そして、検索処理部２４０は、この処理を実行した結果である、印象語毎のスコアを、検索結果集合データを示す文書ＩＤと対応付けて印象語スコアリスト２３１に格納すれば良い。 When obtaining the score for each impression word from the search result set data, the search processing unit 240 executes the processing from step S1001 to step S1003 in FIG. 10 after acquiring word groups from the search result set data by the analysis unit 242 do it. Then, the search processing unit 240 may store the score for each impression word, which is the result of executing this process, in the impression word score list 231 in association with the document ID indicating the set data of the search results.

本実施形態では、このように、レーダチャート１２５の軸を、クエリに基づき抽出された、クエリと関連の強い印象語とすることで、検索結果がユーザの意図したものであることをユーザに容易に判断させることができる。したがって、本実施形態によれば、検索システム１００のユーザに対し、検索結果に対する納得感を提供することができる。 In this embodiment, by setting the axis of the radar chart 125 to the impression words strongly related to the query extracted based on the query, it is easy for the user to know that the search result is what the user intended. can be made to judge. Therefore, according to this embodiment, it is possible to provide the user of the search system 100 with a sense of satisfaction with respect to the search results.

以下に、図１３を参照して、本実施形態を適用しない場合の比較例について説明する。図１３は、比較例となるレーダチャートの一例を示す図である。 A comparative example in which the present embodiment is not applied will be described below with reference to FIG. 13 . FIG. 13 is a diagram showing an example of a radar chart serving as a comparative example.

図１３に示す画面１３１は、入力されたクエリが表示されるクエリ表示欄１３２と、検索結果が表示される検索結果表示欄１３３と、レーダチャート１３４とが表示されている。 A screen 131 shown in FIG. 13 displays a query display field 132 in which an input query is displayed, a search result display field 133 in which search results are displayed, and a radar chart 134 .

図１３に示すレーダチャート１３４に示す軸１～５に対応する項目は、検索結果として取得された「○○レストラン」と対応付けられた文書データの中から選択された単語である。 Items corresponding to the axes 1 to 5 shown in the radar chart 134 shown in FIG. 13 are words selected from document data associated with "○○ restaurant" obtained as a search result.

例えば、軸１と対応する項目は「安い」であり、軸２と対応する項目は「解放感」であり、軸３と対応する項目は「個室あり」であり、軸４と対応する項目は「わいわい」であり、軸５と対応する項目は「ガッツリ」である。 For example, the item corresponding to axis 1 is "Cheap", the item corresponding to axis 2 is "feeling of openness", the item corresponding to axis 3 is "with private room", and the item corresponding to axis 4 is The item that corresponds to Axis 5 is ``Gattsuri''.

この軸１～５に対応する項目に対して、入力されたクエリは「ひっそりとしたプレミアのレストラン」である。 For items corresponding to this axis 1-5, the entered query is "secret premier restaurant".

このため、例えば、軸４と対応する項目「わいわい」は、クエリから連想されにくい項目であり、クエリとの対応関係が明確でない。 Therefore, for example, the item "wai wai" corresponding to axis 4 is an item that is difficult to associate with the query, and the correspondence with the query is not clear.

このように、検索結果を示すレーダチャートにおいて、ユーザが入力したクエリから連想されにくい単語が項目として用いられた場合、ユーザは、この検索結果が自身の意図した内容のものであるか否かを容易に判断できず、検索結果に対して納得感が得られない。 As described above, in the radar chart showing the search results, when words that are difficult to associate with the query input by the user are used as items, the user can determine whether the search results are what he or she intended. It is not easy to judge, and the search results are not convincing.

これに対し、本実施形態では、クエリに基づき抽出された、クエリと関連が強い印象語を項目として用いたレーダチャートにより、検索結果集合データを表示させるため、検索結果の評価を適切に行うことができる。 On the other hand, in the present embodiment, since search result set data is displayed by a radar chart using impression words that are extracted based on the query and are strongly related to the query as items, it is possible to appropriately evaluate the search results. can be done.

次に、図１４を参照して、本実施形態のレーダチャートの表示例について、さらに説明する。図１４は、第一の実施形態のレーダチャートの表示例を示す第二の図である。 Next, a display example of a radar chart according to the present embodiment will be further described with reference to FIG. 14 . FIG. 14 is a second diagram showing a display example of the radar chart according to the first embodiment.

図１４に示す画面１２１Ａでは、検索結果表示欄１２３Ａと検索結果表示欄１２４Ａとを含む。 A screen 121A shown in FIG. 14 includes a search result display column 123A and a search result display column 124A.

検索結果表示欄１２３Ａに表示されるレーダチャート１２５Ａには、グラフ１２６と、グラフ１２８とが表示される。 A graph 126 and a graph 128 are displayed on the radar chart 125A displayed in the search result display field 123A.

グラフ１２８は、クエリと、軸とされた項目と、の関係性を示すグラフである。グラフ１２８は、例えば、レーダチャート１２５Ａのα１軸、α２軸と対応する項目を抽出する処理で取得された印象語スコアリスト１１４を参照して、表示されても良い（図１１参照）。 A graph 128 is a graph showing the relationship between the query and the items used as axes. The graph 128 may be displayed by referring to the impression word score list 114 obtained by the process of extracting items corresponding to the α1 axis and α2 axis of the radar chart 125A (see FIG. 11).

より具体的には、出力部２４６は、印象語スコアリスト１１４において、α１軸、α２、β１、β２、β３のそれぞれと対応する項目とされた印象語のスコアを取得し、各軸の値として、グラフ１２８を表示させても良い。 More specifically, the output unit 246 acquires the score of the impression word corresponding to each of the α1 axis, α2, β1, β2, and β3 in the impression word score list 114, and sets , graph 128 may be displayed.

また、画面１２１Ａの検索結果表示欄１２４Ａには、クエリの各軸の値を示す棒グラフ１２９が表示される。 A bar graph 129 showing the values of each axis of the query is displayed in the search result display field 124A of the screen 121A.

本実施形態では、このように、レーダチャート１２５Ａにおいて、検索結果集合データと軸とされた項目との関係を示すグラフ１２６と、クエリと軸とされた項目との関係を示すグラフ１２８とを表示させる。したがって、本実施形態によれば、ユーザに対して、検索結果集合データとクエリとの関連の度合いを容易に把握させることができる。 In the present embodiment, the radar chart 125A thus displays the graph 126 showing the relationship between the search result set data and the axis item, and the graph 128 showing the relationship between the query and the axis item. Let Therefore, according to this embodiment, it is possible to allow the user to easily grasp the degree of association between the search result set data and the query.

（第二の実施形態）
以下に図面を参照して第二の実施形態について説明する。第二の実施形態は、軸βと対応する項目をクエリに基づき決定する点が、第一の実施形態と相違する。よって、以下の第二の実施形態の説明では、第一の実施形態との相違点についてのみ説明し、第一の実施形態と同様の機能構成を有するものには、第一の実施形態の説明で用いた符号と同様の符号を付与し、その説明を省略する。 (Second embodiment)
A second embodiment will be described below with reference to the drawings. The second embodiment differs from the first embodiment in that the items corresponding to the axis β are determined based on the query. Therefore, in the following description of the second embodiment, only the differences from the first embodiment will be described. are assigned the same reference numerals as those used in , and the description thereof is omitted.

図１５は、第二の実施形態の検索装置の機能を説明する図である。本実施形態の検索装置２００Ａは、検索処理部２４０Ａを有する。 FIG. 15 is a diagram for explaining the functions of the search device of the second embodiment. The search device 200A of this embodiment has a search processing unit 240A.

本実施形態の検索処理部２４０Ａは、入力受付部２４１、解析部２４２、検索部２４３、印象語スコア出力部２４４、軸決定部２４５Ａ、出力部２４６、インデックス生成部２４７、印象空間学習部２４８を有する。 The search processing unit 240A of this embodiment includes an input reception unit 241, an analysis unit 242, a search unit 243, an impression word score output unit 244, an axis determination unit 245A, an output unit 246, an index generation unit 247, and an impression space learning unit 248. have.

本実施形態の軸決定部２４５Ａは、印象語スコア出力部２４４により単語毎印象語スコア１１３を取得すると、単語毎印象語スコア１１３に含まれる印象語毎のスコアの最大値を示す印象語スコアリスト１１４を取得して保持する。そして、本実施形態の軸決定部２４５Ａは、印象語スコアリスト１１４を参照して、レーダチャートの軸α及び軸βと対応する印象語を決定する。 When the impression word score for each word 113 is acquired by the impression word score output unit 244, the axis determination unit 245A of the present embodiment generates an impression word score list indicating the maximum score for each impression word included in the impression word score for each word 113. 114 is acquired and held. Then, the axis determination unit 245A of the present embodiment refers to the impression word score list 114 to determine impression words corresponding to the axes α and β of the radar chart.

図１６は、第二の実施形態の検索処理部の処理を説明するフローチャートである。図１６のステップＳ１６０１の処理は、図９のステップＳ９０１の処理と同様であるから、説明を省略する。 FIG. 16 is a flow chart for explaining the processing of the search processing unit of the second embodiment. Since the processing in step S1601 in FIG. 16 is the same as the processing in step S901 in FIG. 9, description thereof is omitted.

続いて、検索処理部２４０Ａは、印象語スコア出力部２４４により、入力されたクエリに基づき、クエリとの関連が明確な印象語を軸としてｎ個抽出し、軸決定部２４５Ａにより印象語スコアリスト１１４を生成して保持し、α軸に設定する（ステップＳ１６０２）。 Next, based on the input query, the search processing unit 240A uses the impression word score output unit 244 to extract n impression words that are clearly related to the query, and the axis determination unit 245A generates an impression word score list. 114 is generated and held, and set to the α axis (step S1602).

続いて、軸決定部２４５Ａは、クエリに基づき生成された印象語スコアリスト１１４を参照して、β軸と対応する項目をｍ個抽出し、β軸に設定する（ステップＳ１６０３）。ステップＳ１６０３の処理の詳細は後述する。 Subsequently, the axis determining unit 245A refers to the impression word score list 114 generated based on the query, extracts m items corresponding to the β axis, and sets them on the β axis (step S1603). Details of the processing in step S1603 will be described later.

図１６のステップＳ１６０４とステップＳ１６０５の処理は、図９のステップＳ９０４とステップＳ９０５の処理と同様であるから、説明を省略する。 The processing in steps S1604 and S1605 in FIG. 16 is the same as the processing in steps S904 and S905 in FIG. 9, so description thereof will be omitted.

次に、図１７を参照して、本実施形態の軸決定部２４５Ａの処理について説明する。図１７は、第二の実施形態の軸決定部の処理を説明するフローチャートである。図１７の処理は、図１６のステップＳ１６０３の処理の詳細を示している。 Next, with reference to FIG. 17, processing of the axis determination unit 245A of this embodiment will be described. FIG. 17 is a flow chart for explaining the processing of the axis determination unit of the second embodiment. The processing in FIG. 17 shows details of the processing in step S1603 in FIG.

本実施形態の軸決定部２４５Ａは、ステップＳ１６０２において印象語スコアリスト１１４が生成されると、印象語スコアリスト１１４から、印象語毎に対応付けられたスコアのうち、大きさが上位ｎ＋１番目からｍ番目までの各スコアと、各スコアと対応する印象語を抽出し、保持する（ステップＳ１７０１）。 When the impression word score list 114 is generated in step S1602, the axis determination unit 245A of the present embodiment selects from the impression word score list 114 the scores associated with each impression word, from the top (n+1)th in magnitude. Each score up to m-th and the impression words corresponding to each score are extracted and stored (step S1701).

続いて、軸決定部２４５Ａは、検索結果集合データのうち、検索語群との一致度が高い順にｚ件の検索結果集合データを取得し、ｚ件の検索結果集合データについて、印象語スコアリスト２３１を参照して、ステップＳ１７０１で保持された印象語と対応するスコアを取得する（ステップＳ１７０２）。 Subsequently, the axis determining unit 245A acquires z pieces of search result set data in descending order of degree of matching with the search term group from among the search result set data, and creates an impression word score list for the z pieces of search result set data. 231, the score corresponding to the impression word held in step S1701 is obtained (step S1702).

以下に、ステップＳ１７０２の処理について説明する。本実施形態の軸決定部２４５Ａは、ステップＳ１６０１による検索部２４３の検索により取得された検索結果集合データから、検索語群との一致度が高い順に、ｚ件の検索結果集合データを取得する。検索語群との一致度とは、検索結果集合データに含まれる検索語群の数や、検索結果集合データにおける検索語群に含まれる各単語の出現頻度等によって、算出されても良い。また、一致度は、一般的に情報検索をする場合の一致度を用いても良い。 The processing of step S1702 will be described below. The axis determination unit 245A of the present embodiment acquires z pieces of search result set data in descending order of degree of matching with the search term group from the search result set data obtained by the search by the search unit 243 in step S1601. The degree of matching with a search term group may be calculated from the number of search term groups included in the search result set data, the appearance frequency of each word included in the search term group in the search result set data, and the like. As the degree of matching, the degree of matching generally used for information retrieval may be used.

軸決定部２４５Ａは、ｚ件の検索結果集合データを取得すると、印象語スコアリスト２３１を参照し、各検索結果集合データの文書ＩＤ毎に、ステップＳ１７０１で保持された印象語と対応付けられたスコアを取得する。 When acquiring z pieces of search result set data, the axis determination unit 245A refers to the impression word score list 231, and associates each document ID of each search result set data with the impression word held in step S1701. Get score.

例えば、ステップＳ１７０１において保持された印象語が、「ヘルシー」、「ひっそり」、「おしゃれ」、「わいわい」、「ガッツリ」、「落ち着き」であったとする。また、ｚ件の検索結果集合データに、文書ＩＤ「２」の検索結果集合データが含まれたとする。 For example, it is assumed that the impression words retained in step S1701 are "healthy", "quiet", "stylish", "noisy", "full of energy", and "calm". Assume also that search result set data with document ID "2" is included in z sets of search result set data.

この場合、軸決定部２４５Ａは、ステップＳ１７０２において、印象語スコアリスト２３１を参照し、文書ＩＤ「２」と対応する印象語「ヘルシー」のスコアである９．５３、印象語「ひっそり」のスコアである０．１２、印象語「おしゃれ」のスコアである８．９３を取得する（図８参照）。同様に、軸決定部２４５Ａは、印象語、「わいわい」、「ガッツリ」、「落ち着き」についても、対応するスコアを取得する。 In this case, in step S1702, the axis determination unit 245A refers to the impression word score list 231, the score of the impression word "healthy" corresponding to the document ID "2" is 9.53, and the score of the impression word "quietly" is 9.53. is 0.12, and the score of the impression word "fashionable" is 8.93 (see FIG. 8). Similarly, the axis determination unit 245A also acquires the corresponding scores for the impression words "wai wai", "guts", and "calmness".

本実施形態の軸決定部２４５Ａは、ｚ件の検索結果集合データ毎に、上述したように印象語毎のスコアを取得し、保持する。 The axis determination unit 245A of the present embodiment acquires and holds the score for each impression word as described above for each of z sets of search result set data.

続いて、軸決定部２４５Ａは、ステップＳ１７０１で保持された印象語毎に、ステップＳ１７０２で取得された検索結果集合データ毎のスコアの分散を算出する（ステップＳ１７０３）。 Subsequently, the axis determination unit 245A calculates the score variance for each set of search result data acquired in step S1702 for each impression word held in step S1701 (step S1703).

続いて、軸決定部２４５Ａは、ステップＳ１７０１で抽出された印象語を、スコアの分散の値によりソートし、分散の値が大きい印象語から順に、ｋ個の印象語を抽出し、抽出された印象語をβ軸に設定する（ステップＳ１７０４）。ここで、ｋ個とは、レーダチャートに設定される軸の数からｎを減算した数であり、β軸の本数と同数である。 Subsequently, the axis determination unit 245A sorts the impression words extracted in step S1701 according to the score variance value, extracts k impression words in descending order of the variance value, and extracts k impression words. The impression word is set on the β axis (step S1704). Here, k is the number obtained by subtracting n from the number of axes set in the radar chart, which is the same number as the number of β axes.

このように、本実施形態の軸決定部２４５Ａは、α軸と比較して、クエリとの関連の強さを示すスコアが小さい印象語の中から、検索結果集合データとの関連の強さにばらつきがある印象語を抽出し、β軸と対応する項目に設定している。 In this way, the axis determination unit 245A of the present embodiment selects the impression words having a smaller score indicating the strength of the relation with the query compared to the α-axis, and Impression words with variations are extracted and set as items corresponding to the β axis.

本実施形態では、このようにしてβ軸の項目を設定することで、レーダチャートにおいて、各軸の値が近い値となることや、複数の検索結果集合データのレーダチャート同士が似たようなものとなることを抑制している。したがって、本実施形態によれば、検索結果の評価を適切に行い、検索結果集合データ毎のレーダチャートを表示させた際に、検索結果集合データ毎の特徴を捉えやすく表示させることができる。 In this embodiment, by setting the items of the β axis in this way, the values of each axis are close to each other in the radar chart, and the radar charts of a plurality of sets of search result data are similar to each other. It restrains itself from becoming a thing. Therefore, according to the present embodiment, search results can be evaluated appropriately, and when a radar chart for each piece of search result set data is displayed, it is possible to easily grasp the features of each piece of search result set data.

（第三の実施形態）
以下に図面を参照して、第三の実施形態について説明する。第三の実施形態は、軸の決定の際に、単語と印象語との関連の強さを示すスコアを求める際に得られる分散も用いる点が、第一の実施形態と相違する。よって、以下の第三の実施形態の説明では、第一の実施形態との相違点についてのみ説明し、第一の実施形態と同様の機能構成を有するものには、第一の実施形態の説明で用いた符号と同様の符号を付与し、その説明を省略する。 (Third embodiment)
A third embodiment will be described below with reference to the drawings. The third embodiment differs from the first embodiment in that, when determining the axis, the variance obtained when calculating the score indicating the strength of the relationship between the word and the impression word is also used. Therefore, in the following description of the third embodiment, only differences from the first embodiment will be described. are assigned the same reference numerals as those used in , and the description thereof is omitted.

はじめに、図１８を参照して、後述する本実施形態の印象語マップの作成について説明する。 First, with reference to FIG. 18, creation of an impression word map according to this embodiment, which will be described later, will be described.

本実施形態の印象語マップは、例えば、単語と印象語との関連の強さを示すスコアを回答させるアンケート等を多数の人に対して実施し、その結果に基づいて作成される。 The impression word map of the present embodiment is created, for example, based on the results of a questionnaire or the like in which a large number of people are asked to answer a score indicating the strength of the relationship between a word and an impression word.

より具体的には、本実施形態では、上述したアンケート結果等を収集し、単語と印象語との関連の強さを示すスコアの平均値や最大値等を、単語と印象語との関連を示すスコアとして、印象語マップに保持させる。 More specifically, in the present embodiment, the results of the above-mentioned questionnaire are collected, and the average value and maximum value of the scores indicating the strength of the relationship between the word and the impression word are calculated. It is stored in the impression word map as a score to show.

ところで、単語の中には、単語に対して、多くの人が関連が強いと感じる印象語と、人によって感じる関連の強さが異なる印象語とが存在するものがある。前者の印象語は、単語との関連の強さを示すスコアのばらつきが小さくなることが推測される。また、後者の印象語は、単語と印象語との関連の強さを示すスコアのばらつきが大きくなることが推測される。本実施形態では、この印象語毎のスコアのばらつきに着目した。 By the way, among words, there are impression words that many people feel strongly related to, and impression words that feel different depending on people. The former impression word is presumed to have a smaller variation in the score indicating the strength of the relationship with the word. In addition, it is presumed that the latter impression word has a large variation in the score indicating the strength of the relationship between the word and the impression word. In the present embodiment, attention is focused on the variation in score for each impression word.

図１８は、第三の実施形態の印象語マップにおけるスコアのばらつきについて説明する図である。図１８（ａ）は、単語「天然」と印象語「美味しい」との関連の強さを示すスコアのばらつきを示す図である。図１８（ｂ）は、単語「天然」と、印象語「香りが良い」との関連の強さを示すスコアのばらつきを示す図である。 FIG. 18 is a diagram for explaining variations in scores in the impression word map of the third embodiment. FIG. 18(a) is a diagram showing variations in scores indicating the strength of the relationship between the word "natural" and the impression word "delicious." FIG. 18(b) is a diagram showing variations in scores indicating the strength of the relationship between the word "natural" and the impression word "fragrance".

図１８（ａ）、（ｂ）において、横軸は、アンケートにおいて回答されたスコアの値を示す。尚、図１８の例では、単語と印象語との関連の強さを示すスコアは、０～５の間で回答されるものとし、スコアの値が大きいほど、アンケートの回答者が、単語と印象語との関連が強いと感じていることを示すものとした。 In FIGS. 18A and 18B, the horizontal axis indicates score values answered in the questionnaire. In the example of FIG. 18, the score indicating the strength of the relationship between the word and the impression word is answered between 0 and 5. It was used to indicate that they felt a strong connection with the impression word.

また、図１８（ａ）、（ｂ）において、縦軸は、アンケートに回答した全員の人数に対する、スコア毎の回答数の割合を示す。 In FIGS. 18A and 18B, the vertical axis indicates the ratio of the number of responses for each score to the total number of people who responded to the questionnaire.

図１８（ａ）では、単語「天然」と、印象語「美味しい」との関連の強さについて、スコア「４」という回答数が最も多く、スコアの分布が単峰型である。つまり、図１８（ａ）では、スコアのばらつきが小さいことがわかる。これは、多くの人が、単語「天然」と、印象語「美味しい」とは関連が強いと感じることを示している。 In FIG. 18(a), regarding the strength of association between the word "natural" and the impression word "delicious", the number of responses with a score of "4" was the largest, and the score distribution is unimodal. In other words, in FIG. 18A, it can be seen that the variation in scores is small. This indicates that many people feel that the word "natural" and the impression word "delicious" are closely related.

図１８（ｂ）では、単語「天然」と、印象語「香りが良い」との関連の強さについて、スコア「２」という回答数と、スコア「４」という回答数が多く、スコアの分布が双峰型である。つまり、図１８（ｂ）では、図１８（ａ）と比較して、スコアのばらつきが大きいことがわかる。これは、単語「天然」と印象語「香りが良い」との関連の強さは、人によって感じ方が異なることを示している。 In FIG. 18(b), regarding the strength of the relationship between the word "natural" and the impression word "fragrance", the number of responses with a score of "2" and the number of responses with a score of "4" were high. is bimodal. That is, in FIG. 18(b), it can be seen that the variation in scores is greater than in FIG. 18(a). This indicates that the strength of the association between the word "natural" and the impression word "fragrant" differs from person to person.

本実施形態では、このような人による感じ方のばらつきを考慮し、例えば、単語と印象語との組み合わせ毎に、アンケートの回答結果として取得したスコアの平均値と、スコアのばらつきを示す値とを対応付けて、印象語マップに保持させる。 In this embodiment, in consideration of such variations in how people feel, for example, for each combination of a word and an impression word, an average score obtained as a result of answering a questionnaire and a value indicating the variation of the score are combined. are associated with each other and stored in the impression word map.

図１８の例では、単語「天然」と印象語「美味しい」との組み合わせでは、アンケートの回答として取得したスコアの平均値は、３．８であり、ばらつきを示す標準偏差は０．６８である。したがって、本実施形態では、印象語マップにおいて、単語「天然」と印象語「美味しい」との関連の強さを示すスコア「３．８」と標準偏差「０．６８」とを対応付けて保持する。 In the example of FIG. 18, for the combination of the word “natural” and the impression word “delicious”, the average score obtained as answers to the questionnaire is 3.8, and the standard deviation indicating the variation is 0.68. . Therefore, in the present embodiment, in the impression word map, the score "3.8" indicating the strength of the relationship between the word "natural" and the impression word "delicious" and the standard deviation "0.68" are stored in association with each other. do.

また、図１８の例では、単語「天然」と印象語「香りが良い」との組み合わせでは、アンケートの回答として取得したスコアの平均値は、３．２５であり、ばらつきを示す標準偏差は１．１６である。したがって、本実施形態では、印象語マップにおいて、単語「天然」と印象語「香りが良い」との関連の強さを示すスコア「３．２５」と標準偏差「１．１６」とを対応付けて保持する。 In the example of FIG. 18, for the combination of the word “natural” and the impression word “smells good,” the average score obtained as answers to the questionnaire is 3.25, and the standard deviation indicating variation is 1. .16. Therefore, in the present embodiment, in the impression word map, the score "3.25" indicating the strength of the relationship between the word "natural" and the impression word "fragrant" is associated with the standard deviation "1.16". hold.

本実施形態では、このようにして、スコアとスコアのばらつきとが対応付けられた印象語マップを用いることで、例えば、標準偏差が大きい印象語が軸とされた際に、この軸に代わる他の印象語を抽出することができる。言い換えれば、本実施形態では、単語との関連の強さの感じ方が人によって異なる印象語が軸として抽出された場合に、この印象語と入れ替えるための他の印象語を抽出し、ユーザに提示することができる。 In this embodiment, by using an impression word map in which scores and variations in scores are associated with each other in this way, for example, when an impression word with a large standard deviation is used as an axis, other impression words can be extracted. In other words, in the present embodiment, when an impression word is extracted as an axis for which people feel differently about the strength of the relationship with the word, another impression word to be replaced with this impression word is extracted and given to the user. can be presented.

したがって、本実施形態によれば、クエリに基づく検索語群による検索処理で取得された検索結果集合データを表示する際に、ユーザの意図を反映した印象語を、項目（軸）の候補として表示させることができる。 Therefore, according to this embodiment, when displaying search result set data obtained by search processing using a search word group based on a query, impression words that reflect the user's intention are displayed as item (axis) candidates. can be made

以下に、本実施形態の検索装置について説明する。図１９は、第三の実施形態の検索装置の機能を説明する図である。本実施形態の検索装置２００Ｂは、インデックス記憶部２１０、印象語マップ記憶部２２０Ａ、検索対象スコア記憶部２３０Ａ、検索処理部２４０Ｂを有する。 The search device of this embodiment will be described below. FIG. 19 is a diagram for explaining the functions of the search device of the third embodiment. The search device 200B of this embodiment has an index storage unit 210, an impression word map storage unit 220A, a search target score storage unit 230A, and a search processing unit 240B.

印象語マップ記憶部２２０Ａには、印象語マップ２２１Ａが格納されている。印象語マップ２２１Ａは、印象語毎に、印象語と単語との関連を示すスコアと、スコアを取得する際に得られる標準偏差とが対応付けられている。印象語マップ２２１Ａの詳細は後述する。 An impression word map 221A is stored in the impression word map storage unit 220A. Impression word map 221A associates, for each impression word, a score indicating the relationship between the impression word and the word and a standard deviation obtained when obtaining the score. Details of the impression word map 221A will be described later.

検索対象スコア記憶部２３０Ａには、印象語スコアリスト２３１Ａが格納されている。印象語スコアリスト２３１Ａは、検索処理部２４０Ｂの処理によって作成される。印象語スコアリスト２３１Ａの詳細は後述する。 An impression word score list 231A is stored in the search target score storage unit 230A. Impression word score list 231A is created by the processing of search processing unit 240B. Details of the impression word score list 231A will be described later.

本実施形態の検索処理部２４０Ｂは、入力受付部２４１、解析部２４２、検索部２４３、印象語スコア出力部２４４Ａ、軸決定部２４５Ａ、出力部２４６、インデックス生成部２４７、印象空間学習部２４８Ａ、候補通知部２４９を有する。 The search processing unit 240B of the present embodiment includes an input reception unit 241, an analysis unit 242, a search unit 243, an impression word score output unit 244A, an axis determination unit 245A, an output unit 246, an index generation unit 247, an impression space learning unit 248A, It has a candidate notification unit 249 .

本実施形態の印象語スコア出力部２４４Ａは、印象語マップ２２１Ａと、後述する印象空間学習部２４８Ａにより出力される変換モデル２５０Ａと、に基づき、ベクトルで表現された単語と印象語との関連の強さを示すスコアと、スコアの標準偏差とを出力する。 The impression word score output unit 244A of the present embodiment is based on an impression word map 221A and a conversion model 250A output by an impression space learning unit 248A, which will be described later. Output the strength score and the standard deviation of the score.

本実施形態の軸決定部２４５Ａは、印象語スコア出力部２４４Ａにより出力された印象語毎のスコアと標準偏差とに基づき、検索結果を表示するためのレーダチャートの軸となる印象語と、この印象語と入れ替えることが可能な印象語とを出力する。尚、軸に決定さされた印象語と入れ替えることが可能な印象語を、軸の候補と呼ぶ。つまり、本実施形態の軸の候補となる印象語は、言い換えれば、クエリから推定される内容の特徴を示す特徴語群（第１特徴語群）の候補となる第３特徴語群である。 Based on the score and standard deviation for each impression word output by the impression word score output unit 244A, the axis determination unit 245A of the present embodiment determines the impression word as the axis of the radar chart for displaying the search results, and An impression word and an impression word that can be replaced are output. An impression word that can be replaced with an impression word determined for an axis is called an axis candidate. In other words, the impression words that are candidates for the axis in the present embodiment are, in other words, the third feature word group that are candidates for the feature word group (first feature word group) that indicates the feature of the content estimated from the query.

本実施形態の印象空間学習部２４８Ａは、検索対象文書データベース４００に格納された検索対象の文書データと、印象語マップ２２１Ａとに基づき、変換モデル２５０Ａを出力する。尚、本実施形態の変換モデル２５０Ａは、分散表現によって得られた単語の意味ベクトルを入力とし、この単語と対応付けられた印象語毎のスコアと、スコアの標準偏差とを出力とするモデルである。 The impression space learning unit 248A of the present embodiment outputs a conversion model 250A based on the document data to be searched stored in the search target document database 400 and the impression word map 221A. Note that the conversion model 250A of the present embodiment is a model in which the semantic vector of a word obtained by distributed representation is input, and the score for each impression word associated with this word and the standard deviation of the score are output. be.

本実施形態の候補通知部２４９は、軸決定部２４５Ａにより出力された軸の候補となる印象語を通知する。 The candidate notification unit 249 of the present embodiment notifies impression words that are candidates for the axis output by the axis determination unit 245A.

以下に、図２０を参照して、本実施形態の印象語マップ２２１Ａについて説明する。図２０は、第三の実施形態の印象語マップの例を示す図である。 The impression word map 221A of this embodiment will be described below with reference to FIG. FIG. 20 is a diagram showing an example of an impression word map according to the third embodiment.

本実施形態の印象語マップ２２１Ａは、予め作成されて検索装置２００Ｂに与えられる。本実施形態の印象語マップ２２１Ａは、単語毎に、予め用意された印象語との関連の強さを示すスコア（値）と、スコアのばらつきを示す標準偏差とが対応付けられている。 The impression word map 221A of this embodiment is created in advance and given to the search device 200B. In the impression word map 221A of the present embodiment, each word is associated with a score (value) indicating the strength of association with an impression word prepared in advance and a standard deviation indicating the variation of the scores.

図２０では、例えば、単語「レストラン」と印象語「わいわい」との関連の強さを示すスコアは１．６であり、標準偏差σは０．５である。また、単語「レストラン」と印象語「おなかいっぱい」との関連の強さを示すスコアは３．１であり、標準偏差σは１．７である。したがって、印象語「わいわい」と印象語「おなかいっぱい」では、印象語「わいわい」の方が、多くの人が単語「レストラン」と関連が強いと感じていることがわかる。 In FIG. 20, for example, the score indicating the strength of the relationship between the word "restaurant" and the impression word "waiwai" is 1.6, and the standard deviation σ is 0.5. Also, the score indicating the strength of the relationship between the word "restaurant" and the impression word "full stomach" is 3.1, and the standard deviation σ is 1.7. Therefore, it can be seen that many people feel that the impression word "wai wai" is more closely related to the word "restaurant" than the impression word "wai wai" and the impression word "full of stomach".

尚、本実施形態では、単語と印象語との関連の強さを示すスコアのばらつきを示す値を標準偏差としたが、これに限定されない。単語と印象語との関連の強さを示すスコアのばらつきを示す値は、分散として保持されても良い。 Note that in the present embodiment, the standard deviation is the value indicating the variation in the score indicating the strength of the relationship between the word and the impression word, but the present invention is not limited to this. A value indicating the dispersion of the score indicating the strength of the relationship between the word and the impression word may be held as a variance.

次に、図２１を参照して、本実施形態の印象語スコアリスト２３１Ａについて説明する。 Next, the impression word score list 231A of this embodiment will be described with reference to FIG.

図２１は、第三の実施形態の印象語スコアリストの一例を示す図である。図２１では、検索文書対象データベース４００のうち飲食店等に関する文書データから得た印象語スコアリスト２３１Ａの例を示している。また、印象語スコアリスト２３１Ａは、予め作成されて検索装置２００Ｂに与えられていても良い。 FIG. 21 is a diagram showing an example of an impression word score list according to the third embodiment. FIG. 21 shows an example of an impression word score list 231A obtained from document data relating to restaurants, etc., in the retrieval document target database 400. In FIG. Impression word score list 231A may be created in advance and provided to search device 200B.

印象語スコアリスト２３１Ａは、文書データを特定する文書ＩＤと、文書データが示す店舗の名称と、文書データの内容を示すテキストデータと、印象語スコアリスト２３１Ａに含まれる各印象語との関連の強さを示すスコアと、が対応付けられている。尚、印象語スコアリスト２３１Ａに含まれる印象語は、印象語マップ２２１Ａに含まれる印象語と同様である。 Impression word score list 231A contains a document ID that specifies document data, the name of a store indicated by the document data, text data indicating the contents of the document data, and relationships between impression words included in impression word score list 231A. are associated with a score indicating strength. The impression words included in the impression word score list 231A are the same as the impression words included in the impression word map 221A.

次に、図２２を参照して、本実施形態の検索処理部２４０Ｂの処理について説明する。図２２は、第三の実施形態の検索処理部の処理を説明するフローチャートである。 Next, referring to FIG. 22, processing of the search processing unit 240B of this embodiment will be described. FIG. 22 is a flowchart for explaining the processing of the search processing section of the third embodiment.

本実施形態の検索処理部２４０Ｂは、解析部２４２により、クエリから形態素解析等により検索語群を取得し、検索部２４３により、インデックス情報２１１を検索語群で検索し、検索語群と対応する文書ＩＤを特定する（ステップＳ２２０１）。 The search processing unit 240B of this embodiment uses the analysis unit 242 to acquire a search term group from the query through morphological analysis or the like, and the search unit 243 searches the index information 211 using the search term group, and searches for the search term group. A document ID is specified (step S2201).

続いて、検索処理部２４０Ｂは、印象語スコア出力部２４４Ａにより、入力されたクエリに基づき、クエリとの関連が強いとされるｎ個の印象語を軸として抽出してα軸に設定し、さらに、軸の候補となる印象語を抽出する（ステップＳ２２０２）。ステップＳ２２０２の処理の詳細は後述する。 Next, the search processing unit 240B uses the impression word score output unit 244A to extract n impression words that are considered to be strongly related to the query based on the input query, and set them on the α axis, Furthermore, impression words that are candidates for the axis are extracted (step S2202). Details of the processing in step S2202 will be described later.

図２２のステップＳ２２０３からステップＳ２２０５までの処理は、図９のステップＳ９０３からステップＳ９０５までの処理と同様であるから、説明を省略する。 Since the processing from step S2203 to step S2205 in FIG. 22 is the same as the processing from step S903 to step S905 in FIG. 9, description thereof is omitted.

ステップＳ２２０５において出力部２４６がレーダチャートを描画すると、検索処理部２４０Ｂは、候補通知部２４９により、ステップＳ２２０２において、軸の候補となる印象語が抽出されたか否かを判定する（ステップＳ２２０６）。ステップＳ２２０６において、該当する印象語が抽出されていない場合、検索処理部２４０Ｂは、処理を終了する。 When the output unit 246 draws the radar chart in step S2205, the search processing unit 240B determines whether or not the candidate impression word for the axis has been extracted in step S2202 by the candidate notification unit 249 (step S2206). In step S2206, if the corresponding impression word has not been extracted, search processing unit 240B terminates the process.

ステップＳ２２０６において、該当する印象語が抽出された場合、候補通知部２４９は、ステップＳ２２０２で軸の候補として抽出された印象語を、レーダチャートが表示された画面に出力し、レーダチャートの軸と、軸の候補との入れ替えが可能であることを通知し（ステップＳ２２０７）、処理を終了する。 In step S2206, when the corresponding impression word is extracted, the candidate notification unit 249 outputs the impression word extracted as the axis candidate in step S2202 to the screen displaying the radar chart, and outputs the impression word as the axis of the radar chart. , is notified that it can be replaced with the axis candidate (step S2207), and the process ends.

次に、図２３を参照して、図２２のステップＳ２２０２の処理の詳細について説明する。図２３は、第三の実施形態の印象語スコア出力部、軸決定部の処理を説明するフローチャートである。 Next, details of the processing in step S2202 in FIG. 22 will be described with reference to FIG. FIG. 23 is a flow chart for explaining the processing of the impression word score output section and the axis determination section of the third embodiment.

本実施形態の印象語スコア出力部２４４Ａは、ステップＳ２２０１において取得した検索語群に含まれる各単語を、分散表現に変換する（ステップＳ２３０１）。 The impression word score output unit 244A of the present embodiment converts each word included in the search word group acquired in step S2201 into distributed representation (step S2301).

続いて、印象語スコア出力部２４４Ａは、変換モデル２５０Ａを参照し、単語毎に、各印象語との関連の強さを示すスコアと標準偏差とを取得し（ステップＳ２３０２）、ステップＳ２３０３へ進む。言い換えれば、印象語スコア出力部２４４は、変換モデル２５０Ａに対して、ステップＳ２３０１で求めた各単語の意味ベクトルを入力し、変換モデル２５０Ａから出力された、各単語についての印象語毎のスコアと標準偏差とを取得する。 Subsequently, the impression word score output unit 244A refers to the conversion model 250A, acquires the score and standard deviation indicating the strength of the relationship with each impression word for each word (step S2302), and proceeds to step S2303. . In other words, impression word score output unit 244 inputs the semantic vector of each word obtained in step S2301 to conversion model 250A, and converts the score for each impression word of each word output from conversion model 250A into to get the standard deviation and

ステップＳ２３０３とステップＳ２３０４の処理は、図１０のステップＳ１００３とステップＳ１００４の処理と同様であるから、説明を省略する。 The processing in steps S2303 and S2304 is the same as the processing in steps S1003 and S1004 in FIG. 10, and thus description thereof is omitted.

ステップＳ２３０４に続いて、検索処理部２４０Ｂは、軸決定部２４５Ａにより、ステップＳ２３０４で抽出された印象語の標準偏差の中に、標準偏差の値が閾値より大きい印象語が存在するか否かを判定する（ステップＳ２３０５）。 Following step S2304, the search processing unit 240B causes the axis determination unit 245A to determine whether or not there is an impression word whose standard deviation value is greater than the threshold among the standard deviations of the impression words extracted in step S2304. Determine (step S2305).

ステップＳ２３０５において、該当する印象語が存在しない場合、検索処理部２４０Ｂは、処理を終了し、ステップＳ２２０３へ進む。 In step S2305, if the corresponding impression word does not exist, search processing section 240B terminates the process and advances to step S2203.

ステップＳ２３０５において、該当する印象語が存在する場合、軸決定部２４５Ａは、抽出された印象語から、標準偏差が閾値より大きい印象語を特定する（ステップＳ２３０６）。 In step S2305, if there is a corresponding impression word, the axis determining unit 245A identifies an impression word whose standard deviation is larger than the threshold from the extracted impression words (step S2306).

続いて、軸決定部２４５Ａは、スコアの最大値が上位ｎ＋１位以降であり、且つ、スコアの最大値が、ステップＳ２３０６で特定された印象語のスコアの最大値から標準偏差を減算した値よりも大きい印象語をｍ個抽出する（ステップＳ２３０７）。 Subsequently, axis determination unit 245A determines that the maximum score value is in the top (n+1) or higher, and that the maximum score value is less than the value obtained by subtracting the standard deviation from the maximum score value of the impression word identified in step S2306. m impression words are extracted (step S2307).

続いて、軸決定部２４５Ａは、抽出したｍ個の印象語を、ステップＳ２３０６で特定した印象語と入れ替えが可能な軸の候補として保持し（ステップＳ２３０８）、ステップＳ２２０３へ進む。 Subsequently, the axis determination unit 245A holds the extracted m impression words as candidates for the axis that can be replaced with the impression words identified in step S2306 (step S2308), and proceeds to step S2203.

以下に、図２４を参照して、本実施形態の検索処理部２４０Ｂによる軸の候補の抽出について、具体的に説明する。図２４は、第三の実施形態の検索処理部による軸の候補の抽出を説明する図である。 Extraction of axis candidates by the search processing unit 240B of the present embodiment will be specifically described below with reference to FIG. FIG. 24 is a diagram illustrating extraction of axis candidates by the search processing unit of the third embodiment.

図２４では、「天然の素材や野菜を使うレストラン」というクエリが入力され、解析部２４２により検索語群１１１Ａとして、「天然」、「素材」、「野菜」、「レストラン」が抽出された場合を示している。 In FIG. 24, the query "restaurant using natural ingredients and vegetables" is input, and the analysis unit 242 extracts "nature", "ingredients", "vegetables", and "restaurant" as the search term group 111A. is shown.

検索処理部２４０Ｂは、印象語スコア出力部２４４Ａにより、検索語群１１１Ａに含まれる各単語の意味ベクトルを取得して変換モデル２５０Ａの入力とし、各単語について、印象語マップ２２１Ａに含まれる印象語毎のスコアと標準偏差を得る。 Search processing unit 240B uses impression word score output unit 244A to acquire the semantic vector of each word included in search word group 111A and input it to conversion model 250A. Obtain the score and standard deviation for each

図２４では、検索語群１１１Ａに含まれる各単語と、印象語毎のスコアと標準偏差とを対応付けた情報を、単語毎印象語スコア１１３Ａと呼ぶ。 In FIG. 24, information in which each word included in the search word group 111A is associated with the score and standard deviation for each impression word is referred to as an impression word score for each word 113A.

次に、軸決定部２４５Ａは、単語毎印象語スコア１１３Ａから、印象語毎のスコアの最大値を取得する。 Next, the axis determination unit 245A acquires the maximum score for each impression word from the impression word score for each word 113A.

図２４の単語毎印象語スコア１１３Ａでは、印象語「美味しい」に対して、単語「レストラン」との関連の強さを示すスコアと標準偏差は３．４と±０．２であり、単語「野菜」との関連の強さを示すスコアと標準偏差は３．７と０．３である。また、単語「天然」との関連の強さを示すスコアと標準偏差は３．８と０．７であり、単語「素材」との関連の強さを示すスコアと標準偏差は３．２と０．５である。 In the word-by-word impression word score 113A in FIG. The score and standard deviation showing the strength of the association with "vegetables" are 3.7 and 0.3. In addition, the score and standard deviation indicating the strength of association with the word "natural" are 3.8 and 0.7, and the score and standard deviation indicating the strength of association with the word "material" are 3.2. 0.5.

したがって、軸決定部２４５Ａは、印象語「美味しい」のスコアの最大値として「３．８」を取得し、印象語「美味しい」の標準偏差として０．７を取得する。 Therefore, the axis determination unit 245A obtains "3.8" as the maximum score of the impression word "delicious" and obtains 0.7 as the standard deviation of the impression word "delicious".

また、単語毎印象語スコア１１３Ａでは、印象語「ヘルシー」に対して、単語「レストラン」との関連の強さを示すスコアと標準偏差は３．０と０．５であり、単語「野菜」との関連の強さを示すスコアと標準偏差は４．２と０．３である。また、単語「天然」との関連の強さを示すスコアと標準偏差は３．１と１．２であり、単語「素材」との関連の強さを示すスコアと標準偏差は３．０と０．７である。 In addition, in the impression word score for each word 113A, the score and standard deviation indicating the strength of the relationship between the impression word "healthy" and the word "restaurant" are 3.0 and 0.5, respectively. The scores and standard deviations showing the strength of the association with are 4.2 and 0.3. The scores and standard deviations indicating the strength of association with the word "natural" are 3.1 and 1.2, and the scores and standard deviation indicating the strength of association with the word "material" are 3.0. 0.7.

したがって、軸決定部２４５Ａは、印象語「ヘルシー」のスコアの最大値として「４．２」を取得し、印象語「ヘルシー」の標準偏差として０．３を取得する。 Therefore, the axis determination unit 245A acquires "4.2" as the maximum score of the impression word "healthy" and acquires 0.3 as the standard deviation of the impression word "healthy".

本実施形態の軸決定部２４５Ａは、このようにして取得した印象語毎のスコアの最大値と標準偏差とを、印象語スコアリスト１１４Ａとして保持する。 The axis determination unit 245A of the present embodiment holds the maximum value and standard deviation of the score for each impression word thus obtained as an impression word score list 114A.

続いて、軸決定部２４５Ａは、印象語スコアリスト１１４Ａにおいて、スコアの最大値が大きい印象語から順に、ｎ個の印象語を抽出して、α軸とする。図２４の例では、ｎ＝３としている。 Subsequently, the axis determination unit 245A extracts n impression words in descending order of the maximum score from the impression word score list 114A, and defines them as the α-axis. In the example of FIG. 24, n=3.

印象語スコアリスト１１４Ａにおいて、スコアの最大値が最も大きい印象語は「ヘルシー」であり、次にスコアの最大値が大きい印象語は「美味しい」であり、その次にスコアの最大値が大きい印象語は「香りが良い」である。 In the impression word score list 114A, the impression word with the highest maximum score is "healthy", the next highest maximum score is "tasty", and the next highest impression is the impression with the highest maximum score. The word is "fragrant".

したがって、本実施形態の軸決定部２４５Ａは、印象語「ヘルシー」と印象語「美味しい」と印象語「香りが良い」をα軸に決定する。言い換えれば、軸決定部２４５Ａは、印象語「ヘルシー」と印象語「美味しい」と印象語「香りが良い」とを、検索結果集合データを表示する際の項目とする。 Therefore, the axis determination unit 245A of the present embodiment determines the impression word "healthy", the impression word "delicious", and the impression word "fragrant" as the α-axis. In other words, the axis determination unit 245A sets the impression word "healthy", the impression word "delicious", and the impression word "fragrant" as items when displaying the search result set data.

さらに、本実施形態の軸決定部２４５Ａは、印象語スコアリスト１１４Ａを参照し、α軸として決定された印象語「ヘルシー」と印象語「美味しい」と印象語「香りが良い」のそれぞれの標準偏差が所定の閾値より大きいか否かを判定する。 Furthermore, the axis determination unit 245A of the present embodiment refers to the impression word score list 114A, and determines the standard scores of the impression word "healthy", the impression word "delicious", and the impression word "fragrant" determined as the α axis. Determine whether the deviation is greater than a predetermined threshold.

ここでは、標準偏差に対する所定の閾値を１．０として説明する。尚、所定の閾値は、任意の値に設定することができる。 Here, the predetermined threshold for the standard deviation is assumed to be 1.0. Note that the predetermined threshold can be set to any value.

印象語スコアリスト１１４Ａにおいて、印象語「ヘルシー」の標準偏差は０．３であり、印象語「美味しい」の標準偏差は０．７であり、両者とも所定の閾値以下である。 In the impression word score list 114A, the standard deviation of the impression word "healthy" is 0.3 and the standard deviation of the impression word "tasty" is 0.7, both of which are equal to or less than the predetermined threshold.

それに対して、印象語「香りが良い」の標準偏差は１．２であり、所定の閾値より大きい。これは、印象語「香りが良い」と、検索語群１１１Ａに含まれる単語との関連の強さは、人によって感じ方が異なることを示している。 On the other hand, the standard deviation of the impression word "smells good" is 1.2, which is larger than the predetermined threshold. This indicates that the strength of the relationship between the impression word "smells good" and the words included in the search word group 111A is perceived differently by different people.

そこで、本実施形態では、印象語スコアリスト１１４Ａを参照し、スコアが、上位ｎ＋１位以降であって、且つ、印象語「香りが良い」のスコア３．２から標準偏差１．２を減算した値である２．０以上の印象語を、スコアが大きい順にｍ個、軸の候補として抽出する。 Therefore, in the present embodiment, the impression word score list 114A is referred to, and the score of the impression word "smells good" whose score is in the top (n+1) or higher is subtracted from the score of 3.2 by the standard deviation of 1.2. m impression words with a value of 2.0 or more are extracted as axis candidates in descending order of score.

図２４の例では、ｍ＝２としている。よって、軸決定部２４５Ａは、印象語スコアリスト１１４Ａにおいて、スコアが上位４位以下であり、且つ、２．０以上の印象語を、スコアが大きい順に２個抽出する。 In the example of FIG. 24, m=2. Therefore, the axis determination unit 245A extracts two impression words whose scores are in the top four or lower and whose scores are 2.0 or higher in descending order of the scores in the impression word score list 114A.

図２４では、スコアが３．１の印象語「おなかいっぱい」と、スコアが２．８の印象語「おなかにやさしい」とが、軸の候補として抽出される。 In FIG. 24, the impression word "full stomach" with a score of 3.1 and the impression word "easy on the stomach" with a score of 2.8 are extracted as axis candidates.

続いて、検索処理部２４０Ｂの出力部２４６は、検索結果集合データを表示する画面１１５において、印象語「ヘルシー」と印象語「美味しい」と印象語「香りが良い」とを、検索結果集合データを示す項目（軸）１１６として表示させる。また、出力部２４６は、各項目（軸）の値を示す棒グラフ１１７を画面１１５に表示させる。尚、図２４の例で波、検索結果集合データとして、「京料理△△」が抽出された例を示している。 Subsequently, the output unit 246 of the search processing unit 240B converts the impression words "healthy", "delicious", and "fragrant" into the search result set data on the screen 115 displaying the search result set data. is displayed as an item (axis) 116 indicating . In addition, the output unit 246 causes the screen 115 to display a bar graph 117 showing the values of each item (axis). In addition, in the example of FIG. 24, an example in which "Kyoto cuisine △△" is extracted as the search result set data is shown.

このとき、候補通知部２４９は、印象語「香りが良い」と対応付けて、この印象語を、軸の候補として抽出さたれ印象語「おなかいっぱい」又は印象語「おなかにやさしい」に変更することが可能であることを示す通知欄１１８を表示させる。 At this time, the candidate notifying unit 249 associates the impression word with the impression word "smells good" and changes this impression word to the impression word "full stomach" or the impression word "gentle on the stomach" extracted as the candidate for the axis. A notification field 118 indicating that it is possible is displayed.

本実施形態では、このように、通知欄１１８を表示させることで、例えば、ユーザが、検索語群１１１Ａと、項目として表示された印象語「香りが良い」とは、関連していないと感じた場合等に、この項目を他の印象語に変更させることができる。また、本実施形態では、通知欄１１８を表示させることで、検索語群１１１Ａと、印象語「香りが良い」とは関連していない、と感じるユーザに対し、ユーザの意図（感じ方）を反映させることが可能であることを知らせることができる。 In the present embodiment, by displaying the notification field 118 in this way, for example, the user may feel that the search word group 111A and the impression word "smells good" displayed as an item are not related. In such a case, this item can be changed to another impression word. In addition, in the present embodiment, by displaying the notification field 118, the user's intention (how to feel) is communicated to the user who feels that the search word group 111A and the impression word "smells good" are not related. It is possible to inform that it is possible to reflect.

以下に、図２５及び図２６を参照して、本実施形態の表示例について説明する。図２５は、第三の実施形態の表示例を示す第一の図である。 Display examples according to the present embodiment will be described below with reference to FIGS. 25 and 26. FIG. FIG. 25 is a first diagram showing a display example of the third embodiment.

図２５に示す画面１１５Ａでは、検索結果集合データを示す軸（項目）１１６と、各軸（項目）の値を示す棒グラフ１１７と、印象語「香りが良い」の代わりとなる軸の候補の通知欄１１８に加え、クエリの各軸の値を示す棒グラフ１１９が表示されている。 The screen 115A shown in FIG. 25 includes an axis (item) 116 indicating search result aggregate data, a bar graph 117 indicating the value of each axis (item), and notification of candidates for the axis that can be substituted for the impression word "smells good". In addition to columns 118, a bar graph 119 is displayed showing the values for each axis of the query.

図２６は、第三の実施形態の表示例を示す第二の図である。 FIG. 26 is a second diagram showing a display example of the third embodiment.

図２６に示す画面１１５Ｂでは、検索結果集合データをレーダチャート１３０によって表示させている。 In screen 115B shown in FIG. 26, search result set data is displayed by radar chart 130. FIG.

レーダチャート１３０には、グラフ１１７Ａとグラフ１１９Ａとが表示されている。グラフ１１７Ａは、検索結果集合データと軸とされた項目との関係性を示しており、グラフ１１９Ａは、クエリと軸とされた項目との関係性を示している。 A graph 117A and a graph 119A are displayed on the radar chart 130 . A graph 117A shows the relationship between the search result set data and the axis item, and the graph 119A shows the relationship between the query and the axis item.

また、レーダチャート１３０では、α１軸を「ヘルシー」、α２軸を「美味しい」、α３軸を「香りが良い」としており、軸の候補として抽出された印象語「おなかいっぱい」、「おなかにやさしい」を、それぞれβ１軸、β２軸としている。 In the radar chart 130, the α1 axis is "healthy", the α2 axis is "delicious", and the α3 axis is "fragrant". ” are defined as the β1 axis and the β2 axis, respectively.

このように、本実施形態では、レーダチャートを表示させる際に、軸の候補として抽出された印象語を、β軸としても良い。尚、本実施形態では、レーダチャートを表示させる際に、β軸は、第一及び第二の実施形態と同様の手法で決定し、α軸とされた印象語のうち、標準偏差が所定の閾値より大きい印象語と対応付けて、他の軸の候補を通知しても良い。 As described above, in the present embodiment, the impression word extracted as an axis candidate may be used as the β axis when the radar chart is displayed. In this embodiment, when the radar chart is displayed, the β axis is determined by the same method as in the first and second embodiments, and among the impression words on the α axis, the standard deviation Other axis candidates may be notified in association with impression words larger than the threshold.

図２６の画面１１５Ｂでは、例えば、レーダチャート１３０のβ軸を、例えば、クエリから抽出された検索語群１１１Ａの中から抽出した単語とし、α３軸と対応付けて、通知欄１１８を表示されることになる。 On the screen 115B of FIG. 26, for example, the β axis of the radar chart 130 is the word extracted from the search term group 111A extracted from the query, and the notification column 118 is displayed in association with the α3 axis. It will be.

このように、本実施形態では、軸とされた印象語のスコアに一定以上のばらつきが存在する場合には、軸となり得る他の印象語をユーザに通知することで、ユーザの意図を反映した検索結果を表示させることができる。 As described above, in this embodiment, when there is a certain amount of variation in the score of the impression word used as the axis, the user is notified of other impression words that can be used as the axis, thereby reflecting the user's intention. Search results can be displayed.

（第四の実施形態）
以下に図面を参照して、第四の実施形態について説明する。第四の実施形態は、軸の候補として表示された印象語に対する選択を受けて、軸を変更する点が第三の実施形態と相違する。よって、以下の第四の実施形態の説明では、第三の実施形態との相違点についてのみ説明し、第三の実施形態と同様の機能構成を有するものには、第三の実施形態の説明で用いた符号と同様の符号を付与し、その説明を省略する。 (Fourth embodiment)
A fourth embodiment will be described below with reference to the drawings. The fourth embodiment differs from the third embodiment in that the axis is changed in response to selection of impression words displayed as axis candidates. Therefore, in the following description of the fourth embodiment, only differences from the third embodiment will be described. are assigned the same reference numerals as those used in , and the description thereof is omitted.

図２７は、第四の実施形態の検索装置の機能を説明する図である。本実施形態の検索装置２００Ｃは、検索処理部２４０Ｃを有する。 FIG. 27 is a diagram for explaining the functions of the search device of the fourth embodiment. The search device 200C of this embodiment has a search processing unit 240C.

本実施形態の検索処理部２４０Ｃは、検索処理部２４０Ｂの有する各部に加え、軸変更部２５１を有する。 240 C of search processing parts of this embodiment have the axis change part 251 in addition to each part which the search processing part 240B has.

本実施形態の軸変更部２５１は、例えば、端末装置３００等において、軸と対応付けて表示された軸の候補の選択を受け付けると、出力部２４６に対し、選択に応じて軸を変更したグラフを表示させる。また、本実施形態の軸変更部２５１は、ユーザにより軸が指定されている場合には、軸の候補の選択を受け付けて、指定された軸以外の軸とされた印象語のうち、最もスコアが小さい印象語を、選択された軸の候補である印象語に変更する。 For example, when the terminal device 300 or the like accepts a selection of an axis candidate displayed in association with an axis, the axis changing unit 251 of the present embodiment outputs a graph in which the axis is changed according to the selection to the output unit 246. display. In addition, when the axis is specified by the user, the axis changing unit 251 of the present embodiment accepts the selection of the candidate for the axis, and among the impression words with the axis other than the specified axis, Change the impression word with a smaller value to an impression word that is a candidate for the selected axis.

以下に、図２８を参照して、本実施形態の検索処理部２４０Ｃの軸変更部２５１の処理について説明する。 Processing of the axis changing unit 251 of the search processing unit 240C of this embodiment will be described below with reference to FIG.

図２８は、第四の実施形態における軸の変更の処理を説明するフローチャートである。本実施形態の検索処理部２４０Ｃにおいて、軸変更部２５１は、軸の候補の通知において、候補された軸の選択を受け付けたか否かを判定する（ステップＳ２８０１）。ステップＳ２８０１において、選択を受け付けない場合、軸変更部２５１は、選択を受け付けるまで待機する。 FIG. 28 is a flowchart for explaining axis change processing in the fourth embodiment. In the search processing unit 240C of the present embodiment, the axis changing unit 251 determines whether or not selection of a candidate axis has been received in notification of the axis candidate (step S2801). If the selection is not accepted in step S2801, the axis changing unit 251 waits until the selection is accepted.

ステップＳ２８０１において、軸の選択を受け付けた場合、軸変更部２５１は、ユーザにより指定されている軸が存在するか否かを判定する（ステップＳ２８０２）。尚、ここでの指定とは、表示を維持させるための指定である。したがって、本実施形態では、指定された軸の表示は維持される。 When the axis selection is accepted in step S2801, the axis changing unit 251 determines whether or not there is an axis designated by the user (step S2802). The designation here is a designation for maintaining the display. Therefore, in this embodiment, the display of the specified axis is maintained.

ステップＳ２８０２において、指定されている軸が存在する場合、軸変更部２５１は、ユーザに指定されていない軸（印象語）のうち、最もスコアの最大値が小さい軸を、選択された軸の候補に変更する（ステップＳ２８０３）。 In step S2802, if there is a specified axis, the axis changing unit 251 selects the axis with the smallest maximum score among the axes (impression words) not specified by the user as a candidate for the selected axis. (step S2803).

また、ステップＳ２８０２において、指定されている軸が存在しない場合、軸変更部２５１は、表示されているグラフの軸のうち、最もスコアが低い軸を、選択された軸の候に変更する（ステップＳ２８０４）。 Further, in step S2802, if the specified axis does not exist, the axis changing unit 251 changes the axis with the lowest score among the axes of the displayed graph to the selected axis (step S2802). S2804).

ステップＳ２８０３、２８０４に続いて、軸変更部２５１は、出力部２４６に対して、軸を変更したグラフの表示を指示し（ステップＳ２８０５）、処理を終了する。 Following steps S2803 and 2804, the axis changing unit 251 instructs the output unit 246 to display the graph with the changed axis (step S2805), and ends the process.

以下に、図２９を参照して、軸の変更について具体的に説明する。図２９は、軸の変更について説明する図である。 The change of the axis will be specifically described below with reference to FIG. 29 . FIG. 29 is a diagram for explaining the change of the axis.

図２９（ａ）では、検索結果集合データを表示する画面１１５に、α軸として抽出された印象語を指定するための指定欄２９２が表示された画面２９１を示している。 FIG. 29(a) shows a screen 291 on which a designation column 292 for designating an impression word extracted as the α-axis is displayed on the screen 115 displaying search result set data.

図２９（ｂ）に示す画面２９１Ａは、指定欄２９２における軸の指定が行われず、通知欄１１８において表示された軸の候補「おなかいっぱい」が選択された場合に、画面２９１から遷移する画面の例である。 A screen 291A shown in FIG. 29B is a screen transition from the screen 291 when no axis is specified in the specification field 292 and the axis candidate "full" displayed in the notification field 118 is selected. For example.

画面２９１において、軸１１６として抽出された印象語は、「ヘルシー」、「美味しい」、「香りが良い」である。ここで、通知欄１１８において、「おなかいっぱい」が選択されると、軸変更部２５１は、指定欄２９２における指定がないため、印象語「ヘルシー」、「美味しい」、「香りが良い」のうち、最もスコアの最大値が小さい印象語を、印象語「おなかいっぱい」に変更する（図２４参照）。 In the screen 291, the impression words extracted as the axis 116 are "healthy", "delicious", and "fragrant". Here, when "full stomach" is selected in the notification field 118, the axis changing unit 251 selects one of the impression words "healthy", "tasty", and "fragrant" because there is no designation in the designation field 292. , the impression word with the smallest maximum score value is changed to the impression word "full stomach" (see FIG. 24).

ここで、軸１１６の中で、最もスコアの最大値が小さい印象語は、「香りが良い」である。したがって、画面２９１Ａでは、印象語「香りが良い」を印象語「おなかいっぱい」に変更した軸１１６Ａが表示される。また、画面２９１Ａでは、軸１１６Ａに合わせた棒グラフ１１７Ａが表示される。 Here, the impression word with the smallest maximum score on the axis 116 is "fragrant". Therefore, on screen 291A, axis 116A is displayed in which the impression word "smells good" is changed to the impression word "full". A bar graph 117A aligned with the axis 116A is displayed on the screen 291A.

さらに、本実施形態では、画面２９１Ａに、変更された印象語「香りが良い」と、通知欄１１８において軸の候補とされていた印象語「おなかにやさしい」とを、印象語「おかないっぱい」と入れ代えることができる軸の候補として、通知欄１１８Ａに表示させる。 Furthermore, in the present embodiment, the changed impression word "smells good" and the impression word "stomach-friendly" which was a candidate for the axis in the notification column 118 are displayed on the screen 291A. ” is displayed in the notification column 118A as an axis candidate that can be replaced with “.

本実施形態では、このように、変更された軸を、軸の候補として通知するため、変更前のグラフを表示させたい場合には、通知欄１１８Ａに表示された、変更された軸「香りが良い」を選択すれば良く、簡単に軸を変更する前の表示に戻すことができる。 In this embodiment, the changed axis is notified as a candidate for the axis in this way. Therefore, when the graph before the change is desired to be displayed, the changed axis "fragrance" displayed in the notification column 118A is displayed. You can easily return to the display before changing the axis by selecting "OK".

図２９（Ｃ）に示す画面２９１Ｂは、画面２９１の指定欄２９２において、軸とされた印象語「香りが良い」が指定された状態で、軸の候補とされた印象語「おなかいっぱい」が選択された場合に、画面２９１から遷移する画面の例である。 A screen 291B shown in FIG. 29C shows a state in which the impression word "smells good" is specified in the specification field 292 of the screen 291, and the impression word "full stomach" is specified as a candidate for the axis. This is an example of a screen transitioned from the screen 291 when selected.

画面２９１Ｂの指定欄２９２Ａでは、印象語「香りが良い」を指定するチェックが表示されている。よって、画面２９１Ｂでは、印象語「香りが良い」は、軸として表示が維持される。 In the specification column 292A of the screen 291B, a check for specifying the impression word "fragrance is good" is displayed. Therefore, on the screen 291B, the impression word "smells good" is kept displayed as an axis.

また、軸１１６のうち、印象語「香りが良い」以外の印象語で、最もスコアの最大値が小さい印象語は、「美味しい」である。したがって、画面２９１Ｂでは、印象語「美味しい」を印象語「おなかいっぱい」に変更した軸１１６Ｂが表示される。また、画面２９１Ｂでは、軸１１６Ａに合わせた棒グラフ１１７Ｂが表示される。 Further, among the impression words other than the impression word "fragrant" on the axis 116, the impression word with the smallest maximum score value is "delicious". Therefore, on screen 291B, axis 116B is displayed in which the impression word "delicious" is changed to the impression word "full". A bar graph 117B aligned with the axis 116A is displayed on the screen 291B.

さらに、画面２９１Ｂには、変更された印象語「美味しい」と、通知欄１１８において軸の候補とされていた印象語「おなかにやさしい」とを、印象語「おかないっぱい」と入れ代えることができる軸の候補として、通知欄１１８Ｂに表示させる。 Further, on screen 291B, it is possible to replace the changed impression word "delicious" with the impression word "stomach-friendly" which was a candidate for the axis in the notification column 118 with the impression word "full of food." This is displayed in the notification column 118B as possible axis candidates.

このように、本実施形態によれば、軸を変更する場合に、ユーザが指定した軸を維持することができる。 Thus, according to this embodiment, when changing the axis, it is possible to maintain the axis specified by the user.

尚、図２９では、棒グラフの例を説明したが、レーダチャートが表示された画面においても、同様に軸の変更と指定された軸の維持を行うことができる。 Although FIG. 29 illustrates an example of a bar graph, it is also possible to change the axis and maintain the designated axis on the screen displaying the radar chart as well.

（第五の実施形態）
以下に図面を参照して第五の実施形態について説明する。第五の実施形態は、軸が変更された場合に、この変更を受けて印象語マップのスコアを補正する点が第四の実施形態と相違する。よって、以下の第五の実施形態の説明では、第四の実施形態との相違点について説明し、第四の実施形態と同様の機能構成を有するものには、第四の実施形態の説明で用いた符号と同様の符号を付与し、その説明を省略する。 (Fifth embodiment)
A fifth embodiment will be described below with reference to the drawings. The fifth embodiment differs from the fourth embodiment in that when the axis is changed, the score of the impression word map is corrected in response to this change. Therefore, in the following description of the fifth embodiment, the differences from the fourth embodiment will be described, and those having the same functional configuration as the fourth embodiment will be described in the fourth embodiment. The same reference numerals as the used reference numerals are given, and the explanation thereof is omitted.

図３０は、第五の実施形態の検索装置の機能を説明する図である。本実施形態の検索装置２００Ｄは、インデックス記憶部２１０、印象語マップ記憶部２２０Ｂ、検索対象スコア記憶部２３０Ａ、検索処理部２４０Ｄ、ユーザ情報記憶部２７０を有する。 FIG. 30 is a diagram explaining the functions of the search device of the fifth embodiment. The search device 200D of this embodiment has an index storage unit 210, an impression word map storage unit 220B, a search target score storage unit 230A, a search processing unit 240D, and a user information storage unit 270.

本実施形態の印象語マップ記憶部２２０Ｂは、印象語マップ２２１Ａと、補正リスト２２２とを格納する。補正リスト２２２は、印象語スコア出力部２４４Ａにおいて取得されたスコアを補正する際に参照される。 The impression word map storage unit 220B of the present embodiment stores an impression word map 221A and a correction list 222. FIG. The correction list 222 is referred to when correcting the score obtained by the impression word score output unit 244A.

本実施形態の検索処理部２４０Ｄは、第四の実施形態の検索処理部２４０Ｃに加え、補正リスト生成部２５２と、スコア補正部２５３とを有する。 The search processing unit 240D of this embodiment has a correction list generation unit 252 and a score correction unit 253 in addition to the search processing unit 240C of the fourth embodiment.

補正リスト生成部２５２は、軸変更部２５１により軸が変更された場合に、軸の変更又は表示の維持に応じて、印象語スコア出力部２４４Ａから出力されるスコアを補正するための補正リスト２２２を更新していく。補正リスト２２２の詳細は後述する。 When the axis is changed by the axis changing unit 251, the correction list generation unit 252 generates a correction list 222 for correcting the score output from the impression word score output unit 244A according to the change of the axis or the maintenance of the display. will be updated. Details of the correction list 222 will be described later.

スコア補正部２５３は、印象語スコア出力部２４４Ａにより出力されたスコアを、補正リスト２２２に基づいて補正する。 The score correction unit 253 corrects the score output by the impression word score output unit 244A based on the correction list 222. FIG.

本実施形態のユーザ情報記憶部２７０は、ユーザ情報２７１を格納している。ユーザ情報２７１は、検索装置２００Ｄを含む検索システムを利用するユーザを認証するための情報である。具体的には、ユーザ情報２７１は、ユーザを識別するユーザＩＤと、パスワード等である。本実施形態のユーザ情報２７１は、予め検索装置２００Ｄに与えられていても良い。 The user information storage unit 270 of this embodiment stores user information 271 . The user information 271 is information for authenticating a user who uses the search system including the search device 200D. Specifically, the user information 271 includes a user ID for identifying the user, a password, and the like. The user information 271 of this embodiment may be given to the search device 200D in advance.

次に、図３１を参照して、本実施形態の補正リスト生成部２５２の処理について説明する。図３１は、第五の実施形態の補正リスト生成部の処理を説明するフローチャートである。 Next, with reference to FIG. 31, processing of the correction list generation unit 252 of this embodiment will be described. FIG. 31 is a flow chart for explaining processing of the correction list generation unit of the fifth embodiment.

図３１のステップＳ３１０１からステップＳ３１０５までの処理は、図２８のステップＳ２８０１からステップＳ２８０５までの処理と同様であるから、説明を省略する。 Since the processing from step S3101 to step S3105 in FIG. 31 is the same as the processing from step S2801 to step S2805 in FIG. 28, description thereof is omitted.

検索処理部２４０Ｄは、軸の候補が変更されると、補正リスト生成部２５２により、変更を行ったユーザのユーザ情報と、軸の変更又は軸の指定に応じて、補正リスト２２２の更新を行い（ステップＳ３１０６）、処理を終了する。 When the candidate for the axis is changed, the search processing unit 240D updates the correction list 222 by the correction list generation unit 252 according to the user information of the user who made the change and the change of the axis or designation of the axis. (Step S3106), the process ends.

以下に、ステップＳ３１０６の処理について、さらに説明する。はじめに、軸を変更する際に、ユーザによる軸の指定が行われなかった場合について説明する。 The processing of step S3106 will be further described below. First, the case where the user does not specify the axis when changing the axis will be described.

この場合、補正リスト生成部２５２は、変更の対象とされる軸を示す印象語と、この印象語のスコアから減算する値と、クエリから取得された検索語群と、ユーザ情報とを対応付けて、補正リスト２２２へ格納する。尚、変更の対象とされる軸を示す印象語とは、印象語スコア出力部２４４Ａにより軸として抽出された印象語のうち、スコアの最大値が最も小さい印象語である。 In this case, the correction list generation unit 252 associates the impression word indicating the axis to be changed, the value to be subtracted from the score of this impression word, the search word group obtained from the query, and the user information. and store it in the correction list 222 . The impression word indicating the axis to be changed is the impression word with the smallest maximum score among the impression words extracted as the axis by the impression word score output unit 244A.

次に、軸を変更する際に、ユーザによる軸の指定が行われた場合について説明する。この場合、補正リスト生成部２５２は、ユーザにより指定された軸を示す印象語と、この印象語のスコアに加算する値と、クエリから取得された検索語群と、ユーザ情報とを対応付けて、補正リスト２２２へ格納する。 Next, the case where the user designates an axis when changing the axis will be described. In this case, the correction list generation unit 252 associates the impression word indicating the axis specified by the user, the value to be added to the score of this impression word, the search word group acquired from the query, and the user information. , is stored in the correction list 222 .

以下に、図３２を参照して、本実施形態の補正リスト２２２について説明する。図３２は、第五の実施形態の補正リストの一例を示す図である。 The correction list 222 of this embodiment will be described below with reference to FIG. FIG. 32 is a diagram showing an example of a correction list according to the fifth embodiment.

本実施形態の補正リスト２２２は、情報の項目として、ユーザＩＤ、検索語群、変更点を有し、これらが対応付けられている。項目「ユーザＩＤ」の値は、ユーザを識別するためのユーザＩＤを示す。項目「検索語群」の値は、入力されたクエリから取得された検索語群を示す。項目「変更点」の値は、変更された印象語と、この印象語のスコアから減算される値、又は、ユーザに表示を維持する軸として指定された印象語と、この印象語のスコアに加算する値を示す。 The correction list 222 of this embodiment has, as information items, a user ID, a search term group, and a point of change, which are associated with each other. The value of the item "user ID" indicates a user ID for identifying the user. The value of the item "search term group" indicates the search term group acquired from the input query. The value of the item “change point” is the value subtracted from the changed impression word and the score of this impression word, or the impression word specified as the axis to keep displayed to the user and the score of this impression word. Indicates the value to add.

以下の説明では、補正リスト２２２において、項目「ユーザＩＤ」の値と、その他の値とを含む情報を、補正情報と呼ぶ。また、以下の説明では、印象語のスコアに加算する値（第一の補正値）又は印象語のスコアから減算する値（第二の補正値）を、スコア補正値と呼ぶ。スコア補正値は、予め設定されて、補正リスト生成部２５２に保持されていても良い。 In the following description, information including the value of the item “user ID” and other values in the correction list 222 is called correction information. Further, in the following description, a value to be added to the impression word score (first correction value) or a value to be subtracted from the impression word score (second correction value) will be referred to as a score correction value. The score correction value may be set in advance and held in the correction list generator 252 .

図３２の例では、ユーザＩＤ「０００１」を含む補正情報２２２－１では、項目「検索語群」の値が、「天然」、「素材」、「野菜」、「レストラン」であり、項目「変更点」の値が、「香りが良い」、「－０．１」である。 In the example of FIG. 32, in the correction information 222-1 including the user ID "0001", the values of the item "search word group" are "nature", "ingredients", "vegetables", and "restaurant", and the item " The values of "change point" are "fragrance is good" and "-0.1".

よって、補正情報２２２－１は、ユーザＩＤ「０００１」のユーザが入力したクエリから取得された検索語群「天然」、「素材」、「野菜」、「レストラン」で検索を行った検索結果集合データを表示させた際に、軸とされた印象語「香りが良い」が他の印象語に変更されたことを示している。また、補正情報２２２－１は、このとき、ユーザによって、軸の中に、表示を維持するように指定された印象語がなく、印象語「香りが良い」からスコア補正値である「０．１」が減算されることを示している。 Therefore, the correction information 222-1 is a set of search results obtained by searching with the search term group "nature", "ingredients", "vegetables", and "restaurant" acquired from the query input by the user with the user ID "0001". This indicates that the impression word "smells good" used as the axis was changed to another impression word when the data was displayed. Further, the correction information 222-1 is, at this time, the score correction value "0. 1” is subtracted.

また、図３２の補正情報２２２－２では、ユーザＩＤ「０００２」のユーザが入力したクエリから取得された検索語群「がっつり」、「中華」で検索を行った検索結果集合データを表示させ際に、ユーザによって、軸のうち、印象語「おなかいっぱい」の表示を維持するように指定されたことを示す。また、補正情報２２２－２は、印象語「おなかいっぱい」のスコアにスコア補正値である「０．１」が加算されることを示している。 Further, in the correction information 222-2 of FIG. 32, when displaying the search result set data obtained by searching with the search word group "Gattsuri" and "Chinese" acquired from the query input by the user with the user ID "0002", indicates that the user has designated to keep the display of the impression word "full" among the axes. Further, the correction information 222-2 indicates that the score correction value "0.1" is added to the score of the impression word "full".

次に、図３３を参照して、本実施形態の印象語スコア出力部２４４Ａ、軸決定部２４５Ａ、スコア補正部２５３の処理について説明する。図３３は、第五の実施形態の印象語スコア出力部、軸決定部、スコア補正部の処理を説明するフローチャートである。 Next, the processing of the impression word score output unit 244A, the axis determination unit 245A, and the score correction unit 253 of this embodiment will be described with reference to FIG. FIG. 33 is a flowchart for explaining the processing of the impression word score output section, axis determination section, and score correction section of the fifth embodiment.

尚、本実施形態の検索装置２００Ｄでは、ユーザがクエリを入力する前に、ユーザ情報の入力を受け付けて、ログイン処理が行われるものとした。 In addition, in the search device 200D of the present embodiment, before the user inputs a query, input of user information is accepted and login processing is performed.

図３３のステップＳ３３０１とステップＳ３３０２の処理は、図２３のステップＳ２３０１とステップＳ２３０２の処理と同様であるから、説明を省略する。 The processing in steps S3301 and S3302 in FIG. 33 is the same as the processing in steps S2301 and S2302 in FIG. 23, so description thereof will be omitted.

ステップＳ３３０２において、印象語毎のスコアと標準偏差を取得すると、スコア補正部２５３により、補正リスト２２２に、入力されたユーザＩＤを含む補正情報が存在するか否かを判定する（ステップＳ３３０３）。ステップＳ３３０３において、該当する補正情報が存在しない場合、検索処理部２４０Ｄは、後述するステップＳ３３０６へ進む。 After obtaining the score and standard deviation for each impression word in step S3302, the score correction unit 253 determines whether correction information including the input user ID exists in the correction list 222 (step S3303). In step S3303, if the corresponding correction information does not exist, the search processing unit 240D proceeds to step S3306, which will be described later.

ステップＳ３３０３において、該当する補正情報が存在する場合、スコア補正部２５３は、該当する補正情報に、入力されたクエリから取得した検索語群と、検索語群が一致する補正情報が存在するか否かを判定する（ステップＳ３３０７）。ステップＳ３３０７において、該当する補正情報が存在しない場合、検索処理部２４０Ｄは、後述するステップＳ３３０６へ進む。 In step S3303, if the corresponding correction information exists, the score correction unit 253 determines whether or not the corresponding correction information contains correction information in which the search term group matches the search term group obtained from the input query. (step S3307). In step S3307, if the corresponding correction information does not exist, the search processing unit 240D proceeds to step S3306, which will be described later.

ステップＳ３３０７において、該当する補正情報が存在する場合、スコア補正部２５３は、この補正情報の項目「変更点」の値を参照し、ステップＳ３３０２で取得した印象語毎のスコアを補正し（ステップＳ３３０５）、ステップＳ３３０６へ進む。 In step S3307, if the corresponding correction information exists, the score correction unit 253 refers to the value of the item "change point" of this correction information, and corrects the score for each impression word acquired in step S3302 (step S3305). ), and the process advances to step S3306.

ステップＳ３３０６からステップＳ３３１１までの処理は、図２３のステップＳ２３０３からステップＳ２３０８までの処理と同様であるから、説明を省略する。 Since the processing from step S3306 to step S3311 is the same as the processing from step S2303 to step S2308 in FIG. 23, description thereof is omitted.

以下に、図２４を参照して、図３３の処理を具体的に説明する。ここでは、例えば、ユーザＩＤ「０００１」のユーザが、「天然の素材や野菜を使うレストラン」というクエリを入力し、検索語群１１１Ａが取得された場合について説明する。 The processing of FIG. 33 will be specifically described below with reference to FIG. Here, for example, a case where the user with the user ID "0001" inputs the query "restaurants using natural ingredients and vegetables" and the search term group 111A is acquired will be described.

この場合、検索処理部２４０Ｄは、印象語スコア出力部２４４Ａにより、単語毎印象語スコア１１３Ａを得る。このとき、スコア補正部２５３は、補正リスト２２２を参照し、ユーザＩＤ「０００１」を含む補正情報が存在するか否かを判定する。 In this case, the search processing unit 240D obtains the impression word score for each word 113A from the impression word score output unit 244A. At this time, the score correction unit 253 refers to the correction list 222 and determines whether correction information including the user ID "0001" exists.

ここで、補正リスト２２２には、ユーザＩＤ「０００１」を含む補正情報２２２－１が存在する（図３２参照）。したがって、スコア補正部２５３は、補正情報２２２－１の項目「検索語群」の値と、入力さたれクエリから取得した検索語群１１１Ａとが一致するか否かを判定する。 Here, the correction information 222-1 including the user ID "0001" exists in the correction list 222 (see FIG. 32). Therefore, the score correction unit 253 determines whether or not the value of the item “search term group” in the correction information 222-1 matches the search term group 111A acquired from the input query.

ここでは、両者は一致するため、スコア補正部２５３は、補正情報２２２－１の項目「変更点」を参照し、単語毎印象語スコア１１３Ａの印象語「香りが良い」と対応するスコアから０．１を減算する補正を行う。 Here, since both match, the score correction unit 253 refers to the item “change point” of the correction information 222-1, and the score corresponding to the impression word “smells good” in the impression word score for each word 113A is reduced to 0. .A correction is made by subtracting 1.

このように、本実施形態では、ユーザが一度軸となる印象語を変更したり、表示を維持する印象語を指定した場合に、この操作の履歴を補正情報としてユーザ毎に保持しておく。そして、本実施形態では、クエリから取得される検索語群が一致する場合には、この補正情報を用いて印象語のスコアを補正することで、検索結果集合データを表示させる際の項目に、ユーザの意図を反映させることができる。 As described above, in this embodiment, when the user once changes the impression word used as the axis or designates an impression word to be displayed, the history of this operation is held as correction information for each user. Then, in this embodiment, when the search term group obtained from the query matches, the score of the impression word is corrected using this correction information. User's intention can be reflected.

（第六の実施形態）
以下に図面を参照して第六の実施形態について説明する。第六の実施形態は、出力軸上で印象語のスコアを変更する操作を受け付けた際に、適切な数の検索結果集合データを表示させる点が、第一の実施形態と相違する。よって、以下の第六の実施形態の説明では、第一の実施形態との相違点について説明し、第一の実施形態と同様の機能構成を有するものには、第一の実施形態の説明で用いた符号と同様の符号を付与し、その説明を省略する。 (Sixth embodiment)
A sixth embodiment will be described below with reference to the drawings. The sixth embodiment differs from the first embodiment in that an appropriate number of search result set data is displayed when an operation to change the impression word score is received on the output axis. Therefore, in the following description of the sixth embodiment, differences from the first embodiment will be described, and those having the same functional configuration as the first embodiment will be described in the first embodiment. The same reference numerals as the used reference numerals are given, and the explanation thereof is omitted.

図３４は、第六の実施形態の表示例について説明する図である。図３４に示す画面３４１では、複数の出力軸３４２と、複数の検索結果集合データ３４３とが表示されている。 FIG. 34 is a diagram explaining a display example of the sixth embodiment. A screen 341 shown in FIG. 34 displays a plurality of output shafts 342 and a plurality of search result set data 343 .

画面３４１において、出力軸３４２は、入力されたクエリに基づく検索語群と関連が強いとされた印象語を示す。 On the screen 341, an output axis 342 indicates impression words strongly related to the search word group based on the input query.

また、画面３４１では、出力軸３４２とされた印象語毎に、印象語のスコアを変更するためのスケール３４４とスライダ３４５とが表示される。本実施形態では、スケール３４４上でスライダ３４５を移動させ、スライダ３４５がスケール３４４上に予め設けられた目盛り上で停止すると、印象語のスコアが、スライダ３４５が停止した位置と対応する値に変更される。印象語のスコアが変更されると、検索結果集合データ３４３は、変更後のスコアを用いた検索結果となる。 Also, on the screen 341, a scale 344 and a slider 345 for changing the score of the impression word are displayed for each impression word on the output axis 342. FIG. In this embodiment, when the slider 345 is moved on the scale 344 and the slider 345 stops on a scale provided in advance on the scale 344, the impression word score changes to a value corresponding to the position where the slider 345 stops. be done. When the impression word score is changed, the search result set data 343 becomes search results using the changed score.

尚、以下の説明では、スケール上のスライダは、目盛り上において停止するものとし、スケール上に付された目盛りをスライダ停止位置と呼ぶ。また、以下の説明では、スケールにおいてスライダ停止位置が示す値（目盛り）をスライダ値と表現することがある。 In the following description, it is assumed that the slider on the scale stops on the scale, and the scale marked on the scale is called the slider stop position. Further, in the following description, the value (scale) indicated by the slider stop position on the scale may be expressed as the slider value.

検索結果集合データ３４３は、検索処理により特定された文書ＩＤが示す文書データである。画面３４１では、クエリと最も一致の度合いが高い文書データにおける、出力軸「プレミア」、「落ち着き」、「ガッツリ」のそれぞれのスコアと、複数の検索結果集合データとして抽出された他の文書データとして表示される。 The search result set data 343 is document data indicated by the document ID specified by the search process. On the screen 341, the scores for each of the output axis "Premium", "Calmness", and "Gattsuri" in the document data with the highest degree of matching with the query, and other document data extracted as a plurality of sets of search result data Is displayed.

ここで、印象語のスコアが変更された場合の検索結果集合データの表示について説明する。印象語のスコアと対応する検索結果（検索結果集合データ）の数は、抽出された複数の検索結果集合データにおけるスコアの分布によって異なる。このため、スケール上でスライダを移動させてスコアの値を変更した場合、多数の検索結果集合データが抽出されたり、わずかな検索結果集合データしか抽出されなかったりする可能性がある。 Here, the display of the search result set data when the score of the impression word is changed will be described. The impression word score and the number of corresponding search results (search result set data) differ depending on the score distribution in the plurality of extracted search result set data. Therefore, when the score value is changed by moving the slider on the scale, a large amount of search result set data may be extracted, or only a small amount of search result set data may be extracted.

図３５は、印象語のスコアの変更について説明する図である。図３５（ａ）は、出力軸とされた印象語「落着き」、「ガッツリ」のそれぞれのスコアの分布を示す図である。図３５（ｂ）は、スライダが示すスコアと検索結果集合データの関係を示す図である。 FIG. 35 is a diagram explaining how to change the scores of impression words. FIG. 35(a) is a diagram showing the score distribution of each of the impression words "calm" and "gatsuri" that are used as the output axis. FIG. 35(b) is a diagram showing the relationship between the score indicated by the slider and the set data of search results.

図３５（ａ）に示すように、印象語「落着き」は、検索結果集合データに対するスコアの偏りが小さく、印象語「ガッツリ」は、検索結果集合データに対するスコアの偏りが大きい。 As shown in FIG. 35(a), the impression word "calm" has a small bias in score against the search result set data, and the impression word "gatsuri" has a large score bias against the search result set data.

ここで、図３５（ｂ）に示すように、例えば、スケールの始点であるスライダ停止位置（スライダ値「０」）と、スケールの終点であるスライダ停止位置（スライダ値「６」）との間に、等間隔に５つのスライダ停止位置（スライダ値「１」～「５」）を設け、各スライダ停止位置にスライダを合わせてスコアを変更する場合を考える。 Here, as shown in FIG. 35B, for example, between the slider stop position (slider value "0") that is the start point of the scale and the slider stop position (slider value "6") that is the end point of the scale. , five slider stop positions (slider values "1" to "5") are provided at equal intervals, and the score is changed by adjusting the slider to each slider stop position.

この場合、印象語「落着き」は、スコアの偏りが小さいため、スライダをどのスライダ停止位置に合わせても、ある程度の数の検索結果集合データの数が出力される。 In this case, since the score bias for the impression word "calm" is small, a certain number of pieces of search result set data are output no matter which slider stop position the slider is set to.

これに対し、印象語「ガッツリ」は、スコアの偏りが大きいため、スライダ停止位置によっては、検索結果集合データがまったく出力されなかったり、多数の検索結果集合データが出力されたりする可能性がある。言い換えれば、印象語「ガッツリ」は、スケール上でスライダを移動させてスコアを変更した場合に、検索結果集合データが適切に出力されない可能性がある。 On the other hand, the impression word "gutsuri" has a large score bias, so depending on the slider stop position, there is a possibility that no search result set data is output or a large amount of search result set data is output. . In other words, for the impression word "gutsuri", if the slider is moved on the scale to change the score, there is a possibility that the search result set data will not be output appropriately.

そこで、本実施形態では、出力軸とされた印象語のスコアの分布を算出し、スケールにおけるスライダ停止位置とスコアの分布に応じたスコアの変動幅とを対応付ける。言い換えれば、本実施形態によれば、出力軸毎に、スコアの変動幅と対応して表示される検索結果集合データの数が所定数となるように、スケールにおける各スライダ停止位置とスコアの変動幅とを対応づけて表示させる。尚、所定数とは、一定の数であっても良いし、所定の範囲内の数であっても良い。具体的には、スコアの変動幅に対応しで表示される検索結果集合データの数を１０件としても良いし、８～１２件としても良い。 Therefore, in the present embodiment, the distribution of scores of impression words used as the output axis is calculated, and the slider stop position on the scale and the variation range of scores according to the distribution of scores are associated with each other. In other words, according to the present embodiment, each slider stop position on the scale and score fluctuation are adjusted so that the number of pieces of search result set data displayed corresponding to the score fluctuation range for each output axis is a predetermined number. The width is displayed in correspondence with the width. The predetermined number may be a fixed number or a number within a predetermined range. Specifically, the number of pieces of search result set data to be displayed corresponding to the fluctuation range of the score may be 10, or may be 8 to 12.

本実施形態では、この対応付けを行うことで、スケール上でスライダを移動させた際に、一定数の検索結果集合データを表示させることができる。 In this embodiment, by performing this correspondence, it is possible to display a certain number of search result set data when the slider is moved on the scale.

また、本実施形態によれば、出力軸（印象語）のスコアを変更し、変更後のスコアと対応する検索結果集合データをユーザに提示する。したがって、本実施形態によれば、例えば、クエリから推定されるスコアからユーザが望む情報が得られなかった場合等には、印象語のスコアを変更した新たな検索結果をユーザに提供することができる。 Further, according to the present embodiment, the score of the output axis (impression word) is changed, and the search result set data corresponding to the changed score is presented to the user. Therefore, according to the present embodiment, for example, when the information desired by the user cannot be obtained from the score estimated from the query, it is possible to provide the user with a new search result in which the score of the impression word is changed. can.

以下に、図３６を参照して、本実施形態の検索装置２００Ｅの機能について説明する。
図３６は、第六の実施形態の検索装置の機能を説明する図である。 The functions of the search device 200E of this embodiment will be described below with reference to FIG.
FIG. 36 is a diagram explaining the functions of the search device of the sixth embodiment.

本実施形態の検索装置２００Ｅは、検索対象インデックス記憶部２１０、印象語マップ記憶部２２０、検索対象スコア記憶部２３０Ａ、検索処理部２４０Ｅを有する。 The search device 200E of this embodiment has a search target index storage unit 210, an impression word map storage unit 220, a search target score storage unit 230A, and a search processing unit 240E.

本実施形態の検索処理部２４０Ｅは、入力受付部２４１、解析部２４２、検索部２４３、印象語スコア出力部２４４、軸決定部２４５、出力部２４６Ａ、インデックス生成部２４７、印象空間学習部２４８、スコア分布算出部２６０、スケール算出部２６１を有する。 The search processing unit 240E of the present embodiment includes an input reception unit 241, an analysis unit 242, a search unit 243, an impression word score output unit 244, an axis determination unit 245, an output unit 246A, an index generation unit 247, an impression space learning unit 248, It has a score distribution calculator 260 and a scale calculator 261 .

本実施形態のスコア分布算出部２６０は、軸決定部２４５により、出力軸に決定された印象語毎に、スコアの分布を算出する。より具体的には、スコア分布算出部２６０は、軸決定部２４５により、出力軸に決定された印象語毎に、印象語スコアリスト２３１Ａにおけるスコアの分布を示すヒストグラムを生成する。 The score distribution calculation unit 260 of the present embodiment calculates the score distribution for each impression word determined as the output axis by the axis determination unit 245 . More specifically, score distribution calculating section 260 generates a histogram showing the distribution of scores in impression word score list 231A for each impression word determined as the output axis by axis determining section 245 .

本実施形態のスケール算出部２６１は、スコア分布算出部２６０により算出されたスコアの分布に基づき、印象語のスコアを示すスケールに付されたスライダ停止位置と対応するスコアの変動幅を算出する。具体的には、スケール算出部２６１は、スライダ値にスコアの変動幅を対応付けた対応テーブル４１０を作成し、保持する。 Based on the score distribution calculated by the score distribution calculator 260, the scale calculator 261 of the present embodiment calculates the variation range of the score corresponding to the slider stop position attached to the scale indicating the score of the impression word. Specifically, the scale calculator 261 creates and holds a correspondence table 410 in which the slider value is associated with the score fluctuation range.

スコア分布算出部２６０、スケール算出部２６１の処理と、対応テーブル４１０の詳細は後述する。 Details of the processing of the score distribution calculation unit 260 and the scale calculation unit 261 and the correspondence table 410 will be described later.

本実施形態の出力部２４６Ａは、対応テーブル４１０を参照して、検索結果集合データのレーダチャートを表示させる。 The output unit 246A of this embodiment refers to the correspondence table 410 and displays the radar chart of the search result set data.

以下に、図３７を参照して、本実施形態の検索処理部２４０Ｅの処理について説明する。図３７は、第六の実施形態の検索処理部の処理を説明する第一のフローチャートである。 The processing of the search processing unit 240E of this embodiment will be described below with reference to FIG. FIG. 37 is the first flow chart for explaining the processing of the search processing section of the sixth embodiment.

図３７のステップＳ３７０１からステップＳ３７０４までの処理は、図９のステップＳ９０１からステップＳ９０４までの処理と同様であるから、説明を省略する。 The processing from step S3701 to step S3704 in FIG. 37 is the same as the processing from step S901 to step S904 in FIG. 9, so description thereof will be omitted.

ステップＳ３７０４において、軸決定部２４５により、クエリとの関連が明確なｎ個の印象語が抽出されて、α軸に設定されると、検索処理部２４０Ｅは、スコア分布算出部２６０により、ステップＳ３７０１で特定された複数の検索結果集合データにおける出力軸（α軸）毎のスコアの分布を算出する（ステップＳ３７０５）。 In step S3704, the axis determining unit 245 extracts n impression words clearly related to the query and sets them on the α-axis. Score distribution for each output axis (α-axis) in the plurality of search result set data specified in (step S3705).

続いて、検索処理部２４０Ｅは、スケール算出部２６１により、出力軸毎に、スライダ停止位置とスコアの変動幅とを対応付ける（ステップＳ３７０６）。 Subsequently, the search processing unit 240E causes the scale calculation unit 261 to associate the slider stop position with the score fluctuation range for each output axis (step S3706).

続いて、検索処理部２４０Ｅは、出力部２４６Ａにより、α軸を設定したときの印象語のスコアをスケール上の初期値に設定する（ステップＳ３７０７）。 Subsequently, the search processing unit 240E uses the output unit 246A to set the score of the impression word when the α-axis is set to the initial value on the scale (step S3707).

続いて、出力部２４６Ａは、検索結果集合データを、出力軸となる印象語と対応するスケール及びスライダと共に描画し（ステップＳ３７０８）、処理を終了する。 Subsequently, the output unit 246A draws the search result set data together with the scale and slider corresponding to the impression word as the output axis (step S3708), and ends the process.

次に、図３８を参照して、本実施形態のスコア分布算出部２６０の処理について説明する。図３８は、第六の実施形態の検索処理部の処理を説明する第二のフローチャートである。図３８では、図３７のステップＳ３７０５の処理の詳細を示している。 Next, with reference to FIG. 38, processing of the score distribution calculation unit 260 of this embodiment will be described. FIG. 38 is a second flowchart for explaining the processing of the search processing section of the sixth embodiment. FIG. 38 shows details of the processing in step S3705 of FIG.

本実施形態のスコア分布算出部２６０は、軸決定部２４５によって決定された出力軸のうち、一つを選択し、他の出力軸のスコアを固定する（ステップＳ３８０１）。続いて、スコア分布算出部２６０は、選択した出力軸のスコアを変更する（ステップＳ３８０２）。尚、ここで、選択された出力軸のスコアは、所定間隔毎に増減されても良い。より具体的には、例えば、出力軸のスコアの最小値を０、最大値を１０とした場合、スコアを０、１、２、・・・、１０というように変更していく。 The score distribution calculation unit 260 of the present embodiment selects one of the output axes determined by the axis determination unit 245, and fixes the scores of the other output axes (step S3801). Subsequently, the score distribution calculator 260 changes the score of the selected output axis (step S3802). Here, the score of the selected output axis may be increased or decreased at predetermined intervals. More specifically, for example, if the minimum value of the output axis score is 0 and the maximum value is 10, the score is changed to 0, 1, 2, .

続いて、スコア分布算出部２６０は、選択された出力軸のスコアが変更された状態で、各出力軸のスコアと、検索結果集合データとして抽出された文書データと対応する各印象語のスコアとのベクトルマッチングを行う（ステップＳ３８０３）。具体的には、スコア分布算出部２６０は、コサイン類似度に基づくベクトルマッチングを行っても良い。尚、検索結果集合データとして、複数の文書データが抽出されているものとする。 Next, the score distribution calculation unit 260 calculates the score of each output axis and the score of each impression word corresponding to the document data extracted as the search result set data in a state where the score of the selected output axis is changed. vector matching is performed (step S3803). Specifically, the score distribution calculator 260 may perform vector matching based on cosine similarity. It is assumed that a plurality of document data are extracted as the search result set data.

続いて、スコア分布算出部２６０は、ベクトルマッチングの結果が所定の閾値以上となる文書ＩＤを特定し、特定された文書ＩＤの数を取得する（ステップＳ３８０４）。 Subsequently, the score distribution calculation unit 260 identifies document IDs for which the result of vector matching is equal to or greater than a predetermined threshold, and acquires the number of identified document IDs (step S3804).

続いて、スコア分布算出部２６０は、選択した出力軸のスコアを最小値から最大値まで変更したか否かを判定する（ステップＳ３８０５）。言い換えれば、スコア分布算出部２６０は、スコアの最小値から最大値までについて、所定の間隔毎に、ベクトルマッチングを行ったか否かを判定している。 Subsequently, the score distribution calculator 260 determines whether or not the score of the selected output axis has been changed from the minimum value to the maximum value (step S3805). In other words, the score distribution calculation unit 260 determines whether or not vector matching has been performed for each predetermined interval from the minimum value to the maximum value of the score.

ステップＳ３８０５において、スコアの最小値から最大値までについて所定の間隔毎にベクトルマッチングを行っていない場合、スコア分布算出部２６０は、ステップＳ３８０２に戻る。 In step S3805, if vector matching has not been performed for each predetermined interval between the minimum and maximum score values, score distribution calculation section 260 returns to step S3802.

ステップＳ３８０５において、スコア最小値から最大値までについて所定の間隔毎にベクトルマッチングを行った場合、スコア分布算出部２６０は、所定の間隔毎の文書ＩＤの数から、ヒストグラムを生成する（ステップＳ３８０６）。 In step S3805, when vector matching is performed for each predetermined interval from the minimum score value to the maximum score value, the score distribution calculation unit 260 generates a histogram from the number of document IDs for each predetermined interval (step S3806). .

続いて、スコア分布算出部２６０は、全ての出力軸について、ステップＳ３８０１からステップＳ３８０６までの処理を行ったか否かを判定する（ステップＳ３８０７）。尚、ここでは、スコア分布算出部２６０は、出力軸のうち、α軸について、ステップＳ３８０１からステップＳ３８０６までの処理を行ったか否かを判定しても良い。 Subsequently, the score distribution calculation unit 260 determines whether or not the processing from step S3801 to step S3806 has been performed for all output axes (step S3807). Here, the score distribution calculation unit 260 may determine whether or not the processing from step S3801 to step S3806 has been performed for the α-axis among the output axes.

ステップＳ３８０７において、全ての出力軸について処理が行われていない場合、スコア分布算出部２６０は、ステップＳ３８０１に戻る。ステップＳ３８０７において、全ての出力軸について処理が行われると、スコア分布算出部２６０は、処理を終了する。 In step S3807, if processing has not been performed for all output axes, score distribution calculation section 260 returns to step S3801. In step S3807, when all the output axes have been processed, the score distribution calculator 260 terminates the process.

次に、図３９を参照して、本実施形態のスケール算出部２６１の処理について説明する。図３９は、第六の実施形態の検索処理部の処理を説明する第三のフローチャートである。図３８では、図３７のステップＳ３７０６の処理の詳細を示している。 Next, the processing of the scale calculator 261 of this embodiment will be described with reference to FIG. FIG. 39 is the third flow chart for explaining the processing of the search processing section of the sixth embodiment. FIG. 38 shows details of the processing in step S3706 of FIG.

本実施形態のスケール算出部２６１は、出力軸毎に、スコアの範囲（最小値から最大値まで）と、ヒストグラムと、ヒストグラムの生成に際して取得された文書ＩＤの数Ｘを取得する（ステップＳ３９０１）。 The scale calculator 261 of this embodiment acquires the score range (from the minimum value to the maximum value), the histogram, and the number X of document IDs acquired when generating the histogram for each output axis (step S3901). .

続いて、スケール算出部２６１は、出力軸毎に、スケール上に付されるスライダ値（目盛り）の数Ｎを取得する（ステップＳ３９０２）。 Subsequently, the scale calculator 261 acquires the number N of slider values (scales) to be applied on the scale for each output axis (step S3902).

続いて、スケール算出部２６１は、出力軸を選択し（ステップＳ３９０３）、スライダ値と対応する文書ＩＤの数が、Ｘ／Ｎとなるように、対応テーブルを作成する（ステップＳ３９０４）。 Subsequently, the scale calculator 261 selects an output axis (step S3903), and creates a correspondence table so that the number of document IDs corresponding to slider values is X/N (step S3904).

続いて、スケール算出部２６１は、全ての出力軸について、対応テーブルを作成したか否かを判定する（ステップＳ３９０５）。ステップＳ３９０５において、全ての出力軸に対して対応テーブルを作成していない場合、スケール算出部２６１は、ステップＳ３９０３に戻る。 Subsequently, the scale calculator 261 determines whether or not correspondence tables have been created for all output axes (step S3905). In step S3905, if correspondence tables have not been created for all output axes, the scale calculator 261 returns to step S3903.

ステップＳ３９０５において、全ての出力軸に対して対応テーブルを作成した場合、スケール算出部２６１は、処理を終了する。 In step S3905, when the correspondence table is created for all the output axes, the scale calculator 261 ends the process.

ここで、図４０を参照して、本実施形態のスコア分布算出部２６０とスケール算出部２６１の処理について具体的に説明する。図４０は、第六の実施形態のスコアとスライダ値との対応付けを説明する図である。図４０（ａ）は、ヒストグラムの一例を示す図であり、図４０（ｂ）はスコアとスライダ値との対応付けの例を示す図である。 Here, with reference to FIG. 40, the processing of the score distribution calculation unit 260 and the scale calculation unit 261 of this embodiment will be specifically described. FIG. 40 is a diagram for explaining correspondence between scores and slider values according to the sixth embodiment. FIG. 40(a) is a diagram showing an example of a histogram, and FIG. 40(b) is a diagram showing an example of correspondence between scores and slider values.

図４０（ａ）では、横軸が出力軸「プレミア」のスコアを示しており、縦軸が、ベクトルマッチングの結果が閾値以上となった文書ＩＤの数を示す。 In FIG. 40A, the horizontal axis indicates the score of the output axis "premium", and the vertical axis indicates the number of document IDs for which the result of vector matching is equal to or greater than the threshold.

出力軸「プレミア」では、図４０（ａ）からわかるように、スコアの範囲が０～１０であり、スコアが２～５の間は、対応する検索結果集合データが多数存在するが、スコアが８以上である場合や２未満である場合等には、対応する検索結果集合データは、ほぼ存在しない。 On the output axis "premium", as can be seen from FIG. When the number is 8 or more, or when the number is less than 2, there is almost no corresponding search result set data.

したがって、本実施形態では、出力軸「プレミア」を、スケールとして表示させる場合、スライダ値（目盛り）と、文書ＩＤの数とを対応付ける。 Therefore, in this embodiment, when the output axis "premium" is displayed as a scale, the slider value (scale) is associated with the number of document IDs.

図４０（ｂ）は、出力軸「プレミア」に関する対応テーブルの一例である。この場合、スライダ値「０」とスコア「０～３．５」、スライダ値「１」と、スコア「３．５～３．８」、スライダ値「２」と、スコア「３．８～４．６」が対応付けられている。 FIG. 40(b) is an example of a correspondence table relating to the output shaft "premium". In this case, a slider value of "0" and a score of "0 to 3.5", a slider value of "1" and a score of "3.5 to 3.8", a slider value of "2" and a score of "3.8 to 4 .6” is associated.

つまり、出力軸「プレミア」のスケールにおいて、スライダがスライダ値「０」の位置とされた場合には、抽出された検索結果集合データのうち、印象語「プレミア」のスコアが０～３．５となる文書ＩＤと対応する文書データが表示される。また、出力軸「プレミア」のスケールにおいて、スライダがスライダ値「１」の位置とされた場合には、抽出された検索結果集合データのうち印象語「プレミア」のスコアが３．５～３．８となる文書ＩＤと対応する文書データが表示される。 That is, on the scale of the output axis "premium", when the slider is set to the position of the slider value "0", the score of the impression word "premium" in the extracted search result set data is 0 to 3.5. , and the corresponding document data are displayed. On the scale of the output axis "premium", when the slider is set to the position of the slider value "1", the score of the impression word "premium" in the extracted search result set data is 3.5 to 3.5. The document ID of 8 and the corresponding document data are displayed.

このように、本実施形態では、出力軸のスケールに付されるスライダ値とスコアの変動幅とを対応づけることで、スライダがスケール上のどのスライダ値で停止されても、出力される検索結果集合データの数を同等とすることができる。本実施形態では、出力軸全てについて、この対応付けを行い、対応テーブル４１０として保持する。 As described above, in the present embodiment, by associating the slider value attached to the scale of the output axis with the fluctuation range of the score, the search result is output regardless of the slider value on the scale when the slider is stopped. The number of aggregated data can be made equivalent. In this embodiment, this association is made for all output shafts and held as a correspondence table 410 .

図４１は、第六の実施形態の対応テーブルの一例を示す図である。図４１に示す対応テーブル４１０では、出力軸「プレミア」、「隠れ家」、「ヘルシー」のそれぞれについて、スライダ値とスコアの変動幅とが対応付けられている。 FIG. 41 is a diagram showing an example of a correspondence table according to the sixth embodiment. In the correspondence table 410 shown in FIG. 41, slider values and score fluctuation ranges are associated with each of the output axes "premium", "retreat", and "healthy".

対応テーブル４１０では、スライダ値「０」に対して、出力軸「プレミア」はスコア「０～３．５」が対応付けられており、出力軸「隠れ家」ではスコア「０～２．８」が対応付けられており、出力軸「ヘルシー」ではスコア「０～２．９」が対応付けられている。 In the correspondence table 410, a slider value of "0" is associated with a score of "0 to 3.5" for the output axis "Premium", and a score of "0 to 2.8" is associated with the output axis "Retreat". A score of "0 to 2.9" is associated with the output axis "healthy".

また、対応テーブル４１０では、スライダ値「１」に対して、出力軸「プレミア」はスコア「３．５～３．８」が対応付けられており、出力軸「隠れ家」ではスコア「２．８～３．６」が対応付けられており、出力軸「ヘルシー」ではスコア「２．９～３．２」が対応付けられている。 Further, in the correspondence table 410, the score "3.5 to 3.8" is associated with the slider value "1" for the output axis "Premium", and the score "2.8" for the output axis "Retreat". to 3.6" are associated with it, and the output axis "healthy" is associated with a score of "2.9-3.2".

図４２は、第六の実施形態の表示の一例を示す図である。図４２に示す画面４２１は、例えば、端末装置３００等に表示される。 FIG. 42 is a diagram showing an example of display according to the sixth embodiment. A screen 421 shown in FIG. 42 is displayed, for example, on the terminal device 300 or the like.

画面４２１には、入力欄４２２と、結果表示欄４２３とが表示される。入力欄４２２は、クエリを入力するためのクエリ入力欄４２４と、検索要求を行う検索ボタン４２５と、各出力軸と対応した複数のスケール４２６とが表示されている。複数のスケール４２６上には、それぞれにスライダ４２６ａが表示されており、スケール上でスライダを移動させることで、対応する出力軸のスコアが変更される。 An input field 422 and a result display field 423 are displayed on the screen 421 . The input field 422 displays a query input field 424 for inputting a query, a search button 425 for requesting a search, and a plurality of scales 426 corresponding to each output axis. A slider 426a is displayed on each of a plurality of scales 426, and the score of the corresponding output axis is changed by moving the slider on the scale.

結果表示欄４２３には、クエリ入力欄４２４に入力されたクエリに基づき特定された文書データが検索結果集合データとして表示される。図４２の例では、検索結果として、検索結果集合データ４２７、４２８、４２９が表示される。 In the result display field 423, document data specified based on the query input in the query input field 424 is displayed as search result set data. In the example of FIG. 42, search result set data 427, 428, and 429 are displayed as search results.

本実施形態では、複数のスケール４２６の少なくとも何れかにおいて、スライダ４２６ａが操作されてスコアが変更されると、スコアの変更に伴って、結果表示欄４２３に表示される検索結果集合データも変更される。 In this embodiment, when the score is changed by operating the slider 426a on at least one of the plurality of scales 426, the search result set data displayed in the result display field 423 is also changed in accordance with the change in score. be.

このように、本実施形態では、クエリに基づく検索語群から推定された印象語のスコアを変更して検索を行う際に、印象語のスコアの変更に応じて適切な数の検索結果集合データを表示させることができる。 As described above, in the present embodiment, when performing a search by changing the score of impression words estimated from a group of search words based on a query, an appropriate number of search result set data is obtained according to the change in the score of the impression words. can be displayed.

尚、本実施形態では、印象語のスコアを変更するための表示形態として、スライダ値が付されたスケール上でスコアを変更する形態を説明したが、表示形態はこれに限定されない。本実施形態では、スコアの変更を受け付け、変更後のスコアと対応する検索結果集合データを表示させる形態であれば、どのような表示態様であっても良い。例えば、スコアを入力する入力欄を設け、この入力欄に対するスコアの入力によって、印象語のスコアの変更を受け付けるものとしても良い。 In this embodiment, as a display form for changing the impression word score, a form in which the score is changed on a scale with a slider value has been described, but the display form is not limited to this. In this embodiment, any display mode may be used as long as it accepts a change in score and displays the search result set data corresponding to the score after change. For example, an input field for inputting a score may be provided, and the impression word score may be changed by inputting the score in this input field.

また、本実施形態では、スコアの分布に偏りがある場合には、スライダで選択される範囲を限定したり、スケールの表示態様をスコアの分布に合わせた色合いで表示させても良い。 In addition, in the present embodiment, if the distribution of scores is biased, the range selected by the slider may be limited, or the display mode of the scale may be displayed in a shade that matches the distribution of scores.

図４３は、第六の実施形態の表示の他の例を示す図である。図４３（ａ）では、スライダで選択される範囲が限定されたスライダの表示例を示しており、図４３（ｂ）は、スコアの分布に合わせた色合いでスケールを表示させた場合の表示例を示している。 FIG. 43 is a diagram showing another example of display according to the sixth embodiment. FIG. 43(a) shows a display example of the slider in which the range selected by the slider is limited, and FIG. 43(b) shows a display example when the scale is displayed in a color matching the score distribution. is shown.

図４３（ａ）では、出力軸「プレミア」と対応するスケール４３１－１、出力軸「隠れ家」と対応するスケール４３１－２、出力軸「ヘルシー」と対応するスケール４３１－３が表示されている。 In FIG. 43(a), a scale 431-1 corresponding to the output axis "Premier", a scale 431-2 corresponding to the output axis "Retreat", and a scale 431-3 corresponding to the output axis "Healthy" are displayed. .

スケール４３１－１では、スライダで選択することができない選択不可領域４３２－１と、スライダで選択することができる選択領域４３３－１とが含まれる。この場合、スケール４３１－１は、選択不可領域４３２－１が示すスコアの範囲には、検索結果集合データが存在せず、選択領域４３３－１が示すスコアの範囲に検索結果集合データが分布していることを示している。 The scale 431-1 includes a non-selectable area 432-1 that cannot be selected with the slider and a selection area 433-1 that can be selected with the slider. In this case, the scale 431-1 indicates that there is no search result set data in the score range indicated by the non-selectable area 432-1, and the search result set data is distributed in the score range indicated by the selection area 433-1. indicates that

この場合、スケール算出部２６１は、選択領域４３３－１のどの位置がスライダに選択されたとしても、出力される検索結果集合データの数が同じか、又はほぼ同数となるようにすれば良い。 In this case, the scale calculation unit 261 should make the number of output search result set data the same or substantially the same no matter which position of the selection area 433-1 is selected by the slider.

同様に、スケール４３１－２には、選択不可領域４３２－２と選択領域４３３－２とが含まれ、スケール４３１－３には、選択不可領域４３２－３と選択領域４３３－３とが含まれる。 Similarly, the scale 431-2 includes a non-selectable area 432-2 and a selective area 433-2, and the scale 431-3 includes a non-selectable area 432-3 and a selective area 433-3. .

尚、スケール４３１－１、４３１－２、４３１－３において、選択不可領域４３２－１、４３２－２、４３２－３は、選択領域４３３－１、４３３－２、４３３－３と異なる態様で表示されも良い。 In the scales 431-1, 431-2 and 431-3, the non-selectable areas 432-1, 432-2 and 432-3 are displayed in a manner different from the selected areas 433-1, 433-2 and 433-3. It's good to be.

本実施形態では、図４３（ａ）のように、スケールの表示態様によって、スライダにより選択されるスコアの範囲が限定されていることをユーザに視認させることができる。 In this embodiment, as shown in FIG. 43A, the display mode of the scale allows the user to visually recognize that the score range selected by the slider is limited.

図４３（ｂ）では、出力軸「プレミア」と対応するスケール４３４－１、出力軸「隠れ家」と対応するスケール４３４－２、出力軸「ヘルシー」と対応するスケール４３４－３が表示されている。 In FIG. 43(b), a scale 434-1 corresponding to the output axis "Premier", a scale 434-2 corresponding to the output axis "Retreat", and a scale 434-3 corresponding to the output axis "Healthy" are displayed. .

図４３（ｂ）の例では、各スケールにおける選択領域４３５－１、４３５－２、４３５－３において、検索結果集合データの数が多い範囲から領域から順に、赤色の濃度を高くして表示させている。 In the example of FIG. 43(b), the selected areas 435-1, 435-2, and 435-3 on each scale are displayed with red density increasing in order from the area with the largest number of search result set data. ing.

本実施形態では、このようにスケールを表示させることで、ユーザに対して、検索結果集合データの数が多く存在しているスコアの範囲を視認させることができる。この場合、ユーザは、他の検索結果集合データを表示させたい場合には、濃い赤色の領域をスライダで選択すれば良い。 In the present embodiment, by displaying the scale in this manner, the user can visually recognize the range of scores in which a large number of sets of search result data exist. In this case, if the user wishes to display other search result set data, the user can select the dark red area with the slider.

このように、本実施形態によれば、検索結果集合データを表示させる際の項目となる印象語と、クエリとの関連の強さを示すスコアを変更した場合でも、適切な数の検索結果集合データを出力することができる。 As described above, according to the present embodiment, even when the impression word, which is an item for displaying the search result set data, and the score indicating the strength of the relationship with the query are changed, an appropriate number of search result sets can be obtained. Data can be output.

（第七の実施形態）
以下に図面を参照して第七の実施形態について説明する。第七の実施形態は、スライダ値と対応づけるスコアの範囲を限定する点が、第六の実施形態と相違する。よって、以下の第七の実施形態の説明では、第六の実施形態との相違点について説明し、第六の実施形態と同様の機能構成を有するものには、第六の実施形態の説明で用いた符号と同様の符号を付与し、その説明を省略する。 (Seventh embodiment)
A seventh embodiment will be described below with reference to the drawings. The seventh embodiment differs from the sixth embodiment in limiting the range of scores associated with slider values. Therefore, in the following description of the seventh embodiment, the differences from the sixth embodiment will be described. The same reference numerals as the used reference numerals are given, and the explanation thereof is omitted.

図４４は、第七の実施形態のスコアとスライダ値との対応付けを説明する図である。 FIG. 44 is a diagram for explaining the association between scores and slider values according to the seventh embodiment.

図４４では、横軸が出力軸「プレミア」のスコアを示しており、縦軸が、ベクトルマッチングの結果が閾値以上となった文書ＩＤの数を示す。また、図４４の例では、出力軸「プレミア」のスコアの初期値は「６」である。初期値は、クエリと最も関連が強いとされた文書ＩＤと対応する印象語「プレミア」のスコアである。 In FIG. 44, the horizontal axis indicates the score of the output axis "premium", and the vertical axis indicates the number of document IDs for which the result of vector matching is equal to or greater than the threshold. In addition, in the example of FIG. 44, the initial value of the score of the output axis "premium" is "6". The initial value is the score of the impression word "premier" corresponding to the document ID considered to be most closely related to the query.

そこで、図４４の例では、初期値を中心とした所定の範囲Ｈをスコアの範囲として、スライダ値と、スコアの変動幅とを対応づける。尚、所定の範囲Ｈは、予め設定されるものであって良い。 Therefore, in the example of FIG. 44, a predetermined range H centered on the initial value is defined as the score range, and the slider value and the score fluctuation range are associated with each other. Note that the predetermined range H may be set in advance.

具体的には、本実施形態では、スコアの最小値を「４」、最大値「８」として、スコア「４～８」の範囲を、スケール上で表現する。 Specifically, in this embodiment, the range of scores "4 to 8" is represented on the scale, with the minimum score being "4" and the maximum score being "8".

図４５は、第七の実施形態の検索処理部の処理を説明するフローチャートである。本実施形態のスケール算出部２６１は、出力軸毎に、スコアの範囲（最小値から最大値まで）と、ヒストグラムと、ヒストグラムの生成に際して取得された文書ＩＤの数Ｘを取得する（ステップＳ４５０１）。 FIG. 45 is a flow chart for explaining the processing of the search processing section of the seventh embodiment. The scale calculation unit 261 of this embodiment acquires the score range (from the minimum value to the maximum value), the histogram, and the number X of document IDs acquired when generating the histogram for each output axis (step S4501). .

続いて、スケール算出部２６１は、出力軸毎に、予め設定された所定の範囲のスコアと対応する検索結果集合データの数Ｙを取得する（ステップＳ４５０２）。 Subsequently, the scale calculator 261 acquires the number Y of search result set data corresponding to the score within a predetermined range set in advance for each output axis (step S4502).

図４５のステップＳ４５０３とステップＳ４５０４の処理は、図３９のステップＳ３９０２とステップＳ３９０３の処理と同様であるから説明を省略する。 The processing in steps S4503 and S4504 in FIG. 45 is the same as the processing in steps S3902 and S3903 in FIG. 39, so description thereof will be omitted.

続いて、スケール算出部２６１は、選択された出力軸について、スライダ値と対応する文書ＩＤの数が、Ｙ／Ｎとなるように、対応テーブルを作成する（ステップＳ４５０５）。 Subsequently, the scale calculation unit 261 creates a correspondence table so that the number of document IDs corresponding to the slider value is Y/N for the selected output axis (step S4505).

図４５のステップＳ４５０６の処理は、図３９のステップＳ３９０５の処理と同様であるから説明を省略する。 Since the processing in step S4506 in FIG. 45 is the same as the processing in step S3905 in FIG. 39, the description is omitted.

図４６は、第七の実施形態の対応テーブルの一例を示す図である。図４６に示す対応テーブル４６０では、スライダ値０～５に対して、スコア４～８が対応づけられている。 FIG. 46 is a diagram showing an example of a correspondence table according to the seventh embodiment. In the correspondence table 460 shown in FIG. 46, scores 4-8 are associated with slider values 0-5.

よって、本実施形態では、出力軸「プレミア」のスケールにおいて、スライダ値０～５まで移動させても、スコアが４未満又はスコアが８より大きい検索結果集合データは出力されない。 Therefore, in this embodiment, even if the slider value is moved from 0 to 5 on the scale of the output axis "premium", search result set data with a score of less than 4 or a score of more than 8 is not output.

本実施形態では、このように、スライダ値と対応づけるスコアの範囲を限定することで、入力されたクエリと関連の低い検索結果集合データが出力されることを抑制できる。 In this embodiment, by limiting the range of scores associated with the slider value in this way, it is possible to suppress the output of search result aggregate data that is less relevant to the input query.

開示の技術では、以下に記載する付記のような形態が考えられる。
（付記１）
コンピュータによる検索方法であって、該コンピュータが、
クエリに基づき、検索語群を特定し、
検索対象データより、前記検索語群を用いて検索結果集合データを取得し、
前記クエリより、第１特徴語群を抽出し、
前記検索結果集合データを、前記第１特徴語群を項目として用いて表示させる、検索方法。
（付記２）
前記検索結果集合データから第２特徴語群を抽出し、
前記検索結果集合データを、前記第１特徴語群及び前記第２特徴語群を項目として用いて表示させる、付記１記載の検索方法。
（付記３）
前記検索語群に含まれる単語毎に、印象語群に含まれる各印象語と前記単語との関連の強さを示す値と、前記関連の強さを示す値のばらつきを示す値とを取得し、
前記第１特徴語群に、前記ばらつきを示す値が閾値より大きい印象語が含まれる場合に、前記第１特徴語群の候補となる第３特徴語群を通知する、付記１記載の検索方法。
（付記４）
前記ばらつきを示す値が閾値より大きい印象語と、前記第３特徴語群とを対応付けて表示させる、付記３記載の検索方法。
（付記５）
前記検索語群に含まれる単語を分散表現によりベクトルに変換し、
前記ベクトルが入力されると、印象語群と、前記印象語群に含まれる各印象語と前記単語との関連の強さを示す値と、を出力する変換モデルにより、前記検索語群に含まれる単語毎に、前記各印象語との関連の強さを示す値を取得し、
前記印象語群から、前記単語毎の前記関連の強さを示す値に応じて、前記第１特徴語群を抽出する、付記１乃至４の何れか一項に記載の検索方法。
（付記６）
前記第１特徴語群は、
前記印象語群に含まれる印象語のうち、前記単語毎の前記関連の強さを示す値の最大値が大きい印象語から順に、所定個数抽出された印象語である、付記５記載の検索方法。
（付記７）
前記印象語群から、
前記第１特徴語群以外の第２特徴語群を抽出し、前記検索結果集合データを、前記第１特徴語群及び前記第２特徴語群を項目として用いて表示させる、付記５又は６記載の検索方法。
（付記８）
前記印象語群に含まれる印象語のうち、前記第１特徴語群として抽出された印象語よりも、前記単語毎の前記関連の強さを示す値の最大値が小さい印象語を抽出し、
抽出された前記印象語のうち、前記単語毎の前記関連の強さを示す値の分散が大きい印象語から順に所定個数抽出された印象語を、前記第２特徴語群とする、付記７記載の検索方法。
（付記９）
前記検索対象データに含まれる文書データ毎に、前記文書データに含まれる単語群を特定し、
前記単語群に含まれる単語から変換されたベクトルと、前記変換モデルと、に基づき、前記単語毎に、前記各印象語との関連の強さを示す値を取得し、
前記単語毎の前記各印象語との関連の強さを示す値から、前記文書データと前記各印象語との関連の強さを示す値を取得し、
前記文書データが前記検索結果集合データとして取得されると、前記文書データと前記各印象語との関連の強さを示す値を参照し、前記文書データと、前記第１特徴語群とされた印象語との関連の強さを示す値を、前記項目の値として表示させる、付記５乃至８の何れか一項に記載の検索方法。
（付記１０）
前記ばらつきを示す値は、前記関連の強さを示す値の標準偏差であり、
前記第３特徴語群は、
前記印象語群のうち、前記第１特徴語群以外の印象語であって、前記関連の強さを示す値が、前記標準偏差が所定の閾値より大きい印象語の前記関連の強さを示す値から前記標準偏差を減算した値より大きくした値となる印象語である、付記９記載の検索方法。
（付記１１）
前記第３特徴語群に含まれる印象語の選択を受け付け、
前記検索結果集合データを、前記第１特徴語群のうち、前記標準偏差が所定の閾値以下の印象語と、前記第３特徴語群から選択された前記印象語と、を項目として用いて表示させる、付記１０記載の検索方法。
（付記１２）
前記第１特徴語群において、表示を維持させる指定がされた印象語が存在する場合、
前記指定された印象語と、前記第３特徴語群から選択された前記印象語と、を前記項目に含める、付記１１記載の検索方法。
（付記１３）
前記表示を維持させる印象語を指定する操作を受け付けて、前記表示を維持させる印象語と、各単語との関連の強さを示す値を補正するための第一の補正値と、を対応付けて記憶部に保持させ、
前記第３特徴語群から印象語を選択する操作を受け付けて、前記第３特徴語群から選択された印象語と、各単語との関連の強さを示す値を補正する第二の補正値と、を対応付けて記憶部に保持させる、付記１２記載の検索方法。
（付記１４）
前記第１特徴語群として抽出された印象語毎に、
複数の前記検索結果集合データに対し、前記検索結果集合データと前記印象語との関連の強さを示す値の分布を算出し、
前記分布に基づき、前記印象語との関連の強さを示す値の変動幅と対応する前記検索結果集合データの数が所定数となるように、前記印象語との関連の強さを示す値を示すスケールを表示させる、付記５乃至１３の何れか一項に記載の検索方法。
（付記１５）
前記スケールには、前記印象語との関連の強さを示す値の変更を受け付けるスライダが共に表示されており、
前記スケールにおいて前記スライダが停止する位置と、前記印象語との関連の強さを示す値の変動幅とを対応付ける、付記１４記載の検索方法。
（付記１６）
前記単語毎の前記関連の強さを示す値の最大値を初期値とし、
前記スケールにおいて、前記初期値と対応する位置に前記スライダを表示させる、付記１５記載の検索方法。
（付記１７）
前記分布に基づき、
前記印象語との関連の強さを示す値の範囲の一部と、前記検索結果集合データの数とを対応付ける、付記１４乃至１６の何れか一項に記載の検索方法。
（付記１８）
クエリに基づき、検索語群を特定し、
検索対象データより、前記検索語群を用いて検索結果集合データを取得し、
前記クエリより、第１特徴語群を抽出し、
前記検索結果集合データを、前記第１特徴語群を項目として用いて表示させる、処理を、コンピュータに実行させる検索プログラム。
（付記１９）
クエリに基づき、検索語群を特定する特定部と、
検索対象データより、前記検索語群を用いて検索結果集合データを取得する検索部と、
前記クエリより、第１特徴語群を抽出する抽出部と、
前記検索結果集合データを、前記第１特徴語群を項目として用いて表示させる出力部と、を有する検索装置。 The technology disclosed herein can take forms such as those described below.
(Appendix 1)
A method of searching by a computer, the computer comprising:
Identify search terms based on the query,
Obtaining search result set data from the search target data using the search term group,
Extracting a first feature word group from the query,
A search method, wherein the search result set data is displayed using the first characteristic word group as items.
(Appendix 2)
Extracting a second feature word group from the search result set data,
The search method according to appendix 1, wherein the search result set data is displayed using the first characteristic word group and the second characteristic word group as items.
(Appendix 3)
Obtaining, for each word included in the search word group, a value indicating the strength of the relationship between each impression word included in the impression word group and the word, and a value indicating the variation in the value indicating the strength of the relationship. death,
The search method according to appendix 1, wherein when the first feature word group includes an impression word whose value indicating the variation is greater than a threshold, a third feature word group that is a candidate for the first feature word group is notified. .
(Appendix 4)
The search method according to appendix 3, wherein the third characteristic word group is displayed in association with the impression word whose value indicating the variation is greater than a threshold value.
(Appendix 5)
Converting the words included in the search term group into a vector by distributed representation,
When the vector is input, a conversion model that outputs an impression word group and a value indicating the strength of the relationship between each impression word included in the impression word group and the word includes obtaining a value indicating the strength of the relationship with each impression word for each word,
5. The search method according to any one of Appendices 1 to 4, wherein the first characteristic word group is extracted from the impression word group in accordance with a value indicating the strength of association for each word.
(Appendix 6)
The first feature word group is
The search method according to appendix 5, wherein a predetermined number of impression words are extracted from among the impression words included in the impression word group in descending order of the maximum value of the value indicating the strength of association for each word. .
(Appendix 7)
From the impression word group,
Supplementary note 5 or 6, wherein a second feature word group other than the first feature word group is extracted, and the search result set data is displayed using the first feature word group and the second feature word group as items. search method.
(Appendix 8)
extracting, from among the impression words included in the impression word group, impression words having a smaller maximum value indicating the strength of association for each word than the impression words extracted as the first characteristic word group;
Supplementary note 7, wherein a predetermined number of impression words extracted from the extracted impression words in descending order of the variance of the value indicating the strength of the relationship for each word are defined as the second characteristic word group. search method.
(Appendix 9)
identifying a word group included in the document data for each document data included in the search target data;
obtaining, for each word, a value indicating the strength of the relationship with each of the impression words, based on a vector converted from the words included in the word group and the conversion model;
obtaining a value indicating the strength of the relationship between the document data and the impression word from the value indicating the strength of the relationship between each word and the impression word;
When the document data is acquired as the search result set data, a value indicating the strength of the relationship between the document data and each of the impression words is referred to, and the document data and the first feature word group are determined. 9. The search method according to any one of Appendices 5 to 8, wherein a value indicating strength of relationship with the impression word is displayed as the value of the item.
(Appendix 10)
The value indicating the variation is the standard deviation of the value indicating the strength of the relationship,
The third feature word group is
Of the impression word group, the impression words other than the first characteristic word group, and the value indicating the strength of association indicates the strength of association of the impression words whose standard deviation is greater than a predetermined threshold. The retrieval method according to appendix 9, wherein the impression word is a value larger than the value obtained by subtracting the standard deviation from the value.
(Appendix 11)
Receiving selection of impression words included in the third characteristic word group,
The search result set data is displayed using, as items, the impression words having the standard deviation equal to or less than a predetermined threshold among the first feature word group and the impression words selected from the third feature word group. The search method according to Supplementary Note 10.
(Appendix 12)
In the first feature word group, if there is an impression word specified to be displayed,
12. The search method according to attachment 11, wherein the designated impression word and the impression word selected from the third characteristic word group are included in the item.
(Appendix 13)
Receiving an operation of designating the impression words whose display is to be maintained, and associating the impression words whose display is to be maintained with a first correction value for correcting the value indicating the strength of the relationship with each word and store it in the memory,
A second correction value for receiving an operation of selecting an impression word from the third characteristic word group and correcting a value indicating the strength of relationship between the impression word selected from the third characteristic word group and each word. and are stored in the storage unit in association with each other.
(Appendix 14)
For each impression word extracted as the first feature word group,
calculating a distribution of values indicating a strength of association between the search result set data and the impression word for a plurality of the search result set data;
A value indicating the strength of the relationship with the impression word, based on the distribution, such that the variation range of the value indicating the strength of the relationship with the impression word and the number of the search result set data corresponding to a predetermined number. 14. The search method according to any one of Appendices 5 to 13, wherein a scale indicating is displayed.
(Appendix 15)
The scale is displayed together with a slider for receiving a change in a value indicating the strength of the relationship with the impression word,
15. The search method according to appendix 14, wherein a position where the slider stops on the scale is associated with a fluctuation range of a value indicating strength of relation to the impression word.
(Appendix 16)
setting the maximum value of the value indicating the strength of the association for each word as an initial value;
16. The search method according to appendix 15, wherein the slider is displayed on the scale at a position corresponding to the initial value.
(Appendix 17)
Based on said distribution,
17. The search method according to any one of attachments 14 to 16, wherein a part of a range of values indicating strength of association with said impression word is associated with the number of said search result set data.
(Appendix 18)
Identify search terms based on the query,
Obtaining search result set data from the search target data using the search term group,
Extracting a first feature word group from the query,
A search program that causes a computer to execute a process of displaying the search result set data using the first characteristic word group as items.
(Appendix 19)
an identification unit that identifies a search term group based on the query;
a search unit that acquires search result set data using the search term group from the search target data;
an extraction unit that extracts a first feature word group from the query;
and an output unit for displaying the search result set data using the first feature word group as items.

本発明は、具体的に開示された実施形態に限定されるものではなく、特許請求の範囲から逸脱することなく、種々の変形や変更が可能である。 The invention is not limited to the specifically disclosed embodiments, and various modifications and changes are possible without departing from the scope of the claims.

１００検索システム
１１１検索語群
１１２意味ベクトル
１１３単語毎印象語スコア
１１４、２３１印象語スコアリスト
２００、２００Ａ検索装置
２１０検索対象インデックス記憶部
２１１インデックス情報
２２０印象語マップ記憶部
２２１印象語マップ
２２２補正リスト
２３０検索対象スコア記憶部
２３１印象語スコアリスト
２４０、２４０Ａ検索処理部
２４１入力受付部
２４２解析部
２４３検索部
２４４印象語スコア出力部
２４５軸決定部
２４６出力部
２４９候補通知部
２５０、２５０Ａ変換モデル
２５１軸変更部
２５２補正リスト生成部
２５３スコア補正部
３００端末装置 100 Search System 111 Search Word Group 112 Semantic Vector 113 Word-by-Word Impression Word Score 114, 231 Impression Word Score List 200, 200A Search Device 210 Search Target Index Storage Unit 211 Index Information 220 Impression Word Map Storage Unit 221 Impression Word Map 222 Correction List 230 Search target score storage unit 231 Impression word score list 240, 240A Search processing unit 241 Input reception unit 242 Analysis unit 243 Search unit 244 Impression word score output unit 245 Axis determination unit 246 Output unit 249 Candidate notification unit 250, 250A Conversion model 251 Axis change unit 252 Correction list generation unit 253 Score correction unit 300 Terminal device

Claims

コンピュータによる検索方法であって、該コンピュータが、
クエリに基づき、検索語群を特定し、
検索対象データより、前記検索語群を用いて検索結果集合データを取得し、
単語と印象語との関連の強さを示す値と、前記関連の強さを示す値のばらつきを示す値とが対応付けられた印象語マップを用いて作成された変換モデルに対し、前記検索語群に含まれる単語を分散表現したベクトルを入力し、前記検索語群に含まれる単語毎に、前記印象語マップに含まれる各印象語との関連の強さを示す値と、前記ばらつきを示す値とを取得し、
前記関連の強さを示す値に応じて、第１特徴語群となる印象語を抽出し、
前記検索結果集合データを、前記第１特徴語群を項目として用いて表示させ、
前記第１特徴語群に、前記ばらつきを示す値が閾値より大きい印象語が含まれる場合に、前記ばらつきを示す値が閾値より大きい印象語を特定し、
特定された印象語の前記ばらつきを示す値を用いて、前記印象語マップに含まれる印象語のうち、前記第１特徴語群に含まれない印象語の中から、前記第１特徴語群の代わりに前記項目として用いることが可能な印象語である第３特徴語群を特定し、
前記第３特徴語群を、前記第１特徴語群と入れ替えることが可能であることを示す通知を表示させる、検索方法。 A method of searching by a computer, the computer comprising:
Identify search terms based on the query,
Obtaining search result set data from the search target data using the search term group,
The conversion model created using an impression word map in which a value indicating the strength of the relationship between the word and the impression word and a value indicating the variation of the value indicating the strength of the relationship are associated with each other; A vector that expresses the words included in the word group in a distributed manner is input, and for each word included in the search word group, a value indicating the strength of the relationship with each impression word included in the impression word map and the variation are calculated. get the indicated value and
extracting impression words that form a first characteristic word group according to the value indicating the strength of the relationship;
displaying the search result set data using the first characteristic word group as items ;
if the first feature word group includes impression words whose value indicating the variation is greater than a threshold, specifying the impression word whose value indicating the variation is greater than the threshold;
Using the value indicating the variation of the identified impression words, among the impression words included in the impression word map, impression words not included in the first feature word group are selected from among the impression words of the first feature word group. Identifying a third characteristic word group, which is an impression word that can be used as the item instead,
A search method comprising: displaying a notification indicating that the third characteristic word group can be replaced with the first characteristic word group .

前記検索結果集合データから抽出された単語を第２特徴語群とし、
前記検索結果集合データを、前記第１特徴語群及び前記第２特徴語群を項目として用いて表示させる、請求項１記載の検索方法。 Words extracted from the search result set data are defined as a second feature word group ,
2. The retrieval method according to claim 1, wherein said retrieval result set data is displayed using said first characteristic word group and said second characteristic word group as items.

前記第１特徴語群と、前記第３特徴語群とを対応付けて表示させる、請求項１又は２記載の検索方法。 3. The retrieval method according to claim 1 , wherein said first characteristic word group and said third characteristic word group are displayed in association with each other.

前記印象語マップから、
前記第１特徴語群以外の印象語であって、予め選択された印象語を抽出して第２特徴語群とし、前記検索結果集合データを、前記第１特徴語群及び前記第２特徴語群を項目として用いて表示させる、請求項１記載の検索方法。 From the impression word map ,
Preselected impression words other than the first feature word group are extracted and used as a second feature word group, and the search result set data is combined with the first feature word group and the second feature word group. 2. The retrieval method according to claim 1 , wherein the word groups are used as items for display.

前記印象語マップに含まれる印象語のうち、前記第１特徴語群として抽出された印象語よりも、前記単語毎の前記関連の強さを示す値の最大値が小さい印象語を抽出し、
抽出された前記印象語と前記検索結果集合データの関連の強さを示す値を、前記検索結果集合データ毎に算出し、
前記検索結果集合データの関連の強さを示す値の分散が大きい印象語から順に所定個数抽出された印象語を、第２特徴語群とし、
前記第１特徴語群及び前記第２特徴語群を項目として用いて表示させる、請求項１記載の検索方法。 extracting, from among the impression words included in the impression word map , impression words having a smaller maximum value indicating the strength of association for each word than the impression words extracted as the first characteristic word group;
calculating a value indicating the strength of the relationship between the extracted impression word and the search result set data for each of the search result set data;
A second feature word group is defined by a predetermined number of impression words extracted in descending order of variance of values indicating strength of association in the search result set data ,
2. The retrieval method according to claim 1 , wherein said first characteristic word group and said second characteristic word group are used as items and displayed .

前記第１特徴語群として抽出された印象語毎に、
複数の前記検索結果集合データに対し、前記検索結果集合データと前記第１特徴語群として抽出された印象語との関連の強さを示す値の分布を算出し、
前記分布に基づき、前記第１特徴語群として抽出された印象語との関連の強さを示す値の変動幅と対応する前記検索結果集合データの数が所定数となるように、前記第１特徴語群として抽出された印象語との関連の強さを示す値を示すスケールを表示させ、
前記スケールにおいて、指定された前記第１特徴語群として抽出された印象語との関連の強さを示す値と対応した検索結果データを表示させる、請求項１乃至５の何れか一項に記載の検索方法。 For each impression word extracted as the first feature word group,
calculating a distribution of values indicating the strength of association between the search result set data and the impression words extracted as the first feature word group, for a plurality of the search result set data;
Based on the distribution, the number of search result set data corresponding to the fluctuation range of the value indicating the strength of the relationship with the impression word extracted as the first characteristic word group is set to a predetermined number . display a scale indicating a value indicating the strength of the relationship with the impression words extracted as the characteristic word group ;
6. The scale according to any one of claims 1 to 5 , wherein search result data corresponding to a value indicating strength of relationship with the impression word extracted as the specified first characteristic word group is displayed on the scale. search method.

前記スケールには、前記第１特徴語群として抽出された印象語との関連の強さを示す値の変更を受け付けるスライダが共に表示されており、
前記スケールにおいて前記スライダが停止する位置と、前記第１特徴語群として抽出された印象語との関連の強さを示す値の変動幅とを対応付け、
前記スライダの位置が変更されると、前記スライダの位置と対応し前記第１特徴語群として抽出された印象語との関連の強さを示す値を変更し、
変更された前記第１特徴語群として抽出された印象語との関連の強さを示す値と対応した検索結果データを表示させる、請求項６記載の検索方法。 The scale is also displayed with a slider for accepting a change in a value indicating a strength of association with the impression word extracted as the first characteristic word group ,
associating a position where the slider stops on the scale with a variation range of a value indicating a strength of association with the impression word extracted as the first characteristic word group ;
when the position of the slider is changed, changing a value indicating the strength of the relationship between the impression word corresponding to the position of the slider and extracted as the first feature word group ;
7. The retrieval method according to claim 6 , wherein retrieval result data corresponding to a value indicating strength of relationship with the impression word extracted as the changed first characteristic word group is displayed.

前記単語毎の前記関連の強さを示す値の最大値を初期値とし、
前記スケールにおいて、前記初期値と対応する位置に前記スライダを表示させる、請求項７記載の検索方法。 setting the maximum value of the value indicating the strength of the association for each word as an initial value;
8. The retrieval method according to claim 7 , wherein said slider is displayed on said scale at a position corresponding to said initial value.

クエリに基づき、検索語群を特定し、
検索対象データより、前記検索語群を用いて検索結果集合データを取得し、
単語と印象語との関連の強さを示す値と、前記関連の強さを示す値のばらつきを示す値とが対応付けられた印象語マップを用いて作成された変換モデルに対し、前記検索語群に含まれる単語を分散表現したベクトルを入力し、前記検索語群に含まれる単語毎に、前記印象語マップに含まれる各印象語との関連の強さを示す値と、前記ばらつきを示す値とを取得し、
前記関連の強さを示す値に応じて、第１特徴語群となる印象語を抽出し、
前記検索結果集合データを、前記第１特徴語群を項目として用いて表示させ、
前記第１特徴語群に、前記ばらつきを示す値が閾値より大きい印象語が含まれる場合に、前記ばらつきを示す値が閾値より大きい印象語を特定し、
特定された印象語の前記ばらつきを示す値を用いて、前記印象語マップに含まれる印象語のうち、前記第１特徴語群に含まれない印象語の中から、前記第１特徴語群の代わりに前記項目として用いることが可能な印象語である第３特徴語群を特定し、
前記第３特徴語群を、前記第１特徴語群と入れ替えることが可能であることを示す通知を表示させる、処理を、コンピュータに実行させる検索プログラム。 Identify search terms based on the query,
Obtaining search result set data from the search target data using the search term group,
The conversion model created using an impression word map in which a value indicating the strength of the relationship between the word and the impression word and a value indicating the variation of the value indicating the strength of the relationship are associated with each other; A vector that expresses the words included in the word group in a distributed manner is input, and for each word included in the search word group, a value indicating the strength of the relationship with each impression word included in the impression word map and the variation are calculated. get the indicated value and
extracting impression words that form a first characteristic word group according to the value indicating the strength of the relationship;
displaying the search result set data using the first characteristic word group as items ;
if the first feature word group includes impression words whose value indicating the variation is greater than a threshold, specifying the impression word whose value indicating the variation is greater than the threshold;
Using the value indicating the variation of the identified impression words, among the impression words included in the impression word map, impression words not included in the first feature word group are selected from among the impression words of the first feature word group. Identifying a third characteristic word group, which is an impression word that can be used as the item instead,
A search program that causes a computer to execute a process of displaying a notification indicating that the third characteristic word group can be replaced with the first characteristic word group .

クエリに基づき、検索語群を特定する特定部と、
検索対象データより、前記検索語群を用いて検索結果集合データを取得し、
単語と印象語との関連の強さを示す値と、前記関連の強さを示す値のばらつきを示す値とが対応付けられた印象語マップを用いて作成された変換モデルに対し、前記検索語群に含まれる単語を分散表現したベクトルを入力し、前記検索語群に含まれる単語毎に、前記印象語マップに含まれる各印象語との関連の強さを示す値と、前記ばらつきを示す値とを取得し、前記関連の強さを示す値に応じて、第１特徴語群となる印象語を抽出する抽出部と、
前記検索結果集合データを、前記第１特徴語群を項目として用いて表示させ、前記第１特徴語群に、前記ばらつきを示す値が閾値より大きい印象語が含まれる場合に、前記ばらつきを示す値が閾値より大きい印象語を特定し、特定された印象語の前記ばらつきを示す値を用いて、前記印象語マップに含まれる印象語のうち、前記第１特徴語群に含まれない印象語の中から、前記第１特徴語群の代わりに前記項目として用いることが可能な印象語である第３特徴語群を特定し、前記第３特徴語群を、前記第１特徴語群と入れ替えることが可能であることを示す通知を表示させる、出力部と、を有する検索装置。 an identification unit that identifies a search term group based on the query;
Obtaining search result set data from the search target data using the search term group,
The conversion model created using an impression word map in which a value indicating the strength of the relationship between the word and the impression word and a value indicating the variation of the value indicating the strength of the relationship are associated with each other; A vector that expresses the words included in the word group in a distributed manner is input, and for each word included in the search word group, a value indicating the strength of the relationship with each impression word included in the impression word map and the variation are calculated. an extracting unit that acquires a value indicating the strength of the association and extracts impression words that form a first characteristic word group according to the value indicating the strength of association;
The search result set data is displayed using the first feature word group as items, and when the first feature word group includes an impression word whose value indicating the variation is larger than a threshold, the variation is determined. Impression words whose indicated values are larger than a threshold are identified, and impression words not included in the first characteristic word group are identified among the impression words included in the impression word map using the values indicating the variation of the identified impression words. A third characteristic word group, which is an impression word that can be used as the item in place of the first characteristic word group, is specified from among the words, and the third characteristic word group is identified as the first characteristic word group. and an output unit for displaying a notification indicating that the replacement is possible .