JP2012256282A

JP2012256282A - Sensitivity analysis system and program

Info

Publication number: JP2012256282A
Application number: JP2011130042A
Authority: JP
Inventors: Osamu Oshima; 修大島
Original assignee: Nomura Research Institute Ltd
Current assignee: Nomura Research Institute Ltd
Priority date: 2011-06-10
Filing date: 2011-06-10
Publication date: 2012-12-27
Anticipated expiration: 2031-06-10
Also published as: JP4796664B1

Abstract

PROBLEM TO BE SOLVED: To implement a technology which presents a huge amount of documents described by each end user by classifying and organizing them so as to be easy to see for a person in charge of analysis.SOLUTION: A sensibility analysis system 10 includes: a negative/positive determination part 22 which gives a negative/positive polarity tag to a sensibility term and gives a negative expression tag to a negative expression of each sentence, and determines whether each sentence is a negative evaluation sentence or a positive evaluation sentence based on the type of polarity tag and the presence or absence of the negative expression tag; a subject recognition part 24 for recognizing a subject of each sentence; an emotion extraction part 28 for associating the corresponding emotion type tag with the emotion expression of each sentence; and an analysis result evaluation part 21 for associating corresponding evaluation axes with sentences when the subject of each sentence falls under the similar expression stored in evaluation axis similar expression storage means 32, categorizing each sentence for every plurality of similar sentence groups provided with common evaluation axis, emotion type, and sensibility term, recognizing a similar sentence having the least number of characters among each similar sentence group as a representative opinion, and generating a representative opinion list 63 listed with a plurality of representative opinions.

Description

この発明は感性分析システム及びプログラムに係り、特に、電子掲示板上の記事データやアンケート記事データなどの電子化された文書データから、特定の商品やサービスに関するエンドユーザの主観的な評価や意見を自動的に収集する技術に関する。 The present invention relates to a sensitivity analysis system and program, and in particular, automatically evaluates end users' subjective evaluations and opinions regarding specific products and services from electronic document data such as article data and questionnaire article data on an electronic bulletin board. Related to collective technology.

具体的な商品やサービス（以下「商品等」）に対するエンドユーザの主観的な評価（感想）は、現行商品等の改良や次世代商品等の開発にとって極めて重要な指針となるため、各企業はエンドユーザから集めたアンケート結果を様々な観点から分析し、あるいはネット上の電子掲示板にアクセスし、自社商品等に対する評価をチェックすることを行っている。 Each company's subjective evaluation (impressions) on specific products and services (hereinafter “products, etc.”) is an extremely important guideline for improving current products and developing next-generation products. The results of questionnaires collected from end users are analyzed from various viewpoints, or an electronic bulletin board on the Internet is accessed to check the evaluation of their products.

また、このような人手による分析作業の効率化を図るため、電子化された文章（アンケート情報）をコンピュータを用いて自動解析することにより、特定の商品等に対するエンドユーザの評価を抽出する技術が既に提案されている。
例えば、特許文献１にあっては、所定の対象に対する情動表現を含む文章に対して形態素処理や構文解析処理を施した後、多数の情動表現が登録されたアフェクトターム辞書を参照して、当該文章から情動表現を抽出すると共に、各情動表現の属性（ネガ／ポジ等）を集計し、その結果を外部に出力する技術が開示されている。
この結果、「口紅」という評価対象に関し、色つや、におい、付け心地、パッケージといった複数の評価軸毎に、否定的評価と肯定的評価の分布状況を提示することが可能となる。
特開２００３−２４８６８１ Also, in order to improve the efficiency of such manual analysis work, there is a technique for extracting end-user evaluations for specific products by automatically analyzing digitized text (questionnaire information) using a computer. It has already been proposed.
For example, in Patent Document 1, after performing morphological processing and syntax analysis processing on a sentence including emotional expressions for a predetermined object, refer to the effect term dictionary in which a large number of emotional expressions are registered, There is disclosed a technique for extracting emotional expressions from sentences, totaling attributes (negative / positive, etc.) of each emotional expression, and outputting the results to the outside.
As a result, regarding the evaluation target “lipstick”, it is possible to present the distribution of negative evaluations and positive evaluations for each of a plurality of evaluation axes such as color, odor, comfort, and package.
JP2003-248861

このように、従来の分析技術を用いることにより、評価対象である商品等に対するエンドユーザの好き嫌いや良し悪しといった、二者択一的な評価を自動集計することが実現できる。
しかしながら、分析担当者にとってエンドユーザの生の声に触れることは避けて通れず、最終的には膨大な量の文書に当たらざるを得ないのが実情であるが、各エンドユーザがフリーワードで気ままに記述した文書にアトランダムに目を通していくことは極めて非効率的であり、文書閲覧プログラムの検索機能を駆使するにしても、多くの労力を要していた。 As described above, by using the conventional analysis technique, it is possible to automatically aggregate alternative evaluations such as end user's likes and dislikes and good or bad for the products to be evaluated.
However, it is unavoidable for analysts to touch the live voices of end users, and in the end it is inevitably necessary to deal with a huge amount of documents. It is extremely inefficient to look at a document written at will at random, and even if the search function of the document browsing program is fully used, much labor is required.

この発明は、従来のこのような問題を解決するために案出されたものであり、各エンドユーザが記述した膨大な量の文書を、分析担当者が見やすいように分類・整理して提示することを可能とする技術の実現を目的としている。 The present invention has been devised to solve such a conventional problem, and presents an enormous amount of documents described by each end user, classified and arranged so that an analyst can easily see them. The purpose is to realize technology that makes this possible.

上記の目的を達成するため、請求項１に記載した感性分析システムは、事物に対する肯定／否定の価値判断を表す感性用語と、肯定／否定の何れであるかを示す属性との組合せを格納しておく感性辞書記憶手段と、文書ファイルに対して形態素処理を施し、当該文書ファイルに含まれる複数のテキスト文を形態素単位に分解する形態素解析手段と、各文に含まれる形態素または形態素の組合せからなる表現が、上記感性辞書記憶手段に格納された感性用語に該当するか否かを判定し、該当する場合には当該表現に対して組み合わされている上記肯定／否定の何れかの属性を示す属性タグを関連付ける手段と、否定文を構成する際に用いられる否定表現を格納しておく否定表現記憶手段と、各文に含まれる形態素または形態素の組合せからなる表現が、上記否定表現記憶手段に格納された否定表現に該当するか否かを判定し、該当する場合には当該表現に対して否定表現であることを示す否定表現タグを関連付ける手段と、各文に関連付けられた属性タグの種類と、否定表現タグの有無との組合せパターンに基づいて、当該文が否定評価文か肯定評価文かを判定し、否定評価文に対してはネガ判定タグを付与すると共に、肯定評価文に対してはポジ判定タグを付与する手段と、各文中の主格を構成するための助詞を探索し、当該助詞が付属している自立語を主題と認定する主題認定手段と、発言者の発言意図を表す意図表現と、その意図の類型との組合せを格納しておく意図表現記憶手段と、上記ネガ判定タグが付与された文について「不満」を意味する意図類型タグを関連付けると共に、上記ポジ判定タグが付与された文について「好評」を意味する意図類型タグを関連付け、さらに、各文に含まれる形態素または形態素の組合せからなる表現が、上記意図表現辞書記憶手段に格納された意図表現に該当するか否かを判定し、該当する場合には当該表現に対して対応の類型を示す意図類型タグを関連付ける手段と、複数種類の評価軸を示す文字列と、各評価軸に包含される類似表現との対応関係を格納しておく評価軸類似表現記憶手段と、各文に認定された主題が、上記評価軸類似表現記憶手段に格納された類似表現に該当するか否かを判定し、該当する場合には文に対して対応の評価軸を関連付ける手段と、各文を、共通の評価軸、意図類型及び感性用語を備えた類似文の集合である類似文グループ毎に分類すると共に、各類似文グループ中で文字数が最も少ない類似文を代表意見と認定する手段と、複数の代表意見が列記された代表意見リストを生成する手段と、この代表意見リストを出力する手段とを備えたことを特徴としている。
上記「属性」としては、例えば「＋」や「−」の極性が該当するが、「肯定」や「否定」、あるいは「Ｐ」や「Ｎ」等であってもよい（以下同様）。
上記「『不満』を意味する意図類型タグ」としては、「不満」の他にネガティブな意味を有する「残念」、「否定的」、「文句」などが該当する。
上記「『好評』を意味する意図類型タグ」としては、「好評」の他にポジティブな意味を有する「満足」、「肯定的」、「讃辞」などが該当する。
上記「出力」とは、ディスプレイに表示すること以外に、プリンタを介してプリントアウトすること、所定形式のファイルを生成して記憶手段に格納すること、画面（htmlファイル等）を生成してクライアント端末に送信することを含む概念である。 In order to achieve the above object, a sensitivity analysis system according to claim 1 stores a combination of a sensitivity term representing an affirmative / negative value judgment for an object and an attribute indicating whether it is affirmative / negative. Sensitivity dictionary storage means, morpheme processing is performed on a document file, a plurality of text sentences included in the document file are decomposed into morpheme units, and morphemes or combinations of morphemes included in each sentence It is determined whether or not the expression corresponds to the sensitivity term stored in the sensitivity dictionary storage means, and if so, indicates either the positive or negative attribute combined with the expression Consists of means for associating attribute tags, negative expression storage means for storing negative expressions used in constructing negative sentences, and morphemes or combinations of morphemes included in each sentence Determining whether or not the current expression corresponds to the negative expression stored in the negative expression storage means, and, if applicable, means for associating a negative expression tag indicating that the expression is a negative expression with each expression; Based on the combination pattern of the type of attribute tag associated with the sentence and the presence or absence of a negative expression tag, it is determined whether the sentence is a negative evaluation sentence or a positive evaluation sentence. At the same time, a means for assigning a positive determination tag to a positive evaluation sentence and a particle for constructing the main case in each sentence are searched, and the subject recognition that recognizes the independent word attached to the particle as the subject. Meaning storage means for storing a combination of a means, an intention expression expressing the intention of the speaker, and a type of the intention, and an intention type meaning “dissatisfaction” with respect to the sentence with the negative determination tag When you associate tags The sentence with the positive determination tag is associated with an intention type tag that means “popular”, and an expression composed of a morpheme or a combination of morphemes included in each sentence is stored in the intention expression dictionary storage unit. Means for associating an intention type tag indicating a corresponding type with the expression, a character string indicating a plurality of types of evaluation axes, and each evaluation axis. The evaluation axis similar expression storage means for storing the correspondence relationship with the similar expressions included in the text, and whether the subject recognized in each sentence corresponds to the similar expression stored in the evaluation axis similar expression storage means Means for associating a corresponding evaluation axis with a sentence if applicable, and for each sentence group that is a set of similar sentences with a common evaluation axis, intention type, and sensitivity term As well as A means for recognizing a similar sentence having the smallest number of characters in each similar sentence group as a representative opinion, a means for generating a representative opinion list in which a plurality of representative opinions are listed, and a means for outputting the representative opinion list It is characterized by that.
As the “attribute”, for example, the polarity of “+” or “−” is applicable, but “positive” or “negative”, “P”, “N”, or the like may be used (the same applies hereinafter).
The “intention type tag meaning“ dissatisfied ”” includes “disappointment”, “negative”, “complaint”, etc. having negative meanings in addition to “dissatisfied”.
The “intention type tag meaning“ favored ”” corresponds to “satisfied”, “positive”, “affirmation”, etc. having a positive meaning in addition to “popular”.
In addition to displaying on the display, the above "output" means printing out through a printer, generating a file in a predetermined format and storing it in a storage means, generating a screen (such as an html file), and a client It is a concept including transmitting to a terminal.

請求項２に記載した感性分析システムは、請求項１に記載のシステムであって、さらに、上記代表意見リスト中の各代表意見には、それぞれの類似文グループに含まれる類似文の件数が付記されていることを特徴としている。 The sensibility analysis system according to claim 2 is the system according to claim 1, and each representative opinion in the representative opinion list is appended with the number of similar sentences included in each similar sentence group. It is characterized by being.

請求項３に記載した感性分析システムは、請求項１または２に記載のシステムであって、さらに、上記代表意見リスト中の一の代表意見が選択された場合に、対応の類似文グループに属する個々の類似文を列記した内訳リストを生成する手段と、この内訳リストを出力する手段とを備えたことを特徴としている。 The sensitivity analysis system according to claim 3 is the system according to claim 1 or 2, and further belongs to a corresponding similar sentence group when one representative opinion in the representative opinion list is selected. The present invention is characterized by comprising means for generating a breakdown list in which individual similar sentences are listed and means for outputting the breakdown list.

請求項４に記載した感性分析システムは、請求項３に記載のシステムであって、さらに、上記内訳リスト中の一の類似文が選択された場合に、当該類似文を含む文書の全文を出力する手段を備えたことを特徴としている。 The sensibility analysis system according to claim 4 is the system according to claim 3, and further outputs a full sentence of a document including the similar sentence when one similar sentence in the breakdown list is selected. It is characterized by having means to do.

請求項５に記載した感性分析プログラムは、コンピュータを、事物に対する肯定／否定の価値判断を表す感性用語と、肯定／否定の何れであるかを示す属性との組合せを格納しておく感性辞書記憶手段、文書ファイルに対して形態素処理を施し、当該文書ファイルに含まれる複数のテキスト文を形態素単位に分解する形態素解析手段、各文に含まれる形態素または形態素の組合せからなる表現が、上記感性辞書記憶手段に格納された感性用語に該当するか否かを判定し、該当する場合には当該表現に対して組み合わされている上記肯定／否定の何れかの属性を示す属性タグを関連付ける手段、否定文を構成する際に用いられる否定表現を格納しておく否定表現記憶手段、各文に含まれる形態素または形態素の組合せからなる表現が、上記否定表現記憶手段に格納された否定表現に該当するか否かを判定し、該当する場合には当該表現に対して否定表現であることを示す否定表現タグを関連付ける手段、各文に関連付けられた属性タグの種類と、否定表現タグの有無との組合せパターンに基づいて、当該文が否定評価文か肯定評価文かを判定し、否定評価文に対してはネガ判定タグを付与すると共に、肯定評価文に対してはポジ判定タグを付与する手段、各文中の主格を構成するための助詞を探索し、当該助詞が付属している自立語を主題と認定する主題認定手段、発言者の発言意図を表す意図表現と、その意図の類型との組合せを格納しておく意図表現記憶手段、上記ネガ判定タグが付与された文について「不満」を意味する意図類型タグを関連付けると共に、上記ポジ判定タグが付与された文について「好評」を意味する意図類型タグを関連付け、さらに、各文に含まれる形態素または形態素の組合せからなる表現が、上記意図表現辞書記憶手段に格納された意図表現に該当するか否かを判定し、該当する場合には当該表現に対して対応の類型を示す意図類型タグを関連付ける手段、複数種類の評価軸を示す文字列と、各評価軸に包含される類似表現との対応関係を格納しておく評価軸類似表現記憶手段、各文に認定された主題が、上記評価軸類似表現記憶手段に格納された類似表現に該当するか否かを判定し、該当する場合には文に対して対応の評価軸を関連付ける手段、各文を、共通の評価軸、意図類型及び感性用語を備えた類似文の集合である類似文グループ毎に分類すると共に、各類似文グループ中で文字数が最も少ない類似文を代表意見と認定する手段、複数の代表意見が列記された代表意見リストを生成する手段、この代表意見リストを出力する手段として機能させることを特徴としている。 The sensitivity analysis program according to claim 5 stores a combination of a sensitivity term representing an affirmative / negative value judgment for an object and an attribute indicating whether the computer is positive / negative. Means for performing morpheme processing on a document file and decomposing a plurality of text sentences included in the document file into morpheme units; an expression comprising a morpheme or a combination of morphemes included in each sentence is the Kansei dictionary Means for associating attribute tags indicating any one of the above-mentioned positive / negative attributes combined with the expression, whether or not it corresponds to a sensitivity term stored in the storage means; Negative expression storage means for storing negative expressions used when composing sentences, expressions composed of morphemes or combinations of morphemes included in each sentence are A means for determining whether or not the negative expression stored in the current storage means corresponds, and in that case, a means for associating a negative expression tag indicating that the expression is a negative expression with the expression, an attribute associated with each sentence Based on the combination pattern of tag type and presence / absence of negative expression tag, it is determined whether the sentence is a negative evaluation sentence or a positive evaluation sentence. Means to attach positive judgment tags to sentences, search for particles that constitute the main character in each sentence, and subject recognition means that recognizes independent words attached with the particles as the subject; An intention expression storage means for storing a combination of an intention expression indicating the type of the intention and a type of the intention, an intention type tag that means “dissatisfied” is associated with the sentence with the negative determination tag, and the positive determination tag But Whether an intention type tag that means “popular” is associated with a given sentence, and further, an expression composed of a morpheme or a combination of morphemes included in each sentence corresponds to an intention expression stored in the intention expression dictionary storage unit. Means for associating an intention type tag indicating a corresponding type with the expression, if applicable, a character string indicating a plurality of types of evaluation axes, and a similar expression included in each evaluation axis Evaluation axis similar expression storage means for storing the correspondence relationship, whether the subject recognized in each sentence corresponds to the similar expression stored in the evaluation axis similar expression storage means, and if so, Is a means of associating a corresponding evaluation axis with a sentence, and each sentence is classified into a similar sentence group that is a set of similar sentences having a common evaluation axis, intention type, and sensitivity term, and in each similar sentence group The number of characters It is characterized by functioning as means for recognizing few similar sentences as representative opinions, means for generating a representative opinion list in which a plurality of representative opinions are listed, and means for outputting the representative opinion list.

この発明に係る感性分析システム及び感性分析プログラムにあっては、多数の分析対象文書から相互に似通った内容の文を類似文グループとして抽出すると共に、各類似文グループから最も文字数の少ない類似文を代表意見として選別し、代表意見リストとして外部に出力することができる。
このため分析担当者は、コンパクトにまとめられた代表意見リストを眺めることにより、分析対象文書の内容を概括することが可能となり、内容チェックの効率化を実現できる。
この代表意見は、機械によって合成された文ではなく、エンドユーザが実際に記述した文そのものであるため、分析担当者はエンドユーザの生の声に触れることができ、行間に潜む微妙なニュアンスを汲み取ることも可能となる。 In the sensibility analysis system and the sensibility analysis program according to the present invention, sentences having similar contents are extracted as a similar sentence group from a large number of analysis target documents, and a similar sentence having the smallest number of characters is extracted from each similar sentence group. A representative opinion can be selected and output as a representative opinion list to the outside.
For this reason, the person in charge of analysis can summarize the contents of the analysis target document by looking at the representative opinion list compiled in a compact manner, and the efficiency of the contents check can be realized.
This representative opinion is not a sentence synthesized by the machine, but a sentence actually written by the end user, so the analyst can touch the live voice of the end user and subtle nuances hidden between the lines. It can also be drawn.

図１は、この発明に係る感性分析システム10の全体構成を示すブロック図であり、文書登録部11と、文書記憶部12と、文書解析部13と、ネガ／ポジ判定部14と、感性辞書記憶部15と、否定表現記憶部16と、主題認定部17と、意図抽出部18と、意図表現記憶部19と、分析結果記憶部20と、分析結果評価部21と、評価軸類似表現記憶部22と、時系列解析部23と、考察ルール記憶部24と、画面生成部25とを備えている。 FIG. 1 is a block diagram showing the overall configuration of a sensibility analysis system 10 according to the present invention. A document registration unit 11, a document storage unit 12, a document analysis unit 13, a negative / positive determination unit 14, and a sensibility dictionary. Storage unit 15, negative expression storage unit 16, subject recognition unit 17, intention extraction unit 18, intention expression storage unit 19, analysis result storage unit 20, analysis result evaluation unit 21, evaluation axis similar expression storage Unit 22, time series analysis unit 23, consideration rule storage unit 24, and screen generation unit 25.

上記の文書登録部11、文書解析部13、ネガ／ポジ判定部14、主題認定部17、意図抽出部18、分析結果評価部21、時系列解析部23及び画面生成部25は、サーバコンピュータのCPUが、OS及びアプリケーションプログラムに従って必要な処理を実行することによって実現される。
また、上記の文書記憶部12、感性辞書記憶部15、否定表現記憶部16、意図表現記憶部19、分析結果記憶部20、評価軸類似表現記憶部22及び考察ルール記憶部24は、サーバコンピュータの外部記憶装置内に設けられている。 The document registration unit 11, the document analysis unit 13, the negative / positive determination unit 14, the subject recognition unit 17, the intention extraction unit 18, the analysis result evaluation unit 21, the time series analysis unit 23, and the screen generation unit 25 are The CPU is realized by executing necessary processes according to the OS and application programs.
In addition, the document storage unit 12, the sensitivity dictionary storage unit 15, the negative expression storage unit 16, the intention expression storage unit 19, the analysis result storage unit 20, the evaluation axis similar expression storage unit 22, and the consideration rule storage unit 24 are server computers. In the external storage device.

この感性分析システム10は、Webサーバ26とネットワーク接続されており、Webサーバ26はインターネット27を介して複数のクライアント端末28と接続される。
各クライアント端末28は、OS及びWebブラウザを搭載したPC等よりなる。 This sensibility analysis system 10 is connected to a Web server 26 via a network, and the Web server 26 is connected to a plurality of client terminals 28 via the Internet 27.
Each client terminal 28 includes a PC or the like equipped with an OS and a web browser.

つぎに、この感性分析システム10の利用方法について説明する。
まずユーザは、クライアント端末28からWebサーバ26にアクセスし、ID及びパスワードを入力してログインした後、サービスメニューから「感性分析サービス」を選択する。
この結果、Webサーバ26からクライアント端末28に対して分析条件指定画面が送信される。 Next, a method of using the sensitivity analysis system 10 will be described.
First, the user accesses the Web server 26 from the client terminal 28, inputs an ID and password, logs in, and then selects “Kansei analysis service” from the service menu.
As a result, an analysis condition designation screen is transmitted from the Web server 26 to the client terminal 28.

図２は、この分析条件指定画面40の一例を示すものであり、分析対象ファイル指定欄41と、評価軸指定欄42とを備えている。 FIG. 2 shows an example of the analysis condition designation screen 40, which includes an analysis target file designation column 41 and an evaluation axis designation column 42.

まず、分析対象ファイル指定欄41は、ユーザ側で事前に準備した文書ファイルを指定する欄である。すなわち、ユーザが参照ボタン43をクリックすると、クライアント端末28のドライブ構造を示すファイル選択ウィンドウがWebブラウザ上に表示される。これに対しユーザは、特定のドライブ名やその配下のフォルダ名をクリックして展開させ、特定の文書ファイルを指定する。
図においては、「03〜11 アンケート結果.csv」という文書ファイルが指定されている。 First, the analysis target file designation column 41 is a column for designating a document file prepared in advance by the user. That is, when the user clicks the browse button 43, a file selection window showing the drive structure of the client terminal 28 is displayed on the Web browser. On the other hand, the user clicks and expands a specific drive name or a folder name under it to specify a specific document file.
In the figure, a document file “03-11 questionnaire result.csv” is designated.

この文書ファイルには、ユーザである「ＡＢＣ観光ホテル」の顧客から集めた電子化済みの文書が、複数件格納されている。
各文書は、個々の顧客が記述したアンケートの回答文書に相当し、それぞれ複数の文が含まれている。また各文書には、回答日時や更新日時等の時間情報と、回答者の属性情報（性別、年代、職業、都道府県、郵便番号等）が付加されている。 In this document file, a plurality of digitized documents collected from customers of the user “ABC Tourist Hotel” are stored.
Each document corresponds to a questionnaire response document described by an individual customer, and each document includes a plurality of sentences. In addition, each document is added with time information such as reply date and time, update date and time, and attribute information of the respondent (gender, age, occupation, prefecture, postal code, etc.).

評価軸指定欄42は、分析対象ファイルに含まれた各文に対する分析項目を指定する欄であり、ユーザの属する業界毎に設定された標準の評価軸が、複数列挙されている。
図においては、ユーザIDに関連付けられた業種コードが「ホテル・旅館業界」であるため、「価格」、「料理」、「部屋」、「風呂」、「立地」、「宴会」、「設備」等が表示されている。
これに対しユーザは、不要な評価軸のチェックボックスに入れられたチェックを外すことにより、当該評価軸を分析項目から除外することができる。 The evaluation axis designation column 42 is a column for designating analysis items for each sentence included in the analysis target file, and lists a plurality of standard evaluation axes set for each industry to which the user belongs.
In the figure, since the industry code associated with the user ID is “Hotel / Ryokan industry”, “Price”, “Cooking”, “Room”, “Bath”, “Location”, “Banquet”, “Equipment” Etc. are displayed.
On the other hand, the user can exclude the evaluation axis from the analysis items by unchecking the check box of the unnecessary evaluation axis.

以上の設定を完了したユーザが、分析開始ボタン44をクリックすると、クライアント端末28からWebサーバ26に対して、感性分析リクエストが送信される。
この際、ユーザが指定した文書ファイルのデータと、ユーザが設定した評価軸の識別コードが、Webサーバ26に送信される。
これを受けたWebサーバ26は、クライアント端末28からアップロードされた文書ファイルや評価軸の識別コード、当該ユーザの業種コード等のデータを感性分析システム10に送信し、分析を依頼する。 When the user who has completed the above settings clicks the analysis start button 44, a sensitivity analysis request is transmitted from the client terminal 28 to the Web server 26.
At this time, the document file data designated by the user and the evaluation axis identification code set by the user are transmitted to the Web server 26.
Receiving this, the Web server 26 transmits data such as the document file uploaded from the client terminal 28, the identification code of the evaluation axis, the industry code of the user, and the like to the sensitivity analysis system 10, and requests analysis.

以下、図３のフローチャートに従い、この感性分析システム10における処理手順を説明する。
まず、感性分析システム10の文書登録部11は、Webサーバ26から送信された文書ファイルを、文書記憶部12に格納する（Ｓ10）。この文書ファイルには、ユーザが設定した評価軸の識別コード及びユーザの業種コードが関連付けられている。 Hereinafter, the processing procedure in the sensitivity analysis system 10 will be described with reference to the flowchart of FIG.
First, the document registration unit 11 of the sensitivity analysis system 10 stores the document file transmitted from the web server 26 in the document storage unit 12 (S10). This document file is associated with the evaluation axis identification code set by the user and the user's business type code.

つぎに、文書解析部13が文書記憶部12に格納された文書ファイルを取り出し、当該文書ファイル中の各文書について文書識別コードを付与すると同時に、各文書に含まれる各文について文識別コードを付与する（Ｓ11）。 Next, the document analysis unit 13 retrieves the document file stored in the document storage unit 12, assigns a document identification code to each document in the document file, and simultaneously assigns a sentence identification code to each sentence included in each document. (S11).

つぎに文書解析部13は、文書ファイル中の各文について、形態素解析処理を施す（Ｓ12）。
ここで「形態素解析」とは、自然言語で記述された文を、意味を有する最小の言語単位である形態素に分解し、それぞれの品詞を特定する処理をいう。この形態素解析処理は公知技術であり、例えば以下のようなフリーソフトを形態素解析エンジンとして用いることができる。
■MeCab（http://mecab.sourceforge.net/）
■ChaSen（http://chasen.naist.jp/hiki/ChaSen/） Next, the document analysis unit 13 performs a morphological analysis process for each sentence in the document file (S12).
Here, “morpheme analysis” refers to a process of decomposing a sentence written in a natural language into morphemes, which are the smallest meaningful language units, and specifying each part of speech. This morpheme analysis process is a known technique. For example, the following free software can be used as the morpheme analysis engine.
■ MeCab (http://mecab.sourceforge.net/)
■ ChaSen (http://chasen.naist.jp/hiki/ChaSen/)

つぎに文書解析部13は、形態素に分解された各文について、構文解析処理を施す（Ｓ14）。
ここで「構文解析」とは、各文に含まれる文節間の係り受け構造を特定する処理をいう。この構文解析自体も公知技術であり、例えば以下のようなフリーソフトを構文解析エンジンとして用いることができる。
■KNP（http://nlp.kuee.kyoto-u.ac.jp/nl-resource/knp.html）
■CaboCha（http://chasen.org/~taku/software/cabocha/） Next, the document analysis unit 13 performs syntax analysis processing for each sentence decomposed into morphemes (S14).
Here, “syntactic analysis” refers to a process of specifying a dependency structure between clauses included in each sentence. This syntax analysis itself is also a known technique. For example, the following free software can be used as the syntax analysis engine.
■ KNP (http://nlp.kuee.kyoto-u.ac.jp/nl-resource/knp.html)
■ CaboCha (http://chasen.org/~taku/software/cabocha/)

文書解析部13による処理が完了すると、ネガ／ポジ判定部14によるネガ／ポジ判定処理が実行される（Ｓ16）。
以下、図４のフローチャートに従い、ネガ／ポジ判定処理の具体的な手順について説明する。 When the processing by the document analysis unit 13 is completed, negative / positive determination processing by the negative / positive determination unit 14 is executed (S16).
Hereinafter, a specific procedure of the negative / positive determination process will be described with reference to the flowchart of FIG.

まずネガ／ポジ判定部14は、感性辞書記憶部15内に格納された各種感性辞書を参照し、各文を構成する形態素または形態素の組合せの中で、感性辞書に登録された感性用語に該当するものを探索する（Ｓ16-01）。 First, the negative / positive determination unit 14 refers to various sensitivity dictionaries stored in the sensitivity dictionary storage unit 15, and corresponds to the sensitivity term registered in the sensitivity dictionary in the morpheme or combination of morphemes constituting each sentence. Search for what to do (S16-01).

感性辞書記憶部15には、図５に示すように、汎用辞書群50と、複数のドメイン別辞書群55が格納されている。また、汎用辞書群50には、単語辞書51、二項関係辞書52、慣用句辞書53、擬態語辞書54の各辞書が含まれている。さらに、各ドメイン別辞書群55には、単語辞書56、二項関係辞書57、擬態語辞書58の各辞書がそれぞれ含まれている。 As shown in FIG. 5, the sensitivity dictionary storage unit 15 stores a general-purpose dictionary group 50 and a plurality of domain-specific dictionary groups 55. The general-purpose dictionary group 50 includes a word dictionary 51, a binary relation dictionary 52, an idiomatic phrase dictionary 53, and a mimetic word dictionary 54. Further, each domain dictionary group 55 includes a word dictionary 56, a binary relation dictionary 57, and a mimetic word dictionary 58, respectively.

図６は、汎用辞書群50に属する単語辞書51の登録データを例示するものであり、「感性用語」、「読みがな」及び「極性」のデータ項目を備えたレコードが多数格納されている。
ここで「感性用語」のデータ項目には、「おいしい」や「うまい」、「まずい」等、事物に対する肯定／否定（良し悪し）に関する人間の主観的な価値判断を表す名詞や形容詞等が格納されている。また、極性のデータ項目には、当該感性用語の意味内容が肯定的な場合には「＋」の符号が、否定的な場合には「−」の符号が設定されている。 FIG. 6 exemplifies registered data of the word dictionary 51 belonging to the general-purpose dictionary group 50, and stores a large number of records having data items of “Kansei term”, “Reading” and “Polarity”. .
Here, the “Kansei term” data item stores nouns, adjectives, etc. that represent human subjective value judgments regarding positive / negative (good / bad) things such as “delicious”, “delicious”, “bad”, etc. Has been. Further, in the polarity data item, a sign “+” is set when the meaning content of the sensitivity term is positive, and a sign “−” is set when it is negative.

汎用辞書群50に属する二項関係辞書52にも、図示は省略したが、「感性用語」、「読みがな」、「極性」のデータ項目を備えたレコードが多数登録されている。
ここで「二項関係」とは、例えば「値段が高い」のように、二つの単語が助詞を介して組み合わされることにより、初めて一つの具体的な価値判断が生じる言葉を意味している。
すなわち、「高い」という形容詞は、一般的には肯定的な意味合いを観念させるものであるが、「値段が高い」や「危険性が高い」、「腐食性が高い」のように、主語によっては否定的な意味合いが生じる場合がある。あるいは逆に、「低い」という形容詞は、一般的には否定的な意味合いを観念させるものであるが、「危険性が低い」や「毒性が低い」、「違法性が低い」のように、主語によって肯定的な意味合いが生じる場合がある。
このため二項関係辞書52には、このように二つの単語の組合せによって具体的な価値判断を表すこととなる言葉について、「＋」または「−」の極性が付与されている。 In the binary relation dictionary 52 belonging to the general-purpose dictionary group 50, a number of records having data items of “sensibility term”, “reading”, and “polarity” are registered, although illustration is omitted.
Here, the “binary relationship” means a word that causes a specific value judgment for the first time when two words are combined through a particle, for example, “high price”.
In other words, the adjective “high” generally has a positive meaning, but it depends on the subject, such as “high price”, “high risk”, and “high corrosiveness”. May have negative implications. Or, conversely, the adjective “low” is generally conceived of a negative meaning, but like “low risk”, “low toxicity”, “low illegality” The subject may have positive implications.
For this reason, the binary relation dictionary 52 is given a polarity of “+” or “−” for a word that expresses a specific value judgment by a combination of two words.

汎用辞書群50に属する慣用句辞書53にも、図示は省略したが、「感性用語」、「読みがな」、「極性」のデータ項目を備えたレコードが多数登録されている。
具体的には、「馬の耳に念仏（感性用語）／−（極性）」、「雨降って地固まる（感性用語）／＋（極性）」、「片腹痛い（感性用語）／−（極性）」、「覆水盆に返らず（感性用語）／−（極性）」、「出藍の誉れ（感性用語）／＋（極性）」のように、諺や格言のように価値判断を伴う慣用句が「感性用語」のデータ項目に充填されると共に、各慣用句の意味合いが肯定的なものである場合には「＋」が、否定的なものである場合には「−」が「極性」のデータ項目に充填されている。 In the common phrase dictionary 53 belonging to the general-purpose dictionary group 50, a number of records having data items of “sensibility term”, “reading”, and “polarity” are registered, although illustration is omitted.
Specifically, “Nenbutsu in the ears of the horse (sensitivity term) / − (polarity)”, “It rains and solidifies (sensitivity term) / + (polarity)”, “Hit stomachache (sensitivity term) / − (polarity) ”,“ Do not return to the basin (sensitivity term) / − (polarity) ”,“ Honor of the indigo (sensitivity term) / + (polarity) ”, and idioms with value judgments such as proverbs and maxims Is filled in the data item of “Kansei term”, and “+” is used when the meaning of each idiom is positive, and “−” is “polarity” when it is negative. The data items are filled.

汎用辞書群50に属する擬態語辞書54にも、図示は省略したが、「感性用語」、「読みがな」、「極性」のデータ項目を備えたレコードが多数登録されている。
具体的には、「わくわく（感性用語）／＋（極性）」、「ほっこり（感性用語）／＋（極性）」、「ざわざわ（感性用語）／−（極性）」、「ぎとぎと（感性用語）／−（極性）」のように、価値判断を伴う擬態語が「感性用語」のデータ項目に充填されると共に、各擬態語の意味合いが肯定的なものである場合には「＋」が、否定的なものである場合には「−」が「極性」のデータ項目に充填されている。 Although not shown in the mimetic word dictionary 54 belonging to the general-purpose dictionary group 50, a large number of records having data items of “sensitive term”, “reading”, and “polarity” are registered.
Specifically, "Waku Waku (Sensitive term) / + (Polarity)", "Hot (Sensitive term) / + (Polarity)", "Zawazawa (Sensitive term) /-(Polarity)", "Gitto (Sensitive term) Like “/-(polarity)”, a mimetic word with a value judgment is filled in the data item of “Kansei term”, and “+” is negative when the meaning of each mimetic word is positive. If it is a negative one, “-” is filled in the “polarity” data item.

各ドメイン別辞書群55に属する単語辞書56、二項関係辞書57、擬態語辞書58にも、図示は省略したが、「感性用語」、「読みがな」、「極性」のデータ項目を備えたレコードがそれぞれ多数登録されている。
ただし、各ドメイン別辞書群は、それぞれ特定のカテゴリ（業界、商品分野、サービス分野）毎に用意されており、当該カテゴリに特有の感性用語や極性が登録されている。 Although not shown, the word dictionary 56, the binary relation dictionary 57, and the mimetic word dictionary 58 that belong to each domain dictionary group 55 are provided with data items of “sense term”, “reading”, and “polarity”. A lot of records are registered.
However, each domain dictionary group is prepared for each specific category (industry, product field, service field), and sensitivity terms and polarities specific to the category are registered.

そして、特定の形態素または形態素の組合せにマッチする感性用語が感性辞書中に存在していた場合、ネガ／ポジ判定部14はその感性用語の極性を表すタグ（＜＋＞または＜−＞）を当該表現に付与する（Ｓ16-02）。 When a sensitivity term matching a specific morpheme or combination of morphemes exists in the sensitivity dictionary, the negative / positive determination unit 14 adds a tag (<+> or <->) indicating the polarity of the sensitivity term. It is given to the expression (S16-02).

この際、例えば文中に「値段が少し高い」という表現があり、二項関係辞書52中に「値段が高い（感性用語）／−（極性）」の登録例が存在した場合に、ネガ／ポジ判定部14は構文解析の処理結果を参照することにより、「値段が少し高い」に対して＜−＞のタグを付与することができる。
すなわち、文書解析部13による構文解析処理により、当該文中の「値段が」の文節が「高い」の文節に係ることを示す解析データが生成されているため、ネガ／ポジ判定部14はこの解析データに基づき、二項関係辞書52中の「値段が高い」の極性を「値段が少し高い」の表現に適用可能となる。 In this case, for example, if there is an expression “price is a little high” in the sentence, and there is a registered example of “high price (sensitivity term) / − (polarity)” in the binary relational dictionary 52, the negative / positive By referring to the processing result of the syntax analysis, the determination unit 14 can add a <-> tag to “a little expensive”.
That is, since the analysis data indicating that the “price” clause in the sentence relates to the “high” clause is generated by the parsing process by the document analysis unit 13, the negative / positive determination unit 14 performs this analysis. Based on the data, the polarity of “high price” in the binary relational dictionary 52 can be applied to the expression “price is a little high”.

つぎにネガ／ポジ判定部14は、否定表現記憶部16を参照し、各文を構成する形態素または形態素の組合せ中で、否定表現記憶部16に登録された否定表現パターンに合致する表現を探索する（Ｓ16-03）。 Next, the negative / positive determination unit 14 refers to the negative expression storage unit 16 to search for an expression that matches the negative expression pattern registered in the negative expression storage unit 16 in the morpheme or combination of morphemes constituting each sentence. (S16-03).

否定表現記憶部16には、図示は省略したが、否定文を作る際に用いられる多数の否定表現パターンが格納されている。例えば、「思えません」、「思えない」、「いえません」、「いえない」、「いえぬ」、「ありません」、「ない」、「感じません」、「感じない」、「考えません」、「考えない」等が該当する。
そして、否定表現記憶部16中に文中の形態素または形態素の組合せにマッチする否定表現パターンが登録されていた場合、ネガ／ポジ判定部14はその表現（形態素または形態素の組合せ）に対して否定表現タグ（＜否定＞）を付与する（Ｓ16-04）。 Although not shown in the figure, the negative expression storage unit 16 stores a large number of negative expression patterns used when creating negative sentences. For example, "I don't think", "I don't think", "I can't say", "I can't say", "I can't say", "I don't", "I don't feel", "I don't feel", "I don't feel" No ”,“ I don't think ”, etc.
If a negative expression pattern that matches the morpheme or combination of morphemes in the sentence is registered in the negative expression storage unit 16, the negative / positive determination unit 14 performs a negative expression for the expression (morpheme or combination of morphemes). A tag (<Negation>) is assigned (S16-04).

つぎにネガ／ポジ判定部14は、各文に付与された極性タグと否定表現タグとの組合せに独自の文法ルールを適用することにより、文全体が否定的か肯定的かの判定を行う（Ｓ16-05）。
そして、「否定的」との判定結果が出た場合、ネガ／ポジ判定部14は当該否定評価文に「＜Ｎ＞(Negative)」のネガ判定タグを付与し、「肯定的」との判定結果が出た場合には、当該肯定評価文に「＜Ｐ＞(Positive)」のポジ判定タグを付与する（Ｓ16-06）。 Next, the negative / positive determination unit 14 determines whether the entire sentence is negative or positive by applying a unique grammar rule to the combination of the polarity tag and the negative expression tag given to each sentence ( S16-05).
When a negative determination result is obtained, the negative / positive determination unit 14 adds a negative determination tag of “<N> (Negative)” to the negative evaluation sentence, and determines “positive”. If a result is obtained, a positive determination tag of “ (Positive)” is assigned to the positive evaluation sentence (S16-06).

例えば、図７(a)に示すように、「料理は、おいしいと思いました。」という文の場合、「おいしい」の形態素に＜＋＞の極性タグが付与されており、＜否定＞タグは付与されていないため、ネガ／ポジ判定部14は文全体を「肯定的」と認定し、＜Ｐ＞のポジ判定タグを関連付ける。 For example, as shown in FIG. 7A, in the case of the sentence “I thought the dish was delicious,” the <+> polarity tag was attached to the “delicious” morpheme, and the <deny> tag Is not assigned, the negative / positive determination unit 14 recognizes the entire sentence as “positive” and associates the positive determination tag of .

これに対し、図７(b)に示すように、「料理は、おいしいとは思いませんでした。」という文の場合、「おいしい」の形態素に＜＋＞の極性タグが付与されているが、それよりも後方に位置する「思いません」の部分に＜否定＞タグが付与されているため、ネガ／ポジ判定部14は文全体を「否定的」と認定し、＜Ｎ＞のネガ判定タグを関連付ける。 On the other hand, as shown in FIG. 7B, in the case of the sentence “I didn't think the dish was delicious,” the “+” polarity tag was attached to the “delicious” morpheme. Since the <No> tag is attached to the “I don't think” part that is located behind it, the negative / positive determination unit 14 recognizes the entire sentence as “Negative” and the <N> negative Associate a judgment tag.

また、図７(c)に示すように、「料理は、まずかったです。」という文の場合、「まずかっ」の形態素に＜−＞の極性が付与されており、＜否定＞タグは付与されていないため、ネガ／ポジ判定部14は文全体を「否定的」と認定し、＜Ｎ＞のネガ判定タグを関連付ける。 In addition, as shown in FIG. 7 (c), in the case of the sentence “Cooking was bad”, the <-> polarity is assigned to the “morphic” morpheme, and the <negative> tag is assigned. Therefore, the negative / positive determination unit 14 recognizes the entire sentence as “negative” and associates the negative determination tag of <N>.

これに対し、図７(d)に示すように、「料理は、まずくはありませんでした。」という文の場合、「まずく」の形態素に＜−＞の極性が付与されているが、それよりも後方に位置する「ありません」の部分に＜否定＞タグが付与されているため、ネガ／ポジ判定部14は文全体を「肯定的」と認定し、＜Ｐ＞のポジ判定タグを関連付ける。 On the other hand, as shown in FIG. 7 (d), in the case of the sentence “Cooking was not bad”, the <-> polarity was given to the “poor” morpheme. Since the <Negated> tag is assigned to the “no” part located at the rear, the negative / positive determining unit 14 recognizes the entire sentence as “positive” and associates the positive determining tag.

ネガ／ポジ判定部14によるネガ／ポジ判定処理が完了すると、主題認定部17による主題認定処理が実行される（図３のＳ18）。
ここで「主題認定処理」とは、ネガ／ポジの評価対象を特定する処理を意味する。 When the negative / positive determination processing by the negative / positive determination unit 14 is completed, the subject recognition processing by the subject recognition unit 17 is executed (S18 in FIG. 3).
Here, the “subject recognition process” means a process for specifying a negative / positive evaluation target.

この主題を特定するため主題認定部17は、まず文中における格要素「〜は」、「〜が」、「〜も」を探索し、これらの助詞が付属している自立語（主格）を主題と認定し、その旨を示す＜主題＞のタグを付与する。図７で示した各例文の場合、「料理」が主題に該当する。 In order to identify this subject, the subject recognition unit 17 first searches for the case elements "~ ha", "~ ga", and "~ mo" in the sentence, and uses the independent words (major case) to which these particles are attached as the subject. And attach a <subject> tag to that effect. In the case of each example sentence shown in FIG. 7, “cooking” corresponds to the subject.

文によっては、主格が省略されている場合がある。例えば、図８に示すように、「(1) 部屋が蒸し暑くて寝苦しかった。」の文に続く「(2) しかもかび臭かった。」では、主格が省略されている。このような場合、主題認定部17は前の文の主題である「部屋」を後の文に継承させ、(2)の文の主題として「部屋」を認定する。 In some sentences, the main character may be omitted. For example, as shown in FIG. 8, the main character is omitted in “(2) It was musty” following the sentence “(1) The room was too hot and hard to sleep”. In such a case, the subject recognition unit 17 passes the “room” that is the subject of the previous sentence to the subsequent sentence, and recognizes “room” as the subject of the sentence (2).

この主題の継承は、連続する文の間に限定されるものではなく、主格の存在しない文が続いた場合には、次々と継承される。
図８においては、「(3) 静かなのがせめてもの救いだった。」の文にも、(1)の文の「部屋」が主格として継承されている。そして、新たな主格である「料理は」を有する(4)の文が登場した時点で、「部屋」の主題としての継承が停止されている。 Inheritance of this subject is not limited between successive sentences, but is inherited one after another when sentences without a main character continue.
In FIG. 8, the “room” in the sentence (1) is inherited as the main character in the sentence “(3) It was a salvation at least for quiet.” Then, when the sentence (4) having a new main character “Cooking” appears, the inheritance as the theme of “room” is stopped.

当該文中に主格が存在せず、かつ、継承すべき主格を備えた先行文が存在しない場合、主題認定部17は当該文中に連体修飾語が含まれているか否かを探索し、連体修飾語を発見した場合にはその修飾先の形態素を主題と認定する。
例えば、「いい香り。」という文の場合、主格が省略されているが、「香り」という名詞（体言）を修飾している連体修飾語「いい（＜＋＞）」が存在しているため、主題認定部17は「香り」を当該文の主題と認定する。 If there is no main character in the sentence and there is no preceding sentence with a main character that should be inherited, the subject recognition unit 17 searches for whether or not a combination modifier is included in the sentence. If it is found, the modified morpheme is recognized as the subject.
For example, in the case of the sentence “good scent.”, The main character is omitted, but there is a combined modifier “good (<+>)” that modifies the noun (body name) “scent”. The subject recognition unit 17 recognizes “scent” as the subject of the sentence.

主題認定部17による主題認定処理が完了すると、意図抽出部18による意図抽出処理が実行される（図３のＳ20）。
ここで「意図」とは、個別の主題に対する発言者（文執筆者）の「良い／悪い」という二元論的な価値判断の他に、当該価値判断の背後に潜む発言目的や思惑、あるいは良し悪しの価値判断から離れた発言者の読み手に対する主張やメッセージなどを含む概念であり、「好評」、「不満」、「意向」、「要望」、「質問」、「予想外」、「興味有り」、「興味無し」等に類型化できる。 When the subject certification process by the subject certification unit 17 is completed, the intention extraction process by the intention extraction unit 18 is executed (S20 in FIG. 3).
Here, “intent” refers to the dualistic value judgment of “good / bad” by a speaker (writer) on an individual subject, as well as the purpose or speculation behind the value judgment, or the good or bad This is a concept that includes an assertion or message to the reader of a speaker who is far from the judgment of the value of, and is "popular", "dissatisfied", "intention", "request", "question", "unexpected", "interesting" , “No interest”, etc.

まず意図抽出部18は、＜Ｐ＞のポジ判定タグが付与された文に対して＜好評＞の意図類型タグを付与すると共に、＜Ｎ＞のネガ判定タグが付与された文に対して＜不満＞の意図類型タグを付与する。 First, the intention extraction unit 18 adds a <Popular> intention type tag to a sentence with a positive determination tag, and <N> for a sentence with a <N> negative determination tag. A dissatisfaction> intention type tag is assigned.

つぎに、意図抽出部18は意図表現記憶部19を参照して、各文中の表現（形態素または形態素の組合せ）とのマッチングを実行し、＜好評＞及び＜不満＞以外の意図類型を抽出する。
すなわち、図９に示すように、意図表現記憶部19に格納された各レコードは、「意図表現」と「意図類型」のデータ項目を備えている。そこで意図抽出部18は、この意図表現にマッチする表現を各文中において探索し、該当する表現を発見した場合には、対応する意図類型のタグを当該表現に関連付ける。 Next, the intention extraction unit 18 refers to the intention expression storage unit 19 and executes matching with expressions (morpheme or combination of morphemes) in each sentence to extract intention types other than <popular> and <dissatisfaction>. .
That is, as shown in FIG. 9, each record stored in the intention expression storage unit 19 includes data items of “intention expression” and “intention type”. Therefore, the intention extraction unit 18 searches each sentence for an expression that matches the intention expression, and if a corresponding expression is found, associates the tag of the corresponding intention type with the expression.

例えば、図１０(a)に示すように、「夕食が粗末なので、もう少し品数を増やして下さい。」という文が与えられた場合、意図抽出部18は「下さい」の文字列が意図表現記憶部19に存在することを探知した後、「下さい」の意図類型である「要望」に対応した＜要望＞の意図類型タグを当該表現に関連付ける。
因みに、この文については文全体が「否定的」であることを示す＜Ｎ＞のネガ判定タグが付与されているため、＜不満＞の意図類型タグが重複的に付与されている。 For example, as shown in FIG. 10 (a), when the sentence “Since dinner is poor, please increase the number of items a little more.” Is given, the intention extraction unit 18 displays the character string “please” as the intention expression storage unit. After detecting that it exists in 19, an intention type tag of <request> corresponding to “request” which is an intention type of “please” is associated with the expression.
Incidentally, since the negative determination tag of <N> indicating that the whole sentence is “negative” is assigned to this sentence, the <unsatisfied> intention type tag is redundantly assigned.

また、図１０(b)に示すように、「値段が良心的なのには驚きました。」という文が与えられた場合、意図抽出部18は「驚き」の文字列が意図表現記憶部19に存在することを探知した後、「驚き」に対応した＜予想外＞の意図類型タグを当該表現に付与する。
この文については、文全体が「肯定的」であることを示す＜Ｐ＞のポジ判定タグが付与されているため、＜好評＞の意図類型タグも付与されている。 Also, as shown in FIG. 10B, when a sentence “I was surprised that the price is conscientious” is given, the intention extraction unit 18 sends a character string “surprise” to the intention expression storage unit 19. After detecting the presence, an <unexpected> intention type tag corresponding to “surprise” is assigned to the expression.
For this sentence, the positive determination tag indicating that the entire sentence is “positive” is assigned, and therefore, the <popular> intention type tag is also assigned.

また、図１０(c)に示すように、「アクセスが悪いので今度は車にしたい。」という文が与えられた場合、意図抽出部18は「したい」の文字列が意図表現記憶部19に存在することを探知した後、「したい」に対応した＜意向＞の意図類型タグを当該文に付与する。
この文については、文全体が「否定的」であることを示す＜Ｎ＞のネガ判定タグが付与されているため、＜不満＞の意図類型タグも付与されている。。 Further, as shown in FIG. 10 (c), when the sentence “I want to drive this time because the access is bad” is given, the intention extraction unit 18 stores the character string “I want” in the intention expression storage unit 19. After detecting that it exists, an intention type tag of <intention> corresponding to “I want to do” is given to the sentence.
For this sentence, since the negative determination tag of <N> indicating that the whole sentence is “negative” is assigned, the intention type tag of <dissatisfaction> is also assigned. .

さらに、図１０(d)に示すように、「当方の頼み方がいけなかったのでしょうか？」という文が与えられた場合、意図抽出部18は「でしょうか」の文字列が意図表現記憶部19に存在することを探知した後、「でしょうか」に対応した＜質問＞の意図類型タグを当該文に付与する。
この文の場合、ネガ／ポジの判定結果を示すタグが付与されていないため、＜好評＞や＜不満＞の意図類型タグは付与されていない。 Furthermore, as shown in FIG. 10 (d), when a sentence “Is it possible to ask us?” Is given, the intention extraction unit 18 stores the intention expression memory string. After detecting that it exists in the part 19, an intention type tag of <question> corresponding to “Is it?” Is given to the sentence.
In the case of this sentence, the tag indicating the negative / positive determination result is not attached, and thus the intention type tag of <favorable> or <not satisfied> is not attached.

分析対象ファイルに含まれる全ての文について意図抽出処理が完了すると、意図抽出部18は、これまでの分析結果を分析結果記憶部20に格納する（図３のＳ22）。
ここで「分析結果」とは、各文に対して主題タグや極性タグ、否定表現タグ、ネガ／ポジ判定タグ、意図類型タグを付与した分析結果データが該当する。 When the intention extraction process is completed for all sentences included in the analysis target file, the intention extraction unit 18 stores the analysis results thus far in the analysis result storage unit 20 (S22 in FIG. 3).
Here, “analysis result” corresponds to analysis result data in which a subject tag, a polarity tag, a negative expression tag, a negative / positive determination tag, and an intention type tag are assigned to each sentence.

つぎに、分析結果評価部21及び画面生成部25によって、分析結果画面が生成される（図３のＳ24）。
この分析結果画面は、Webサーバ26を経由してクライアント端末28に送信される（図３のＳ26）。 Next, the analysis result evaluation unit 21 and the screen generation unit 25 generate an analysis result screen (S24 in FIG. 3).
This analysis result screen is transmitted to the client terminal 28 via the Web server 26 (S26 in FIG. 3).

図１１は、クライアント端末28のWebブラウザ上に表示された分析結果画面60の一例を示すものであり、メイン領域にはネガ／ポジ分析結果を示すネガ／ポジ分布グラフ61が表示されている。
また、サイドバーには、意図抽出結果の件数がツリー状に表示された意図分類チャート62が設けられている。 FIG. 11 shows an example of the analysis result screen 60 displayed on the Web browser of the client terminal 28, and a negative / positive distribution graph 61 showing a negative / positive analysis result is displayed in the main area.
Further, the sidebar is provided with an intention classification chart 62 in which the number of intention extraction results is displayed in a tree shape.

まず、ネガ／ポジ分布グラフ61には、総合、価格、料理、部屋、風呂、立地の評価軸毎に、ポジ、ネガ、中立のパーセンテージが帯グラフによって示されている。
このグラフ61を参照することにより、例えば、「価格」についてはポジ判定の付いた文書の割合がネガ判定の文書の割合よりも多いことから、顧客は価格について概ね満足していることが読み取れる。
これに対し、「部屋」についてはネガ判定の付いた文書の割合が圧倒的に多くなっているため、多くの顧客の不興を買っていることが理解できる。
また、各評価軸の合計値から導かれた「総合」については、ポジとネガの比率が拮抗しているため、全体としては「可もなく不可もなし」という結果を認識することができる。 First, in the negative / positive distribution graph 61, positive, negative, and neutral percentages are shown by band graphs for each evaluation axis of total, price, food, room, bath, and location.
By referring to this graph 61, for example, it can be read that the customer is generally satisfied with the price because the ratio of the document with the positive determination is larger than the ratio of the negative determination document.
On the other hand, regarding the “room”, since the proportion of documents with negative judgments is overwhelmingly large, it can be understood that many customers are buying a bad experience.
In addition, with respect to “total” derived from the total value of each evaluation axis, the ratio of positive and negative is antagonizing, so the overall result can be recognized as “no good or bad”.

ネガ／ポジ分析結果の見せ方は上記の帯グラフに限定されるものではなく、レーダーチャートや円グラフによってネガ／ポジの割合や数を表示することもできる。 The way of showing the negative / positive analysis result is not limited to the above-mentioned band graph, and the negative / positive ratio and number can also be displayed by a radar chart or a pie chart.

意図分類チャート62においては、価格、料理、部屋、風呂、立地の評価軸毎に、該当文の延べ件数が括弧内に表示されている。例えば、「価格」に関しては318件の文が、「料理」に関しては170件の文が関連付けられている。 In the intention classification chart 62, the total number of corresponding sentences is displayed in parentheses for each evaluation axis of price, cooking, room, bath, and location. For example, 318 sentences are associated with “Price” and 170 sentences are associated with “Cooking”.

ここでユーザが各評価軸をクリックすると、好評、不満、意向、要望、質問、予想外、興味有り、興味無し、不明の意図類型がそれぞれ展開する。
また、各意図類型には該当の意図類型タグが付与された文の延べ数が括弧内に表示されている。例えば、「価格」配下の「好評」に関しては74件の文が存在しており、同「不満」に関しては121件の文が存在していることが示されている。
「不明」とは、該当の評価軸に係るものではあるが、何れの意図類型タグをも付与されていない文の数を示している。 When the user clicks each evaluation axis, popular intentions, dissatisfactions, intentions, requests, questions, unexpected, interested, not interested, and unknown intention types are developed.
In addition, the total number of sentences with the corresponding intention type tag is displayed in parentheses for each intention type. For example, it is shown that there are 74 sentences for “popular” under “price” and 121 sentences for “dissatisfied”.
“Unknown” indicates the number of sentences that are related to the relevant evaluation axis but are not assigned any intention type tag.

ここでユーザが何れかの意図類型をクリックすると、当該意図類型に含まれる代表意見が複数列挙された代表意見リストが展開する。
例えば、ユーザが評価軸「価格」配下の意図類型「好評」をクリックすると、図１２に示すように、サイドバー中に「料金の安さが魅力。（17）」、「料金が手ごろだし。（12）」、「安いわりには良し。（15）」等の代表意見を列記した代表意見リスト63が展開表示される。
各代表意見の末尾に付記された括弧付きの数字は、当該代表意見及びその類似文の合計数を示している。 Here, when the user clicks on any intention type, a representative opinion list in which a plurality of representative opinions included in the intention type are listed is developed.
For example, when the user clicks an intention type “popular” under the evaluation axis “price”, as shown in FIG. 12, “low price is attractive. (17)”, “price is reasonable” in the sidebar. The representative opinion list 63 listing representative opinions such as “12)” and “Good for cheap” (15) is displayed.
The numbers in parentheses attached to the end of each representative opinion indicate the total number of the representative opinions and similar sentences.

これに対しユーザが何れかの代表意見をマウスポインタで選択すると、メイン領域に当該代表意見及びその類似文から構成される類似文グループの内訳リスト64が表示される。
この内訳リスト64は、文書番号及び類似文の表示項目を備えており、類似文の表示項目には、各文書の中で代表意見と同一または近似した文が表示されている。この結果ユーザは、個々の類似文の具体的な表現を確認することが可能となる。 On the other hand, when the user selects any representative opinion with the mouse pointer, a breakdown list 64 of similar sentence groups including the representative opinion and similar sentences is displayed in the main area.
The breakdown list 64 includes document number and similar sentence display items, and the similar sentence display item displays a sentence that is the same as or close to the representative opinion in each document. As a result, the user can confirm a specific expression of each similar sentence.

ユーザが任意の類似文の「選択」ボタン65をクリックすると、図示は省略したが、ディスプレイ上に別ウィンドウが起動して、当該代表意見または他の類似文を含む文書の全体が表示される。
表示された文書中の各文には、各種情報（極性タグ、否定表現タグ、ネガ／ポジ判定タグ、主題タグ、評価軸、意図類型タグ、属性情報、時間情報等）が付与されている。 When the user clicks the “select” button 65 of any similar sentence, although not shown, another window is activated on the display, and the entire document including the representative opinion or other similar sentences is displayed.
Various information (polarity tag, negative expression tag, negative / positive determination tag, theme tag, evaluation axis, intention type tag, attribute information, time information, etc.) is given to each sentence in the displayed document.

上記のように、このシステム10においては、まず代表意見リスト63が画面中に表示された後、その中の一の代表意見に係る内訳リスト64が表示され、最後に内訳リスト64中の一の類似文に係る文書全体が表示されというように、分析対象データの内容が段階的に表示される仕組みを備えているため、分析担当者は要チェックの文を容易に探知することができ、作業効率を飛躍的に向上させることができる。 As described above, in this system 10, first, the representative opinion list 63 is displayed on the screen, then the breakdown list 64 related to one representative opinion is displayed, and finally one of the breakdown lists 64 in the breakdown list 64 is displayed. Since the entire document related to similar sentences is displayed, the analysis staff has a mechanism to display the contents of the analysis target data in stages. Efficiency can be improved dramatically.

つぎに、図１３のフローチャートに従い、この分析結果画面60の生成に係る処理手順を説明する。
まず、分析結果評価部21は、当該文書ファイルに関連付けられた評価軸である「価格」、「料理」、「部屋」、「風呂」、「立地」を取得する（Ｓ24-01）。
つぎに分析結果評価部21は、評価軸類似表現記憶部22に格納された業界毎の評価軸類似表現辞書を参照し、各評価軸の類似表現（展開語）を特定する（Ｓ24-02）。 Next, a processing procedure related to generation of the analysis result screen 60 will be described with reference to the flowchart of FIG.
First, the analysis result evaluation unit 21 acquires “price”, “dish”, “room”, “bath”, and “location”, which are evaluation axes associated with the document file (S24-01).
Next, the analysis result evaluation unit 21 refers to the industry-specific evaluation axis similar expression dictionary stored in the evaluation axis similar expression storage unit 22, and specifies the similar expression (expanded word) of each evaluation axis (S24-02). .

図１４は、評価軸類似表現記憶部22に格納されたホテル・旅館業界用の評価軸類似表現辞書の具体例を示している。
例えば、「価格」の評価軸については、「価格」の他に、「料金」、「値段」、「宿泊料」、「宿代」等の類義語や関連語が類似表現として格納されている。これらの類似表現は、一般的な類義語辞書等を参照し、また個々の業界の特性を考慮しつつ、ユーザの属する業界毎に編纂されたものである。
このため、「価格」の評価軸について、「価格」や「料金」、「値段」といった一般的な呼び名の他に、「宿泊料」、「宿代」のようにホテル・旅館業界に特有の呼び名が列記されている。
また、飲食業界に属するユーザに対してサービスを提供する場合には、「宿泊料」や「宿代」の代わりに「飲食代」や「飲み代」等の類似表現が列記された、飲食業界用の評価軸類似表現辞書が適用されることとなる。 FIG. 14 shows a specific example of an evaluation axis similar expression dictionary for the hotel / ryokan industry stored in the evaluation axis similar expression storage unit 22.
For example, for the evaluation axis of “price”, synonyms and related words such as “fee”, “price”, “accommodation fee”, “accommodation fee”, and the like are stored as similar expressions in addition to “price”. These similar expressions are compiled for each industry to which the user belongs while referring to a general synonym dictionary or the like and considering the characteristics of each industry.
For this reason, on the evaluation axis of “price”, in addition to general names such as “price”, “price”, and “price”, “hotel charges” and “price” are unique to the hotel and ryokan industry. Names are listed.
In addition, when providing services to users belonging to the food and beverage industry, similar expressions such as “food and beverage charges” and “beverage charges” are listed instead of “hotel charges” and “hotel charges”. The evaluation axis similarity expression dictionary is applied.

つぎに分析結果評価部21は、文書ファイルに含まれる各文の「主題」として認定された文字列と、上記の類似表現とをマッチングさせ、該当する文に対応の評価軸を関連付ける（Ｓ24-03）。
例えば、図１０(a)の「夕食が粗末なので、もう少し品数を増やして下さい。」という文の場合、主題である「夕食」の文字列が評価軸「料理」の類似表現として登録されているため、分析結果評価部21は「料理」の評価軸識別コードを当該文に関連付ける。 Next, the analysis result evaluation unit 21 matches the character string recognized as the “subject” of each sentence included in the document file with the above similar expression, and associates the corresponding evaluation axis with the corresponding sentence (S24-). 03).
For example, in the case of the sentence “Dinner is poor, please increase the number of items a little more” in FIG. 10 (a), the character string “dinner” which is the subject is registered as a similar expression of the evaluation axis “dish”. Therefore, the analysis result evaluation unit 21 associates the evaluation axis identification code of “cooking” with the sentence.

また、図１０(b)の「値段が良心的なのには驚きました。」という文の場合、主題である「値段」の文字列が評価軸「価格」の類似表現として登録されているため、分析結果評価部21は「価格」の評価軸識別コードを当該文に関連付ける。 Also, in the case of the sentence “I was surprised that the price is conscientious” in FIG. 10 (b), since the character string of “the price” that is the subject is registered as a similar expression of the evaluation axis “price”, The analysis result evaluation unit 21 associates the “price” evaluation axis identification code with the sentence.

また、図１０(c)の「アクセスが悪いので今度は車にしたい。」という文の場合、主題である「アクセス」の文字列が評価軸「立地」の類似表現として登録されているため、分析結果評価部21は「立地」の評価軸識別コードを当該文に関連付ける。 Also, in the case of the sentence “Access is bad and I want to drive this time” in FIG. 10 (c), since the character string “access” that is the subject is registered as a similar expression of the evaluation axis “location”, The analysis result evaluation unit 21 associates the evaluation axis identification code of “location” with the sentence.

これに対し、図１０(d)の「当方の頼み方がいけなかったのでしょうか？」という文の場合、主題である「頼み方」の文字列は何れの評価軸でも類似表現として登録されていないため、分析結果評価部21は「不明」の評価軸識別コードを関連付ける。 On the other hand, in the case of the sentence “Wouldn't it be my request?” In Fig. 10 (d), the character string “How to request” that is the subject is registered as a similar expression on any evaluation axis. Therefore, the analysis result evaluation unit 21 associates an evaluation axis identification code of “unknown”.

つぎに分析結果評価部21は、ユーザが選択した評価軸の識別コードが付与された文を文書ファイルから抽出した後、各文に設定されたネガ／ポジ判定タグ＜Ｐ＞及び＜Ｎ＞の数を評価軸毎に集計する（Ｓ24-04）。 Next, the analysis result evaluation unit 21 extracts the sentence to which the identification code of the evaluation axis selected by the user is added from the document file, and then sets the negative / positive determination tags and <N> set for each sentence. The number is counted for each evaluation axis (S24-04).

例えば、「価格」の評価軸識別コードが付与された文が全部で80個あり、その中で＜Ｐ＞のタグが付された文が45個、＜Ｎ＞のタグが付された文が30個、＜Ｐ＞及び＜Ｎ＞の何れも付与されていない文が５個あった場合、分析結果評価部21は以下の集計結果を生成する。
［評価軸：価格］
＜Ｐ＞（ポジ）：45個
＜Ｎ＞（ネガ）：30個
（中立）：５個 For example, there are a total of 80 sentences with “price” evaluation axis identification codes, of which 45 are tagged with the tag, and those with the <N> tag. When there are five sentences to which none of and <N> is assigned, the analysis result evaluation unit 21 generates the following total result.
[Evaluation axis: Price]
 (Positive): 45 <N> (Negative): 30
(Neutral): 5

また、「料理」の識別コードが付与された文が全部で60個あり、その中で＜Ｐ＞のタグが付された文が18個、＜Ｎ＞のタグが付された文が25個、＜Ｐ＞及び＜Ｎ＞の何れも付与されていない文が17個あった場合、分析結果評価部21は以下の集計結果を生成する。
［評価軸：料理］
＜Ｐ＞（ポジ）：18個
＜Ｎ＞（ネガ）：25個
（中立）：17個 In addition, there are 60 sentences with “Cooking” identification code in total, 18 of which are tagged with and 25 of which are tagged with <N>. When there are 17 sentences to which neither nor <N> is assigned, the analysis result evaluation unit 21 generates the following tabulation results.
[Evaluation axis: Cooking]
 (Positive): 18 <N> (Negative): 25
(Neutral): 17

つぎに分析結果評価部21は、上記の集計結果を反映させた帯グラフ（ネガ・ポジ分布グラフ61）を生成する（Ｓ24-05）。
この際、分析結果評価部21は、各評価軸のポジ数を合計した「総合ポジ数」を算出すると共に、各評価軸のネガ数を合計した「総合ネガ数」、各評価軸の中立数を合計した「総合中立数」を算出し、それぞれの割合を帯グラフの「総合」に反映させる。 Next, the analysis result evaluation unit 21 generates a band graph (negative / positive distribution graph 61) reflecting the above-described total result (S24-05).
At this time, the analysis result evaluation unit 21 calculates the “total number of positives” that is the total number of positives of each evaluation axis, and also calculates the “total number of negatives” that is the total number of negatives of each evaluation axis, and the neutral number of each evaluation axis. Calculate the “total neutral number”, and reflect the percentages in the “total” of the band graph.

つぎに分析結果評価部21は、各文に付与された評価軸及び意図類型毎に文の数を集計し、意図分類チャート76を生成する（Ｓ24-06）。 Next, the analysis result evaluation unit 21 aggregates the number of sentences for each evaluation axis and intention type assigned to each sentence, and generates an intention classification chart 76 (S24-06).

つぎに分析結果評価部21は、「評価軸×意図類型」単位で文書ファイルから代表意見を抽出する（Ｓ24-07）。
このために分析結果評価部21は、まず同じ評価軸及び意図類型を有する文の集合の中から、同一の感性表現を備えた文同士を、類似文グループとして抽出する。 Next, the analysis result evaluation unit 21 extracts representative opinions from the document file in units of “evaluation axis × intention type” (S24-07).
For this purpose, the analysis result evaluation unit 21 first extracts sentences having the same emotional expression as a similar sentence group from a set of sentences having the same evaluation axis and intention type.

例えば図１５(a)に示すように、評価軸「価格」及び意図類型「不満」を共通にする(1)〜(6)の文が与えられた場合、分析結果評価部21は図１５(b)に示すように、感性表現「高い」を共通にする(1)(2)(5)の３件の文を共通の類似文グループとして取り出す。
つぎに分析結果評価部21は、図１５(c)に示すように、評価軸、意図類型、感性表現を共通にする(1)(2)(5)の類似文グループ中で、最も文字数の少ない(2)の文「明らかに価格が高すぎる。」を代表意見と認定する。
つぎに分析結果評価部21は、当該類似文グループに属する類似文の合計数である「３」を、当該代表意見の件数として関連付ける。 For example, as shown in FIG. 15 (a), when the sentences (1) to (6) that share the evaluation axis “price” and the intention type “dissatisfaction” are given, the analysis result evaluation unit 21 performs FIG. As shown in b), the three sentences (1), (2), and (5) that share the sensitivity expression “high” are extracted as a common similar sentence group.
Next, as shown in FIG. 15 (c), the analysis result evaluation unit 21 has the same number of characters among the similar sentence groups (1), (2), and (5) that share the evaluation axis, intention type, and sensitivity expression. A few (2) sentences “obviously too expensive” are recognized as representative opinions.
Next, the analysis result evaluation unit 21 associates “3”, which is the total number of similar sentences belonging to the similar sentence group, as the number of representative opinions.

図示は省略したが、同様の手順に従い、分析結果評価部21は感性表現「ぼったくり」を共通にする(3)及び(6)の文を類似文グループを構成するものとして取り出した後、より文字数の少ない(3)の文「ぼったくりとしか思えません。」を代表意見と認定する。 Although illustration is omitted, following the same procedure, the analysis result evaluation unit 21 takes out the sentences (3) and (6) that share the emotional expression `` bottling '' as constituting a similar sentence group, and then increases the number of characters. The sentence (3) with a small number of words, “I can only think of a rip-off,” is recognized as the representative opinion.

画面生成部25は、分析結果評価部21から渡されたネガ・ポジ分布グラフ及び意図分類チャートを所定のテンプレートに充填することにより、分析結果画面60を生成する（Ｓ24-08）。
この分析結果画面60は、上記の通りWebサーバ26を介してクライアント端末28に送信される。 The screen generation unit 25 generates the analysis result screen 60 by filling a predetermined template with the negative / positive distribution graph and the intention classification chart delivered from the analysis result evaluation unit 21 (S24-08).
The analysis result screen 60 is transmitted to the client terminal 28 via the Web server 26 as described above.

また、Webサーバ26経由でクライアント端末28から特定の評価軸に係る特定の意図類型の選択情報が送信された場合、画面生成部25は分析結果評価部21から渡された代表意見及びその件数（代表意見が属する類似グループ中の類似文数）が列記された画面を生成し、Webサーバ26経由でクライアント端末28に送信する。
この結果、図１２に示したように、クライアント端末28のWebブラウザ上には、サイドバーに特定の評価軸及び意図類型に係る代表意見リスト63が配置された画面60が表示される。 In addition, when selection information of a specific intention type related to a specific evaluation axis is transmitted from the client terminal 28 via the Web server 26, the screen generation unit 25 sends the representative opinion and the number of cases ( A screen listing the number of similar sentences in the similar group to which the representative opinion belongs is generated and transmitted to the client terminal 28 via the Web server 26.
As a result, as shown in FIG. 12, on the web browser of the client terminal 28, a screen 60 in which the representative opinion list 63 relating to a specific evaluation axis and intention type is arranged in the sidebar is displayed.

さらに、Webサーバ26経由でクライアント端末28から特定の代表意見の選択情報が送信された場合、画面生成部25は分析結果評価部21から渡された対応の類似文グループに属する全類似文（代表意見＋他の類似文）の具体的な内容が列記された画面を生成し、Webサーバ26経由でクライアント端末28に送信する。
この結果、図１２に示したように、クライアント端末28のWebブラウザ上には、メイン領域に代表意見を含む類似文グループの内訳リスト64が配置された画面60が表示される。 Furthermore, when selection information of a specific representative opinion is transmitted from the client terminal 28 via the Web server 26, the screen generation unit 25 displays all similar sentences (representatives) belonging to the corresponding similar sentence group passed from the analysis result evaluation unit 21. A screen in which specific contents of (opinion + other similar sentences) are listed is generated and transmitted to the client terminal 28 via the Web server 26.
As a result, as shown in FIG. 12, a screen 60 in which a breakdown list 64 of similar sentence groups including representative opinions is arranged in the main area is displayed on the Web browser of the client terminal 28.

ユーザが画面60中の「時系列解析結果」ボタン66をクリックすると、分析結果評価部21及び時系列解析部23による時系列解析処理が実行されると共に、その解析結果を反映させた画面が画面生成部25によって生成され、Webサーバ26経由でクライアント端末28に送信される。 When the user clicks the “time series analysis result” button 66 in the screen 60, the time series analysis processing by the analysis result evaluation unit 21 and the time series analysis unit 23 is executed, and a screen reflecting the analysis result is displayed on the screen. Generated by the generation unit 25 and transmitted to the client terminal 28 via the Web server 26.

図１６は、この時系列解析結果画面70を示すものであり、画面のメイン領域には、縦軸に件数が設定されると共に、横軸に時間が設定された時系列変化グラフ71が表示されている。
また、時系列変化グラフ71中の変化点Ｐについては、「ネガティブな意見が非常に多く、意見の53％を占めており要注意です。不満意見として『料理の味が落ちた。』が多く出現しています。」という内容の考察文72が付記されている。「変化点」の意義については、後述する。 FIG. 16 shows the time series analysis result screen 70. In the main area of the screen, a time series change graph 71 in which the number of cases is set on the vertical axis and the time is set on the horizontal axis is displayed. ing.
As for the change point P in the time series change graph 71, “There are a lot of negative opinions, and 53% of the opinions are cautions. A consideration sentence 72 with the content “Appears” is added. The significance of the “change point” will be described later.

以下、図１７のフローチャートに従い、この時系列解析に係る処理手順を説明する。
まず、Webサーバ26経由でクライアント端末28から時系列解析のリクエストを受信した分析結果評価部21は（Ｓ30）、分析結果データに含まれる各文書の時間情報（回答日時等）に基づいて日毎の文書件数を集計し（Ｓ32）、この時系列情報に基づいて時系列変化グラフを生成する（Ｓ34）。
つぎに分析結果評価部21は、この日毎の件数情報（時系列情報）を時系列解析部23に渡し、変化点の検出を依頼する。 Hereinafter, a processing procedure related to the time series analysis will be described with reference to the flowchart of FIG.
First, the analysis result evaluation unit 21 that has received a request for time series analysis from the client terminal 28 via the Web server 26 (S30), based on the time information (answer date and time) of each document included in the analysis result data, The number of documents is totaled (S32), and a time series change graph is generated based on the time series information (S34).
Next, the analysis result evaluation unit 21 passes the daily number information (time series information) to the time series analysis unit 23 and requests detection of a change point.

これを受けた時系列解析部23は、所定の時系列解析モデル（アルゴリズム）に上記の時系列情報を投入することにより、変化点を特定する（Ｓ36）。
ここで「変化点」とは、複数のランダムな時系列データの集合において、データの基本的な性質が変化した時点を意味し、データの特性に応じて最適な解析モデルが複数用意されている。図１８において、主な時系列解析モデルの特徴を示す。
時系列解析部23は、デフォルトでは「ARIMAモデル」を適用することによって変化点を特定し、分析結果評価部21に返す。 Receiving this, the time series analysis unit 23 specifies the change point by putting the above time series information into a predetermined time series analysis model (algorithm) (S36).
Here, the “change point” means the point in time when the basic properties of the data change in a set of multiple random time-series data, and multiple optimal analysis models are prepared according to the characteristics of the data. . FIG. 18 shows the characteristics of the main time series analysis model.
The time series analysis unit 23 specifies a change point by applying an “ARIMA model” by default, and returns it to the analysis result evaluation unit 21.

これに対し分析結果評価部21は、この変化点に対応した考察文を生成し、上記の通り時系列変化グラフ中に付記する（Ｓ38）。この考察文の生成方法については、後に詳述する。 On the other hand, the analysis result evaluation unit 21 generates a consideration sentence corresponding to the change point and appends it to the time series change graph as described above (S38). A method for generating the consideration sentence will be described in detail later.

画面生成部25は、分析結果評価部21から渡された時系列変化グラフに基づいて時系列解析結果画面70を生成する（Ｓ40）。 The screen generation unit 25 generates a time series analysis result screen 70 based on the time series change graph passed from the analysis result evaluation unit 21 (S40).

上記の通り、デフォルトでは「全分析結果データ」に対して、「ARIMAモデル」に基づいた時系列解析が実行されるが、ユーザは解析対象期間や評価軸×意図類型を限定し、あるいは異なる時系列解析モデルの適用を求めることもできる。 As described above, by default, time series analysis based on the “ARIMA model” is performed on “all analysis result data”, but the user limits the analysis target period, evaluation axis x intention type, or at different times. Application of a series analysis model can also be obtained.

具体的には、サイドバーに用意された期間指定欄73において始期及び周期を選択することで、ユーザは解析対象期間を絞り込む。
またユーザは、対象データ指定欄74において必要な「評価軸×意図類型」のチェックボックスにチェックを入れることにより、対象文の意図類型を絞り込む。例えば、「総合」配下の「不満」にチェックを入れると、全評価軸に係る＜不満＞のタグが付与された文が時系列解析対象として指定されたことになる。これに対し、「価格」配下の「不満」にチェックを入れると、「価格」の評価軸に係る＜不満＞のタグが付与された文のみが時系列解析対象として指定されたことになる。
さらにユーザは、解析モデル指定欄75において任意の解析モデルを指定することで、適用すべき解析モデルの変更を求めることができる。 Specifically, the user narrows down the analysis target period by selecting a start period and a period in the period designation field 73 prepared in the sidebar.
Further, the user narrows down the intention type of the target sentence by checking a check box of “evaluation axis × intention type” necessary in the target data designation column 74. For example, if “unsatisfied” under “general” is checked, a sentence to which a tag of <dissatisfied> related to all evaluation axes is assigned is designated as a time-series analysis target. On the other hand, when “dissatisfied” under “price” is checked, only sentences with the <dissatisfaction> tag related to the evaluation axis of “price” are designated as time series analysis targets.
Furthermore, the user can obtain a change of the analysis model to be applied by designating an arbitrary analysis model in the analysis model designation field 75.

サイドバーにおける設定を完了したユーザが「再表示」ボタン76をクリックすると、クライアント端末28からWebサーバ26に指定条件データが送信される。
Webサーバ26経由でこの指定条件データを受け取った分析結果評価部21は、指定された条件に合致する分析結果データに基づいて時系列変化グラフを作成すると共に、同データに基づく変化点の特定を時系列解析部23に依頼する。
これを受けた時系列解析部23は、指定された時系列解析モデルに時系列データを投入して新たな変化点を特定し、分析結果評価部21に返す。
分析結果評価部21は、この変化点に係る考察文を生成する。
画面生成部25は、分析結果評価部21から渡された時系列変化グラフ及び考察文に基づいて時系列解析結果画面を生成する。
この時系列解析結果画面は、Webサーバ26経由でクライアント端末28に送信される。
以上の結果、クライアント端末28のWebブラウザ上には、新たな時系列解析結果画面70が表示される（図示省略）。 When the user who has completed the setting in the sidebar clicks the “redisplay” button 76, the specified condition data is transmitted from the client terminal 28 to the Web server 26.
Upon receiving this specified condition data via the Web server 26, the analysis result evaluation unit 21 creates a time-series change graph based on the analysis result data that matches the specified condition, and specifies the change point based on the data. The time series analysis unit 23 is requested.
Receiving this, the time series analysis unit 23 inputs time series data into the designated time series analysis model, identifies a new change point, and returns it to the analysis result evaluation unit 21.
The analysis result evaluation unit 21 generates a consideration sentence related to the change point.
The screen generation unit 25 generates a time series analysis result screen based on the time series change graph and the consideration sentence passed from the analysis result evaluation unit 21.
This time series analysis result screen is transmitted to the client terminal 28 via the Web server 26.
As a result, a new time series analysis result screen 70 is displayed on the web browser of the client terminal 28 (not shown).

ユーザが画面70中の「自動考察結果」ボタン77をクリックすると、分析結果評価部21による考察文生成処理が実行されると共に、その考察文を記述した画面が画面生成部25によって生成され、Webサーバ26経由でクライアント端末28に送信される。 When the user clicks the “automatic consideration result” button 77 in the screen 70, the analysis result evaluation unit 21 executes a consideration sentence generation process, and a screen describing the consideration sentence is generated by the screen generation part 25. It is transmitted to the client terminal 28 via the server 26.

図１９は、この自動考察結果画面80を示すものであり、「総合的な考察」、「回答者の属性別考察(1)」、「回答者の属性別考察(2)」、「価格に関する考察」、「料理に関する考察」、「部屋に関する考察」、「風呂に関する考察」等の考察文表示欄81が設けられており、各考察文表示欄81には分析対象文書における記述内容の傾向を、簡潔な言葉で表現した考察文（コメント）が記述されている。 FIG. 19 shows the automatic consideration result screen 80, which is “Comprehensive consideration”, “Respondent attribute-specific consideration (1)”, “Respondent attribute-specific consideration (2)”, “Price related”. Consideration sentence display fields 81 such as "consideration", "consideration about cooking", "consideration about room", and "consideration about bath" are provided, and each consideration sentence display field 81 shows the tendency of description contents in the analysis target document. , Comments (comments) expressed in simple words are described.

例えば、「総合的な考察」として「全般的にポジティブな意見が多く、意見の38％を占めています。好評意見として『料金が安い』という意見が多くなっています。」が表示されており、これを参照することによってユーザは、文書ファイル全体の傾向を大まかに把握することが可能となる。 For example, “Comprehensive consideration” is displayed as “Generally, there are many positive opinions, accounting for 38% of the opinions. As a popular opinion, there are many opinions that“ the price is cheap ”.” By referring to this, the user can roughly grasp the tendency of the entire document file.

また「回答者の属性別考察(1)」として、「属性『30代男性』では、ポジティブな意見が非常に多く、意見の62％を占めており、要注目です。好評意見として『アクセスが良い』が多く出現しています。」が表示されており、これを参照することによってユーザは、文書を記述した回答者の属性に特有の評価傾向を把握することが可能となる。 Also, as “Consideration by respondent attribute (1),” “Attribute '30's male” has a lot of positive opinions, accounting for 62% of the opinions. "Good" appears a lot "is displayed, and by referring to this, the user can grasp the evaluation tendency peculiar to the attribute of the respondent describing the document.

また「価格に関する考察」として、「価格に関しては、ポジティブな意見が多く、意見の42％を占めています。好評意見として『料金が安い』という意見が多くなっています。」が表示されており、これを参照することによってユーザは、「評価軸：価格」に関する評価傾向を大まかに認識することが可能となる。 Also, “Price consideration” is displayed as “There are many positive opinions regarding the price, accounting for 42% of the opinion. As a popular opinion, there are many opinions that“ the price is cheap ””. By referring to this, the user can roughly recognize the evaluation tendency related to “evaluation axis: price”.

以下において、「価格」に関する考察文の生成に係る処理手順を説明する。
まず分析結果評価部21は、「価格」の評価軸が関連付けられた文の件数と、当該件数中における「不満」、「好評」、「要望」、「予想外」、「質問」、「意向」の意図類型毎の件数を、考察ルール記憶部24に格納された考察ルールに当てはめることにより、考察文を生成する。 Hereinafter, a processing procedure related to generation of a consideration sentence related to “price” will be described.
First, the analysis result evaluation unit 21 determines the number of sentences associated with the “price” evaluation axis, and “dissatisfaction”, “popularity”, “request”, “unexpected”, “question”, “intention” Is applied to the consideration rule stored in the consideration rule storage unit 24 to generate a consideration sentence.

図２０は、考察ルールの一例を示しており、各ルールは「優先度」、「考察名」、「適用条件」、「テンプレート」のデータ項目を備えている。
これに対し分析結果評価部21は、図２１に示すように、優先度の高い順に適用条件と分析対象データとの合致／不合致を調べ、合致した時点で当該考察ルール（好評）に係るテンプレート中の比率挿入欄に具体的な数値（ポジティブの占めるパーセンテージ）を挿入すると共に、意見挿入欄に最多代表意見を挿入し、さらに価格の評価軸に係る枕詞である「価格に関しては、」を文頭に挿入することにより、考察文を完成させる。 FIG. 20 shows an example of the consideration rule, and each rule includes data items of “priority”, “consideration name”, “application condition”, and “template”.
On the other hand, as shown in FIG. 21, the analysis result evaluation unit 21 checks the matching / non-matching of the application condition and the data to be analyzed in descending order of priority, and when it matches, the template related to the consideration rule (popular). Insert a specific numerical value (percentage of positives) in the ratio insertion column, insert the most representative opinion in the opinion insertion column, and also start with “With regard to price”, which is a pillow line related to the price evaluation axis To complete the consideration sentence.

上記「ポジティブの占めるパーセンテージ」とは、「価格」の評価軸が設定された文の集合の中で、＜好評＞の意図類型タグが付与された文の百分率を意味している。
また、上記「最多代表意見」とは、評価軸として「価格」が付与されると共に、＜好評＞の意図類型タグが付与された代表意見の中で、関係する類似文の件数が最も多いものを意味している。
これに対し、適用される考察ルールが仮に「予想外」であった場合、「価格」の評価軸が設定され、かつ＜予想外＞の意図類型タグが付与された代表意見の中で、関係する類似文の件数が最も多いものが、上記の「最多代表意見」として選定される。 The above-mentioned “percentage occupied by positives” means the percentage of sentences with the <popular> intent type tag in the sentence set in which the evaluation axis of “price” is set.
The above “most representative opinion” has “price” as the evaluation axis and has the highest number of related similar sentences among the representative opinions with the <popular> intention type tag. Means.
On the other hand, if the consideration rule to be applied is “unexpected”, the evaluation axis of “price” is set, and in the representative opinion with the <unexpected> intention type tag, The sentence with the largest number of similar sentences is selected as the above “most representative opinion”.

他の評価軸に関する考察文を生成する場合も、分析結果評価部21は上記と同様の手順を踏襲することで、当該評価軸固有の考察文を完成させる。 Even in the case of generating a consideration sentence relating to another evaluation axis, the analysis result evaluation unit 21 follows the same procedure as described above to complete a consideration sentence unique to the evaluation axis.

総合的な考察文を生成する場合も、分析結果評価部21は基本的には上記と同様の手順を踏む。
すなわち、価格〜立地の評価軸に係る全ての文の件数と、その中に含まれる「不満」、「好評」、「要望」、「予想外」、「質問」、「意向」の意図類型毎の合計件数を考察ルールに当てはめて、「好評」の考察ルールに係るテンプレートを特定した後、＜好評＞の意図類型タグが設定された文が占める具体的な数値を比率挿入欄に挿入すると共に、全評価軸を通じて最も頻度の高い＜好評＞の意図類型タグが設定された代表意見を意見挿入欄に挿入し、「全般的に」の枕詞を文頭に挿入することにより、考察文を完成させる。 Even when generating a comprehensive consideration sentence, the analysis result evaluation unit 21 basically follows the same procedure as described above.
In other words, the number of all sentences related to the evaluation axis of price to location and the intention types of “dissatisfied”, “popular”, “request”, “unexpected”, “question”, and “intention” included in it. After applying the total number of cases to the consideration rule and identifying the template related to the “favorite” consideration rule, insert a specific numerical value occupied by the sentence with the <intentional> intention type tag into the ratio insertion field. , The representative opinion with the most popular <popular> intention type tag set through all evaluation axes is inserted into the opinion insertion field, and the general statement is inserted at the beginning of the sentence to complete the consideration sentence. .

つぎに、回答者の属性別の考察文生成処理について説明する。
まず分析結果評価部21は、全分析対象文を各文書に関連付けられた「年代×性別」の属性グループ単位で分類し、それぞれの文集合の件数を集計する。
つぎに分析結果評価部21は、図２２に示すように、「年代×性別」単位で考察ルールの適用条件を各文集合に適用して、判定結果（true／false）を導き出す。
そして、一の属性グループの文集合にのみに該当する考察ルールのテンプレートが、考察文の雛形として抽出され、必要な数値や文言の挿入を経て考察文が完成される。 Next, discussion text generation processing for each attribute of the respondent will be described.
First, the analysis result evaluation unit 21 classifies all the sentences to be analyzed in attribute group units of “age × gender” associated with each document, and totals the number of sentences in each sentence set.
Next, as shown in FIG. 22, the analysis result evaluation unit 21 applies the consideration rule application condition to each sentence set in units of “age × gender” to derive a determination result (true / false).
Then, a consideration rule template corresponding to only the sentence set of one attribute group is extracted as a template of the consideration sentence, and the consideration sentence is completed through insertion of necessary numerical values and words.

例えば、考察名「かなり好評」についてみると、「30代男性」の属性グループのみが「判定結果＝true」となっているため、対応のテンプレートである「ポジティブな意見が非常に多く、意見の［比率］％を占めており要注目です。好評意見として「［意見］」が多く出現しています。」が取り出される。
つぎに、「30代男性」の属性グループに係る文集合の中で、＜好評＞の意図類型タグが付与された文の比率が算出され、比率挿入欄に挿入される。
また、上記文集合の中から＜好評＞の意図類型タグが付与された最多代表意見が抽出され、意見挿入欄に挿入される。
最後に、文頭に「属性『30代男性では、』」が挿入されることにより、考察文が完成される。 For example, in the case of the consideration name “very popular”, only the attribute group of “male in their 30s” has “judgment result = true”, so the corresponding template is “very positive opinion, [Ratio]%, and it is worth paying attention. Is extracted.
Next, the ratio of sentences to which the intention type tag of <popular> is given in the sentence set related to the attribute group of “male in their 30s” is calculated and inserted into the ratio insertion column.
In addition, the most representative opinion with the <popular> intention type tag is extracted from the sentence set and inserted into the opinion insertion column.
Finally, the consideration sentence is completed by inserting “attribute“ in the 30's male ”” at the beginning of the sentence.

また、考察名「質問」についてみると、「30代女性」の属性グループのみが「判定結果＝true」となっているため、対応のテンプレートである「『［意見］』という質問が数多く出現しています。」が取り出される。
つぎに、「30代女性」の属性グループに係る文集合の中から＜質問＞の意図類型タグが付与された最多代表意見が抽出され、意見挿入欄に挿入される。
最後に、文頭に「属性『30代女性では、』」が挿入されることにより、考察文が完成される。 In addition, as for the consideration name “question”, only the attribute group of “female 30s” has “judgment result = true”, so a lot of questions “[Opinion]”, which is the corresponding template, appear. Is taken out.
Next, the most representative opinion with the <question> intent type tag is extracted from the sentence set related to the attribute group of “30's female” and inserted into the opinion insertion column.
Finally, the consideration sentence is completed by inserting “attribute“ in 30's female ”” at the beginning of the sentence.

これに対し考察名「要望」の場合には、「20代男性」のみならず「40代女性」の属性グループについても「判定結果＝true」となっているため、対応のテンプレートに基づいて属性別考察文が生成されることはない。 On the other hand, in the case of the consideration name “request”, since “judgment result = true” is obtained for the attribute group of “male in the 20s” as well as “male in the 40s”, the attribute is based on the corresponding template. Another consideration sentence is not generated.

図２２においては、図示の便宜上、20代女性〜40代男性の属性グループのみが示されているが、実際には全ての属性グループについて上記の判定処理が実行され、一の属性グループについてのみ合致する考察ルールのテンプレートに基づいて考察文が生成される。 In FIG. 22, only the attribute group of women in their 20s to 40s is shown for convenience of illustration, but in reality, the above determination processing is executed for all attribute groups, and only one attribute group is matched. A consideration sentence is generated based on the template of the consideration rule.

上記のように、一の属性グループについてのみ合致する考察ルールのテンプレートに基づいて考察文が生成されるため、ユーザは各属性グループに特有な評価傾向を的確に認識することが可能となる。
ただし、複数の属性グループにマッチする場合であっても、それぞれの属性グループについて考察文を生成するように、条件を緩和してこのシステム10を運用することも当然に可能である。 As described above, since the consideration sentence is generated based on the consideration rule template that matches only one attribute group, the user can accurately recognize the evaluation tendency unique to each attribute group.
However, even when a plurality of attribute groups are matched, it is naturally possible to operate the system 10 with relaxed conditions so that a consideration sentence is generated for each attribute group.

上記においては、「年代×性別」の属性グループ毎に考察文を生成する例を説明したが、他の属性（例えば回答者の職業、居住地域等）単位で考察文を生成することもできる。 In the above description, an example in which a consideration sentence is generated for each attribute group of “age × gender” has been described. However, a consideration sentence can be generated in units of other attributes (for example, the occupation of the respondent, the residential area, etc.).

つぎに、図１６に示した時系列解析に際して表示される考察文72の生成方法について説明する。
まず分析結果評価部21は、図２３(a)に示すように、時系列変化グラフ71中の変化点Ｐを中心にした所定期間（例えば１週間）を、着目期間と認定する。
つぎに分析結果評価部21は、着目期間内の分析対象文について考察ルールを適用する。
同時に分析結果評価部21は、着目期間をも含めた全期間内の分析対象文について考察ルールを適用する。
そして、着目期間のみに該当する考察ルールのテンプレートに基づいて、考察文を生成する。 Next, a method for generating the consideration sentence 72 displayed in the time series analysis shown in FIG. 16 will be described.
First, as shown in FIG. 23A, the analysis result evaluation unit 21 recognizes a predetermined period (for example, one week) centered on the change point P in the time-series change graph 71 as the period of interest.
Next, the analysis result evaluation unit 21 applies the consideration rule to the analysis target sentence within the target period.
At the same time, the analysis result evaluation unit 21 applies the consideration rule to the analysis target sentences within the entire period including the period of interest.
Then, a consideration sentence is generated based on a consideration rule template corresponding only to the period of interest.

例えば図２３(b)に示すように、考察名「かなり不満」の考察ルールについてみると、着目期間のみが「true」で全期間は「false」であるため、「ネガティブな意見が非常に多く、意見の［比率］％を占めており要注意です。不満意見として『［意見］』が多く出現しています。」のテンプレートが引用され、［比率］及び［意見］に具体的な数値や文字列（最多代表意見）が挿入されることによって、考察文が完成される。 For example, as shown in FIG. 23 (b), regarding the consideration rule of the consideration name “substantially dissatisfied”, only the period of interest is “true” and the whole period is “false”. , Which accounts for [ratio]% of opinions, and the template of “[opinion]” appears frequently as a dissatisfied opinion. A consideration sentence is completed by inserting a character string (most representative opinion).

このように、全期間内の分析対象文に基づく判定結果と着目期間内の分析対象文に基づく判定結果とを比較し、後者のみに適合する考察ルールのテンプレートに基づいて考察文を生成することにより、着目期間に特有の考察文を導くことが可能となる。
このためユーザは、変化点付近においける特異な評価傾向を容易に認識することができる。 In this way, the determination result based on the analysis target sentence within the entire period is compared with the determination result based on the analysis target sentence within the target period, and the consideration sentence is generated based on the template of the consideration rule that only suits the latter. This makes it possible to derive a consideration sentence peculiar to the period of interest.
Therefore, the user can easily recognize the unique evaluation tendency in the vicinity of the change point.

なお、考察名「不満」に関しても、着目期間のみが「true」で全期間は「false」という判定結果が示されているが、優先度が劣るためこの考察ルールに係るテンプレートが適用されることはない。 In addition, regarding the consideration name “dissatisfied”, only the period of interest is “true” and the determination result is “false” for all periods. However, because the priority is inferior, the template related to this consideration rule is applied. There is no.

ユーザは、時系列解析結果画面70または自動考察結果画面80の「分析結果」ボタン85をクリックすることにより、分析結果画面60に戻ることができる。 The user can return to the analysis result screen 60 by clicking the “analysis result” button 85 on the time series analysis result screen 70 or the automatic consideration result screen 80.

上記においては、この発明に係る感性分析システム10を、Webブラウザを搭載したクライアント端末28に各種画面（HTMLファイル）がWebサーバ26経由で送信されるWebシステムとして具体化した例を説明したが、このシステム10を実現するためのコンピュータプログラムをPC等のコンピュータにセットアップした、所謂スタンドアロン型のシステムとして具体化することも当然に可能である。 In the above, the sensitivity analysis system 10 according to the present invention has been described as an example in which the various screens (HTML files) are transmitted via the Web server 26 to the client terminal 28 equipped with the Web browser. Of course, it is also possible to implement a so-called stand-alone system in which a computer program for realizing the system 10 is set up in a computer such as a PC.

この発明に係る感性分析システムの機能構成を示すブロック図である。It is a block diagram which shows the function structure of the Kansei analysis system which concerns on this invention. 分析条件指定画面の一例を示す図である。It is a figure which shows an example of an analysis condition designation | designated screen. 感性分析システムにおける処理手順を示すフローチャートである。It is a flowchart which shows the process sequence in a sensitivity analysis system. ネガ／ポジ判定処理の手順を示すフローチャートである。It is a flowchart which shows the procedure of a negative / positive determination process. 感性辞書の構成例を示す模式図である。It is a schematic diagram which shows the structural example of a sensitivity dictionary. 汎用辞書群に属する単語辞書の登録データを示すテーブルである。It is a table which shows the registration data of the word dictionary which belongs to a general purpose dictionary group. 具体的な文に対するネガ／ポジ判定処理の実行例を示す図である。It is a figure which shows the execution example of the negative / positive determination process with respect to a specific sentence. 具体的な文に対する主題継承処理の実行例を示す図である。It is a figure which shows the example of execution of the subject inheritance process with respect to a specific sentence. 意図表現記憶部の登録データを示すテーブルである。It is a table which shows the registration data of an intention expression memory | storage part. 具体的な文に対する意図表現抽出処理の実行例を示す図である。It is a figure which shows the execution example of the intention expression extraction process with respect to a specific sentence. 分析結果画面の一例を示す図である。It is a figure which shows an example of an analysis result screen. 分析結果画面の一例を示す図である。It is a figure which shows an example of an analysis result screen. 分析結果画面の生成に係る処理手順を示すフローチャートである。It is a flowchart which shows the process sequence which concerns on the production | generation of an analysis result screen. 評価軸類似表現辞書の登録データを示すテーブルである。It is a table which shows the registration data of an evaluation axis similarity expression dictionary. 代表意見抽出の手順を示す図である。It is a figure which shows the procedure of representative opinion extraction. 時系列解析結果画面を示す図である。It is a figure which shows a time series analysis result screen. 時系列解析結果画面の生成に係る処理手順を示すフローチャートである。It is a flowchart which shows the process sequence which concerns on the production | generation of a time series analysis result screen. 各種時系列解析モデルの特徴を示す図である。It is a figure which shows the characteristic of various time series analysis models. 自動考察結果画面を示す図である。It is a figure which shows an automatic consideration result screen. 考察ルールの一例を示す図である。It is a figure which shows an example of a consideration rule. 考察文の生成手順を示す図である。It is a figure which shows the production | generation procedure of a consideration sentence. 属性別考察文の生成手順を示す図である。It is a figure which shows the production | generation procedure of the attribute-specific consideration sentence. 時系列解析時の変化点に係る考察文の生成手順を示す図である。It is a figure which shows the production | generation procedure of the consideration sentence which concerns on the change point at the time series analysis.

10 感性分析システム
11 文書登録部
12 文書記憶部
13 文書解析部
14 ネガ／ポジ判定部
15 感性辞書記憶部
16 否定表現記憶部
17 主題認定部
18 意図抽出部
19 意図表現記憶部
20 分析結果記憶部
21 分析結果評価部
22 評価軸類似表現記憶部
23 時系列解析部
24 考察ルール記憶部
25 画面生成部
26 Webサーバ
27 インターネット
28 クライアント端末
40 分析条件指定画面
41 分析対象ファイル指定欄
42 評価軸指定欄
43 参照ボタン
44 分析開始ボタン
50 汎用辞書群
51 単語辞書
52 二項関係辞書
53 慣用句辞書
54 擬態語辞書
55 ドメイン別辞書群
56 単語辞書
57 二項関係辞書
58 擬態語辞書
60 分析結果画面
61 ネガ・ポジ分布グラフ
62 意図分類チャート
63 代表意見リスト
64 代表意見の内訳リスト
65 「選択」ボタン
66 「時系列解析結果」ボタン
70 時系列解析結果画面
71 時系列変化グラフ
72 考察文
73 期間指定欄
74 対象データ指定欄
75 解析モデル指定欄
76 「再表示」ボタン
77 「自動考察結果」ボタン
80 自動考察結果画面
81 考察文表示欄
85 「分析結果」ボタン 10 Kansei analysis system
11 Document Registration Department
12 Document storage
13 Document Analysis Department
14 Negative / Positive judgment part
15 Sensitivity dictionary storage
16 Negative expression storage
17 Subject Certification Department
18 Intention extraction unit
19 Intention expression storage
20 Analysis result storage
21 Analysis result evaluation department
22 Evaluation axis similarity expression storage
23 Time Series Analysis Department
24 Consideration rule memory
25 Screen generator
26 Web server
27 Internet
28 Client terminal
40 Analysis condition specification screen
41 Analysis target file specification column
42 Evaluation axis specification field
43 Browse button
44 Start analysis button
50 general dictionaries
51 word dictionary
52 Binary relational dictionary
53 Phrasebook Dictionary
54 Mimicry Dictionary
55 Domain-specific dictionaries
56 word dictionary
57 Binary relational dictionary
58 Mimicry Dictionary
60 Analysis result screen
61 Negative / Positive Distribution Graph
62 Intention classification chart
63 Representative Opinion List
64 Breakdown list of representative opinions
65 Select button
66 Time Series Analysis Results button
70 Time Series Analysis Result Screen
71 Time series change graph
72 Thoughts
73 Period designation field
74 Target data specification column
75 Analysis model specification field
76 Refresh button
77 “Automatic Result” button
80 Automatic consideration result screen
81 Discussion text display column
85 “Result” button

Claims

事物に対する肯定／否定の価値判断を表す感性用語と、肯定／否定の何れであるかを示す属性との組合せを格納しておく感性辞書記憶手段と、
文書ファイルに対して形態素処理を施し、当該文書ファイルに含まれる複数のテキスト文を形態素単位に分解する形態素解析手段と、
各文に含まれる形態素または形態素の組合せからなる表現が、上記感性辞書記憶手段に格納された感性用語に該当するか否かを判定し、該当する場合には当該表現に対して組み合わされている上記肯定／否定の何れかの属性を示す属性タグを関連付ける手段と、
否定文を構成する際に用いられる否定表現を格納しておく否定表現記憶手段と、
各文に含まれる形態素または形態素の組合せからなる表現が、上記否定表現記憶手段に格納された否定表現に該当するか否かを判定し、該当する場合には当該表現に対して否定表現であることを示す否定表現タグを関連付ける手段と、
各文に関連付けられた属性タグの種類と、否定表現タグの有無との組合せパターンに基づいて、当該文が否定評価文か肯定評価文かを判定し、否定評価文に対してはネガ判定タグを付与すると共に、肯定評価文に対してはポジ判定タグを付与する手段と、
各文中の主格を構成するための助詞を探索し、当該助詞が付属している自立語を主題と認定する主題認定手段と、
発言者の発言意図を表す意図表現と、その意図の類型との組合せを格納しておく意図表現記憶手段と、
上記ネガ判定タグが付与された文について「不満」を意味する意図類型タグを関連付けると共に、上記ポジ判定タグが付与された文について「好評」を意味する意図類型タグを関連付け、さらに、各文に含まれる形態素または形態素の組合せからなる表現が、上記意図表現辞書記憶手段に格納された意図表現に該当するか否かを判定し、該当する場合には当該表現に対して対応の類型を示す意図類型タグを関連付ける手段と、
複数種類の評価軸を示す文字列と、各評価軸に包含される類似表現との対応関係を格納しておく評価軸類似表現記憶手段と、
各文に認定された主題が、上記評価軸類似表現記憶手段に格納された類似表現に該当するか否かを判定し、該当する場合には文に対して対応の評価軸を関連付ける手段と、
各文を、共通の評価軸、意図類型及び感性用語を備えた類似文の集合である類似文グループ毎に分類すると共に、各類似文グループ中で文字数が最も少ない類似文を代表意見と認定する手段と、
複数の代表意見が列記された代表意見リストを生成する手段と、
この代表意見リストを出力する手段と、
を備えたことを特徴とする感性分析システム。 A sensitivity dictionary storage means for storing a combination of a sensitivity term representing an affirmative / negative value judgment for an object and an attribute indicating whether it is affirmative / negative;
A morpheme analyzing unit that performs morpheme processing on a document file and decomposes a plurality of text sentences included in the document file into morpheme units;
It is determined whether or not an expression composed of a morpheme or a combination of morphemes included in each sentence corresponds to a sensitivity term stored in the sensitivity dictionary storage unit, and if applicable, is combined with the expression. Means for associating an attribute tag indicating any of the positive / negative attributes;
Negative expression storage means for storing negative expressions used when constructing negative sentences;
It is determined whether or not an expression composed of a morpheme or a combination of morphemes included in each sentence corresponds to a negative expression stored in the negative expression storage means, and if so, is a negative expression for the expression. Means for associating a negative expression tag indicating
Based on the combination pattern of the attribute tag type associated with each sentence and the presence / absence of a negative expression tag, it is determined whether the sentence is a negative evaluation sentence or a positive evaluation sentence, and for negative evaluation sentences, a negative determination tag And a means for adding a positive determination tag to a positive evaluation sentence,
Searching for a particle for constructing the main character in each sentence, and subject recognition means that recognizes an independent word attached to the particle as a subject;
An intention expression storage means for storing a combination of an intention expression representing a speaker's intention and a type of the intention;
The sentence with the negative determination tag is associated with an intention type tag that means “dissatisfied”, and the sentence with the positive determination tag is associated with an intention type tag that means “popular”. It is determined whether or not an expression composed of a morpheme or a combination of morphemes corresponds to an intention expression stored in the intention expression dictionary storage means, and if so, an intention indicating a corresponding type for the expression Means for associating a type tag;
An evaluation axis similar expression storage means for storing a correspondence relationship between a character string indicating a plurality of types of evaluation axes and a similar expression included in each evaluation axis;
A means for determining whether the subject recognized in each sentence corresponds to the similar expression stored in the evaluation axis similar expression storage means, and, if applicable, associating a corresponding evaluation axis with the sentence;
Each sentence is classified into a similar sentence group that is a set of similar sentences with a common evaluation axis, intention type, and sentiment terms, and the similar sentence with the smallest number of characters in each similar sentence group is recognized as a representative opinion. Means,
Means for generating a representative opinion list in which a plurality of representative opinions are listed;
Means for outputting this representative opinion list;
Kansei analysis system characterized by having.

上記代表意見リスト中の各代表意見には、それぞれの類似文グループに含まれる類似文の件数が付記されていることを特徴とする請求項１に記載の感性分析システム。 The sensitivity analysis system according to claim 1, wherein the number of similar sentences included in each similar sentence group is appended to each representative opinion in the representative opinion list.

上記代表意見リスト中の一の代表意見が選択された場合に、対応の類似文グループに属する個々の類似文を列記した内訳リストを生成する手段と、
この内訳リストを出力する手段とを備えたことを特徴とする請求項１または２に記載の感性分析システム。 Means for generating a breakdown list listing individual similar sentences belonging to a corresponding similar sentence group when one representative opinion in the representative opinion list is selected;
The sensibility analysis system according to claim 1 or 2, further comprising means for outputting the breakdown list.

上記内訳リスト中の一の類似文が選択された場合に、当該類似文を含む文書の全文を出力する手段を備えたことを特徴とする請求項３に記載の感性分析システム。 4. The sensibility analysis system according to claim 3, further comprising means for outputting a full sentence of a document including the similar sentence when one similar sentence in the breakdown list is selected.

コンピュータを、
事物に対する肯定／否定の価値判断を表す感性用語と、肯定／否定の何れであるかを示す属性との組合せを格納しておく感性辞書記憶手段、
文書ファイルに対して形態素処理を施し、当該文書ファイルに含まれる複数のテキスト文を形態素単位に分解する形態素解析手段、
各文に含まれる形態素または形態素の組合せからなる表現が、上記感性辞書記憶手段に格納された感性用語に該当するか否かを判定し、該当する場合には当該表現に対して組み合わされている上記肯定／否定の何れかの属性を示す属性タグを関連付ける手段、
否定文を構成する際に用いられる否定表現を格納しておく否定表現記憶手段、
各文に含まれる形態素または形態素の組合せからなる表現が、上記否定表現記憶手段に格納された否定表現に該当するか否かを判定し、該当する場合には当該表現に対して否定表現であることを示す否定表現タグを関連付ける手段、
各文に関連付けられた属性タグの種類と、否定表現タグの有無との組合せパターンに基づいて、当該文が否定評価文か肯定評価文かを判定し、否定評価文に対してはネガ判定タグを付与すると共に、肯定評価文に対してはポジ判定タグを付与する手段、
各文中の主格を構成するための助詞を探索し、当該助詞が付属している自立語を主題と認定する主題認定手段、
発言者の発言意図を表す意図表現と、その意図の類型との組合せを格納しておく意図表現記憶手段、
上記ネガ判定タグが付与された文について「不満」を意味する意図類型タグを関連付けると共に、上記ポジ判定タグが付与された文について「好評」を意味する意図類型タグを関連付け、さらに、各文に含まれる形態素または形態素の組合せからなる表現が、上記意図表現辞書記憶手段に格納された意図表現に該当するか否かを判定し、該当する場合には当該表現に対して対応の類型を示す意図類型タグを関連付ける手段、
複数種類の評価軸を示す文字列と、各評価軸に包含される類似表現との対応関係を格納しておく評価軸類似表現記憶手段、
各文に認定された主題が、上記評価軸類似表現記憶手段に格納された類似表現に該当するか否かを判定し、該当する場合には文に対して対応の評価軸を関連付ける手段、
各文を、共通の評価軸、意図類型及び感性用語を備えた類似文の集合である類似文グループ毎に分類すると共に、各類似文グループ中で文字数が最も少ない類似文を代表意見と認定する手段、
複数の代表意見が列記された代表意見リストを生成する手段、
この代表意見リストを出力する手段、
として機能させることを特徴とする感性分析プログラム Computer
A sensitivity dictionary storage means for storing a combination of a sensitivity term representing an affirmative / negative value judgment for an object and an attribute indicating whether it is positive / negative;
Morpheme analysis means for performing morpheme processing on a document file and decomposing a plurality of text sentences included in the document file into morpheme units;
It is determined whether or not an expression composed of a morpheme or a combination of morphemes included in each sentence corresponds to a sensitivity term stored in the sensitivity dictionary storage unit, and if applicable, is combined with the expression. Means for associating an attribute tag indicating any of the positive / negative attributes;
Negative expression storage means for storing negative expressions used when constructing negative sentences,
It is determined whether or not an expression composed of a morpheme or a combination of morphemes included in each sentence corresponds to a negative expression stored in the negative expression storage means, and if so, is a negative expression for the expression. Means for associating a negative expression tag indicating
Based on the combination pattern of the attribute tag type associated with each sentence and the presence / absence of a negative expression tag, it is determined whether the sentence is a negative evaluation sentence or a positive evaluation sentence, and for negative evaluation sentences, a negative determination tag And a means for adding a positive determination tag to a positive evaluation sentence,
Search for a particle for composing the main character in each sentence, and determine the subject recognition means to recognize the independent word attached with the particle as the subject,
An intention expression storage means for storing a combination of an intention expression representing a speaker's intention and a type of the intention;
The sentence with the negative determination tag is associated with an intention type tag that means “dissatisfied”, and the sentence with the positive determination tag is associated with an intention type tag that means “favored”. It is determined whether or not an expression composed of a morpheme or a combination of morphemes corresponds to an intention expression stored in the intention expression dictionary storage means, and if so, an intention indicating a corresponding type for the expression Means for associating type tags,
Evaluation axis similar expression storage means for storing a correspondence relationship between a character string indicating a plurality of types of evaluation axes and a similar expression included in each evaluation axis;
Means for determining whether the subject recognized in each sentence corresponds to the similar expression stored in the evaluation axis similar expression storage means, and in this case, associating a corresponding evaluation axis with the sentence;
Each sentence is classified into a similar sentence group that is a set of similar sentences with a common evaluation axis, intention type, and sentiment terms, and the similar sentence with the smallest number of characters in each similar sentence group is recognized as a representative opinion. means,
Means for generating a representative opinion list in which a plurality of representative opinions are listed;
A means for outputting this representative opinion list,
Kansei analysis program characterized by functioning as