JP2016139229A

JP2016139229A - Device and program for generating personal profile, and content recommendation device

Info

Publication number: JP2016139229A
Application number: JP2015013057A
Authority: JP
Inventors: 苗村　昌秀; Masahide Naemura; 昌秀苗村; 山内　結子; Yuiko Yamauchi; 結子山内
Original assignee: Nippon Hoso Kyokai NHK
Current assignee: Japan Broadcasting Corp
Priority date: 2015-01-27
Filing date: 2015-01-27
Publication date: 2016-08-04
Anticipated expiration: 2035-01-27
Also published as: JP6450203B2

Abstract

PROBLEM TO BE SOLVED: To provide a content recommendation device capable of generating a personal profile on which a wide personal preference is reflected.SOLUTION: A content recommendation device 2 comprises initial graph generation means 10 for generating a first initial graph, grouping means 20 for recursively grouping the first initial graph, tree structure graph generation means 30 for generating a first group tree structure graph from a first grouped graph on the basis of the position information of the first grouped graph, profile generation means 50 for generating a personal profile by transmitting weight expressing personal preference to a node of the first group tree structure graph, and content recommendation means 70 for determining whether or not to recommend content of a recommendation object to an individual person.SELECTED DRAWING: Figure 2

Description

本願発明は、キーワード同士がリンクされたリンク付きコーパスから、個人の嗜好に適応した個人プロファイルを生成する個人プロファイル生成装置及びそのプログラム、並びに、コンテンツ推薦装置に関する。 The present invention relates to a personal profile generation device that generates a personal profile adapted to individual preferences from a linked corpus in which keywords are linked to each other, a program thereof, and a content recommendation device.

従来より、番組やＷｅｂページなどのコンテンツへのアクセス履歴から、個人の嗜好をモデル化した個人プロファイルを生成し、個人の嗜好に応じたコンテンツを推薦する技術が提案されている。 2. Description of the Related Art Conventionally, a technique has been proposed in which a personal profile that models personal preferences is generated from access histories to content such as programs and Web pages, and content according to personal preferences is recommended.

例えば、特許文献１に記載の発明は、コンテンツに関するキーワードについての嗜好度を個別に計算し、視聴中に特化した番組推薦を行うものである。このとき、特許文献１に記載の発明では、視聴中のコンテンツだけではなく、視聴していない裏番組から抽出したキーワードとの相対関係から、キーワードについての嗜好度の精度を向上させている。 For example, the invention described in Patent Document 1 individually calculates a degree of preference for a keyword related to content, and performs program recommendation specialized during viewing. At this time, in the invention described in Patent Document 1, the accuracy of the preference degree of the keyword is improved not only from the content being viewed but also from the relative relationship with the keyword extracted from the back program that is not viewed.

また、特許文献２に記載の発明は、視聴者のニッチな嗜好を反映したコンテンツを推薦するため、番組の出演者情報を解析して出演者間の関係度合いを算出し、登場が少ない出演者にも視聴者の嗜好を反映させている。 In addition, the invention described in Patent Document 2 recommends content reflecting the niche preferences of viewers, and therefore analyzes the performer information of the program to calculate the degree of relationship between performers and performers with few appearances. It also reflects viewer preferences.

また、特許文献３に記載の発明は、視聴者の嗜好を反映したコンテンツ推薦を行うため、アクセスしたコンテンツから抽出したキーワード毎にヒストグラムを算出し、そのヒストグラムに基づいてコンテンツを推薦するものである。 The invention described in Patent Document 3 calculates a histogram for each keyword extracted from accessed content and recommends the content based on the histogram in order to recommend content reflecting the viewer's preference. .

特開２００６−２５９８９７号公報JP 2006-259897 A 特開２００８−１９９４０６号公報JP 2008-199406 A 特開２０１１−１０７８６９号公報JP 2011-107869 A

前記した従来技術は、視聴者がアクセスしたコンテンツに関するキーワードを抽出して、それらキーワードに基づいて推薦を行っている。つまり、従来技術では、抽出したキーワードの意味を解釈せずに単なるシンボルとして処理するため、概念空間へのマッピングが伴っていない。
なお、概念空間とは、物事や事象の概念が付与された疑似的な空間である。 In the above-described conventional technology, keywords related to content accessed by the viewer are extracted, and recommendation is performed based on the keywords. That is, in the prior art, since the meaning of the extracted keyword is not interpreted, it is processed as a mere symbol, and therefore, mapping to the concept space is not accompanied.
The concept space is a pseudo space to which the concept of things and events is given.

このため、従来技術では、抽出されたキーワードと概念的に同一であるが、抽出されなかった未知のキーワードを扱うことができない。その結果、従来技術では、広範な視聴者の嗜好が反映された個人プロファイルを生成できず、視聴者の嗜好に適したコンテンツを推薦できていないという問題がある。 For this reason, in the prior art, unknown keywords that are conceptually identical to the extracted keywords but not extracted cannot be handled. As a result, the conventional technology has a problem that a personal profile reflecting a wide range of viewer preferences cannot be generated, and content suitable for the viewer's preferences cannot be recommended.

そこで、本願発明は、広範な個人の嗜好が反映された個人プロファイルを生成できる個人プロファイル生成装置及びそのプログラム、並びに、コンテンツ推薦装置を提供することを課題とする。 SUMMARY OF THE INVENTION An object of the present invention is to provide a personal profile generation device capable of generating a personal profile reflecting a wide range of personal preferences, a program thereof, and a content recommendation device.

前記した課題に鑑みて、本願発明に係る個人プロファイル生成装置は、第１キーワード同士がリンクされたリンク付きコーパスから、個人の嗜好に適応した個人プロファイルを生成する個人プロファイル生成装置であって、初期グラフ生成手段と、グループ化手段と、グループ木構造グラフ生成手段と、プロファイル生成手段と、を備える構成とした。 In view of the above-described problems, the personal profile generation device according to the present invention is a personal profile generation device that generates a personal profile adapted to personal preference from a linked corpus in which first keywords are linked to each other. The configuration includes a graph generation unit, a grouping unit, a group tree structure graph generation unit, and a profile generation unit.

かかる構成によれば、個人プロファイル生成装置は、初期グラフ生成手段によって、リンク付きコーパスの第１キーワードをノードとして、リンクされた第１キーワード同士をエッジとして接続した第１初期グラフを生成する。
個人プロファイル生成装置は、グループ化手段によって、予め設定された終了条件を満たすまで第１初期グラフを再帰的にグループ化することで、木構造におけるノードの位置を表した位置情報が付加された第１グループ化グラフを生成する。
個人プロファイル生成装置は、グループ木構造グラフ生成手段によって、位置情報に基づいて第１グループ化グラフの各ノードを配置することで、第１グループ化グラフから、上位概念及び下位概念となるノード同士が木構造で階層化された第１グループ木構造グラフを生成する。 According to this configuration, the personal profile generation apparatus generates a first initial graph in which the first keyword of the linked corpus is connected as a node and the linked first keywords are connected as an edge by the initial graph generation unit.
The personal profile generation device regroups the first initial graph recursively until a predetermined termination condition is satisfied by the grouping means, so that the position information representing the position of the node in the tree structure is added. One grouped graph is generated.
The personal profile generation device arranges each node of the first grouping graph based on the position information by the group tree structure graph generation unit, so that the nodes that are the higher concept and the lower concept are found from the first grouping graph. A first group tree structure graph hierarchized by a tree structure is generated.

このように、個人プロファイル生成装置は、リンク付きコーパスの第１キーワードが表す概念を階層的にグループ化することで、第１グループ木構造グラフを生成する。言い換えるなら、個人プロファイル生成装置は、キーワードの意味するところをシンボル化して、擬似的な概念空間にキーワードをマッピングする。このようにして、個人プロファイル生成装置は、従来技術で困難であった概念空間までの拡張を実現している。 In this way, the personal profile generation device generates the first group tree structure graph by hierarchically grouping the concepts represented by the first keywords of the linked corpus. In other words, the personal profile generation apparatus symbolizes the meaning of the keyword and maps the keyword to the pseudo concept space. In this way, the personal profile generation device realizes expansion to a concept space that has been difficult in the prior art.

個人プロファイル生成装置は、プロファイル生成手段によって、個人が興味を持つコンテンツの内容を表した個人興味情報に基づいて、個人の嗜好を表す重みを第１グループ木構造グラフのノードに伝搬させることで、個人プロファイルを生成する。 The personal profile generation device propagates the weight representing the personal preference to the nodes of the first group tree structure graph based on the personal interest information representing the content of the content that the individual is interested in by the profile generation unit. Generate a personal profile.

また、本願発明に係るコンテンツ推薦装置は、本願発明の個人プロファイル生成装置と、個人プロファイルと推薦対象プロファイルとの一致度を算出し、算出した一致度の閾値判定によって、推薦対象のコンテンツを個人に推薦するか否かを判定するコンテンツ推薦手段と、を備える構成とした。 The content recommendation device according to the present invention calculates the degree of coincidence between the personal profile generation device according to the present invention and the personal profile and the recommendation target profile, and determines the recommended content to the individual by determining the threshold value of the calculated degree of coincidence. And a content recommendation means for determining whether or not to recommend.

本願発明によれば、以下のような優れた効果を奏する。
本願発明に係る個人プロファイル生成装置は、擬似的な概念空間にキーワードをマッピングし、概念空間まで個人プロファイルを拡張できるので、広範な個人の嗜好が反映された個人プロファイルを生成することができる。
本願発明に係るコンテンツ推薦装置は、個人プロファイル生成装置が生成した個人プロファイルを用いて、視聴者の嗜好に適したコンテンツを推薦することができる。 According to the present invention, the following excellent effects can be obtained.
The personal profile generation apparatus according to the present invention can map a keyword to a pseudo concept space and expand the personal profile to the concept space, so that a personal profile reflecting a wide range of individual preferences can be generated.
The content recommendation device according to the present invention can recommend content suitable for the viewer's preference using the personal profile generated by the personal profile generation device.

本願発明の実施形態において、リンク付きコーパスを説明する説明図である。In embodiment of this invention, it is explanatory drawing explaining a corpus with a link. 本願発明の実施形態に係るコンテンツ推薦装置の構成を示すブロック図である。It is a block diagram which shows the structure of the content recommendation apparatus which concerns on embodiment of this invention. 本願発明の実施形態において、初期グラフの生成を説明する説明図である。In embodiment of this invention, it is explanatory drawing explaining the production | generation of an initial graph. 本願発明の実施形態において、初期グラフの初期化を説明する説明図である。In embodiment of this invention, it is explanatory drawing explaining initialization of an initial graph. 本願発明の実施形態において、１回目のグループ化処理を説明する説明図である。In embodiment of this invention, it is explanatory drawing explaining the grouping process of the 1st time. 本願発明の実施形態において、２回目のグループ化処理を説明する説明図である。In embodiment of this invention, it is explanatory drawing explaining the grouping process of the 2nd time. 本願発明の実施形態において、グループ木構造グラフの生成を説明する説明図である。In embodiment of this invention, it is explanatory drawing explaining the production | generation of a group tree structure graph. 本願発明の実施形態において、個人プロファイルの生成を説明する説明図である。In an embodiment of the invention in this application, it is an explanatory view explaining generation of a personal profile. 図２の個人プロファイル生成装置によるグループ木構造グラフ生成処理を示すフローチャートである。It is a flowchart which shows the group tree structure graph production | generation process by the personal profile production | generation apparatus of FIG. 図２の個人プロファイル生成装置によるプロファイル生成処理を示すフローチャートである。It is a flowchart which shows the profile production | generation process by the personal profile production | generation apparatus of FIG. 図２のコンテンツ推薦装置によるコンテンツ推薦処理を示すフローチャートである。It is a flowchart which shows the content recommendation process by the content recommendation apparatus of FIG.

以下、本願発明の実施形態について、適宜図面を参照しながら詳細に説明する。
まず、本願発明の実施形態に係るコンテンツ推薦装置２で利用されるリンク付きコーパスを説明した後、コンテンツ推薦装置２の構成を説明する。 Hereinafter, embodiments of the present invention will be described in detail with reference to the drawings as appropriate.
First, after explaining the corpus with a link used in the content recommendation device 2 according to the embodiment of the present invention, the configuration of the content recommendation device 2 will be described.

［リンク付きコーパス］
リンク付きコーパスとは、キーワード同士がリンク付けられた辞書コーパスのことである。リンク付きコーパスの代表例としては、ウィキペディアがあげられる。ウィキペディアは、キーワード間のリンク、及び、カテゴリに属する上位概念や下位概念のキーワードが管理されているので、リンク付きコーパスとして簡単に利用できる。 [Corpus with link]
A linked corpus is a dictionary corpus in which keywords are linked to each other. A typical example of a corpus with links is Wikipedia. Since Wikipedia manages links between keywords and keywords of higher concepts and lower concepts belonging to a category, it can be easily used as a corpus with links.

例えば、ウィキペディアには、俳優“○○○○”に関するページがある。図１のように、俳優“○○○○”のページの下部には、この俳優が属するカテゴリのキーワードとして、“日本の俳優”、“日本の歌手”、“△△市出身の人物”などが表示されている。 For example, Wikipedia has a page related to the actor “XXX”. As shown in Fig. 1, at the bottom of the page for the actor “XXX”, keywords of the category to which the actor belongs include “Japanese actor”, “Japanese singer”, “Person from △△ city”, etc. Is displayed.

ここで、各カテゴリでは、そのカテゴリに含まれる、上位概念又は下位概念となるキーワード同士がリンクされている。例えば、カテゴリのキーワード“日本の俳優”をクリックすると、“主要カテゴリ＞…＞俳優＞各国の俳優＞日本の俳優”と、“主要カテゴリ＞…＞日本の人物＞日本の芸能人＞日本の俳優”とが表示される。 Here, in each category, keywords that are included in the category and that are higher concepts or lower concepts are linked. For example, if you click on the keyword “Japanese actors” in the category, “Main categories>…> Actors> International actors> Japanese actors” and “Main categories>…> Japanese characters> Japanese entertainers> Japanese actors” Is displayed.

主要カテゴリの不等号“＞”は、左側のキーワードが右側のキーワードの上位概念であることを表す。１つ目のカテゴリでは、キーワード“俳優”及び“各国の俳優”が上位概念及び下位概念の関係でリンクされ、キーワード“各国の俳優”及び“日本の俳優”も上位概念及び下位概念の関係でリンクされている。また、２つ目のカテゴリでは、キーワード“日本の人物”及び“日本の芸能人”が上位概念及び下位概念の関係でリンクされ、キーワード“日本の芸能人”及び“日本の俳優”も上位概念及び下位概念の関係でリンクされている。 The inequality sign “>” in the main category indicates that the left keyword is a superordinate concept of the right keyword. In the first category, the keywords “actors” and “actors of each country” are linked in relation to the superordinate concepts and subordinate concepts, and the keywords “actors in each country” and “Japanese actors” are also related to the superordinate concepts and subordinate concepts. Linked. Also, in the second category, the keywords “Japanese characters” and “Japanese entertainers” are linked in the relationship between the superordinate concepts and subordinate concepts, and the keywords “Japanese entertainers” and “Japanese actors” are also superordinate concepts and subordinates. Linked in a conceptual relationship.

また、例えば、カテゴリのキーワード“△△市出身の人物”をクリックすると、“主要カテゴリ＞…＞△△県出身の人物＞△△市出身の人物＞△△市出身の力士”が表示される。つまり、キーワード“△△市出身の人物”は、“△△県出身の人物”という上位概念のキーワードにリンクされている。さらに、キーワード“△△市出身の人物”は、“△△市出身の力士”という下位概念のキーワードにリンクされている。 Further, for example, when the category keyword “a person from a city” is clicked, “main category>... . That is, the keyword “person from △△ city” is linked to the keyword of the superordinate concept “person from △△ prefecture”. Furthermore, the keyword “a person from ΔΔ city” is linked to a subordinate concept keyword “ΔΔ city-born wrestler”.

なお、ある主要カテゴリでは、上位概念及び下位概念となるキーワード同士が、別の主要カテゴリでは反対にリンクされることもある。このように、ウィキペディアでは、上位概念及び下位概念の関係に曖昧さが残り、擬似的な概念空間にキーワードをマッピングする際の妨げとなる。 In some main categories, keywords that are a superordinate concept and a subordinate concept may be linked oppositely in another main category. Thus, in Wikipedia, ambiguity remains in the relationship between the superordinate concept and the subordinate concept, which hinders the mapping of keywords to the pseudo concept space.

［コンテンツ推薦装置の概略］
図２を参照し、コンテンツ推薦装置２の概略について説明する。
コンテンツ推薦装置２は、個人プロファイル生成装置１がリンク付きコーパスから個人プロファイルを生成し、生成した個人プロファイルに基づいて、コンテンツの推薦を行うものである。 [Outline of content recommendation device]
The outline of the content recommendation device 2 will be described with reference to FIG.
The content recommendation device 2 is a device in which the personal profile generation device 1 generates a personal profile from a linked corpus and recommends content based on the generated personal profile.

コンテンツ推薦装置２では、大きく分けて、３つの処理が行われる。まず、個人プロファイル生成装置１は、リンク付きコーパスをグラフ化し、それを木構造グラフに変換する（グループ木構造グラフ生成処理）。このとき、個人プロファイル生成装置１は、グラフ解析における連結程度の粗密具合を判定するグループ化処理（コミュニティ検出）を再帰的に施す。 In the content recommendation device 2, three processes are roughly divided. First, the personal profile generation device 1 graphs a linked corpus and converts it into a tree structure graph (group tree structure graph generation processing). At this time, the personal profile generation device 1 recursively performs a grouping process (community detection) for determining the degree of connection in the graph analysis.

次に、個人プロファイル生成装置１は、各視聴者の嗜好を木構造グラフに反映した個人プロファイルを生成する（プロファイル生成処理）。このとき、コンテンツ推薦装置２は、木構造グラフを構成するノードに一致するキーワードを抽出し、そのノードに重みを加算すると同時に、木構造に沿ってその重みを伝搬する。これによって、各視聴者に適応した重み付き木構造グラフが生成され、個人プロファイルとして扱うことができる。 Next, the personal profile generation device 1 generates a personal profile that reflects each viewer's preference in the tree structure graph (profile generation processing). At this time, the content recommendation device 2 extracts a keyword that matches a node constituting the tree structure graph, adds a weight to the node, and simultaneously propagates the weight along the tree structure. Thus, a weighted tree structure graph adapted to each viewer is generated and can be handled as a personal profile.

最後に、コンテンツ推薦装置２は、個人プロファイル生成装置１が生成した個人プロファイルを用いて、コンテンツの推薦を行う（コンテンツ推薦処理）。このとき、コンテンツ推薦装置２は、新たに生成した推薦対象プロファイルがどの程度個人プロファイルに一致しているかどうかで、推薦の可否を判定する。 Finally, the content recommendation device 2 recommends content using the personal profile generated by the personal profile generation device 1 (content recommendation processing). At this time, the content recommendation device 2 determines whether or not the recommendation is possible depending on how much the newly generated recommendation target profile matches the personal profile.

［コンテンツ推薦装置の構成］
続いて、コンテンツ推薦装置２の構成について説明する。
図２のように、コンテンツ推薦装置２は、個人プロファイル生成装置１と、コンテンツ推薦手段７０とを備える。
個人プロファイル生成装置１は、初期グラフ生成手段１０と、グループ化手段２０と、グループ木構造グラフ生成手段３０と、グループ木構造グラフＤＢ４０と、プロファイル生成手段５０と、プロファイルＤＢ６０とを備える。 [Configuration of content recommendation device]
Next, the configuration of the content recommendation device 2 will be described.
As shown in FIG. 2, the content recommendation device 2 includes a personal profile generation device 1 and content recommendation means 70.
The personal profile generation device 1 includes an initial graph generation unit 10, a grouping unit 20, a group tree structure graph generation unit 30, a group tree structure graph DB 40, a profile generation unit 50, and a profile DB 60.

初期グラフ生成手段１０は、リンク付きコーパスが入力され、入力されたリンク付きコーパスのキーワード（第１キーワード）をノードとし、リンクされたキーワード同士をエッジとして接続した初期グラフ（第１初期グラフ、g_cat）を生成するものである。 The initial graph generation means 10 receives a linked corpus, an initial graph (first initial graph, g_cat) in which the keyword (first keyword) of the input linked corpus is used as a node and the linked keywords are connected as edges. ).

＜初期グラフの生成＞
図１，図３を参照し、初期グラフの生成について、詳細に説明する（適宜図２参照）。
この例では、ウィキペディアにおける俳優“○○○○”のページから、初期グラフＡを生成することとする。例えば、初期グラフ生成手段１０は、俳優“○○○○”のページのＨＴＭＬ（Hyper Text Markup Language）を解析し、このページの下部に表示されているカテゴリのキーワードを取得する。ここでは、初期グラフ生成手段１０は、カテゴリのキーワードとして、“日本の俳優”、“△△市出身の人物”を取得したこととし、“日本の歌手”については説明を省略する。 <Generation of initial graph>
The initial graph generation will be described in detail with reference to FIGS. 1 and 3 (see FIG. 2 as appropriate).
In this example, the initial graph A is generated from the page of the actor “XXX” on Wikipedia. For example, the initial graph generation means 10 analyzes HTML (Hyper Text Markup Language) of the page of the actor “XXX”, and acquires the keyword of the category displayed at the bottom of this page. Here, it is assumed that the initial graph generation means 10 has acquired “Japanese actors” and “persons from ΔΔ city” as category keywords, and the description of “Japanese singers” is omitted.

図１のように、初期グラフ生成手段１０は、カテゴリのキーワード“日本の俳優”のリンクを辿り、２つのカテゴリから、キーワード“俳優”、“各国の俳優”、“日本の人物”、“日本の芸能人”を取得する。さらに、初期グラフ生成手段１０は、カテゴリのキーワード“△△市出身の人物”のリンクを辿り、キーワード“△△県出身の人物”、“△△市出身の力士”を取得する。 As shown in FIG. 1, the initial graph generation means 10 follows the link of the keyword “Japanese actor” in the category, and from the two categories, the keywords “actor”, “actors in various countries”, “Japanese characters”, “Japan” The “celebrity”. Further, the initial graph generation means 10 follows the link of the keyword “Δ △ from the city” of the category, and acquires the keywords “person from Δ △ prefecture” and “wrestler from ΔΔ city”.

この場合、図３のように、キーワード“俳優”、“各国の俳優”、“日本の人物”、“日本の芸能人”、“日本の俳優”、“△△県出身の人物”、“△△市出身の人物”、“△△市出身の力士”のそれぞれがノードとなる。 In this case, as shown in FIG. 3, the keywords “actors”, “actors from various countries”, “Japanese characters”, “Japanese entertainers”, “Japanese actors”, “△△ characters from prefectures”, “△△ Each of “a person from the city” and “△△ a wrestler from the city” is a node.

また、キーワード“俳優”と“各国の俳優”とのリンク、及び、キーワード“各国の俳優”と“日本の俳優”とのリンクがエッジとなる。また、キーワード“日本の人物”と“日本の芸能人”とのリンク、及び、キーワード“日本の芸能人”と“日本の俳優”とのリンクがエッジとなる。さらに、キーワード“△△県出身の人物”と“△△市出身の人物” とのリンク、及び、キーワード“△△市出身の人物”と“△△市出身の力士”とのリンクがエッジとなる。 Also, the link between the keyword “actor” and “actor of each country” and the link between the keyword “actor of each country” and “Japanese actor” are the edges. In addition, the links between the keywords “Japanese characters” and “Japanese entertainers” and the links between the keywords “Japanese entertainers” and “Japanese actors” are the edges. Furthermore, the links between the keywords “△△ person from prefecture” and “△△ person from city” and the link between keyword “△△ person from city” and “△△ city-born wrestler” Become.

この他、別の主要カテゴリにおいて、キーワード“各国の俳優”と“日本の人物”とのリンク、キーワード“日本の人物”と“△△県出身の人物”のリンクが存在していたので、これらリンクもエッジとなる。
従って、初期グラフ生成手段１０は、前記したノードのそれぞれをエッジで接続し、図３のような初期グラフＡを生成する。 In addition, in another major category, there was a link between the keyword “actors from each country” and “Japanese people”, and a keyword “Japanese people” and “△ △ Prefecture-born people”. Links are also edges.
Therefore, the initial graph generation means 10 connects each of the nodes described above with an edge, and generates an initial graph A as shown in FIG.

図２に戻り、初期グラフ生成手段１０について、説明を続ける。
また、初期グラフ生成手段１０は、推薦対象情報が入力され、入力された推薦対象情報にキーワード抽出処理（例えば、形態素解析処理）を施し、推薦対象情報からキーワード（第３キーワード）を抽出する。そして、初期グラフ生成手段１０は、推薦対象情報の第３キーワードのうち、リンク付きコーパスのキーワードに一致するものを抽出する（名寄せ処理）。 Returning to FIG. 2, the description of the initial graph generation means 10 will be continued.
Further, the initial graph generation means 10 receives recommendation target information, performs keyword extraction processing (for example, morpheme analysis processing) on the input recommendation target information, and extracts a keyword (third keyword) from the recommendation target information. Then, the initial graph generation means 10 extracts the third keyword of the recommendation target information that matches the keyword of the linked corpus (name identification process).

なお、推薦対象情報とは、推薦対象となるコンテンツの内容を表す情報である。例えば、推薦対象情報としては、推薦対象となる放送番組について、ＥＰＧの番組内容情報又は字幕情報があげられる。 Note that the recommendation target information is information representing the content of the content to be recommended. For example, the recommendation target information includes EPG program content information or caption information for a broadcast program to be recommended.

さらに、初期グラフ生成手段１０は、抽出された第３キーワードをノードとして、リンク付きコーパスにおけるキーワード同士のリンクにより、第３キーワード同士をエッジとして接続した第２初期グラフを生成する。このとき、初期グラフ生成手段１０は、リンク付きコーパスのキーワードに一致する第３キーワードのみを抽出したため、リンク付きコーパスにおけるキーワード同士のリンクを利用して、第３キーワード同士を接続できる。
その後、初期グラフ生成手段１０は、生成した初期グラフＡ及び第２初期グラフをグループ化手段２０に出力する。 Furthermore, the initial graph generation means 10 generates a second initial graph in which the third keywords are connected as edges by using the extracted third keywords as nodes and by linking the keywords in the linked corpus. At this time, since the initial graph generation means 10 extracts only the third keyword that matches the keyword of the linked corpus, the third keyword can be connected using the links of the keywords in the linked corpus.
Thereafter, the initial graph generation unit 10 outputs the generated initial graph A and the second initial graph to the grouping unit 20.

グループ化手段２０は、初期グラフ生成手段１０から入力された初期グラフＡを再帰的にグループ化することで、位置情報が付加されたグループ化グラフ（第１グループ化グラフ、g_cat_tree）を生成するものである。 The grouping unit 20 generates a grouped graph (first grouped graph, g_cat_tree) to which position information is added by recursively grouping the initial graph A input from the initial graph generating unit 10. It is.

具体的には、グループ化手段２０は、ノード間の連結粗密情報による再帰的グループ化処理を初期グラフＡに施す。このとき、グループ化手段２０は、再帰的グループ化処理によって、後記する位置情報を各ノードに付加する。この再帰的グループ化処理は、下記の参考文献に記載されている。
参考文献：Vincent D Blondel et.al,“Fast unfolding of communities in large networks”,“Journal of Statistical Mechanics:Theory and Experiment 2008(10),P10008(12pp)” Specifically, the grouping means 20 performs a recursive grouping process on the initial graph A based on connection density information between nodes. At this time, the grouping means 20 adds position information described later to each node by recursive grouping processing. This recursive grouping process is described in the following references.
References: Vincent D Blondel et.al, “Fast unfolding of communities in large networks”, “Journal of Statistical Mechanics: Theory and Experiment 2008 (10), P10008 (12pp)”

なお、連結粗密情報とは、対象となるノードが属するグループを表す指標であり、ノード同士の類似度と言うこともできる。前記参考文献に記載の手法では、連結粗密情報として、モジュラリティ（modularity）と呼ばれる指標を用いて、コミュニティ検出を行っている。 Note that the connection density information is an index representing a group to which the target node belongs, and can also be referred to as the similarity between nodes. In the method described in the above-mentioned reference, community detection is performed using an index called modularity as the connection density information.

＜再帰的グループ化処理＞
図４〜図６を参照し、再帰的グループ化処理について、詳細に説明する（適宜図２参照）。
図４のように、グループ化手段２０は、図３の初期グラフＡのグループを初期化する。初期化された初期グラフＡでは、全てのノードが同一のグループＧ_１に属する。 <Recursive grouping>
The recursive grouping process will be described in detail with reference to FIGS. 4 to 6 (see FIG. 2 as appropriate).
As shown in FIG. 4, the grouping means 20 initializes the group of the initial graph A of FIG. In initialized initial graph A, all nodes belong to the same group G _1.

次に、グループ化手段２０は、図４の初期グラフＡに対し、グループＧ_１を親グループとして、１回目のグループ化処理を施す。このとき、グループ化手段２０は、各グループの親子関係（従属関係）を抽出し、連結部分が密なノード同士を子グループとして纏める。 Then, the grouping unit 20, with respect to initial graph A in FIG. 4, the group G ₁ as the parent group, subjected to a first grouping process. At this time, the grouping means 20 extracts the parent-child relationship (subordinate relationship) of each group, and collects nodes with densely connected portions as child groups.

図５のように、グループ化手段２０は、親グループＧ_１を、ノード“俳優”、“各国の俳優”、“日本の人物”、“日本の芸能人”、“日本の俳優”が纏まった子グループＧ_２と、ノード“△△県出身の人物”、“△△市出身の人物”、“△△市出身の力士”が纏まった子グループＧ_３とに分割する。 As shown in FIG. 5, the grouping means 20 is a child in which the parent group G ₁ is composed of nodes “actors”, “actors from various countries”, “Japanese characters”, “Japanese entertainers”, and “Japanese actors”. The group G ₂ is divided into the child group G _{3 in} which the nodes “a person from ΔΔ prefecture”, “a person from ΔΔ city”, and “wrestlers from ΔΔ city” are gathered.

次に、グループ化手段２０は、図５の初期グラフＡに対し、子グループであったグループＧ_２，Ｇ_３をそれぞれ親グループとして更新し、２回目のグループ化処理を施す。図６のように、グループ化手段２０は、親グループＧ_２を、ノード“俳優”、“各国の俳優”が纏まった子グループＧ_２１と、ノード“日本の人物”、“日本の芸能人”、“日本の俳優”が纏まった子グループＧ_２２とに分割する。
なお、グループＧ_３は、分割されなかったこととする。 Next, the grouping means 20 updates the groups G ₂ and G ₃ that are child groups as parent groups to the initial graph A in FIG. 5 and performs the second grouping process. As shown in FIG. 6, the grouping means 20 divides the parent group G ₂ into a node “actor”, a child group G _{21 in} which “actors of each country” are grouped, nodes “Japanese characters”, “Japanese entertainers”, "Japan of the actor" is divided into a child group G _22, which collectively.
The group G ₃ are, and that has not been divided.

このように、グループ化手段２０は、予め設定された終了条件を満たすまでグループ化処理を再帰的に施し、グループ化グラフＢを生成する。このとき、グループ化手段２０は、グループ化グラフＢのグループ数を、後記するグループ木構造グラフの階層数以下にしたい。そこで、終了条件は、グループ化処理の実行回数が予め設定された回数（例えば、グループ木構造グラフの最大階層数）を超えた場合、グループ化処理を終了するという条件で設定される。 In this way, the grouping means 20 recursively performs the grouping process until a preset end condition is satisfied, thereby generating a grouped graph B. At this time, the grouping means 20 wants the number of groups in the grouping graph B to be less than or equal to the number of layers in the group tree structure graph described later. Therefore, the termination condition is set under the condition that the grouping process is terminated when the number of execution times of the grouping process exceeds a preset number of times (for example, the maximum number of hierarchies of the group tree structure graph).

図２に戻り、グループ化手段２０について、説明を続ける。
また、グループ化手段２０は、初期グラフ生成手段１０から入力された第２初期グラフを再帰的にグループ化することで、位置情報が付加された第２グループ化グラフを生成する。ここで、グループ化手段２０は、グループ化グラフＢと同様の手順で、第２グループ化グラフを生成できるため、説明を省略する。
その後、グループ化手段２０は、生成したグループ化グラフＢ及び第２グループ化グラフをグループ木構造グラフ生成手段３０に出力する。 Returning to FIG. 2, the description of the grouping means 20 will be continued.
The grouping unit 20 recursively groups the second initial graph input from the initial graph generation unit 10 to generate a second grouped graph to which position information is added. Here, since the grouping means 20 can generate the second grouped graph in the same procedure as the grouped graph B, the description is omitted.
Thereafter, the grouping unit 20 outputs the generated grouped graph B and the second grouped graph to the group tree structure graph generating unit 30.

グループ木構造グラフ生成手段３０は、グループ化手段２０から入力されたグループ化グラフＢの各ノードに付加された位置情報に基づいて、各視聴者（各個人）に共通するグループ木構造グラフ（第１グループ木構造グラフ）を１つ生成するものである。
なお、グループ木構造グラフは、上位概念及び下位概念となるノード同士が木構造で階層化されたグラフである。 The group tree structure graph generation means 30 is based on the position information added to each node of the grouping graph B input from the grouping means 20 and the group tree structure graph (first) common to each viewer (each individual). One group tree structure graph) is generated.
Note that the group tree structure graph is a graph in which the nodes that are the upper concept and the lower concept are hierarchized in a tree structure.

＜グループ木構造グラフの生成＞
以下、図７を参照し、グループ木構造グラフの生成について、説明する（適宜図２参照）。 <Generation of group tree structure graph>
Hereinafter, generation of a group tree structure graph will be described with reference to FIG. 7 (see FIG. 2 as appropriate).

前記したように、グループ化グラフＢの各ノードには、再帰的グループ化処理により、位置情報が付加されている。この位置情報は、グループ木構造グラフにおけるノードの位置を表す情報であり、例えば、タプル形式で表される。タプル形式の位置情報は、左から順にカッコ内の各数値が、グループ木構造グラフの階層毎に辿るノードの位置を表している。 As described above, position information is added to each node of the grouping graph B by recursive grouping processing. This position information is information indicating the position of the node in the group tree structure graph, and is expressed in a tuple format, for example. The position information in the tuple format represents the position of the node in which each numerical value in parentheses traces for each hierarchy of the group tree structure graph in order from the left.

例えば、あるノードの位置情報が（０，２，２５，１１，２）であることとする。この場合、位置情報の先頭は、常にグループ木構造グラフのルートノードを表すため、‘０’になる。また、位置情報は、２番目の数値が‘２’であることから、グループ木構造グラフでルートノードから１つ下となる第１階層において、左から‘２’個目のノードを表す。また、位置情報は、３番目の数値が‘２５’であることから、グループ木構造グラフの第２階層において、左から‘２５’個目のノードを表す。また、位置情報は、４番目の数値が‘１１’であることから、グループ木構造グラフの第３階層において、左から‘１１’個目のノードを表す。また、位置情報は、５番目の数値が‘２’であることから、グループ木構造グラフの第４階層において、左から‘２’個目のノードを表す。 For example, assume that the position information of a certain node is (0, 2, 25, 11, 2). In this case, since the head of the position information always represents the root node of the group tree structure graph, it becomes “0”. Further, since the second numerical value is “2”, the position information represents the “2” -th node from the left in the first hierarchy that is one level lower than the root node in the group tree structure graph. Further, since the third numerical value is “25”, the position information represents the “25th” node from the left in the second layer of the group tree structure graph. In addition, since the fourth numerical value is “11”, the position information represents the “11” -th node from the left in the third layer of the group tree structure graph. Further, since the fifth numerical value is “2”, the position information represents the “2” -th node from the left in the fourth layer of the group tree structure graph.

以上より、位置情報は、このノードが、第４階層に位置することを表す。そして、位置情報は、このノードが、ルートノード、第１階層で左から‘２’個目のノード、第２階層で左から‘２５’個目のノード、第３階層で左から‘１１’個目のノードを順に経由して、第４階層で左から‘２’個目のノードであることを表す。
なお、各ノードの位置情報は、グループ木構造グラフ内で異なる位置を表し、重複しない。 As described above, the position information indicates that this node is located in the fourth hierarchy. The position information includes the root node, the '2' node from the left in the first layer, the '25' node from the left in the second layer, and the '11' from the left in the third layer. This indicates that the node is the '2' node from the left in the fourth layer through the nodes in order.
Note that the position information of each node represents a different position in the group tree structure graph and does not overlap.

そこで、グループ木構造グラフ生成手段３０は、グループ化グラフＢの各ノードに付加された位置情報が示す位置にそのノードを配置する。このようにして、グループ木構造グラフ生成手段３０は、図６のグループ化グラフＢを、図７のようなグループ木構造グラフＣに変換できる。 Therefore, the group tree structure graph generating unit 30 arranges the node at the position indicated by the position information added to each node of the grouped graph B. In this way, the group tree structure graph generating means 30 can convert the grouped graph B of FIG. 6 into a group tree structure graph C as shown in FIG.

前記したように、ウィキペディアでは、上位概念及び下位概念の関係に曖昧さが残る。そこで、個人プロファイル生成装置１は、ウィキペディアの各キーワードを概念的に類似するキーワード同士でグループ化し、グループ木構造グラフＣを生成する。このグループ木構造グラフＣは、全ノードで木構造が保たれているため、上位概念及び下位概念の関係に曖昧さが残っていない。 As described above, in Wikipedia, ambiguity remains in the relationship between the superordinate concept and the subordinate concept. Therefore, the personal profile generation device 1 groups the keywords of Wikipedia with keywords that are conceptually similar to each other, and generates a group tree structure graph C. In this group tree structure graph C, the tree structure is maintained at all nodes, so that there is no ambiguity in the relationship between the superordinate concept and the subordinate concept.

図２に戻り、グループ木構造グラフ生成手段３０について、説明を続ける。
また、グループ木構造グラフ生成手段３０は、グループ化手段２０から入力された第２グループ化グラフの各ノードに付加された位置情報に基づいて、第２グループ木構造グラフを生成する。ここで、第２グループ木構造グラフは、グループ木構造グラフＣと同様の手順で生成できるため、説明を省略する。
その後、グループ木構造グラフ生成手段３０は、グループ木構造グラフＣ及び第２グループ木構造グラフをグループ木構造グラフＤＢ４０に書き込む。 Returning to FIG. 2, the description of the group tree structure graph generating means 30 will be continued.
The group tree structure graph generation unit 30 generates a second group tree structure graph based on the positional information added to each node of the second grouping graph input from the grouping unit 20. Here, since the second group tree structure graph can be generated in the same procedure as the group tree structure graph C, description thereof is omitted.
Thereafter, the group tree structure graph generation means 30 writes the group tree structure graph C and the second group tree structure graph in the group tree structure graph DB 40.

グループ木構造グラフＤＢ４０は、グループ木構造グラフＣ及び第２グループ木構造グラフを記憶するデータベースである。このグループ木構造グラフＤＢ４０は、後記するプロファイル生成手段５０により、参照される。 The group tree structure graph DB 40 is a database that stores the group tree structure graph C and the second group tree structure graph. The group tree structure graph DB 40 is referred to by profile generation means 50 described later.

プロファイル生成手段５０は、アクセス履歴情報（個人興味情報）が入力され、入力されたアクセス履歴情報に基づいて、視聴者の嗜好を表した重みをグループ木構造グラフＣのノードに伝搬させることで、視聴者毎に個人プロファイル（base_tree）を生成するものである。 The profile generation means 50 receives access history information (individual interest information) and propagates a weight representing the viewer's preference to the nodes of the group tree structure graph C based on the input access history information. A personal profile (base_tree) is generated for each viewer.

ここで、アクセス履歴情報とは、視聴者が興味を持つコンテンツの内容を表す情報であり、各視聴者がコンテンツにアクセスした履歴である。例えば、アクセス履歴情報としては、各視聴者が視聴した放送番組について、ＥＰＧ（Electronic Program Guide）の番組内容情報又は字幕情報があげられる。
なお、個人プロファイルは、グループ木構造グラフＣを各視聴者に適応させたものであるから、個人適応型グループ木構造グラフと呼ぶこともできる。 Here, the access history information is information representing the content of the content that the viewer is interested in, and is a history that each viewer has accessed the content. For example, the access history information includes EPG (Electronic Program Guide) program content information or subtitle information for a broadcast program viewed by each viewer.
Since the personal profile is obtained by adapting the group tree structure graph C to each viewer, it can also be called an individual adaptive group tree structure graph.

＜個人プロファイルの生成＞
図８を参照し、個人プロファイルの生成について、詳細に説明する（適宜図２参照）。
まず、プロファイル生成手段５０は、アクセス履歴情報にキーワード抽出処理（例えば、形態素解析処理）を施し、アクセス履歴情報からキーワード（第２キーワード）を抽出する。次に、プロファイル生成手段５０は、抽出した各キーワードに一致するグループ木構造グラフＣのノードを探索する。そして、プロファイル生成手段５０は、この探索されたノード（基準ノード）に予め設定された重みを加える。
なお、この重みは、任意の値に設定することができる。 <Generate personal profile>
The personal profile generation will be described in detail with reference to FIG. 8 (see FIG. 2 as appropriate).
First, the profile generation means 50 performs keyword extraction processing (for example, morpheme analysis processing) on the access history information, and extracts a keyword (second keyword) from the access history information. Next, the profile generation means 50 searches for a node of the group tree structure graph C that matches each extracted keyword. Then, the profile generation means 50 adds a preset weight to the searched node (reference node).
This weight can be set to an arbitrary value.

続いて、プロファイル生成手段５０は、基準ノードから各ノードに到るまで階層が１つ上がる毎に上位係数αと階層が１つ下がる毎に下位係数βとが乗じられた重みを算出する。ここで、上位係数α及び下位係数βは、１以下の値で予め設定される。そして、プロファイル生成手段５０は、算出した重みが予め設定された閾値以下になるまで、算出した重みを各ノードに加える。すなわち、プロファイル生成手段５０は、上位係数α及び下位係数βが１の場合を除き、基準ノードから階層が離れる程に重みを小さくしながら、重みを伝搬させる。 Subsequently, the profile generation means 50 calculates a weight obtained by multiplying the higher coefficient α every time the hierarchy is increased from the reference node to each node and the lower coefficient β every time the hierarchy is decreased. Here, the higher coefficient α and the lower coefficient β are preset with values of 1 or less. Then, the profile generation unit 50 adds the calculated weight to each node until the calculated weight is equal to or less than a preset threshold value. That is, the profile generation unit 50 propagates the weight while reducing the weight as the layer is separated from the reference node, except when the higher coefficient α and the lower coefficient β are 1.

例えば、アクセス履歴情報から“日本の俳優”というキーワードが抽出された場合を考える。この場合、プロファイル生成手段５０は、図８のように、グループ木構造グラフＣのノード“日本の俳優”を基準ノードとして探索する。そして、プロファイル生成手段５０は、基準ノード“日本の俳優”に重みを加える。
なお、図８では、ａｃｃｕｍが重みを表す。 For example, consider a case where the keyword “Japanese actor” is extracted from the access history information. In this case, the profile generation means 50 searches for the node “Japanese actor” of the group tree structure graph C as a reference node, as shown in FIG. Then, the profile generation means 50 adds weight to the reference node “Japanese actor”.
In FIG. 8, accum represents a weight.

次に、プロファイル生成手段５０は、基準ノード“日本の俳優”からノード“日本の芸能人”まで階層が１つ上がるので、ノード“日本の芸能人”に重み×上位係数αを加算する。そして、プロファイル生成手段５０は、基準ノード“日本の俳優”からノード“日本の人物”まで階層が２つ上がるので、ノード“日本の人物”に重み×上位係数α^２を加算する。さらに、プロファイル生成手段５０は、基準ノード“日本の俳優”からノード“△△県出身の人物”まで階層が２つ上がった後に１つ下がるので、ノード“△△県出身の人物”に重み×上位係数α^２×下位係数βを加算する。 Next, the profile generation means 50 adds a weight × upper coefficient α to the node “Japanese entertainer” because the hierarchy goes up from the reference node “Japanese actor” to the node “Japanese entertainer”. The profile generating unit 50, since the hierarchy from the reference node "Japanese actor" node to the "Japanese person" two up, adds weight × higher coefficient alpha ² to node "Japanese person". Furthermore, since the profile generation means 50 goes up two levels from the reference node “Japanese actor” to the node “person from △△ prefecture” and then goes down, the weight “x” is assigned to the node “person from △△ prefecture”. Add upper coefficient α ² × lower coefficient β.

ここで、プロファイル生成手段５０は、重みが予め設定された閾値以下になるまで、算出した重みを各ノードに加算する。言い換えるなら、プロファイル生成手段５０は、重みが閾値を超える範囲のノードには重みを加算し、重みが閾値以下になる範囲のノードには重みを加算しない。例えば、プロファイル生成手段５０は、ノード“△△市出身の人物”では、重み×上位係数α^２×下位係数β^２の値が閾値以下になるので、ノード“△△市出身の人物”に重みを加算しない。 Here, the profile generation means 50 adds the calculated weight to each node until the weight falls below a preset threshold value. In other words, the profile generation means 50 adds weights to nodes in the range where the weight exceeds the threshold, and does not add weights to nodes in the range where the weight is less than or equal to the threshold. For example, since the value of weight × higher coefficient α ² × lower coefficient β ² is less than or equal to the threshold value at the node “ΔΔ person from the city”, the profile generation means 50 gives weight to the node “ΔΔ city person”. Is not added.

このようにして、プロファイル生成手段５０は、各視聴者に最適化された個人プロファイルを生成する。これにより、個人プロファイル生成装置１は、キーワードの数が少なくても、視聴者の嗜好を表すキーワードの上位概念を見つけることが可能となる。 In this way, the profile generation means 50 generates a personal profile optimized for each viewer. Thereby, even if the number of keywords is small, the personal profile generation device 1 can find a high-level concept of keywords representing the viewer's preference.

なお、プロファイル生成手段５０は、ある視聴者がコンテンツを視聴する都度、視聴されたコンテンツのアクセス履歴情報が入力される。従って、プロファイル生成手段５０は、アクセス履歴情報が入力される都度、前記手順でプロファイルＤＢ６０の個人プロファイルに反映させてもよい。 Note that the profile generation means 50 receives the access history information of the viewed content every time a certain viewer views the content. Accordingly, the profile generation means 50 may reflect the access history information in the personal profile of the profile DB 60 by the above procedure every time the access history information is input.

図２に戻り、プロファイル生成手段５０について、説明を続ける。
また、プロファイル生成手段５０は、第２グループ木構造グラフのノードに重み付けを行うことで、推薦対象プロファイル(target_tree)を生成する。ここで、プロファイル生成手段５０は、第２グループ木構造グラフのノードに、任意の手法で重み付けを行うことができる。例えば、プロファイル生成手段５０は、ｔｆ−ｉｄｆ（Term Frequency-Inverse Document Frequency）法で重み付けを行ってもよい。また、プロファイル生成手段５０は、第２グループ木構造グラフのノードが固有名詞のときは一般名詞よりも大きく重み付けを行い、そのノードが一般名詞のときは固有名詞よりも小さく重み付けを行ってもよい。 Returning to FIG. 2, the description of the profile generation means 50 will be continued.
Further, the profile generation means 50 generates a recommendation target profile (target_tree) by weighting the nodes of the second group tree structure graph. Here, the profile generation means 50 can weight the nodes of the second group tree structure graph by any method. For example, the profile generation means 50 may perform weighting by a tf-idf (Term Frequency-Inverse Document Frequency) method. Further, the profile generation means 50 may weight more heavily than a general noun when the node of the second group tree structure graph is a proper noun, and may weight less than the proper noun when the node is a general noun. .

その後、プロファイル生成手段５０は、生成した個人プロファイル及び推薦対象プロファイルをプロファイルＤＢ６０に書き込む。
なお、推薦対象プロファイルは、個人プロファイルと同一の木構造を有するが、各ノードの重みが個人プロファイルで同一位置のノードと異なる。 Thereafter, the profile generation unit 50 writes the generated personal profile and recommendation target profile in the profile DB 60.
The recommendation target profile has the same tree structure as the personal profile, but the weight of each node is different from the node at the same position in the personal profile.

プロファイルＤＢ６０は、個人プロファイル及び推薦対象プロファイルを記憶するデータベースである。このプロファイルＤＢ６０は、後記するコンテンツ推薦手段７０により、参照される。 The profile DB 60 is a database that stores personal profiles and recommendation target profiles. This profile DB 60 is referred to by content recommendation means 70 described later.

コンテンツ推薦手段７０は、個人プロファイルと推薦対象プロファイルとの一致度を算出し、算出した一致度の閾値判定によって、推薦対象のコンテンツを視聴者に推薦するか否かを判定するものである。 The content recommendation means 70 calculates the degree of coincidence between the personal profile and the recommendation target profile, and determines whether or not to recommend the content to be recommended to the viewer by determining the threshold value of the calculated degree of coincidence.

＜コンテンツの推薦＞
以下、コンテンツの推薦について、詳細に説明する（適宜図２参照）。
本実施形態では、コンテンツ推薦手段７０は、個人プロファイルと推薦対象プロファイルとの一致度として、推薦対象プロファイルが個人プロファイルにどの程度含まれているかを表す包含係数を算出する。 <Content recommendation>
Hereinafter, the content recommendation will be described in detail (see FIG. 2 as appropriate).
In the present embodiment, the content recommendation unit 70 calculates an inclusion coefficient indicating how much the recommendation target profile is included in the personal profile as the degree of coincidence between the personal profile and the recommendation target profile.

ここで、コンテンツ推薦手段７０は、推薦対象プロファイル（target_tree）において、重みが‘０’を超えるリーフノードを全て抽出する。なお、リーフノードとは、推薦対象プロファイル及び個人プロファイルで最下層に位置するノードのことである。 Here, the content recommendation unit 70 extracts all leaf nodes whose weights exceed “0” in the recommendation target profile (target_tree). In addition, a leaf node is a node located in the lowest layer in a recommendation object profile and a personal profile.

また、コンテンツ推薦手段７０は、抽出された各リーフノードを起点として、ルートノードまでの重みを表したベクトルｔａｒｇｅｔ＿ｖｅｃ[ｉ]を算出する。ここで、ｉは、リーフノードの番号を表す１以上の整数である。例えば、ｉ番目のリーフノードの位置情報が（０，１，２，３，４）であることとする。この場合、コンテンツ推薦手段７０は、ベクトルｔａｒｇｅｔ＿ｖｅｃ[ｉ]として、[（０，１，２，３，４）＿ａｃｃｕｍ，（０，１，２，３）＿ａｃｃｕｍ，（０，１，２）＿ａｃｃｕｍ，（０，１）＿ａｃｃｕｍ，（０）＿ａｃｃｕｍ]という５次元のベクトルを算出する。
なお、ａｃｃｕｍの前に各ノードの位置情報を記載した。例えば、（０，１，２，３，４）＿ａｃｃｕｍがｉ番目のリーフノードの重みを表し、（０）＿ａｃｃｕｍがルートノードの重みを表す。 In addition, the content recommendation unit 70 calculates a vector target_vec [i] representing the weight to the root node, starting from each extracted leaf node. Here, i is an integer of 1 or more that represents the number of a leaf node. For example, it is assumed that the position information of the i-th leaf node is (0, 1, 2, 3, 4). In this case, the content recommendation unit 70 uses [(0, 1, 2, 3, 4) _accum, (0, 1, 2, 3) _accum, (0, 1, 2) _accum, as the vector target_vec [i]. A five-dimensional vector (0,1) _accum, (0) _accum] is calculated.
In addition, the position information of each node was described before accum. For example, (0, 1, 2, 3, 4) _accum represents the weight of the i-th leaf node, and (0) _accum represents the weight of the root node.

続いて、コンテンツ推薦手段７０は、個人プロファイル（base_tree）において、重みが‘０’を超えるリーフノードを全て抽出し、抽出した各リーフノードを起点として、ルートノードまでの重みを表したベクトルｂａｓｅ＿ｖｅｃ[ｉ]を算出する。ここで、コンテンツ推薦手段７０は、ベクトルｂａｓｅ＿ｖｅｃ[ｉ]を、ベクトルｔａｒｇｅｔ＿ｖｅｃ[ｉ]と同様に算出するため、説明を省略する。 Subsequently, the content recommendation unit 70 extracts all leaf nodes having a weight exceeding “0” in the personal profile (base_tree), and a vector base_vec [ i] is calculated. Here, the content recommendation unit 70 calculates the vector base_vec [i] in the same manner as the vector target_vec [i], and thus the description thereof is omitted.

そして、コンテンツ推薦手段７０は、ベクトルｂａｓｅ＿ｖｅｃ[ｉ]の各要素を、個人プロファイルを生成したときの補正係数を用いて平準化する。例えば、コンテンツ推薦手段７０は、補正係数が個人プロファイルのキーワード総数の場合、ベクトルｂａｓｅ＿ｖｅｃ[ｉ]の各要素をキーワード総数で除算する。
なお、コンテンツ推薦手段７０は、ベクトルｔａｒｇｅｔ＿ｖｅｃ[ｉ]の各要素を平準化してもよいし、平準化しなくともよい。 Then, the content recommendation unit 70 equalizes each element of the vector base_vec [i] using the correction coefficient when the personal profile is generated. For example, when the correction coefficient is the total number of keywords in the personal profile, the content recommendation unit 70 divides each element of the vector base_vec [i] by the total number of keywords.
Note that the content recommendation unit 70 may or may not level the elements of the vector target_vec [i].

続いて、コンテンツ推薦手段７０は、ベクトルｔａｒｇｅｔ＿ｖｅｃ[ｉ]とベクトルｂａｓｅ＿ｖｅｃ[ｉ]との内積をベクトル類似度として算出する。さらに、コンテンツ推薦手段７０は、包含係数として、ｉ個のベクトル類似度の平均値を算出する。つまり、コンテンツ推薦手段７０は、同じリーフノードからルートノードまでの経路で生成したベクトルの内積により、各経路の重なり具合を表した指標を算出する。 Subsequently, the content recommendation unit 70 calculates the inner product of the vector target_vec [i] and the vector base_vec [i] as the vector similarity. Further, the content recommendation unit 70 calculates an average value of i vector similarities as an inclusion coefficient. That is, the content recommendation unit 70 calculates an index that represents the degree of overlap of each path based on the inner product of vectors generated from the same leaf node to the root node.

続いて、コンテンツ推薦手段７０は、包含係数（一致度）が予め設定された閾値以上であるか否かを判定する。
包含係数が閾値以上の場合、コンテンツ推薦手段７０は、推薦対象となるコンテンツを視聴者に推薦するという判定結果を出力する。
包含係数が閾値未満の場合、コンテンツ推薦手段７０は、推薦対象となるコンテンツを視聴者に推薦しないという判定結果を出力する。 Subsequently, the content recommendation unit 70 determines whether or not the inclusion coefficient (matching degree) is equal to or greater than a preset threshold value.
When the inclusion coefficient is equal to or greater than the threshold, the content recommendation unit 70 outputs a determination result that recommends the content to be recommended to the viewer.
If the inclusion coefficient is less than the threshold, the content recommendation unit 70 outputs a determination result that the content to be recommended is not recommended to the viewer.

［個人プロファイル生成装置の動作：グループ木構造グラフ生成処理］
図９を参照し、個人プロファイル生成装置１のグループ木構造グラフ生成処理について、詳細に説明する（適宜図２参照）。 [Personal Profile Generation Device Operation: Group Tree Structure Graph Generation Processing]
With reference to FIG. 9, the group tree structure graph generation processing of the personal profile generation device 1 will be described in detail (see FIG. 2 as appropriate).

個人プロファイル生成装置１は、初期グラフ生成手段１０によって、リンク付きコーパスのキーワードをノードとし、リンクされたキーワード同士をエッジとして接続した初期グラフＡを生成する（ステップＳ１）。 The personal profile generation device 1 uses the initial graph generation means 10 to generate an initial graph A in which the keywords of the linked corpus are used as nodes and the linked keywords are connected as edges (step S1).

個人プロファイル生成装置１は、グループ化手段２０によって、全てのノードが同一の親グループに属するように、初期グラフＡのグループを初期化する（ステップＳ２）。
個人プロファイル生成装置１は、グループ化手段２０によって、ノード間の連結粗密情報に基づいて、親グループを２以上の子グループに分割する（ステップＳ３）。 The personal profile generation device 1 initializes the group of the initial graph A by the grouping means 20 so that all the nodes belong to the same parent group (step S2).
In the personal profile generation device 1, the grouping means 20 divides the parent group into two or more child groups based on the connection density information between nodes (step S3).

個人プロファイル生成装置１は、グループ化手段２０によって、各グループの親子関係を抽出する（ステップＳ４）。
個人プロファイル生成装置１は、グループ化手段２０によって、終了条件を満たすか否かを判定する（ステップＳ５）。 The personal profile generation device 1 extracts the parent-child relationship of each group by the grouping means 20 (step S4).
The personal profile generation device 1 determines whether or not the end condition is satisfied by the grouping unit 20 (step S5).

終了条件を満たさない場合（ステップＳ５でＮｏ）、個人プロファイル生成装置１は、グループ化手段２０によって、ステップＳ４で抽出した子グループを親グループに更新し（ステップＳ６）、ステップＳ３の処理に戻る。 When the end condition is not satisfied (No in step S5), the personal profile generation device 1 updates the child group extracted in step S4 to the parent group by the grouping unit 20 (step S6), and returns to the process of step S3. .

終了条件を満たす場合（ステップＳ５でＹｅｓ）、個人プロファイル生成装置１は、グループ木構造グラフ生成手段３０によって、ステップＳ２〜Ｓ４の処理で生成されたグループ化グラフＢにおいて、位置情報が示す位置に各ノードを配置することで、グループ木構造グラフＣを生成する（ステップＳ７）。
以上の手順により、個人プロファイル生成装置１は、グループ木構造グラフＣを生成できる。 If the end condition is satisfied (Yes in step S5), the personal profile generation device 1 uses the group tree structure graph generation unit 30 to position the position indicated by the position information in the grouped graph B generated in the processes of steps S2 to S4. The group tree structure graph C is generated by arranging each node (step S7).
Through the above procedure, the personal profile generation device 1 can generate the group tree structure graph C.

なお、個人プロファイル生成装置１は、推薦対象情報にキーワード抽出処理を施し、リンク付きコーパスにおけるキーワード同士のリンクを利用することで、第２初期グラフを生成できる（ステップＳ１Ａ）。
以後、個人プロファイル生成装置１は、ステップＳ２〜Ｓ４と同様の処理で第２グループ木構造グラフを生成できるため、説明を省略する。 Note that the personal profile generation device 1 can generate a second initial graph by performing keyword extraction processing on the recommendation target information and using links between keywords in the linked corpus (step S1A).
Thereafter, since the personal profile generation device 1 can generate the second group tree structure graph by the same processing as steps S2 to S4, description thereof is omitted.

［個人プロファイル生成装置の動作：プロファイル生成処理］
図１０を参照し、個人プロファイル生成装置１のプロファイル生成処理について、詳細に説明する（適宜図２参照）。
個人プロファイル生成装置１は、プロファイル生成手段５０によって、アクセス履歴情報からキーワードを抽出する（ステップＳ１１）。 [Personal Profile Generation Device Operation: Profile Generation Processing]
With reference to FIG. 10, the profile generation processing of the personal profile generation device 1 will be described in detail (see FIG. 2 as appropriate).
The personal profile generation device 1 extracts keywords from the access history information by the profile generation means 50 (step S11).

個人プロファイル生成装置１は、プロファイル生成手段５０によって、抽出したキーワードに一致するグループ木構造グラフＣのノードを探索し、探索された基準ノードに重みを加える。
個人プロファイル生成装置１は、プロファイル生成手段５０によって、上位係数αと下位係数βとが乗じられた重みが閾値以下になるまで、その重みをグループ木構造グラフＣのノードに加える（ステップＳ１２）。 The personal profile generation device 1 uses the profile generation means 50 to search for a node of the group tree structure graph C that matches the extracted keyword, and adds a weight to the searched reference node.
The personal profile generation device 1 adds the weight to the node of the group tree structure graph C until the weight generated by multiplying the high-order coefficient α and the low-order coefficient β by the profile generation means 50 becomes equal to or less than the threshold (step S12).

ここで、個人プロファイル生成装置１は、アクセス履歴情報から抽出した全てのキーワードについて、ステップＳ１１，Ｓ１２の処理を行う。
以上の手順により、個人プロファイル生成装置１は、個人プロファイルを生成できる。 Here, the personal profile generation device 1 performs the processing of steps S11 and S12 for all keywords extracted from the access history information.
Through the above procedure, the personal profile generation apparatus 1 can generate a personal profile.

なお、個人プロファイル生成装置１は、ステップＳ１１の処理を行わずに、第２グループ木構造グラフの各ノードに任意の手法で重み付けを行うことで、推薦対象プロファイルを生成できる（ステップＳ１２Ａ）。 The personal profile generation device 1 can generate a recommendation target profile by weighting each node of the second group tree structure graph by an arbitrary method without performing the process of step S11 (step S12A).

［コンテンツ推薦装置の動作：コンテンツ推薦処理］
図１１を参照し、コンテンツ推薦装置２のコンテンツ推薦処理について、詳細に説明する（適宜図２参照）。 [Operation of content recommendation device: content recommendation processing]
The content recommendation process of the content recommendation device 2 will be described in detail with reference to FIG. 11 (see FIG. 2 as appropriate).

コンテンツ推薦装置２は、コンテンツ推薦手段７０によって、個人プロファイルと推薦対象プロファイルとをベクトル化する（ステップＳ２１）。
コンテンツ推薦装置２は、コンテンツ推薦手段７０によって、ステップＳ２１でベクトル化された個人プロファイルと推薦対象プロファイルとの一致度を算出する（ステップＳ２２）。
コンテンツ推薦装置２は、コンテンツ推薦手段７０によって、ステップＳ２２で算出した一致度が閾値以上であるか否かを判定する（ステップＳ２３）。 The content recommendation device 2 vectorizes the personal profile and the recommendation target profile by the content recommendation means 70 (step S21).
The content recommendation device 2 calculates the degree of coincidence between the personal profile vectorized in step S21 and the recommendation target profile by the content recommendation means 70 (step S22).
The content recommendation device 2 determines whether or not the degree of coincidence calculated in step S22 is greater than or equal to a threshold value by the content recommendation unit 70 (step S23).

一致度が閾値以上の場合（ステップＳ２３でＹｅｓ）、コンテンツ推薦手段７０は、推薦対象となるコンテンツを推薦する（ステップＳ２４）。
一致度が閾値未満の場合（ステップＳ２３でＮｏ）、又は、ステップＳ２４の処理後、コンテンツ推薦手段７０は、処理を終了する。
以上の手順により、コンテンツ推薦装置２は、コンテンツを推薦できる。 If the degree of coincidence is equal to or greater than the threshold (Yes in step S23), the content recommendation unit 70 recommends the content to be recommended (step S24).
If the degree of coincidence is less than the threshold (No in step S23) or after the processing in step S24, the content recommendation unit 70 ends the processing.
Through the above procedure, the content recommendation device 2 can recommend content.

本願発明の実施形態に係るコンテンツ推薦装置２は、擬似的な概念空間にキーワードをマッピングするので、概念空間まで個人プロファイルを拡張可能とし、広範な視聴者の嗜好が反映された個人プロファイルを生成することができる。 Since the content recommendation device 2 according to the embodiment of the present invention maps keywords to a pseudo concept space, the personal profile can be expanded to the concept space, and a personal profile reflecting a wide range of viewer preferences is generated. be able to.

すなわち、コンテンツ推薦装置２は、概念的な領域まで踏み込んで、視聴者の嗜好を個人プロファイル化することができる。その結果、コンテンツ推薦装置２は、表層的なキーワードでのマッチング処理で困難であったグループ単位での概念を基にした情報推薦が可能となり、未知のキーワードが出現しても、的確に視聴者の嗜好に応じたコンテンツを推薦できる。 That is, the content recommendation device 2 can step into a conceptual area and personalize the viewer's preference. As a result, the content recommendation device 2 can recommend information based on the concept of the group unit, which is difficult in the matching process with the surface keyword, and even if an unknown keyword appears, the viewer can accurately You can recommend content according to your preferences.

以上、本願発明の各実施形態を詳述してきたが、本願発明は前記した実施形態に限られるものではなく、本願発明の要旨を逸脱しない範囲の設計変更なども含まれる。 As mentioned above, although each embodiment of this invention was explained in full detail, this invention is not restricted to above-described embodiment, The design change etc. of the range which does not deviate from the summary of this invention are included.

前記した実施形態では、キーワードを扱って説明したが、本願発明は、キーワードに限られず画像特徴量などの情報リソースにも適用することができる。
また、前記した実施形態では、番組に関するコンテンツを推薦することとして説明したが、本願発明による個人プロファイルを用いて、番組以外のコンテンツ、例えばイベント情報、商品情報などを推薦することもできる。 In the above-described embodiment, the keyword has been described. However, the present invention is not limited to the keyword, and can be applied to information resources such as image feature amounts.
In the above-described embodiment, the content related to the program is recommended. However, content other than the program, such as event information and product information, can be recommended using the personal profile according to the present invention.

前記した実施形態では、リンク付きコーパスがウィキペディアであることとして説明したが、本願発明は、これに限定されない。例えば、本願発明は、リンク付きコーパスとして、リンクが張られたホームページや、学術論文の引用情報も利用することができる。 In the above-described embodiment, the linked corpus is described as being Wikipedia, but the present invention is not limited to this. For example, the present invention can also use a linked home page or academic paper citation information as a linked corpus.

前記した実施形態では、一致度の閾値判定により推薦の可否を判定したが、本願発明は、これに限定されない。例えば、本願発明は、推薦対象となるコンテンツ毎に一致度を算出し、一致度が高い順に視聴者に推薦してもよい。 In the above-described embodiment, whether or not the recommendation is possible is determined by the threshold value determination of the degree of coincidence, but the present invention is not limited to this. For example, the present invention may calculate the degree of coincidence for each content to be recommended and recommend it to the viewer in descending order of the degree of coincidence.

前記した実施形態では、個人プロファイル生成装置１を独立したハードウェアとして説明したが、本願発明は、これに限定されない。例えば、個人プロファイル生成装置１は、コンピュータが備えるＣＰＵ、メモリ、ハードディスクなどのハードウェア資源を、前記した各手段として協調動作させる個人プロファイル生成プログラムで実現することもできる。このプログラムは、通信回線を介して配布してもよく、ＣＤ−ＲＯＭやフラッシュメモリなどの記録媒体に書き込んで配布してもよい。 In the above-described embodiment, the personal profile generation device 1 has been described as independent hardware, but the present invention is not limited to this. For example, the personal profile generation apparatus 1 can also be realized by a personal profile generation program that causes hardware resources such as a CPU, a memory, and a hard disk included in a computer to operate cooperatively as the above-described units. This program may be distributed via a communication line, or may be distributed by writing in a recording medium such as a CD-ROM or a flash memory.

１個人プロファイル生成装置
２コンテンツ推薦装置
１０初期グラフ生成手段
２０グループ化手段
３０グループ木構造グラフ生成手段
４０グループ木構造グラフＤＢ
５０プロファイル生成手段
６０プロファイルＤＢ
７０コンテンツ推薦手段 1 Personal profile generation device 2 Content recommendation device 10 Initial graph generation means 20 Grouping means 30 Group tree structure graph generation means 40 Group tree structure graph DB
50 Profile generation means 60 Profile DB
70 Content recommendation means

Claims

第１キーワード同士がリンクされたリンク付きコーパスから、個人の嗜好に適応した個人プロファイルを生成する個人プロファイル生成装置であって、
前記リンク付きコーパスの第１キーワードをノードとして、リンクされた前記第１キーワード同士をエッジとして接続した第１初期グラフを生成する初期グラフ生成手段と、
予め設定された終了条件を満たすまで前記第１初期グラフを再帰的にグループ化することで、木構造における前記ノードの位置を表した位置情報が付加された第１グループ化グラフを生成するグループ化手段と、
前記位置情報に基づいて前記第１グループ化グラフの各ノードを配置することで、前記第１グループ化グラフから、上位概念及び下位概念となる前記ノード同士が木構造で階層化された第１グループ木構造グラフを生成するグループ木構造グラフ生成手段と、
前記個人が興味を持つコンテンツの内容を表す個人興味情報に基づいて、前記個人の嗜好を表した重みを前記第１グループ木構造グラフのノードに伝搬させることで、前記個人プロファイルを生成するプロファイル生成手段と、を備えることを特徴とする個人プロファイル生成装置。 A personal profile generation device that generates a personal profile adapted to personal preferences from a linked corpus in which first keywords are linked,
Initial graph generation means for generating a first initial graph in which the first keywords of the linked corpus are connected as nodes and the linked first keywords are connected as edges;
Grouping to generate a first grouped graph to which position information representing the position of the node in the tree structure is added by recursively grouping the first initial graph until a preset termination condition is satisfied Means,
By arranging each node of the first grouped graph based on the position information, a first group in which the nodes that are the higher and lower concepts are hierarchized in a tree structure from the first grouped graph A group tree structure graph generating means for generating a tree structure graph;
Profile generation for generating the personal profile by propagating a weight representing the personal preference to a node of the first group tree structure graph based on personal interest information representing the content of the content that the individual is interested in Means for generating a personal profile.

前記グループ化手段は、前記ノード間の連結粗密情報による再帰的グループ化処理を前記第１初期グラフに施すことを特徴とする請求項１に記載の個人プロファイル生成装置。 2. The personal profile generation apparatus according to claim 1, wherein the grouping unit performs a recursive grouping process on the first initial graph based on connection density information between the nodes.

前記プロファイル生成手段は、
前記個人興味情報に含まれる第２キーワードを抽出し、
抽出した前記第２キーワードに一致する前記第１グループ木構造グラフのノードである基準ノードに所定の重みを加え、
前記基準ノードから前記第１グループ木構造グラフの各ノードに到るまで階層が１つ上がる毎に予め設定された１以下の上位係数と前記階層が１つ下がる毎に予め設定された１以下の下位係数とが乗じられた重みを算出し、
算出した当該重みが予め設定された閾値以下になるまで、算出した当該重みを前記第１グループ木構造グラフの各ノードに加えることを特徴とする請求項１又は請求項２に記載の個人プロファイル生成装置。 The profile generation means includes
Extracting a second keyword contained in the personal interest information;
A predetermined weight is added to a reference node that is a node of the first group tree structure graph that matches the extracted second keyword,
Every time the hierarchy goes up from the reference node to each node of the first group tree structure graph, a higher coefficient of 1 or less set in advance and 1 or less set in advance every time the hierarchy goes down Calculate the weight multiplied by the lower coefficient,
3. The personal profile generation according to claim 1 or 2, wherein the calculated weight is added to each node of the first group tree structure graph until the calculated weight is equal to or less than a preset threshold value. apparatus.

前記初期グラフ生成手段は、推薦対象となるコンテンツの内容を表す推薦対象情報に含まれる第３キーワードのうち、前記第１キーワードに一致するものを抽出し、抽出された前記第３キーワードをノードとして、前記第１キーワード同士のリンクにより前記第３キーワード同士をエッジとして接続した第２初期グラフを生成し、
前記グループ化手段は、前記終了条件を満たすまで前記第２初期グラフを再帰的にグループ化することで、前記位置情報が付加された第２グループ化グラフを生成し、
前記グループ木構造グラフ生成手段は、前記位置情報に基づいて前記第２グループ化グラフの各ノードを配置することで、前記第２グループ化グラフから、上位概念及び下位概念となる前記ノード同士が木構造で階層化された第２グループ木構造グラフを生成し、
前記プロファイル生成手段は、前記第２グループ木構造グラフのノードに重み付けを行うことで、推薦対象プロファイルを生成することを特徴とする請求項１から請求項３の何れか一項に記載の個人プロファイル生成装置。 The initial graph generation unit extracts a third keyword included in the recommendation target information representing the content of the content to be recommended, and matches the first keyword, and uses the extracted third keyword as a node. , Generating a second initial graph in which the third keywords are connected as edges by links between the first keywords,
The grouping means generates a second grouped graph to which the position information is added by recursively grouping the second initial graph until the termination condition is satisfied,
The group tree structure graph generating means arranges each node of the second grouped graph based on the position information, so that the nodes that are the higher concept and the lower concept are arranged in the tree from the second grouped graph. Generate a second group tree structure graph hierarchized by structure,
4. The personal profile according to claim 1, wherein the profile generation unit generates a recommendation target profile by weighting nodes of the second group tree structure graph. 5. Generator.

請求項４に記載の個人プロファイル生成装置と、
前記個人プロファイルと前記推薦対象プロファイルとの一致度を算出し、算出した前記一致度の閾値判定によって、前記推薦対象のコンテンツを前記個人に推薦するか否かを判定するコンテンツ推薦手段と、を備えることを特徴とするコンテンツ推薦装置。 The personal profile generation device according to claim 4,
Content recommendation means for calculating a degree of coincidence between the personal profile and the recommendation target profile, and determining whether or not to recommend the content to be recommended to the individual by determining the threshold value of the calculated degree of coincidence. A content recommendation device.

コンピュータを、請求項１から請求項４の何れか一項に記載の個人プロファイル生成装置として機能させるための個人プロファイル生成プログラム。 The personal profile production | generation program for functioning a computer as a personal profile production | generation apparatus as described in any one of Claims 1-4.