JP6040137B2

JP6040137B2 - Item recommendation device, item recommendation method, and item recommendation program

Info

Publication number: JP6040137B2
Application number: JP2013216573A
Authority: JP
Inventors: 結城遠藤; 浩之戸田; 佐藤　隆; 隆佐藤; 鷲崎　誠司; 誠司鷲崎
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 2013-10-17
Filing date: 2013-10-17
Publication date: 2016-12-07
Anticipated expiration: 2033-10-17
Also published as: JP2015079381A

Description

本発明は、ある情報に関連するアイテムを推薦する装置、特に世の中のトレンドに基づいて、ＴＶやＶＯＤ(ビデオオンデマンド)等の番組と関連するアイテムを推定し、推薦するためのアイテム推薦装置、方法、プログラムに関する。 The present invention is an apparatus for recommending items related to certain information, in particular, an item recommendation apparatus for estimating and recommending items related to programs such as TV and VOD (video on demand) based on the trend of the world, Methods and programs.

ＴＶのデジタル化やインターネットの普及に伴い、単に番組を視聴するだけでなく、ＴＶを媒介とした様々なサービスを利用できるようになった。具体的には、番組関連情報の閲覧やショッピング等のサービスが提供されている(例えば特許文献１、２、３参照)。 With the digitization of TV and the spread of the Internet, it has become possible not only to watch programs but also to use various services via TV. Specifically, services such as browsing of program related information and shopping are provided (see, for example, Patent Documents 1, 2, and 3).

特許文献１においては、推薦するアイテムについて、予め番組とアイテムが関連付けられていることが前提である。しかし、膨大な数のアイテムから、人手で関連するアイテムを選ぶのは、多大なコストを要する作業である。 In Patent Document 1, it is premised that a program and an item are associated in advance with respect to an item to be recommended. However, it is a costly operation to manually select a related item from a large number of items.

これに対し、特許文献２によれば、電子番組表（ＥＰＧ）やクローズドキャプション（ＣＣ）のメタデータを基に、映像データの所定のタイミングで関連情報を提供することができる。 On the other hand, according to Patent Document 2, related information can be provided at a predetermined timing of video data based on metadata of an electronic program guide (EPG) or closed caption (CC).

また、特許文献３によれば、ユーザの番組視聴履歴に基づいて、過去に視聴していた番組情報を解析し、ユーザの嗜好を把握することで、ユーザの好みに合った番組を推薦する方法が提案されている。 According to Patent Document 3, a method of recommending a program that matches a user's preference by analyzing program information that has been viewed in the past based on the user's program viewing history and grasping the user's preference. Has been proposed.

尚、本発明においてトピック抽出の際に利用する、トピックモデルであるＬａｔｅｎｔＤｉｒｉｃｈｌｅｔＡｌｌｏｃａｔｉｏｎ（ＬＤＡ）は非特許文献１に記載され、また本発明の番組トピック関連度計算部が、隣接行列に基づいて番組およびトピックの各特徴量を求める際に利用するＰｏｗｅｒＩｔｅｒａｔｉｏｎの技術は非特許文献２に記載されている。 Note that the topic direc- tion allocation (LDA) used in topic extraction in the present invention is described in Non-Patent Document 1, and the program topic relevance calculation unit of the present invention is based on the adjacency matrix. Further, Non-Patent Document 2 describes a power iteration technique used when obtaining each feature amount of a topic.

特開２００２−３３０４２２号公報JP 2002-330422 A 特開２０１３−１６４７７０号公報JP 2013-164770 A 特開２００８−１９９４０６号公報JP 2008-199406 A

Ｄ．Ｍ．Ｂｌｅｉ，Ａ．Ｙ．Ｎｇ，ａｎｄＭ．Ｉ．Ｊｏｒｄａｎ．ＬａｔｅｎｔＤｉｒｉｃｈｌｅｔＡｌｌｏｃａｔｉｏｎ，ｉｎＪｏｕｒｎａｌｏｆＭａｃｈｉｎｅＬｅａｒｎｉｎｇＲｅｓｅａｒｃｈ（ＪＭＬＲ），ｖｏｌ．３，ｐｐ．９９３−１０２２（２００３）．D. M.M. Blei, A .; Y. Ng, and M.M. I. Jordan. Latent Dirichlet Allocation, in Journal of Machine Learning Research (JMLR), vol. 3, pp. 993-1022 (2003). Ｌｉｎ，Ｆ．ａｎｄＣｏｈｅｎ，Ｗ．Ｗ．：Ｐｏｗｅｒｉｔｅｒａｔｉｏｎｃｌｕｓｔｅｒｉｎｇ，ＩｎＰｒｏｃ．ＯｆＩＣＭＬ，ｐｐ．６５５−６６２，２０１０．Lin, F.M. and Cohen, W.M. W. : Power iteration clustering, In Proc. Of ICML, pp. 655-662, 2010.

しかし、従来の番組情報や視聴履歴を用いた技術は、ユーザの好みに基づいてアイテムを推薦できる一方で、世の中のトレンドに基づいた推薦はできなかった。例えばスポーツ番組を視聴しているユーザに対して、スポーツ映像や用品等のアイテムを推薦することはできるが、今何のスポーツ映像や用品に関する内容が話題になっているのかといった、今話題のトピックに関連するアイテムを推薦することができないという問題があった。 However, conventional techniques using program information and viewing history can recommend items based on user preferences, but cannot make recommendations based on trends in the world. For example, you can recommend items such as sports videos and equipment to users who are watching sports programs, but topics that are currently discussed, such as what sports videos and equipment are currently being discussed There was a problem that items related to could not be recommended.

また、特許文献２をはじめとする従来技術の多くは、推薦するアイテムを番組情報に基づいて検索しているが、用いる番組の情報量が不十分な場合が多く、番組と関連する多くのアイテムを推薦できないという問題がある。一般的な番組情報としては、放送波と共に配信されるＥＰＧやクローズドキャプションＣＣがある。 In addition, many of the conventional techniques including Patent Document 2 search for recommended items based on program information, but there are many cases where the amount of information of programs to be used is insufficient, and many items related to the programs. There is a problem that cannot be recommended. General program information includes EPG distributed with broadcast waves and closed caption CC.

しかし、ＥＰＧは番組の概要であるため具体的な情報に乏しく、ＣＣを含まない番組も多く存在する。例えば、箱根駅伝の視聴者にシューズやウェアを推薦したい場合、箱根駅伝の番組情報にはシューズという単語が含まれていることはなく、逆にシューズのメタデータにおいては箱根駅伝という単語が含まれていないため、従来技術では推薦することができない。これに対し、人手により番組情報を増やすサービスも存在するが、これは非常にコストがかかるという問題がある。 However, since the EPG is an outline of a program, there is a lack of specific information, and there are many programs that do not include a CC. For example, if you want to recommend shoes and wear to viewers of Hakone Ekiden, the program information for Hakone Ekiden will not include the word shoes, but conversely the shoe metadata will include the word Hakone Ekiden. Therefore, it cannot be recommended by the conventional technology. On the other hand, there is a service for manually increasing program information, but this has a problem that it is very expensive.

本発明は上記課題を解決するものであり、その目的は、世の中のトレンドに基づいた話題のトピックと関連し、かつ番組と関連するアイテムを推薦することができるアイテム推薦装置、方法、プログラムを提供することにある。 SUMMARY OF THE INVENTION The present invention solves the above-mentioned problems, and an object thereof is to provide an item recommendation device, method, and program capable of recommending an item related to a topic based on a trend in the world and related to a program. There is to do.

上記課題を解決するための本発明のアイテム推薦装置は、番組と関連するアイテムを推薦するアイテム推薦装置であって、テキストの集合が格納されたテキストストリーム記憶部から特徴語を含むトピックを抽出するトピック抽出手段と、前記トピック抽出手段により抽出されたトピックのトレンド度合いを表すトレンドスコアを計算し、前記トピックとともに記憶するトレンドスコア計算手段と、Ｗｅｂ情報が格納されたＷｅｂ情報データベースから番組情報と関連するＷｅｂ情報を抽出し、該抽出した番組関連Ｗｅｂ情報を記憶する番組関連Ｗｅｂ情報抽出手段と、番組情報、前記番組関連Ｗｅｂ情報抽出手段により抽出された番組関連Ｗｅｂ情報および前記トレンドスコア計算手段により記憶されたトピックに基づいて、番組とトピックの関連度を計算する番組・トピック関連度計算手段と、アイテムとそれに関する情報が格納されたアイテム情報データベースと、前記アイテム情報データベース内のアイテム、前記番組・トピック関連度計算手段により計算された番組とトピックの関連度および前記トレンドスコア計算手段により計算し記憶されたトピックのトレンドスコアに基づいて、番組とアイテムの関連度を計算する番組・アイテム関連度計算手段と、前記番組・アイテム関連度計算手段によって計算された番組とアイテムの関連度に基づいて、当該番組において推薦するアイテムを出力する出力手段と、を備えたことを特徴としている。 An item recommendation device according to the present invention for solving the above-described problem is an item recommendation device for recommending an item related to a program, and extracts a topic including a feature word from a text stream storage unit in which a set of texts is stored. Topic extraction means, trend score calculation means for calculating the trend score of the topic extracted by the topic extraction means, and storing the trend score together with the topic; and related to program information from the Web information database storing the Web information Program-related Web information extracting means for extracting Web information to be stored and storing the extracted program-related Web information; program information; program-related Web information extracted by the program-related Web information extracting means; and the trend score calculating means Programs and topics based on memorized topics Program / topic relevance calculating means for calculating the relevance of items, item information database storing items and information related thereto, items in the item information database, programs calculated by the program / topic relevance calculating means Program / item relevance calculating means for calculating the relevance of a program and an item based on the topic relevance and the topic trend score calculated and stored by the trend score calculating means, and the program / item relevance calculation Output means for outputting an item recommended in the program based on the degree of association between the program and the item calculated by the means.

なお、テキストストリームとはマイクロブログ(例えばTwitter（登録商標）)等のリアルタイム性の高いテキストを想定している。 The text stream is assumed to be text with high real-time properties such as a microblog (for example, Twitter (registered trademark)).

本発明によれば、世の中のトレンドに基づいた話題のトピックと関連し、かつ番組と関連するアイテムを推薦することができる。また、番組情報がＥＰＧおよびＣＣのみの少ない情報量であっても、番組関連Ｗｅｂ情報を抽出し利用しているので、幅広く番組とアイテムとを関連付け、関連アイテムを推薦することができる。 ADVANTAGE OF THE INVENTION According to this invention, the item relevant to the topic of a topic based on the trend of the world and relevant to a program can be recommended. Further, even if the program information has a small amount of information only of EPG and CC, since the program related Web information is extracted and used, it is possible to widely associate programs with items and recommend related items.

本発明の一実施形態例を示す構成図。The block diagram which shows one embodiment of this invention. 本発明の一実施形態例における、テキストストリームからトレンドトピックを計算するまでのフローチャート。The flowchart until it calculates a trend topic from a text stream in the example of one embodiment of the present invention. 本発明の一実施形態例における、テキストストリーム記憶部のテキストストリームの例を示す説明図。Explanatory drawing which shows the example of the text stream of the text stream memory | storage part in one example of embodiment of this invention. 本発明の一実施形態例における、抽出されたトピックと計算されたトレンドスコアの例を示す説明図。Explanatory drawing which shows the example of the extracted topic and the calculated trend score in one example embodiment of this invention. 本発明の一実施形態例における、番組と関連するトレンドトピックに基づくアイテム推薦のフローチャート。The flowchart of the item recommendation based on the trend topic relevant to the program in the example embodiment of the present invention. 本発明の一実施形態例における、番組関連情報とトピックの関連度を計算する際に用意する隣接行列の一例を示す説明図。Explanatory drawing which shows an example of the adjacency matrix prepared when calculating the relevance degree of program relevant information and a topic in one example of this invention. 本発明の一実施形態例におけるアイテム情報ＤＢ（データベース）の一例を示す説明図。Explanatory drawing which shows an example of item information DB (database) in one embodiment of this invention. 本発明の一実施形態例における出力部が出力する、番組に関連する推薦アイテムの一例を示す説明図。Explanatory drawing which shows an example of the recommendation item relevant to the program which the output part in one embodiment of this invention outputs.

以下、図面を参照しながら本発明の実施の形態を説明するが、本発明は下記の実施形態例に限定されるものではない。図１は本発明の一実施形態例によるアイテム推薦装置の構成を示している。 Hereinafter, embodiments of the present invention will be described with reference to the drawings, but the present invention is not limited to the following embodiments. FIG. 1 shows the configuration of an item recommendation device according to an embodiment of the present invention.

図１に示すアイテム推薦装置は、テキストストリーム記憶部１０、トピック抽出手段としてのトピック抽出部２０、トレンドスコア計算手段を構成するトレンドスコア計算部３０およびトピック・トレンドスコア記憶部４０、Ｗｅｂ情報ＤＢ５０、番組関連Ｗｅｂ情報抽出手段を構成する番組関連Ｗｅｂ情報抽出部６０および番組関連Ｗｅｂ情報記憶部７０、番組・トピック関連度計算手段としての番組・トピック関連度計算部８０、アイテム情報ＤＢ９０、番組・アイテム関連度計算手段としての番組・アイテム関連度計算部１００、出力手段としての出力部１１０を備えている。 The item recommendation device shown in FIG. 1 includes a text stream storage unit 10, a topic extraction unit 20 as a topic extraction unit, a trend score calculation unit 30 and a topic / trend score storage unit 40 constituting a trend score calculation unit, a Web information DB 50, Program-related Web information extracting unit 60 and program-related Web information storage unit 70 constituting program-related Web information extracting unit, program / topic related level calculating unit 80 as program / topic related level calculating unit, item information DB 90, program / item A program / item relevance calculation unit 100 as relevance calculation means and an output unit 110 as output means are provided.

トピック抽出部２０は、マイクロブログ(例えばTwitter)等のテキストの集合が格納されたテキストストリーム記憶部１０から特徴語を含むトピックを抽出する。 The topic extraction unit 20 extracts a topic including a feature word from the text stream storage unit 10 in which a set of text such as a microblog (for example, Twitter) is stored.

トレンドスコア計算部３０は、トピック抽出部２０により抽出されたトピックのトレンド度合いを表すトレンドスコアを計算し、トピックとともにトピック・トレンドスコア記憶部４０に記憶する。 The trend score calculation unit 30 calculates a trend score representing the trend degree of the topic extracted by the topic extraction unit 20, and stores it in the topic / trend score storage unit 40 together with the topic.

番組関連Ｗｅｂ情報抽出部６０は、Ｗｅｂ情報が格納されたＷｅｂ情報ＤＢ５０から番組情報（ＥＰＧ（電子番組表）、ＣＣ（クローズドキャプション）のデータ）と関連するＷｅｂ情報を抽出し、該抽出した番組関連Ｗｅｂ情報を番組関連Ｗｅｂ情報記憶部７０に記憶する。 The program-related Web information extraction unit 60 extracts Web information related to program information (EPG (electronic program guide), CC (closed caption) data) from the Web information DB 50 in which Web information is stored, and the extracted program The related Web information is stored in the program related Web information storage unit 70.

番組・トピック関連度計算部８０は、番組情報、番組関連Ｗｅｂ情報記憶部７０に記憶された番組関連Ｗｅｂ情報およびトピック・トレンドスコア記憶部４０に記憶されたトピックに基づいて、番組とトピックの関連度を計算する。 The program / topic relevance calculation unit 80 associates a program with a topic based on program information, program-related Web information stored in the program-related Web information storage unit 70, and topics stored in the topic / trend score storage unit 40. Calculate the degree.

アイテム情報ＤＢ９０には、アイテムとそれに関する情報が格納されている。 The item information DB 90 stores items and information related thereto.

番組・アイテム関連度計算部１００は、アイテム情報ＤＢ９０内のアイテム、番組・トピック関連度計算部８０により計算された番組とトピックの関連度およびトピック・トレンドスコア記憶部４０に記憶されたトピックのトレンドスコアに基づいて、番組とアイテムの関連度を計算する。 The program / item relevance calculation unit 100 includes the item in the item information DB 90, the relevance between the program and the topic calculated by the program / topic relevance calculation unit 80, and the topic trend stored in the topic / trend score storage unit 40. Based on the score, the degree of association between the program and the item is calculated.

出力部１１０は、番組・アイテム関連度計算部１００によって計算された番組とアイテムの関連度に基づいて、当該番組において推薦するアイテムを出力する。 The output unit 110 outputs an item recommended in the program based on the degree of association between the program and the item calculated by the program / item relevance calculation unit 100.

図１のアイテム推薦装置は、例えばコンピュータにより構成され、通常のコンピュータのハードウェアリソース、例えばＲＯＭ、ＲＡＭ、ＣＰＵ、入力装置、出力装置、通信インターフェース、ハードディスク、記録媒体およびその駆動装置を備えている。 The item recommendation device in FIG. 1 is configured by a computer, for example, and includes hardware resources of a normal computer, such as a ROM, a RAM, a CPU, an input device, an output device, a communication interface, a hard disk, a recording medium, and a driving device thereof. .

このハードウェアリソースとソフトウェアリソース（ＯＳ、アプリケーションなど）との協働の結果、アイテム推薦装置は、図１に示すように、テキストストリーム記憶部１０、トピック抽出部２０、トレンドスコア計算部３０、トピック・トレンドスコア記憶部４０、Ｗｅｂ情報ＤＢ５０、番組関連Ｗｅｂ情報抽出部６０、番組関連Ｗｅｂ情報記憶部７０、番組・トピック関連度計算部８０、アイテム情報ＤＢ９０、番組・アイテム関連度計算部１００、出力部１１０を実装する。 As a result of the cooperation between the hardware resource and the software resource (OS, application, etc.), as shown in FIG. 1, the item recommendation device has a text stream storage unit 10, a topic extraction unit 20, a trend score calculation unit 30, and a topic. Trend score storage unit 40, Web information DB 50, program related web information extraction unit 60, program related web information storage unit 70, program / topic relevance calculation unit 80, item information DB 90, program / item relevance calculation unit 100, output Part 110 is implemented.

前記テキストストリーム記憶部１０、トピック・トレンドスコア記憶部４０、Ｗｅｂ情報ＤＢ５０、番組関連Ｗｅｂ情報記憶部７０、アイテム情報ＤＢ９０は、ハードディスクあるいはＲＡＭなどの保存手段・記憶手段に構築されているものとする。 The text stream storage unit 10, the topic / trend score storage unit 40, the Web information DB 50, the program-related Web information storage unit 70, and the item information DB 90 are constructed in storage means / storage means such as a hard disk or RAM. .

次に、上記のように構成された装置の動作を具体的に説明する。本実施形態例における処理は、テキストストリームからのトレンドトピックの計算、番組と関連するトレンドトピックに基づくアイテム推薦の二つに分かれる。 Next, the operation of the apparatus configured as described above will be specifically described. The processing in the present embodiment is divided into two types: calculation of a trend topic from a text stream, and item recommendation based on a trend topic associated with a program.

最初に、テキストストリームからのトレンドトピックの計算について説明する。図２は、本発明の一実施の形態におけるテキストストリームからのトレンドトピックを計算する処理のフローチャートを示している。 First, calculation of trend topics from a text stream will be described. FIG. 2 shows a flowchart of processing for calculating a trend topic from a text stream according to an embodiment of the present invention.

＜ステップＳ２１０＞コンピュータは、一定時間分のテキストストリームに関して、発生時間、テキスト情報とテキストＩＤをテキストストリーム記憶部１０に記憶する。記憶例を図３に示す。記憶されたテキストストリームは一定時間後削除され、再度新たに一定時間分のテキストストリームを記憶する。ここで、テキストストリームとはマイクロブログ(例えばTwitter)等のリアルタイム性の高いテキストを想定している。 <Step S210> The computer stores the generation time, text information, and text ID in the text stream storage unit 10 for the text stream for a predetermined time. An example of storage is shown in FIG. The stored text stream is deleted after a predetermined time, and a new text stream for a predetermined time is stored again. Here, the text stream is assumed to be text with high real-time properties such as a microblog (for example, Twitter).

＜ステップＳ２２０＞トピック抽出部２０が、テキストストリーム記憶部１０に記憶されたテキストｄ∈Ｄからトピックｚ∈Ｚを抽出する。ここでＤはテキスト集合、Ｚはトピック集合であり、トピックｚは特徴語ｗ∈Ｗの集合で表される（Ｗは語彙全体の集合）。トピック抽出のために、トピック抽出部２０は形態素解析器を用いて、テキストを名詞・動詞・形容詞等の単語集合に分割する。トピックの抽出には、代表的なトピックモデルであるＬａｔｅｎｔＤｉｒｉｃｈｌｅｔＡｌｌｏｃａｔｉｏｎ（ＬＤＡ）（非特許文献１）等、公知のものを用いることができる。図４に抽出されたトピックの例を示す。各トピックはトピックＩＤおよびトレンドスコアを持つ。トレンドスコアの算出方法は後述する。 <Step S220> The topic extraction unit 20 extracts the topic zεZ from the text dεD stored in the text stream storage unit 10. Here, D is a text set, Z is a topic set, and topic z is represented by a set of feature words wεW (W is a set of the entire vocabulary). For topic extraction, the topic extraction unit 20 uses a morphological analyzer to divide the text into word sets such as nouns, verbs, and adjectives. For the topic extraction, a well-known topic model such as Late Dichlet Allocation (LDA) (Non-Patent Document 1), which is a typical topic model, can be used. FIG. 4 shows an example of the extracted topic. Each topic has a topic ID and a trend score. A method for calculating the trend score will be described later.

＜ステップＳ２３０＞トレンドスコア計算部３０が、抽出された各トピックｚ_iのトレンドスコアＴ_ziを計算する。トレンドスコアは、あるトピックに関係するテキストが多く発生しているほど、そのトピックはトレンド度合が高いとみなして算出する。例えば以下の式（１）のように算出する。 <Step S230> The trend score calculation unit 30 calculates the trend score T _zi of each extracted topic z _i . The trend score is calculated by regarding that the topic has a higher degree of trend as the more text related to the topic is generated. For example, it calculates like the following formula | equation (1).

ここで、ｉおよびｊはそれぞれトピックとテキストのＩＤであり、Ｐ（ｚ_i｜ｄ_j）はテキストｄ_jにおけるトピックｚ_iの生起確率である。図４に各トピックに関して計算されたトレンドスコアの例を示す。 Here, i and j are topic and text IDs, respectively, and P (z _i | d _j ) is the occurrence probability of topic z _i in text d _j . FIG. 4 shows an example of the trend score calculated for each topic.

＜ステップＳ２４０＞抽出・計算したトピックとトレンドスコアを、トピック・トレンドスコア記憶部４０に記憶する。記憶例は図４に示す通りである。 <Step S240> The extracted and calculated topic and trend score are stored in the topic / trend score storage unit 40. An example of storage is as shown in FIG.

次に、番組と関連するトレンドトピックに基づくアイテム推薦について説明する。図５は、本発明の一実施の形態における番組と関連するトレンドトピックに基づくアイテムを推薦する処理のフローチャートを示している。 Next, item recommendation based on trend topics related to programs will be described. FIG. 5 shows a flowchart of a process for recommending an item based on a trend topic associated with a program according to an embodiment of the present invention.

＜ステップＳ３１０＞Ｗｅｂ情報ＤＢ５０には、例えばインターネットをクロールして得られた、Wikipedia（登録商標）記事やTwitterの番組ハッシュタグ付きTweet（登録商標）等が格納されており、番組関連Ｗｅｂ情報抽出部６０からの要求に従って、番組ＥＰＧやＣＣと関連するＷｅｂ情報を送信する。番組関連Ｗｅｂ情報抽出部６０は、例えばＥＰＧやＣＣとＷｅｂ情報ＤＢ５０内のＷｅｂ情報のテキストを形態素解析器によって単語単位に分割した後、各テキストに関して出現する単語情報に基づき、テキストの特徴を表すベクトルを構築する。それら特徴ベクトルの関連度をコサイン類似度等によって計算し、番組と関連するＷｅｂ情報を抽出する。 <Step S310> The Web information DB 50 stores, for example, Wikipedia (registered trademark) articles obtained by crawling the Internet, Tweet (registered trademark) with Twitter program hashtags, and the like. Web information related to the programs EPG and CC is transmitted according to the request from the unit 60. The program-related Web information extraction unit 60 divides the text of the Web information in the EPG, CC, and Web information DB 50 into word units by a morphological analyzer, for example, and then represents the characteristics of the text based on the word information that appears for each text. Build a vector. The relevance of these feature vectors is calculated based on the cosine similarity and the like, and Web information related to the program is extracted.

＜ステップＳ３２０＞番組関連Ｗｅｂ情報抽出部６０の要求に応じてＷｅｂ情報ＤＢ５０から送信された情報を番組関連Ｗｅｂ情報記憶部７０に記憶する。視聴中の番組情報であるＥＰＧやＣＣが時間と共に変わる度に、番組関連Ｗｅｂ情報記憶部７０に記憶される番組関連Ｗｅｂ情報を更新する。 <Step S320> Information transmitted from the Web information DB 50 in response to a request from the program-related Web information extraction unit 60 is stored in the program-related Web information storage unit 70. Every time EPG or CC, which is program information being viewed, changes with time, the program-related Web information stored in the program-related Web information storage unit 70 is updated.

＜ステップＳ３３０＞番組・トピック関連度計算部８０が番組(ＥＰＧ・ＣＣテキスト)を示すｔｖ、番組関連Ｗｅｂ情報記憶部７０に記憶された番組関連Ｗｅｂ情報ａ∈Ａ（Ａは番組関連Ｗｅｂ情報の集合）、およびトピック・トレンドスコア記憶部４０に記憶されたトピックｚ∈Ｚとの関連度を算出する。番組ｔｖとトピックｚとを幅広く関連付けるために、番組関連Ｗｅｂ情報ａをそれらの情報を繋ぐ橋渡しとして用いる。具体的には、例えば番組ｔｖ、番組関連Ｗｅｂ情報ａ、トピックｚおよび特徴語ｗ∈Ｗとの関係を表すグラフを想定し、隣接行列Ｃを算出する。その隣接行列Ｃに基づいて、各々の特徴量を計算し、各特徴量を比較することで関連度を算出する。 <Step S330> tv indicating that the program / topic relevance calculation unit 80 indicates a program (EPG / CC text), program-related Web information aεA (A is the program-related Web information stored in the program-related Web information storage unit 70) Set), and the degree of association with the topic zεZ stored in the topic trend score storage unit 40 is calculated. In order to widely associate the program tv and the topic z, the program-related Web information a is used as a bridge connecting the information. Specifically, for example, an adjacency matrix C is calculated assuming a graph representing the relationship between the program tv, the program-related Web information a, the topic z, and the feature word wεW. Based on the adjacency matrix C, each feature amount is calculated, and the degree of association is calculated by comparing each feature amount.

隣接行列Ｃは番組情報やトピック、特徴語の関係を、図６に示すような一つの行列として表現したものである。隣接行列Ｃは次の手順で構築される。隣接行列として（１＋｜Ｗ｜＋｜Ｚ｜＋｜Ａ｜）×（１＋｜Ｗ｜＋｜Ｚ｜＋｜Ａ｜）の行列Ｃ＝（ｃ_j，_k）を用意する。ｊはｊ＝１のときに番組を表し、２≦ｊ≦｜Ｗ｜＋１のときに特徴語を表す。また、｜Ｗ｜＋２≦ｊ≦｜Ｗ｜＋｜Ｚ｜＋１のときにトピックを表し、｜Ｗ｜＋｜Ｚ｜＋２≦ｊ≦｜Ｗ｜＋｜Ｚ｜＋｜Ａ｜＋１のとき番組関連Ｗｅｂ情報を表す。これはｋも同様とする。例えば、ｃ_j,k＝１（ｊ＝１，２≦ｋ≦｜Ｗ｜＋１）は、行列中の番組と特徴語の関係を表す値が１であることを表す。 The adjacency matrix C expresses the relationship between program information, topics, and feature words as one matrix as shown in FIG. The adjacency matrix C is constructed by the following procedure. A matrix C = (c _j , _k ) of (1+ | W | + | Z | + | A |) × (1+ | W | + | Z | + | A |) is prepared as an adjacency matrix. j represents a program when j = 1, and represents a feature word when 2 ≦ j ≦ | W | +1. Also, a topic is represented when | W | + 2 ≦ j ≦ | W | + | Z | +1, and a program when | W | + | Z | + 2 ≦ j ≦ | W | + | Z | + | A | +1 Represents related Web information. The same applies to k. For example, c _{j, k} = 1 (j = 1, 2 ≦ k ≦ | W | +1) represents that the value representing the relationship between the program in the matrix and the feature word is 1.

行列の各要素の値は次のように計算する。トピックｚと特徴語ｗとの組み合わせを表す行列の値ｃ_j,kには、例えば前述の非特許文献１のＬＤＡによって算出した、単語−トピック確率Ｐ（ｗ_k｜ｚ_j）を代入する。番組ｔｖおよび番組関連Ｗｅｂ情報ａと特徴語ｗとの組み合わせを表す行列の値ｃ_j,kには、例えばｔｖおよびａのテキスト中に単語ｗが発生する単語頻度ｔｆに基づいて、ｔｆ／σを代入する。ここでσは定数である。最後に、まだ値の代入していない要素ｃ_j,kには０を代入する。 The value of each element of the matrix is calculated as follows: For example, the word-topic probability P (w _k | z _j ) calculated by the LDA of Non-Patent Document 1 described above is substituted into the matrix value c _{j, k} representing the combination of the topic z and the feature word w. The matrix value c _{j, k} representing the combination of the program tv and the program-related Web information a and the feature word w is, for example, tf / σ based on the word frequency tf in which the word w occurs in the text of tv and a. Is assigned. Here, σ is a constant. Finally, 0 is assigned to the element c _{j, k} that has not yet been assigned a value.

番組ｔｖとトピックｚとの関連度を計算するために、構築した隣接行列Ｃに基づき、各々の特徴量を計算する。特徴量は、例えば以下の式（２）のように、隣接行列からグラフラプラシアンを構築し、固有値分解によって計算する。 In order to calculate the degree of association between the program tv and the topic z, each feature amount is calculated based on the constructed adjacency matrix C. The feature amount is calculated by constructing a graph Laplacian from the adjacency matrix and performing eigenvalue decomposition as in the following equation (2), for example.

ここでＢは隣接行列Ｃの各行の和を対角成分の値に持つ対角行列、Ｐは固有ベクトルを列成分に保持した行列、∧は固有値を対角成分に保持した行列である。Ｐの行成分のベクトルをＰ_iとしたとき、例えば、隣接行列において第一成分に位置する番組ｔｖの特徴量はＰ₁となる。この特徴量は、隣接行列Ｃに基づくグラフの内在的な構造を表しており、関連度の強いノードほど似た特徴量になる。したがって、これらの特徴量を比較することで関連度を計算できる。また、固有ベクトルＰの算出は、非特許文献２に記載されたＰｏｗｅｒＩｔｅｒａｔｉｏｎ等の公知の技術を用いることができる。 Here, B is a diagonal matrix having the sum of each row of the adjacency matrix C as diagonal component values, P is a matrix holding eigenvectors as column components, and ∧ is a matrix holding eigenvalues as diagonal components. When the vector of the row component of P is P _i , for example, the feature quantity of the program tv located at the first component in the adjacency matrix is P ₁ . This feature amount represents the intrinsic structure of the graph based on the adjacency matrix C, and the more similar the node, the more similar the feature amount. Therefore, the degree of association can be calculated by comparing these feature amounts. The eigenvector P can be calculated using a known technique such as Power Iteration described in Non-Patent Document 2.

上記の通りに算出した特徴量をもとに、番組ｔｖとトピックｚとの関連度Ｒ_tv,ziを、例えばコサイン類似度等によって以下の式（３）のように計算する。 Based on the feature amount calculated as described above, the degree of association R _{tv, zi} between the program tv and the topic z is calculated by the following equation (3) based on the cosine similarity, for example.

ここで、ｉはトピックのＩＤを表す。 Here, i represents a topic ID.

＜ステップＳ３４０＞番組・アイテム関連度計算部１００が、アイテム情報ＤＢ９０に記憶されたアイテムｌ∈Ｌと番組ｔｖとの関連度を、トレンドトピックｚとの関連度も考慮して計算する。図７にアイテム情報ＤＢ９０の記憶例を示す。各アイテムに関してアイテムＩＤ、ジャンル、アイテム名、アイテムメタデータ等が記憶されている。 <Step S340> The program / item relevance calculation unit 100 calculates the relevance between the item lεL stored in the item information DB 90 and the program tv in consideration of the relevance with the trend topic z. FIG. 7 shows a storage example of the item information DB 90. For each item, item ID, genre, item name, item metadata, etc. are stored.

まずアイテムとトピックの関連度Ｒ_lk,ziを算出する。関連度Ｒ_lk,ziは、例えば前述したテキストストリームにトピックモデル（ＬＤＡ）を適用して抽出した結果（トピックｚ_i）を用いて、以下の式（４）のように計算できる。 First, the degree of association R _{lk, zi} between items and topics is calculated. The relevance R _{lk, zi} can be calculated as shown in the following equation (4) using, for example, a result (topic z _i ) extracted by applying the topic model (LDA) to the above-described text stream.

ここで、ｋはアイテムのＩＤを示す。またｎ_lk（ｗ）はアイテムメタデータｌ_kにおいて単語ｗが発生する回数を表す。 Here, k indicates an item ID. N _lk (w) represents the number of occurrences of the word w in the item metadata l _k .

以上の通り計算した、トピックのトレンドスコアＴ_zi、番組とトピックの関連度Ｒ_tv,zi、アイテムとトピックの関連度Ｒ_lk,zi、を用いて、最終的に番組ｔｖとアイテムｌ_kの関連度Ｒ_tv,lkを例えば次の式（５）ように算出する。 Using the topic trend score T _zi , the program-topic relevance level R _{tv, zi} , and the item-topic relevance level R _{lk, zi} , the relationship between the program tv and the item l _k is finally _obtained . The degree R _{tv, lk} is calculated _, for example, by the following equation (5).

ここでλはトレンドスコアや関連度の比重を決定する定数である。 Here, λ is a constant that determines the specific gravity of the trend score and relevance.

＜ステップＳ３５０＞出力部１１０は、番組ｔｖとアイテムｌ_kの関連度Ｒ_tv,lkに基づき、関連度の高い順にアイテムを出力する。ここで、出力とは、ディスプレイへの表示、プリンタへの印字、音出力、外部装置への送信等を含む概念である。出力部１１０は出力デバイスのドライバソフトまたは、出力デバイスのドライバソフトと出力デバイス等で実現され得る。出力の具体例を図８に示す。図８はマラソン番組を視聴中のユーザに話題のシューズを推薦する例である。ＳＮＳ（ＳｏｃｉａｌＮｅｔｗｏｒｋｉｎｇＳｅｒｖｉｃｅ）においてトレンドトピックを特定することで、例えば話題の商品を検出し、番組関連Ｗｅｂ情報によってマラソンとシューズを連想させることで、関連性のあるトレンドアイテムの推薦を可能にしている。 <Step S350> The output unit 110 includes a program tv and item l _k of relevance R _tv, based on _lk, and output items in the descending order of the degree of relation. Here, output is a concept including display on a display, printing on a printer, sound output, transmission to an external device, and the like. The output unit 110 may be realized by output device driver software, or output device driver software and an output device. A specific example of output is shown in FIG. FIG. 8 shows an example of recommending a topical shoe to a user who is viewing a marathon program. By identifying trend topics in SNS (Social Networking Service), it is possible to recommend trend items that are relevant by detecting, for example, topical products and associating marathons and shoes with program-related Web information .

また、本実施形態のアイテム推薦装置における各手段の一部もしくは全部の機能をコンピュータのプログラムで構成し、そのプログラムをコンピュータを用いて実行して本発明を実現することができること、本実施形態のアイテム推薦方法における手順をコンピュータのプログラムで構成し、そのプログラムをコンピュータに実行させることができることは言うまでもなく、コンピュータでその機能を実現するためのプログラムを、そのコンピュータが読み取り可能な記録媒体、例えばＦＤ（Ｆｌｏｐｐｙ（登録商標）Ｄｉｓｋ）や、ＭＯ（Ｍａｇｎｅｔｏ−Ｏｐｔｉｃａｌｄｉｓｋ）、ＲＯＭ（ＲｅａｄＯｎｌｙＭｅｍｏｒｙ）、メモリカード、ＣＤ（ＣｏｍｐａｃｔＤｉｓｋ）−ＲＯＭ、ＤＶＤ（ＤｉｇｉｔａｌＶｅｒｓａｔｉｌｅＤｉｓｋ）−ＲＯＭ、ＣＤ−Ｒ、ＣＤ−ＲＷ、ＨＤＤ、リムーバブルディスクなどに記録して、保存したり、配布したりすることが可能である。また、上記のプログラムをインターネットや電子メールなど、ネットワークを通して提供することも可能である。 In addition, a part or all of the functions of each unit in the item recommendation device of the present embodiment can be configured by a computer program, and the program can be executed using the computer to realize the present invention. It goes without saying that the procedure in the item recommendation method can be constituted by a computer program, and the program can be executed by the computer, and the computer-readable recording medium, for example, FD, can be realized by the computer. (Floppy (registered trademark) Disk), MO (Magneto-Optical disk), ROM (Read Only Memory), memory card, CD (Compact Disk) -ROM, DVD (Digital Versati) e Disk) -ROM, CD-R, CD-RW, HDD, and recorded in a removable disk, or stored, it is possible or distribute. It is also possible to provide the above program through a network such as the Internet or electronic mail.

本発明は、上記の実施の形態に限定されることなく、特許請求の範囲内において種々変更・応用が可能である。 The present invention is not limited to the above-described embodiments, and various modifications and applications can be made within the scope of the claims.

１０…テキストストリーム記憶部
２０…トピック抽出部
３０…トレンドスコア計算部
４０…トピック・トレンドスコア記憶部
５０…Ｗｅｂ情報ＤＢ
６０…番組関連Ｗｅｂ情報抽出部
７０…番組関連Ｗｅｂ情報記憶部
８０…番組・トピック関連度計算部
９０…アイテム情報ＤＢ
１００…番組・アイテム関連度計算部
１１０…出力部 DESCRIPTION OF SYMBOLS 10 ... Text stream memory | storage part 20 ... Topic extraction part 30 ... Trend score calculation part 40 ... Topic / trend score memory | storage part 50 ... Web information DB
60 ... Program-related Web information extraction unit 70 ... Program-related Web information storage unit 80 ... Program / topic relevance calculation unit 90 ... Item information DB
100: Program / item relevance calculation unit 110 ... Output unit

Claims

番組と関連するアイテムを推薦するアイテム推薦装置であって、
テキストの集合が格納されたテキストストリーム記憶部から特徴語を含むトピックを抽出するトピック抽出手段と、
前記トピック抽出手段により抽出されたトピックのトレンド度合いを表すトレンドスコアを計算し、前記トピックとともに記憶するトレンドスコア計算手段と、
Ｗｅｂ情報が格納されたＷｅｂ情報データベースから番組情報と関連するＷｅｂ情報を抽出し、該抽出した番組関連Ｗｅｂ情報を記憶する番組関連Ｗｅｂ情報抽出手段と、
番組情報、前記番組関連Ｗｅｂ情報抽出手段により抽出された番組関連Ｗｅｂ情報および前記トレンドスコア計算手段により記憶されたトピックに基づいて、番組とトピックの関連度を計算する番組・トピック関連度計算手段と、
アイテムとそれに関する情報が格納されたアイテム情報データベースと、
前記アイテム情報データベース内のアイテム、前記番組・トピック関連度計算手段により計算された番組とトピックの関連度および前記トレンドスコア計算手段により計算し記憶されたトピックのトレンドスコアに基づいて、番組とアイテムの関連度を計算する番組・アイテム関連度計算手段と、
前記番組・アイテム関連度計算手段によって計算された番組とアイテムの関連度に基づいて、当該番組において推薦するアイテムを出力する出力手段と、
を備えたことを特徴とするアイテム推薦装置。 An item recommendation device for recommending items related to a program,
Topic extraction means for extracting a topic including a feature word from a text stream storage unit in which a set of texts is stored;
A trend score calculating means for calculating a trend score representing a trend degree of a topic extracted by the topic extracting means, and storing the trend score together with the topic;
Program-related Web information extracting means for extracting Web information related to program information from a Web information database in which Web information is stored, and storing the extracted program-related Web information;
Program / topic relevance calculating means for calculating relevance between a program and a topic based on program information, program related Web information extracted by the program related Web information extracting means, and a topic stored by the trend score calculating means; ,
An item information database storing items and information about them,
Based on the item in the item information database, the relevance between the program and the topic calculated by the program / topic relevance calculating means, and the trend score of the topic calculated and stored by the trend score calculating means, Program / item relevance calculation means for calculating relevance;
Output means for outputting an item recommended in the program based on the degree of association between the program and the item calculated by the program / item relevance calculating means;
An item recommendation device comprising:

前記トレンドスコア計算手段は、前記テキストストリーム記憶部に格納されたテキスト集合Ｄの各テキストをｄ_j、前記抽出された各トピックをｚ_iとし、各トピックｚ_iのトレンドスコアＴｚ_iを、

なる計算式によって算出し、
前記番組・トピック関連度計算手段は、前記番組情報をｔｖ、番組関連Ｗｅｂ情報をａ∈Ａ（Ａは番組関連Ｗｅｂ情報の集合）、前記トピックをｚ∈Ｚ（Ｚはトピック集合）、前記特徴語をｗ∈Ｗ（Ｗは語彙全体の集合）とし、これらとの関係を表すグラフを想定し、（１＋｜Ｗ｜＋｜Ｚ｜＋｜Ａ｜）×（１＋｜Ｗ｜＋｜Ｚ｜＋｜Ａ｜）の隣接行列Ｃ＝（ｃ_j，_k）を構築し、該隣接行列Ｃに基づいて番組ｔｖおよびトピックｚの各特徴量を求め、それらの特徴量を比較することによって番組とトピックの関連度Ｒ_tv,ziを算出し、
前記番組・アイテム関連度計算手段は、トピック−アイテム確率をＰ（ｚ_i｜ｌ_k）、トピック−単語確率をＰ（ｚ_i｜ｗ）、単語−アイテム確率をＰ（ｗ｜ｌ_k）、テキストｄ_jにおけるトピックｚ_iの生起確率をＰ（ｚ_i｜ｄ_j）、アイテムメタデータｌ_kにおいて単語ｗが発生する回数をｎ_lk（ｗ）とし、アイテムとトピックの関連度Ｒ_lk,ziを、

なる計算式によって算出し、
さらに、トレンドスコアおよび関連度の比重を決定する定数λ、前記トレンドスコア計算手段により計算されたトピックのトレンドスコアＴ_zi、前記番組・トピック関連度計算手段により計算された番組とトピックの関連度Ｒ_tv,ziおよび前記算出されたアイテムとトピックの関連度Ｒ_lk,ziとによって、番組とアイテムの関連度Ｒ_tv,lkを、

なる計算式によって算出する
ことを特徴とする請求項１に記載のアイテム推薦装置。 The trend score calculation means sets each text of the text set D stored in the text stream storage unit as d _j , each extracted topic as z _i, and the trend score Tz _i of each topic z _i ,

Calculated by the following formula:
The program / topic relevance calculation means includes tv as the program information, aεA as program-related Web information (A is a set of program-related Web information), zεZ (Z is a topic set), and the features. A word is assumed as w∈W (W is a set of whole vocabulary), and a graph representing the relationship with these is assumed, and (1+ | W | + | Z | + | A |) × (1+ | W | + | Z | + | A |) is constructed, an adjacency matrix C = (c _j , _k ) is constructed, feature quantities of the program tv and topic z are obtained based on the adjacency matrix C, and the feature quantities are compared with each other. Calculate topic relevance R _{tv, zi} ,
The program / item relevance calculation means has a topic-item probability P (z _i | l _k ), a topic-word probability P (z _i | w), a word-item probability P (w | l _k ), The occurrence probability of the topic z _i in the text d _j is P (z _i | d _j ), the number of occurrences of the word w in the item metadata l _k is n _lk (w), and the item-topic relevance R _{lk, zi} The

Calculated by the following formula:
Furthermore, a constant λ that determines the specific gravity of the trend score and the relevance degree, a trend score T _{zi of the} topic calculated by the trend score calculation means, and a relevance degree R of the program and topic calculated by the program / topic relevance calculation means _{Based on tv, zi} and the calculated item-topic relevance R _{lk, zi} , the program-item relevance R _{tv, lk} is

The item recommendation device according to claim 1, wherein the item recommendation device is calculated by the following formula.

番組と関連するアイテムを推薦するアイテム推薦方法であって、
トピック抽出手段が、テキストの集合が格納されたテキストストリーム記憶部から特徴語を含むトピックを抽出するトピック抽出ステップと、
トレンドスコア計算手段が、前記トピック抽出手段により抽出されたトピックのトレンド度合いを表すトレンドスコアを計算し、前記トピックとともに記憶するトレンドスコア計算ステップと、
番組関連Ｗｅｂ情報抽出手段が、Ｗｅｂ情報が格納されたＷｅｂ情報データベースから番組情報と関連するＷｅｂ情報を抽出し、該抽出した番組関連Ｗｅｂ情報を記憶する番組関連Ｗｅｂ情報抽出ステップと、
番組・トピック関連度計算手段が、番組情報、前記番組関連Ｗｅｂ情報抽出手段により抽出された番組関連Ｗｅｂ情報および前記トレンドスコア計算手段により記憶されたトピックに基づいて、番組とトピックの関連度を計算する番組・トピック関連度計算ステップと、
番組・アイテム関連度計算手段が、アイテムとそれに関する情報が格納されたアイテム情報データベース内のアイテム、前記番組・トピック関連度計算手段により計算された番組とトピックの関連度および前記トレンドスコア計算手段により計算し記憶されたトピックのトレンドスコアに基づいて、番組とアイテムの関連度を計算する番組・アイテム関連度計算ステップと、
出力手段が、前記番組・アイテム関連度計算手段によって計算された番組とアイテムの関連度に基づいて、当該番組において推薦するアイテムを出力する出力ステップと、
を備えたことを特徴とするアイテム推薦方法。 An item recommendation method for recommending items related to a program,
A topic extraction means for extracting a topic including a feature word from a text stream storage unit in which a set of texts is stored;
A trend score calculating means for calculating a trend score representing a trend degree of the topic extracted by the topic extracting means, and storing the trend score together with the topic; and
A program-related Web information extracting unit, wherein a program-related Web information extracting unit extracts Web information related to program information from a Web information database in which Web information is stored, and stores the extracted program-related Web information;
The program / topic relevance calculating means calculates the relevance between the program and the topic based on the program information, the program related Web information extracted by the program related Web information extracting means, and the topic stored by the trend score calculating means. A program / topic relevance calculation step,
The program / item relevance calculating means includes an item in an item information database storing items and information related thereto, the relevance between the program and the topic calculated by the program / topic relevance calculating means, and the trend score calculating means. A program / item relevance calculating step for calculating a relevance between a program and an item based on a trend score of a topic calculated and stored;
An output step of outputting an item recommended in the program based on the degree of association between the program and the item calculated by the program / item relevance calculating unit;
Item recommendation method characterized by comprising.

前記トレンドスコア計算ステップは、前記テキストストリーム記憶部に格納されたテキスト集合Ｄの各テキストをｄ_j、前記抽出された各トピックをｚ_iとし、各トピックｚ_iのトレンドスコアＴｚ_iを、

なる計算式によって算出し、
前記番組・トピック関連度計算ステップは、前記番組情報をｔｖ、番組関連Ｗｅｂ情報をａ∈Ａ（Ａは番組関連Ｗｅｂ情報の集合）、前記トピックをｚ∈Ｚ（Ｚはトピック集合）、前記特徴語をｗ∈Ｗ（Ｗは語彙全体の集合）とし、これらとの関係を表すグラフを想定し、（１＋｜Ｗ｜＋｜Ｚ｜＋｜Ａ｜）×（１＋｜Ｗ｜＋｜Ｚ｜＋｜Ａ｜）の隣接行列Ｃ＝（ｃ_j，_k）を構築し、該隣接行列Ｃに基づいて番組ｔｖおよびトピックｚの各特徴量を求め、それらの特徴量を比較することによって番組とトピックの関連度Ｒ_tv,ziを算出し、
前記番組・アイテム関連度計算ステップは、トピック−アイテム確率をＰ（ｚ_i｜ｌ_k）、トピック−単語確率をＰ（ｚ_i｜ｗ）、単語−アイテム確率をＰ（ｗ｜ｌ_k）、テキストｄ_jにおけるトピックｚ_iの生起確率をＰ（ｚ_i｜ｄ_j）、アイテムメタデータｌ_kにおいて単語ｗが発生する回数をｎ_lk（ｗ）とし、アイテムとトピックの関連度Ｒ_lk,ziを、

なる計算式によって算出する
ことを特徴とする請求項３に記載のアイテム推薦方法。 In the trend score calculation step, each text of the text set D stored in the text stream storage unit is d _j , each extracted topic is z _i, and the trend score Tz _i of each topic z _i is

Calculated by the following formula:
In the program / topic relevance calculation step, the program information is tv, the program-related Web information is aεA (A is a set of program-related Web information), the topic is zεZ (Z is a topic set), and the features A word is assumed as w∈W (W is a set of whole vocabulary), and a graph representing the relationship with these is assumed, and (1+ | W | + | Z | + | A |) × (1+ | W | + | Z | + | A |) is constructed, an adjacency matrix C = (c _j , _k ) is constructed, feature quantities of the program tv and topic z are obtained based on the adjacency matrix C, and the feature quantities are compared with each other. Calculate topic relevance R _{tv, zi} ,
In the program / item relevance calculation step, the topic-item probability is P (z _i | l _k ), the topic-word probability is P (z _i | w), the word-item probability is P (w | l _k ), The occurrence probability of the topic z _i in the text d _j is P (z _i | d _j ), the number of occurrences of the word w in the item metadata l _k is n _lk (w), and the item-topic relevance R _{lk, zi} The

The item recommendation method according to claim 3, wherein the item recommendation method is calculated by the following formula.

コンピュータを請求項１又は２に記載の各手段として機能させるアイテム推薦プログラム。 An item recommendation program for causing a computer to function as each means according to claim 1.