JP7240240B2

JP7240240B2 - Information processing device, information processing method, and program

Info

Publication number: JP7240240B2
Application number: JP2019083031A
Authority: JP
Inventors: 聡基川端
Original assignee: Yahoo Japan Corp
Current assignee: Yahoo Japan Corp
Priority date: 2019-04-24
Filing date: 2019-04-24
Publication date: 2023-03-15
Anticipated expiration: 2039-04-24
Also published as: JP2020181316A

Description

本発明は、情報処理装置、情報処理方法、およびプログラムに関する。 The present invention relates to an information processing device, an information processing method, and a program.

従来、ウェブページを閲覧した閲覧結果に基づいて、ユーザの行動を推定する技術が知られている（例えば、特許文献１参照）。 Conventionally, there is known a technique for estimating a user's behavior based on browsing results of browsing a web page (see Patent Document 1, for example).

特開２０１２－１６４３２３号公報JP 2012-164323 A

しかしながら、従来の技術では、ユーザの状況に応じた検索結果を出力することができても、その後、ユーザが、商品、またはサービスの購入、サービスへの入会、或いは、不正決済等の所定の行動を行うことを推定することができず、その結果、適切なアクションを行うことができなかった。 However, in the conventional technology, even if the search results according to the user's situation can be output, after that, the user does not perform a predetermined action such as purchasing a product or service, joining a service, or fraudulent payment. and as a result could not take appropriate action.

本発明は、上記の課題に鑑みてなされたものであり、ユーザの行動をより正確に推定し、適切なアクションを行うことができる情報処理装置、情報処理方法、およびプログラムを提供することを目的としている。 SUMMARY OF THE INVENTION It is an object of the present invention to provide an information processing apparatus, an information processing method, and a program capable of more accurately estimating a user's behavior and taking an appropriate action. and

本発明の一態様は、ネットワークを介したウェブページへのアクセスログを取得する取得部と、前記アクセスログにおける所定の行動の有無を教師データとして、入力されたアクセスログにおいて前記所定の行動が行われるかを出力するように学習された分類器の出力結果に基づいて、前記取得部により取得された前記アクセスログに示されるウェブページを閲覧した閲覧者に提供すべき情報を決定する決定部とを備える情報処理装置である。 According to one aspect of the present invention, an acquisition unit acquires an access log to a web page via a network; a determination unit that determines information to be provided to a viewer who viewed the web page indicated in the access log acquired by the acquisition unit, based on the output result of a classifier that has learned to output whether the It is an information processing device comprising

また、本発明の一態様は、ネットワークを介したウェブページへのアクセスログを取得する取得部と、前記アクセスログにおける所定の行動の有無を学習データとして、入力されたアクセスログにおいて前記所定の行動が行われたかを出力するように学習された分類器の出力結果に基づいて、前記取得部によって取得された前記アクセスログに示されるウェブページを閲覧した閲覧者によって前記所定の行動が行われたか否かを推定し、前記所定の行動が行われたと推定した場合、購入に係る処理を実行する外部装置に対して、処理の中止を指示することを決定する決定部とを備える情報処理装置である。 In one aspect of the present invention, an acquisition unit acquires an access log to a web page via a network; Based on the output result of a classifier trained to output whether the predetermined action was performed by the viewer who viewed the web page indicated in the access log acquired by the acquisition unit an information processing apparatus comprising: a determination unit that determines whether or not the purchase has occurred, and if it is estimated that the predetermined action has been performed, instructs an external device that executes processing related to purchase to stop the processing be.

本発明の一態様によれば、ユーザの行動をより正確に推定し、適切なアクションを行うことができる。 ADVANTAGE OF THE INVENTION According to one aspect of the present invention, a user's behavior can be estimated more accurately and an appropriate action can be taken.

情報処理装置１００を含む情報処理システム１の一例を示す図である。1 is a diagram showing an example of an information processing system 1 including an information processing device 100; FIG. アクセスログの一例を示す図である。It is a figure which shows an example of an access log. 情報処理装置１００の構成の一例を示す図である。1 is a diagram illustrating an example of a configuration of an information processing device 100; FIG. 第１実施形態における情報処理装置１００による運用時の一連の処理の流れを示すフローチャートである。4 is a flow chart showing a series of processes during operation by the information processing apparatus 100 according to the first embodiment; 分類器２００の一例を示す図である。FIG. 2 is a diagram showing an example of a classifier 200; FIG. 時間経過に応じたＲＮＮ２４０の処理内容を並列に展開した図である。It is the figure which expand|deployed the processing content of RNN240 according to time passage in parallel. 第１実施形態における情報処理装置１００による学習時の一連の処理の流れを示すフローチャートである。4 is a flow chart showing a series of processing during learning by the information processing apparatus 100 according to the first embodiment. 実施形態の情報処理装置１００のハードウェア構成の一例を示す図である。It is a figure which shows an example of the hardware constitutions of the information processing apparatus 100 of embodiment.

以下、本発明を適用した情報処理装置、情報処理方法、およびプログラムを、図面を参照して説明する。 Hereinafter, an information processing apparatus, an information processing method, and a program to which the present invention is applied will be described with reference to the drawings.

［概要］
情報処理装置は、一以上のプロセッサにより実現される。情報処理装置は、ウェブページへのアクセスログを取得する。情報処理装置は、予め学習された分類器に基づいて、取得したアクセスログに示されるウェブページを閲覧する者（以下、閲覧者）によって所定の行動が行われるかを出力する。分類器は、ネットワークを介して所定の行動が行われたアクセスログを学習データ、及び教師データとして、入力されたアクセスログの閲覧者によって所定の行動が行われるか否かを示す分類結果を出力するように学習される。このような分類器を利用することで、情報処理装置は、閲覧者が所定の行動を行うと推定される場合に、適切なアクションを、閲覧者、又は所定の装置に対して行うことができる。以下、商品またはサービスをまとめて単に「商品」と称して説明する。 [overview]
An information processing device is realized by one or more processors. An information processing device acquires an access log to a web page. The information processing device outputs whether a person viewing the web page indicated in the acquired access log (hereinafter referred to as a viewer) performs a predetermined action based on a pre-learned classifier. The classifier outputs a classification result indicating whether or not a given action is taken by a viewer of the input access log, using access logs in which a given action is performed via the network as learning data and teacher data. learned to do. By using such a classifier, the information processing device can perform an appropriate action for the viewer or the predetermined device when it is estimated that the viewer will perform a predetermined action. . Hereinafter, products or services will be collectively referred to simply as "products" for explanation.

＜第１実施形態＞
［全体構成］
図１は、情報処理装置１００を含む情報処理システム１の一例を示す図である。第１実施形態における情報処理システム１は、例えば、一以上の端末装置１０と、サービス提供装置２０と、決済処理装置３０と、情報処理装置１００とを備える。これらの装置は、ネットワークＮＷを介して接続される。また、これらの装置のうち一部は、他の装置に仮想的な装置として包含されてもよく、例えば、サービス提供装置２０の機能の一部または全部、決済処理装置３０の機能の一部または全部等が、情報処理装置１００の機能によって実現される仮想マシンであってもよい。決済処理装置３０は、「外部装置」の一例である。 <First embodiment>
[overall structure]
FIG. 1 is a diagram showing an example of an information processing system 1 including an information processing device 100. As shown in FIG. The information processing system 1 according to the first embodiment includes, for example, one or more terminal devices 10 , a service providing device 20 , a payment processing device 30 and an information processing device 100 . These devices are connected via a network NW. Also, some of these devices may be included in other devices as virtual devices. For example, some or all of the functions of the service providing device 20, or All of them may be virtual machines implemented by the functions of the information processing apparatus 100 . The payment processing device 30 is an example of an “external device”.

図１に示す各装置は、ネットワークＮＷを介して種々の情報を送受信する。ネットワークＮＷは、例えば、インターネット、ＷＡＮ（Wide Area Network）、ＬＡＮ（Local Area Network）、プロバイダ端末、無線通信網、無線基地局、専用回線等を含む。なお、図１に示す各装置の全ての組み合わせが相互に通信可能である必要はなく、ネットワークＮＷは、一部にローカルなネットワークを含んでもよい。 Each device shown in FIG. 1 transmits and receives various information via the network NW. The network NW includes, for example, the Internet, WAN (Wide Area Network), LAN (Local Area Network), provider terminal, wireless communication network, wireless base station, leased line, and the like. Note that not all combinations of the devices shown in FIG. 1 need to be able to communicate with each other, and the network NW may partially include a local network.

端末装置１０は、例えば、スマートフォン等の携帯電話、タブレット端末、各種パーソナルコンピュータ等の、入力装置、表示装置、通信装置、記憶装置、および演算装置を備える端末装置である。通信装置は、ＮＩＣ（Network Interface Card）等のネットワークカード、無線通信モジュール等を含む。端末装置１０では、ウェブブラウザやアプリケーションプログラム等のＵＡ（User Agent）が起動し、ユーザの入力する内容に応じたリクエストをサービス提供装置２０に送信する。また、ＵＡが起動された端末装置１０は、サービス提供装置２０から取得した情報に基づいて、表示装置に各種画像を表示させる。端末装置１０のユーザは、「ウェブページを閲覧した閲覧者」の一例である。 The terminal device 10 is a terminal device including an input device, a display device, a communication device, a storage device, and an arithmetic device, such as a mobile phone such as a smart phone, a tablet terminal, and various personal computers. The communication device includes a network card such as a NIC (Network Interface Card), a wireless communication module, and the like. In the terminal device 10 , a UA (User Agent) such as a web browser or an application program is activated, and a request according to the contents input by the user is sent to the service providing device 20 . Also, the terminal device 10 on which the UA is activated causes the display device to display various images based on the information acquired from the service providing device 20 . The user of the terminal device 10 is an example of the "viewer who viewed the web page".

サービス提供装置２０は、例えば、ＵＡとして起動されたウェブブラウザからのリクエストに応じてウェブページを端末装置１０に提供するウェブサーバである。ウェブページは、例えば、インターネット上において商品を販売するショッピングサイトやオークションサイト、フリーマーケットサイト等のウェブページを構成するウェブページであってよい。また、サービス提供装置２０は、検索サイトやＳＮＳ（Social Networking Service）、メールサービス等の各種サービスを提供するウェブページを端末装置１０に提供してもよい。また、サービス提供装置２０は、ＵＡとして起動されたアプリケーションからのリクエストに応じてコンテンツを端末装置１０に提供することで、販売サイト等の各種ウェブページと同様のサービスを提供するアプリケーションサーバであってもよい。 The service providing device 20 is, for example, a web server that provides a web page to the terminal device 10 in response to a request from a web browser activated as a UA. The web page may be, for example, a web page that constitutes web pages of shopping sites, auction sites, flea market sites, etc. that sell products on the Internet. The service providing apparatus 20 may also provide the terminal apparatus 10 with web pages that provide various services such as search sites, SNS (Social Networking Service), and mail services. Further, the service providing device 20 is an application server that provides the same services as various web pages such as sales sites by providing content to the terminal device 10 in response to requests from applications started as UA. good too.

決済処理装置３０は、例えば、サービス提供装置２０において販売された商品の決済処理を行う。決済処理装置３０は、端末装置１０を利用するユーザにより指定された決済手段に基づく決済処理を行う。決済処理は、クレジットカードのシステムやインターネットバンキングを利用して行われてもよいし、端末装置１０が携帯電話である場合には、端末装置１０の利用料に加算することで行われてもよい。また、決済処理は、コンビニエンスストア等の店舗における支払いにより行われてもよい。 The payment processing device 30 performs payment processing for products sold by the service providing device 20, for example. The payment processing device 30 performs payment processing based on the payment method specified by the user using the terminal device 10 . The payment processing may be performed using a credit card system or Internet banking, or may be performed by adding the fee to the usage fee of the terminal device 10 when the terminal device 10 is a mobile phone. . Also, the payment processing may be performed by paying at a store such as a convenience store.

ユーザが端末装置１０を用いてインターネット上において商品を購入する場合、ユーザは、例えば、サービス提供装置２０が提供する検索サイトにアクセスする。ユーザは、商品を販売するショッピングサイトの名称や、購入したい商品の名称等をクエリとして検索し、検索結果として表示されたショッピングサイトの一覧のうち、いずれかのショッピングサイトにアクセス（閲覧）する。ユーザは、ショッピングサイトにおいて購入したい商品を販売している販売サイトにアクセスする。そして、ユーザは、販売サイトのうち、販売される商品（以下、商品ページ）にアクセスし、購入したい商品の商品ページから商品を購入する処理を行う。 When a user purchases a product on the Internet using the terminal device 10, the user accesses a search site provided by the service providing device 20, for example. The user searches for the name of the shopping site that sells the product, the name of the product that the user wants to purchase, etc. as a query, and accesses (views) one of the shopping sites displayed as the search result. A user accesses a sales site that sells products that the user wants to purchase on the shopping site. Then, the user accesses the product for sale (hereinafter referred to as product page) on the sales site and performs processing of purchasing the product from the product page of the product that the user wants to purchase.

サービス提供装置２０は、このようなアクセスの履歴（以下、アクセスログ）をセッション毎に取得する。セッションは、例えば、あるユーザが、サービス提供装置２０が提供するいずれかのウェブページにアクセスを開始してから、サービス提供装置２０が提供するいずれかのウェブページから離れる、ウェブブラウザを閉じる、又はウェブページへのアクセスを開始してから所定の時間（例えば、３０分以上）が経過するまでの期間である。したがって、アクセスログには、１セッションの間にユーザがウェブページ上で行った行動に応じて、ユーザが閲覧したウェブページに係る情報が、１以上含まれる。 The service providing apparatus 20 acquires such access history (hereinafter referred to as an access log) for each session. A session is, for example, when a user starts accessing any web page provided by the service providing apparatus 20 and then leaves any web page provided by the service providing apparatus 20, closes the web browser, or It is a period from when access to the web page is started until a predetermined time (for example, 30 minutes or longer) elapses. Therefore, the access log contains one or more pieces of information relating to the web pages viewed by the user according to the user's actions on the web page during one session.

図２は、アクセスログの一例を示す図である。アクセスログには、例えば、１セッションの間に、ユーザがアクセスしたウェブページを識別可能な識別情報（以下、ウェブＩＤ）と、ユーザが当該ウェブページを閲覧した時間（以下、閲覧時間）とが互いに対応付けられたレコードが、時系列順に含まれる情報である。ウェブＩＤは、例えば、ＵＲＬ（Uniform Resource Locator）であってもよく、予めウェブページに付されるＩＤであってもよく、サブドメインであってもよい。 FIG. 2 is a diagram showing an example of an access log. For example, the access log contains identification information (hereinafter referred to as web ID) that can identify the web page accessed by the user and the time the user viewed the web page (hereinafter referred to as viewing time) during one session. Records associated with each other are information included in chronological order. The web ID may be, for example, a URL (Uniform Resource Locator), an ID given to a web page in advance, or a subdomain.

図３のアクセスログには、まず、検索サイトにアクセスし、次にショッピングサイトにアクセスし、次に販売サイトにアクセスし、次に商品ページにアクセスしたことが示される。また、アクセスログには、検索サイトのウェブＩＤと閲覧時間のレコードと、ショッピングサイトのウェブＩＤと閲覧時間のレコードと、販売サイトのウェブＩＤと閲覧時間のレコードと、商品ページのウェブＩＤと閲覧時間のレコードとが含まれる。なお、アクセスログの内容は、一例であってこれに限られない。 The access log in FIG. 3 indicates that the user first accessed the search site, then accessed the shopping site, then accessed the sales site, and then accessed the product page. The access log contains a record of the web ID and browsing time of the search site, a record of the web ID and browsing time of the shopping site, a record of the web ID and browsing time of the sales site, and a record of the web ID and browsing time of the product page. Includes time records. Note that the content of the access log is an example and is not limited to this.

ここで、アクセスログには、ユーザが所定の行動に至るまでの、ユーザの振る舞いの特徴が表れている場合がある。例えば、類似する複数の商品の商品ページを、それぞれ長い時間閲覧していることがアクセスログに示されている場合、ユーザは、商品を購入する可能性が高いことが推定される。また、類似する複数のサービスのウェブページを、それぞれ長い時間閲覧していることがアクセスログに示されている場合、ユーザは、サービスに入会する可能性が高いことが推定される。また、各ウェブページの閲覧時間が極端に短く、商品ページまで最少の数のウェブページを介して到達し、商品を購入したことがアクセスログに示される場合、ユーザは、不正な手段による決済（以下、不正な決済）によって商品を購入した可能性が高いことが推定される。 Here, the access log may show characteristics of the user's behavior until the user reaches a predetermined action. For example, if the access log indicates that product pages of multiple similar products have been viewed for a long time, it is estimated that the user is likely to purchase the product. Also, if the access log indicates that the user browses web pages of a plurality of similar services for a long period of time, it is estimated that the user is highly likely to become a member of the service. In addition, if the access log shows that the browsing time of each web page is extremely short, the user reached the product page through the minimum number of web pages, and purchased the product, the user may make a payment by fraudulent means ( It is presumed that there is a high possibility that the product was purchased by fraudulent payment.

情報処理装置１００は、アクセスログを取得し、取得したアクセスログを機械学習することで、ユーザが所定の行動（例えば、商品の購入、サービスの入会、或いは不正決済）を行うか否かを推定する。 The information processing apparatus 100 obtains an access log, and performs machine learning on the obtained access log to estimate whether or not the user performs a predetermined action (for example, purchasing a product, signing up for a service, or fraudulent payment). do.

［情報処理装置の構成］
図３は、情報処理装置１００の構成の一例を示す図である。図示のように、情報処理装置１００は、例えば、通信部１０２と、制御部１１０と、記憶部１３０とを備える。 [Configuration of information processing device]
FIG. 3 is a diagram showing an example of the configuration of the information processing apparatus 100. As shown in FIG. As illustrated, the information processing apparatus 100 includes, for example, a communication unit 102, a control unit 110, and a storage unit .

通信部１０２は、例えば、ＮＩＣ等の通信インターフェースを含む。通信部１０２は、ネットワークＮＷを介して、端末装置１０、サービス提供装置２０、或いは決済処理装置３０等と通信する。 The communication unit 102 includes, for example, a communication interface such as NIC. The communication unit 102 communicates with the terminal device 10, the service providing device 20, the payment processing device 30, or the like via the network NW.

制御部１１０は、例えば、取得部１１２と、形態素解析部１１４と、行動推定部１１６と、決定部１１８と、学習処理部１２０とを備える。 The control unit 110 includes, for example, an acquisition unit 112, a morphological analysis unit 114, an action estimation unit 116, a determination unit 118, and a learning processing unit 120.

制御部１１０の構成要素は、例えば、ＣＰＵ（Central Processing Unit）などのプロセッサが記憶部１３０に格納されたプログラムを実行することにより実現される。また、制御部１１０の構成要素の一部または全部は、ＬＳＩ（Large Scale Integration）、ＡＳＩＣ（Application Specific Integrated Circuit）、ＦＰＧＡ（Field-Programmable Gate Array）、またはＧＰＵ（Graphics Processing Unit）などのハードウェア（回路部；circuitry）により実現されてもよいし、ソフトウェアとハードウェアの協働によって実現されてもよい。 The constituent elements of the control unit 110 are realized by executing a program stored in the storage unit 130 by a processor such as a CPU (Central Processing Unit), for example. Some or all of the components of the control unit 110 are hardware such as LSI (Large Scale Integration), ASIC (Application Specific Integrated Circuit), FPGA (Field-Programmable Gate Array), or GPU (Graphics Processing Unit). (circuitry), or by cooperation of software and hardware.

記憶部１３０は、例えば、ＨＤＤ（Hard Disc Drive）、フラッシュメモリ、ＥＥＰＲＯＭ（Electrically Erasable Programmable Read Only Memory）、ＲＯＭ（Read Only Memory）、ＲＡＭ（Random Access Memory）などの記憶装置により実現される。記憶部１３０には、ファームウェアやアプリケーションプログラムなどの各種プログラムの他に、分類器情報１３２、及び教師データ１３４が格納される。これらの詳細については後述する。 The storage unit 130 is implemented by a storage device such as a HDD (Hard Disc Drive), a flash memory, an EEPROM (Electrically Erasable Programmable Read Only Memory), a ROM (Read Only Memory), and a RAM (Random Access Memory). The storage unit 130 stores classifier information 132 and teacher data 134 in addition to various programs such as firmware and application programs. Details of these will be described later.

［運用時の処理フロー］
以下、第１実施形態における情報処理装置１００による運用時の一連の処理の流れをフローチャートに即して説明する。運用時とは、既に学習された分類器２００を利用する状態である。図４は、第１実施形態における情報処理装置１００による運用時の一連の処理の流れを示すフローチャートである。本フローチャートの処理は、例えば、所定の周期で繰り返し行われてもよい。 [Processing flow during operation]
The flow of a series of processes during operation by the information processing apparatus 100 according to the first embodiment will be described below with reference to flowcharts. During operation is a state in which the already learned classifier 200 is used. FIG. 4 is a flow chart showing a series of processes during operation by the information processing apparatus 100 according to the first embodiment. The processing of this flowchart may be repeated, for example, at a predetermined cycle.

まず、取得部１１２は、通信部１０２にサービス提供装置２０と通信させ、サービス提供装置２０からユーザのアクセスログを取得する（Ｓ１００）。 First, the acquiring unit 112 causes the communication unit 102 to communicate with the service providing apparatus 20 and acquires the user's access log from the service providing apparatus 20 (S100).

次に、形態素解析部１１４は、取得部１１２によって取得されたアクセスログに含まれる文書（ウェブＩＤや、閲覧時間）に対して形態素解析を行う（Ｓ１０２）。例えば、形態素解析部１１４は、ワンホット表現と呼ばれる手法を用いて、単語や語句をベクトル化してもよいし、ｗｏｒｄ２ｖｅｃやＧｌｏＶｅのような分散表現または単語埋め込み表現（Word Embedding(s)）と呼ばれる手法を用いて、単語や語句をベクトル化する。 Next, the morphological analysis unit 114 performs morphological analysis on the document (web ID and browsing time) included in the access log acquired by the acquisition unit 112 (S102). For example, the morphological analysis unit 114 may vectorize words and phrases using a technique called one-hot representation, distributed representation such as word2vec or GloVe, or word embedding representation (Word Embedding(s)). method to vectorize words and phrases.

ワンホット表現とは、予め用意された単語または語句のリスト（辞書）と、アクセスログに含まれる単語または語句とを比較して、単語または語句のうち、リストに出現した単語または語句の要素値が１、他の単語または語句の要素値が０となるような多次元のベクトルを生成する手法である。形態素解析部１１４は、例えば、ワンホット表現によって閲覧時間を示す１１次元のベクトルを生成する。 A one-hot expression is a list of words or phrases prepared in advance (dictionary) that is compared with the words or phrases included in the access log, and among the words or phrases that appear in the list, the element values of the words or phrases are found. is 1, and other words or phrases have element values of 0. The morphological analysis unit 114 generates, for example, an 11-dimensional vector indicating browsing time by one-hot expression.

また、分散表現は、形態素解析で得られた単語または語句と、その単語または語句の前後で出現する単語または語句との共起性に基づいて、単語や語句をベクトル化する手法であり、例えば、予め用意された複数の単語や語句を含むコーパスに基づいて、形態素解析で得られた単語または語句の前後に出現する単語または語句の出現確率を求め、その出現確率を要素値とする多次元のベクトルを生成する手法である。 Distributed representation is a method of vectorizing words or phrases based on the co-occurrence of words or phrases obtained by morphological analysis and words or phrases that appear before and after the words or phrases. , Based on a corpus containing multiple words and phrases prepared in advance, the occurrence probability of words or phrases that appear before and after the word or phrase obtained by morphological analysis is obtained, and the occurrence probability is used as an element value. is a method to generate a vector of

例えば、形態素解析部１１４は、Countinuous Bag-of-WordsおよびSkip-gramの双方の学習モデルを適用したニューラルネットワークに形態素解析で得られた単語または語句を入力することで、その単語や語句が分散表現されたベクトルを生成してよい。 For example, the morphological analysis unit 114 inputs words or phrases obtained by morphological analysis to a neural network to which both Countinuous Bag-of-Words and Skip-gram learning models are applied, so that the words and phrases are dispersed. A vector representation may be generated.

形態素解析部１１４は、ベクトル化した単語または語句（以下、単語ベクトルと称する）を行ベクトルとした２階テンソルの行列（以下、単語行列と称する）を生成する。例えば、形態素解析部１１４は、形態素解析によってタイトルを１０個の単語に分割した場合、１０個の単語ベクトルのそれぞれを行ベクトルとした単語行列（１０行×ｎ列の行列：ｎは単語ベクトルの次元数）を生成する。形態素解析部１１４は、例えば、分散表現によってウェブＩＤを示す１００次元のベクトルを生成する。 The morphological analysis unit 114 generates a second-order tensor matrix (hereinafter referred to as a word matrix) in which vectorized words or phrases (hereinafter referred to as word vectors) are used as row vectors. For example, when the title is divided into 10 words by morphological analysis, the morphological analysis unit 114 creates a word matrix (a matrix of 10 rows×n columns, where n is a word vector number of dimensions). The morphological analysis unit 114 generates, for example, a 100-dimensional vector indicating the web ID by distributed representation.

次に、行動推定部１１６は、分類器情報１３２を参照して、分類器２００を構築（生成）し、形態素解析部１１４によって生成された単語行列を、構築した分類器２００に入力する（Ｓ１０４）。以下、分類器２００が、ユーザが行う所定の行動のうち、「商品を購入するか」について出力するものである場合について説明する。 Next, the action estimation unit 116 constructs (generates) the classifier 200 by referring to the classifier information 132, and inputs the word matrix generated by the morphological analysis unit 114 to the constructed classifier 200 (S104). ). A case will be described below in which the classifier 200 outputs "Whether to purchase a product" among the predetermined actions performed by the user.

なお、分類器２００が、ユーザが行う所定の行動のうち、「サービスに入会するか」について出力するものである場合、以下の説明のうち、「商品を購入（する／しない）」を「サービスに入会（する／しない）」に読み替えればよく、分類器２００が、ユーザが行う所定の行動のうち、「不正な決済が行われたか」について出力するものである場合、以下の説明のうち、「商品を購入（する／しない）」を「不正な決済が（行われた／行われていない）」に読み替えればよい。 It should be noted that if the classifier 200 outputs "whether or not to join a service" among the predetermined actions performed by the user, in the following description, "purchase (purchase/do not) products" is replaced by "service If the classifier 200 outputs "whether fraudulent payment has been made" among the predetermined actions performed by the user, then , "Purchase (purchase/do not) goods" should be read as "Unauthorized payment (made/not made)".

分類器情報１３２は、分類器を定義した情報（プログラムまたはデータ構造）である。分類器２００は、例えば、コンボリューショナルニューラルネットワーク（Convolutional Neural Network：ＣＮＮ）や、リカレントネットワーク（Reccurent Neural Network：ＲＮＮ）等の種々のニューラルネットワークによって実現される。 Classifier information 132 is information (program or data structure) that defines a classifier. The classifier 200 is realized by various neural networks such as a convolutional neural network (CNN) and a recurrent neural network (RNN).

分類器情報１３２には、例えば、各ニューラルネットワークを構成する入力層、一以上の隠れ層（中間層）、出力層のそれぞれに含まれるニューロン（ユニット）が互いにどのように結合されるのかという結合情報や、結合されたニューロン間で入出力されるデータに付与される結合係数等の各種情報が含まれる。結合情報とは、例えば、各層に含まれるニューロン数や、各ニューロンの結合先のニューロンの種類を指定する情報、各ニューロンを実現する活性化関数、隠れ層のニューロン間に設けられたゲート等の情報を含む。ニューロンを実現する活性化関数は、例えば、正規化線形関数（ＲｅＬＵ関数）であってもよいし、シグモイド関数や、ステップ関数、その他の関数等であってもよい。ゲートは、例えば、活性化関数によって返される値（例えば１または０）に応じて、ニューロン間で伝達されるデータを選択的に通過させたり、重み付けたりする。結合係数は、活性化関数のパラメータであり、例えば、ニューラルネットワークの隠れ層において、ある層のニューロンから、より深い層のニューロンにデータが出力される際に、出力データに対して付与される重みを含む。また、結合係数は、各層の固有のバイアス成分等を含んでもよい。 The classifier information 132 includes, for example, how the neurons (units) included in each of the input layer, one or more hidden layers (intermediate layers), and the output layer that constitute each neural network are connected to each other. Information and various information such as coupling coefficients assigned to data input/output between connected neurons are included. The connection information includes, for example, the number of neurons included in each layer, information specifying the type of neuron to which each neuron is connected, an activation function that realizes each neuron, a gate provided between neurons in the hidden layer, etc. Contains information. An activation function that implements a neuron may be, for example, a normalized linear function (ReLU function), a sigmoid function, a step function, or other functions. A gate selectively passes or weights data transferred between neurons, for example, depending on the value (eg, 1 or 0) returned by the activation function. A coupling coefficient is a parameter of an activation function. For example, in a hidden layer of a neural network, a weight given to output data when data is output from a neuron in a certain layer to a neuron in a deeper layer. including. The coupling coefficient may also include bias components specific to each layer, and the like.

図５は、分類器２００の一例を示す図である。第１実施形態における分類器２００は、例えば、第１ＣＮＮ２１０と、第２ＣＮＮ２２０と、ＦＣＮＮ（Fully-Connected Neural Network）２３０と、ＲＮＮ２４０と、ＬＵ２５０によって実現される。 FIG. 5 is a diagram showing an example of the classifier 200. As shown in FIG. The classifier 200 in 1st Embodiment is implement|achieved by 1st CNN210, 2nd CNN220, FCNN(Fully-Connected Neural Network)230, RNN240, and LU250, for example.

第１ＣＮＮ２１０および第２ＣＮＮ２２０は、単語行列が入力されると、次元数が所定数のベクトルを出力するように学習されたコンボリューショナルニューラルネットワークである。 The first CNN 210 and the second CNN 220 are convolutional neural networks trained to output vectors having a predetermined number of dimensions when word matrices are input.

例えば、単語行列を、各要素値が画素値に置き換わった画像とした場合、第１ＣＮＮ２１０および第２ＣＮＮ２２０は、入力された単語行列に相当する画像と所定のフィルタとを畳み込み（内積を求め）、所定のフィルタと重なる複数の画素を含む画像領域を、一つの単位領域に圧縮する。この単位領域には、畳み込みにより得られた単変量値が特徴量として対応付けられている。第１ＣＮＮ２１０および第２ＣＮＮ２２０は、画像上で所定のフィルタをシフト移動させながら畳み込み処理を繰り返し、複数の単位領域により形成されたコンボリューショナルレイヤと呼ばれる畳み込みの特徴量を画素値とした特徴画像を生成する。第１ＣＮＮ２１０および第２ＣＮＮ２２０は、各特徴画像のサイズ（次元数）をプーリング（例えばGlobal Average Pooling）等によって圧縮し、この結果を出力する。圧縮した特徴画像のそれぞれには、単変量の特徴量が対応付けられているため、畳み込み処理の回数分だけ、畳み込みの特徴量を要素としてもつ多次元ベクトル（１階テンソル）が出力される。 For example, if the word matrix is an image in which each element value is replaced with a pixel value, the first CNN 210 and the second CNN 220 convolve an image corresponding to the input word matrix with a predetermined filter (obtain the inner product), An image region containing a plurality of pixels that overlap with the filters of is compressed into one unit region. A univariate value obtained by convolution is associated with this unit region as a feature amount. The first CNN 210 and the second CNN 220 repeat convolution processing while shifting a predetermined filter on the image, and generate a feature image in which the feature amount of the convolution called a convolutional layer formed by a plurality of unit regions is used as the pixel value. do. The first CNN 210 and the second CNN 220 compress the size (number of dimensions) of each feature image by pooling (for example, Global Average Pooling) or the like, and output this result. Since each of the compressed feature images is associated with a univariate feature amount, a multidimensional vector (first-order tensor) having convolution feature amounts as elements is output as many times as the number of convolution processes.

例えば、行動推定部１１６は、閲覧時間から得られた単語行列を第１ＣＮＮ２１０に入力し、ウェブＩＤから得られた単語行列を第２ＣＮＮ２２０に入力する。これによって、各ＣＮＮは、入力されたデータを所定の次元数のベクトルに変換してＦＣＮＮ２３０に出力する。 For example, the behavior estimator 116 inputs a word matrix obtained from browsing times to the first CNN 210 and inputs a word matrix obtained from web IDs to the second CNN 220 . Thereby, each CNN converts the input data into a vector with a predetermined number of dimensions and outputs the vector to the FCNN 230 .

ＦＣＮＮ２３０は、各層のユニットのそれぞれが、より階層の深い次の層のユニットのそれぞれと接続されたニューラルネットワークである。なお、ＦＣＮＮ２３０は、中間層がない入力層のユニットのそれぞれと出力層のユニットのそれぞれとが互いに接続された２つの層であってもよい。例えば、ＦＣＮＮ２３０は、第１ＣＮＮ２１０から出力されたベクトルの次元数が１１、第２ＣＮＮ２２０から出力されたベクトルの次元数が１００である場合、それらの全てのベクトルの次元数の総和である１１１次元の一つのベクトルを出力する。 The FCNN 230 is a neural network in which each unit in each layer is connected to each unit in the next deeper layer. Note that the FCNN 230 may be two layers in which each unit in the input layer and each unit in the output layer without an intermediate layer are connected to each other. For example, if the number of dimensions of the vectors output from the first CNN 210 is 11 and the number of dimensions of the vectors output from the second CNN 220 is 100, then the FCNN 230 may be one of 111 dimensions, which is the total number of dimensions of all the vectors. outputs two vectors.

ＲＮＮ２４０は、例えば、ネットワークの中間層がＬＳＴＭ（Long short-term memory）であるＲＮＮであり、１階テンソルであるベクトルが入力されると、そのベクトルの元になったウェブＩＤ、及び閲覧時間によって示されるユーザが商品を購入することの確からしさ（尤もらしさ）を表すスコアをレコード毎に出力するように学習されたニューラルネットワークである。例えば、ＬＳＴＭは、重みと状態変数との内積を計算する。数式（１）は、ＬＳＴＭのゲートおよびニューロンの計算式を表している。状態変数とは、各ゲートによって求められるベクトルである。 RNN 240 is, for example, an RNN whose middle layer of the network is LSTM (Long short-term memory). It is a neural network trained to output a score representing the probability (likelihood) that the displayed user will purchase the product for each record. For example, the LSTM computes the inner product of weights and state variables. Equation (1) expresses the computational equations for LSTM gates and neurons. A state variable is a vector determined by each gate.

ｔは、ＲＮＮ２４０によって繰り返し行われる再帰処理の処理周期（処理時刻）を表しており、ｘ_ｔは、処理周期ｔにおいてＦＣＮＮ２３０から入力されるベクトル（以下、入力ベクトル）を表し、ｈ_ｔは、処理周期ｔにおいてＲＮＮ２４０により出力されるベクトル（以下、出力ベクトル）を表している。再帰処理とは、過去の処理周期で求めた出力ベクトルを、今回の処理周期での出力ベクトルの導出に利用することである。 t represents the processing cycle (processing time) of the recursive processing repeatedly performed by the RNN 240, x _t represents the vector input from the FCNN 230 in the processing cycle t (hereinafter referred to as input vector), and h _t represents the processing time. The vector output by the RNN 240 at period t (hereinafter output vector) is represented. Recursive processing is to use the output vector obtained in the past processing cycle to derive the output vector in the current processing cycle.

ｉ_ｔは、ＬＳＴＭに含まれるインプットゲートによって出力されるベクトル（以下、インプットゲートベクトル）を表しており、σは、ゲートの活性化関数がシグモイド関数であることを表しており、Ｗ_ｉは、入力ベクトルｘ_ｔを線形変換するための重みを表しており、Ｒ_ｉは、前回の処理周期ｔ－１の出力ベクトルｈ_ｔ－１を線形変換するための重みを表しており、Ｑ_ｉは、メモリーセルに記憶された前回の処理周期ｔ－１の内部演算ベクトルｃ_ｔ－１を線形変換するための重みを表しており、ｂ_ｉは、インプットゲートのバイアス成分を表している。メモリーセルとは、一時的に内部演算ベクトルｃを記憶させておくための記憶領域（ＬＳＴＭブロック）である。 i _t represents the vector output by the input gate included in the LSTM (hereinafter, input gate vector), σ represents that the activation function of the gate is a sigmoid function, and _Wi is represents the weight for linearly transforming the input vector x _t , R _i represents the weight for linearly transforming the output vector h _t−1 in the previous processing cycle t−1, and Q _i is It represents the weight for linearly transforming the internal operation vector ct-1 of the previous processing cycle t- ₁ stored in the memory cell, and _bi represents the bias component of the input gate. A memory cell is a storage area (LSTM block) for temporarily storing an internal operation vector c.

例えば、ＲＮＮ２４０は、重みＷ_ｉと入力ベクトルｘ_ｔとの内積値と、重みＲ_ｉと前回の出力ベクトルｈ_ｔ－１との内積値と、重みＱ_ｉと前回の内部演算ベクトルｃ_ｔ－１とのアダマール積（要素毎の積）と、バイアス成分ｂ_ｉとの和を変数としたシグモイド関数σを解くことで、処理周期ｔのインプットゲートベクトルｉ_ｔを導出する。 For example, the RNN 240 stores the inner product value of the weight W _i and the input vector x _t , the inner product value of the weight R _i and the previous output vector h _t−1 , the weight Q _i and the previous internal operation vector c _t−1 and the sum of the Hadamard product (product for each element) of and the bias component b _i as a variable, the input gate vector i _t of the processing cycle t is derived.

ｆ_ｔは、ＬＳＴＭに含まれるフォーゲットゲートによって出力されるベクトル（以下、フォーゲットゲートベクトル）を表しており、Ｗ_ｆは、重みＷ_ｉと同様に、入力ベクトルｘ_ｔを線形変換するための重みを表しており、Ｒ_ｆは、重みＲ_ｉと同様に、前回の処理周期ｔ－１の出力ベクトルｈ_ｔ－１を線形変換するための重みを表しており、Ｑ_ｆは、重みＱ_ｉと同様に、メモリーセルに記憶された前回の処理周期ｔ－１の内部演算ベクトルｃ_ｔ－１を線形変換するための重みを表しており、ｂ_ｆは、フォーゲットゲートのバイアス成分を表している。 f _t represents the vector output by the forget gate included in the LSTM (hereinafter referred to as the forget gate vector), and W _f is the weight W _i for linearly transforming the input vector x _t . R _f represents a weight for linearly transforming the output vector h _t−1 of the previous processing cycle t−1, similar to the weight R _i , and Q _f represents the weight Q _i , represents the weight for linearly transforming the internal operation vector c t-1 of the previous processing cycle t- ₁ stored in the memory cell, and b _f represents the bias component of the forget gate. there is

例えば、ＲＮＮ２４０は、重みＷ_ｆと入力ベクトルｘ_ｔとの内積値と、重みＲ_ｆと前回の出力ベクトルｈ_ｔ－１との内積値と、重みＱ_ｆと前回の内部演算ベクトルｃ_ｔ－１とのアダマール積と、バイアス成分ｂ_ｆとの和を変数としたシグモイド関数σを解くことで、処理周期ｔのフォーゲットゲートベクトルｆ_ｔを導出する。 For example, the RNN 240 stores the inner product value of the weight W _f and the input vector x _t , the inner product value of the weight R _f and the previous output vector h _t−1 , the weight Q _f and the previous internal operation vector c _t−1 By solving the sigmoid function σ whose variable is the sum of the Hadamard product of and the bias component _bf , the forget gate vector _ft of the processing period t is derived.

ｃ_ｔは、ＬＳＴＭに含まれるメモリーセルに記憶させる内部演算ベクトルを表している。 _ct represents an internal operation vector stored in a memory cell included in the LSTM.

例えば、ＲＮＮ２４０は、処理周期ｔのフォーゲットゲートベクトルｆ_ｔおよび後述するベクトルｚ_ｔのアダマール積と、処理周期ｔ－１の内部演算ベクトルｃ_ｔ‐１および処理周期ｔのフォーゲットゲートベクトルｆ_ｔのアダマール積との和を、処理周期ｔの内部演算ベクトルｃ_ｔとして導出する。 For example, the RNN 240 generates the Hadamard product of the forget gate vector f _t in the processing period t and the vector z _t described later, the internal operation vector c _t-1 in the processing period t−1, and the forget gate vector f t in the processing period _t and the Hadamard product of is derived as an internal operation vector _ct of the processing cycle t.

ｚ_ｔは、重みＷ_ｚと入力ベクトルｘ_ｔとの内積値と、重みＲ_ｚと前回の出力ベクトルｈ_ｔ－１との内積値と、バイアス成分ｂ_ｚとの和を変数としたハイパポリックタンジェント関数ｔａｎｈを解くことで導出される。Ｗ_ｚは、重みＷ_ｉやＷ_ｆと同様に、入力ベクトルｘ_ｔを線形変換するための重みを表しており、Ｒ_ｆは、重みＲ_ｉやＲ_ｆと同様に、前回の処理周期ｔ－１の出力ベクトルｈ_ｔ－１を線形変換するための重みを表しており、ｂ_ｚは、メモリーセルのバイアス成分を表している。 z _t is a hyperpolic whose variable is the sum of the inner product value of the weight W _z and the input vector x _t , the inner product value of the weight R _z and the previous output vector h _t−1 , and the bias component b _z It is derived by solving the tangent function tanh. W _z represents _a weight for linearly transforming the input vector x _t like the weights W _i and _W _f , and R _f represents the previous processing cycle t− The weights for linearly transforming the output vector h _t−1 of 1 are represented, and b _z represents the bias component of the memory cell.

ｏ_ｔは、ＬＳＴＭに含まれるアウトプットゲートによって出力されるベクトル（以下、アウトプットゲートベクトル）を表しており、Ｗ_ｏは、重みＷ_ｉ等と同様に、入力ベクトルｘ_ｔを線形変換するための重みを表しており、Ｒ_ｏは、重みＲ_ｉ等と同様に、前回の処理周期ｔ－１の出力ベクトルｈ_ｔ－１を線形変換するための重みを表しており、Ｑ_ｏは、メモリーセルに記憶された今回の処理周期ｔの内部演算ベクトルｃ_ｔを線形変換するための重みを表しており、ｂ_ｏは、アウトプットゲートのバイアス成分を表している。 o _t represents the vector output by the output gate included in the LSTM (hereinafter referred to as the output gate vector), and _Wo is for linearly transforming the input vector x _t in the same way as the weight W _i etc. R _o represents the weight for linearly transforming the output vector h _t−1 of the previous processing cycle t−1, similar to the weight R _i etc., and Q _o represents the memory It represents the weight for linearly transforming the internal operation vector _ct of the current processing cycle t stored in the cell, and _bo represents the bias component of the output gate.

例えば、ＲＮＮ２４０は、重みＷ_ｏと入力ベクトルｘ_ｔとの内積値と、重みＲ_ｏと前回の出力ベクトルｈ_ｔ－１との内積値と、重みＱ_ｏと今回の内部演算ベクトルｃ_ｔとのアダマール積と、バイアス成分ｂ_ｏとの和を変数としたシグモイド関数σを解くことで、処理周期ｔのアウトプットゲートベクトルｏ_ｔを導出する。 For example, the RNN 240 stores the inner product value of the weight W _o and the input vector x _t , the inner product value of the weight R _o and the previous output vector h _t−1 , and the weight Q _o and the current internal operation vector c _t . By solving the sigmoid function σ with the sum of the Hadamard product and the bias component _bo as a variable, the output gate vector o _t of the processing period t is derived.

ＲＮＮ２４０は、アウトプットゲートベクトルｏ_ｔを導出すると、アウトプットゲートベクトルｏ_ｔと、内部演算ベクトルｃ_ｔを変数としたハイパポリックタンジェント関数ｔａｎｈの解とのアダマール積を、処理周期ｔの出力ベクトルｈ_ｔとして導出する。 After deriving the output gate vector _ot , the RNN 240 converts the Hadamard product of the output gate vector _ot and the solution of the hyperpolic tangent function tanh with the internal operation vector _ct as a variable into the output vector of the processing period t. h _t .

出力ベクトルｈ_ｔの各要素値は、レコード毎の行動（この一例では、ユーザが商品を購入すること）の確からしさのスコアを表している。例えば、ｈ_ｔ＝｛ｅ１，ｅ２｝であった場合、要素ｅ１は、ユーザが商品を購入することの確からしさを示すスコア（確率）を表し、要素ｅ２は、ユーザが商品を購入しないことの確からしさを示すスコアを表している。ＲＮＮ２４０は、上述した数式に基づく計算を繰り返すことで、出力ベクトルを繰り返し導出する。 Each element value of the output vector _ht represents the likelihood score of the action (in this example, the user purchasing a product) for each record. For example, when h _t ={e1, e2}, the element e1 represents a score (probability) indicating the probability that the user will purchase the product, and the element e2 represents the probability that the user will not purchase the product. It represents a score that indicates certainty. The RNN 240 iteratively derives the output vector by repeating the calculations based on the formulas described above.

図６は、時間経過に応じたＲＮＮ２４０の処理内容を並列に展開した図である。例えば、ＦＣＮＮ２３０から入力ベクトルｘ_ｔ１が入力された初回の処理周期ｔ１では、ＲＮＮ２４０は、出力ベクトルｈ_ｔ１を出力する。初回処理時ｔ１の出力ベクトルｈ_ｔ１は、アクセスログのうち最初のレコードに係る、ユーザが商品を購入することの確からしさを示すスコアを要素として含んでおり、各要素は、例えば、各要素におけるユーザが商品を購入することの確からしさを示すスコアである。 FIG. 6 is a parallel development diagram of the processing contents of the RNN 240 according to the passage of time. For example, in the first processing cycle t1 when the input vector _xt1 is input from the FCNN 230, the RNN 240 outputs the output vector _ht1 . The output vector h _t1 at the time of initial processing t1 includes, as elements, a score indicating the probability that the user will purchase the product for the first record in the access log, and each element is, for example, It is a score that indicates the likelihood that the user will purchase the product.

次の処理周期ｔ２では、ＲＮＮ２４０は、初回の処理周期ｔ１に入力された入力ベクトルｘ_ｔ１と、前回の処理周期ｔ１の出力ベクトルｈ_ｔ１とに基づいて、出力ベクトルｈ_ｔ２を出力する。処理周期ｔ２の出力ベクトルｈ_ｔ２は、２番目のレコードに係るユーザが商品を購入することの確からしさを示すスコアを要素として含んでいる。 In the next processing cycle t2, the RNN 240 outputs an output vector _ht2 based on the input vector xt1 input in the first processing cycle t1 and the output vector _ht1 in the previous processing cycle _t1 . The output vector _ht2 of the processing cycle t2 contains as elements the score indicating the probability that the user associated with the second record will purchase the product.

次の処理周期ｔ３では、ＲＮＮ２４０は、初回の処理周期ｔ１に入力された入力ベクトルｘ_ｔ１と、前回の処理周期ｔ２の出力ベクトルｈ_ｔ２とに基づいて、出力ベクトルｈ_ｔ３を出力する。処理周期ｔ３の出力ベクトルｈ_ｔ３は、３番目のレコードに係るユーザが商品を購入することの確からしさを示すスコアを要素として含んでいる。 In the next processing cycle t3, the RNN 240 outputs an output vector _ht3 based on the input vector xt1 input in the first processing cycle t1 and the output vector _ht2 in the previous processing cycle _t2 . The output vector _ht3 of the processing cycle t3 includes as elements the score indicating the probability that the user associated with the third record will purchase the product.

このように、ＲＮＮ２４０は、再帰処理を繰り返すことで、より多くのレコードのスコアを要素とする出力ベクトルを導出する。なお、ＲＮＮ２４０は、セッションの最後のレコードのスコアを要素とする出力ベクトルを導出するまで、再帰的に出力ベクトルを導出することを繰り返すものとする。例えば、ＲＮＮ２４０の学習時に、セッションの最後のレコードに対して所定の付加的情報（以下、エンドラベルと称する）が付与された商品データが利用された場合、ＲＮＮ２４０は、エンドラベルが付与されたレコードのスコアを要素とする出力ベクトルを導出するまで再帰処理を繰り返す。これによって、例えば、ある２つのアクセスログが、いずれもユーザが商品を購入するアクセスログであって、それらのアクセスログには、互いに数が異なるレコードが含まれる場合であっても、最後のレコードまで、アクセスログに含まれるレコードの数と同じ回数だけ再帰処理を繰り返すため、ＲＮＮ２４０の再帰処理の回数（図６中のｋ）というハイパーパラメータを調整せずとも、各レコードの種類に応じて再帰処理の回数を動的に変更しながらスコアを求めることができる。 Thus, the RNN 240 repeats the recursive process to derive an output vector whose elements are the scores of more records. Note that the RNN 240 repeats recursively deriving the output vector until it derives the output vector whose elements are the scores of the last record of the session. For example, when the RNN 240 learns and uses product data to which predetermined additional information (hereinafter referred to as an end label) is assigned to the last record of the session, the RNN 240 uses the record to which the end label is assigned. The recursive process is repeated until an output vector whose elements are the scores of . As a result, for example, even if two access logs are access logs in which a user purchases a product and the access logs contain different numbers of records, the last record Since the recursive process is repeated the same number of times as the number of records included in the access log, the recursive process can be performed according to the type of each record without adjusting the hyperparameter of the number of recursive processes of the RNN 240 (k in FIG. 6). A score can be obtained while dynamically changing the number of processing times.

ＬＵ２５０は、ＲＮＮ２４０から出力されたベクトルに対して、線形変換行列を乗算することで、２次元のベクトルに変換（次元を圧縮）し、それを出力層に出力する。 The LU 250 multiplies the vector output from the RNN 240 by a linear transformation matrix to transform it into a two-dimensional vector (compresses the dimensions) and outputs it to the output layer.

図５に戻り、次に、行動推定部１１６は、分類器２００から分類結果を取得し、その分類結果に基づいてユーザが所定の行動を行うか否かを推定する（Ｓ１０６）。カテゴリの分類結果には、ＲＮＮ２４０が再帰処理毎に出力した出力ベクトルが含まれており、その出力ベクトルには、行動の確からしさのスコアの結果が要素として含まれている。行動推定部１１６は、例えば、ユーザが商品を購入することの確からしさを示すスコアと、ユーザが商品を購入しないことの確からしさを示すスコアとのうち、スコアが大きいほうの行動をユーザが行う、或いは行わないと推定する。 Returning to FIG. 5, next, the action estimation unit 116 acquires the classification result from the classifier 200, and estimates whether or not the user will perform a predetermined action based on the classification result (S106). The category classification results include the output vector output by the RNN 240 for each recursive process, and the output vector includes the action probability score result as an element. The behavior estimation unit 116, for example, selects a score indicating the probability that the user will purchase the product and a score indicating the probability that the user will not purchase the product, and the user performs the action with the higher score. , or presumed not to do.

決定部１１８は、行動推定部１１６の推定結果に基づいて、ユーザに提供する情報を提供すると決定する（Ｓ１０８）。決定部１１８は、例えば、行動推定部１１６の推定結果が、ユーザが商品を購入しないと推定した場合、ユーザの商品の購入を促すように、商品の割引や値引き等の商品の購入に係るクーポン（特典）情報をユーザに提供すると決定する。この場合、決定部１１８は、推定の元になったアクセスログに係るユーザ（の端末装置１０）にクーポン情報を送信するようにサービス提供装置２０に指示する。これにより、情報処理装置１００は、ユーザが商品を購入しないことが推定される場合であっても、ユーザに商品の購入を促すことができる。 The determination unit 118 determines to provide the information to be provided to the user based on the estimation result of the behavior estimation unit 116 (S108). For example, when the estimation result of the behavior estimating unit 116 indicates that the user does not purchase the product, the determining unit 118 provides a coupon related to the purchase of the product, such as a discount or discount, so as to encourage the user to purchase the product. It decides to provide (privilege) information to the user. In this case, the determining unit 118 instructs the service providing apparatus 20 to transmit the coupon information to the user (the terminal apparatus 10 of the user) related to the access log that is the basis of the estimation. Accordingly, the information processing apparatus 100 can prompt the user to purchase the product even when it is estimated that the user will not purchase the product.

なお、決定部１１８は、例えば、行動推定部１１６の推定結果に基づいて、ユーザが商品を購入すると推定した場合、ユーザが購入する商品の割引や値引き等に係るクーポンを示す情報をユーザに出力するものであってもよい。これにより、情報処理装置１００は、商品を購入してくれるユーザに対して特典を付与することにより、今後もサービス提供装置２０が提供するサービスを介して商品を購入するように促すことができる。 Note that, for example, when determining that the user will purchase a product based on the estimation result of the behavior estimation unit 116, the determination unit 118 outputs to the user information indicating a discount for the product purchased by the user or a coupon related to the discount. It may be something to do. Accordingly, the information processing apparatus 100 can encourage the user who purchases the product to continue to purchase the product through the service provided by the service providing apparatus 20 by giving the privilege.

［不正な決済が行われた場合の決定部１１８の処理］
また、分類器２００が、ユーザが行う所定の行動のうち、「サービスに入会するか」について出力するものであり、行動推定部１１６が、分類器２００から分類結果を取得し、その分類結果に基づいてユーザがサービスに入会すると推定する場合、決定部１１８は、サービスへの入会を促す案内情報をユーザに対して出力する。具体的には、決定部１１８は、推定の元になったアクセスログに係るユーザ（の端末装置１０）に案内情報を提供すると決定し、案内情報を送信するようにサービス提供装置２０に指示する。これにより、情報処理装置１００は、ユーザがサービスに入会することが推定される場合に、ユーザに適切にサービスの案内をすることができる。 [Processing of decision unit 118 when fraudulent payment is made]
In addition, the classifier 200 outputs "Do you want to join the service?" among the predetermined actions performed by the user. If it is estimated that the user will join the service based on the information, the determination unit 118 outputs guidance information to the user to prompt the user to join the service. Specifically, the determining unit 118 determines to provide guidance information to the user (the terminal device 10 of the user) associated with the access log that is the basis of the estimation, and instructs the service providing device 20 to transmit the guidance information. . Accordingly, the information processing apparatus 100 can appropriately guide the user about the service when it is estimated that the user will join the service.

［サービスに入会する場合の決定部１１８の処理］
また、分類器２００が、ユーザが行う所定の行動のうち、「不正な決済が行われたか」について出力するものであり、行動推定部１１６が、分類器２００から分類結果を取得し、その分類結果に基づいてユーザによって不正な決済が行われたと推定する場合、決定部１１８は、不正な決済によってユーザに購入された商品に係る決済処理を中止すると決定する。この場合、決定部１１８は、決済処理装置３０に対して当該決済処理の中止を指示する情報を送信する。決済処理装置３０は、決定部１１８から受信した情報に基づいて、不正な決済によってユーザに購入された商品に係る決済処理を中止する。これにより、情報処理装置１００は、不正な決済によってユーザが商品を購入したことが推定される場合に、当該購入に係る決済処理を中止することができる。 [Processing of decision unit 118 when joining a service]
In addition, the classifier 200 outputs “whether fraudulent payment was made” among the predetermined actions performed by the user. If it is estimated that the user made an unauthorized payment based on the result, the determination unit 118 decides to stop the payment processing for the product purchased by the user through the unauthorized payment. In this case, the determination unit 118 transmits information instructing the payment processing device 30 to stop the payment processing. Based on the information received from the determining unit 118, the payment processing device 30 stops the payment processing for the product purchased by the user through fraudulent payment. Accordingly, when it is presumed that the user has purchased a product through fraudulent payment, the information processing apparatus 100 can cancel the payment processing related to the purchase.

［アクセスログの取得タイミング］
また、上述では、取得部１１２が、セッション毎のアクセスログを取得する場合について説明したが、これに限られない。取得部１１２は、例えば、現在、サービス提供装置２０が提供するウェブページを閲覧しているアクセスログを、セッションが完結するよりも前に、リアルタイムに取得するものであってもよい。これにより、情報処理装置１００は、商品の購入を検討しながらウェブページを閲覧するユーザに対して、即座にクーポンを提供することができ、サービスへの入会を検討しながらウェブページを閲覧するユーザに対して、即座に案内情報を提供することができ、不正な決済で商品を購入しようとしているユーザの決済を抑制したりすることができる。 [Access log acquisition timing]
Also, in the above description, a case where the acquisition unit 112 acquires an access log for each session has been described, but the present invention is not limited to this. For example, the acquisition unit 112 may acquire an access log of current browsing of web pages provided by the service providing apparatus 20 in real time before the session is completed. As a result, the information processing apparatus 100 can immediately provide a coupon to a user who browses the web page while considering purchasing a product. , guidance information can be immediately provided, and payment by a user who intends to purchase a product by fraudulent payment can be suppressed.

［行動推定部１１６の処理の他の例］
また、上述した説明では、行動推定部１１６が、分類器２００の分類結果を参照し、ユーザが商品を購入することの確からしさを示すスコアと、ユーザが商品を購入しないことの確からしさを示すスコアとのうち、スコアが大きいほうの行動をユーザが行う、或いは行わないと推定したがこれに限られない。例えば、行動推定部１１６は、分類器２００の分類結果として示されるスコアの全部が、或いは所定数以上のスコアが閾値未満である場合、ＲＮＮ２４０に、一つ前の再帰処理を再び行わせ、レコードのスコアが閾値未満のレコードの出力ベクトルをあらためて導出させてよい。例えば、ＲＮＮ２４０が、処理周期ｔ１において導出した出力ベクトルｈ_ｔ１の複数の要素ｅのうち、最も値が大きい要素だけを残した出力ベクトルｈ_ｔ１を基に処理周期ｔ２において出力ベクトルｈ_ｔ２を導出したとする。このような場合に、処理周期ｔ２における出力ベクトルｈ_ｔ２の要素ｅがいずれも閾値未満であった場合、すなわち、２番目のレコードのスコアがいずれも閾値未満であった場合、行動推定部１１６は、ＲＮＮ２４０に、処理周期ｔ１の再帰処理を再び行わせる。この際、行動推定部１１６は、処理周期ｔ１において導出した出力ベクトルｈ_ｔ１の複数の要素ｅのうち、最も値が大きい要素を残すのではなく、例えば、２番目に値が大きい要素、或いは３番目や４番目に値が大きい要素を残した出力ベクトルｈ_ｔ１を基にして、ＲＮＮ２４０に出力ベクトルｈ_ｔ２を導出させる。 [Another example of processing of the action estimation unit 116]
Further, in the above description, the behavior estimation unit 116 refers to the classification result of the classifier 200, and indicates the score indicating the likelihood that the user will purchase the product and the likelihood that the user will not purchase the product. Although it is presumed that the user performs or does not perform the action with the higher score, the present invention is not limited to this. For example, when all of the scores indicated as the classification results of the classifier 200 or a predetermined number or more of scores are below the threshold, the behavior estimation unit 116 causes the RNN 240 to perform the previous recursive process again, and records The output vectors for records whose score is less than the threshold may be re-derived. For example, the RNN 240 derives the output vector ht2 in the processing cycle t2 based on the output vector _ht1 _in which only the element with the largest value is left among the plurality of elements e of the output vector _ht1 derived in the processing cycle t1. and In such a case, if the elements e of the output vector _ht2 in the processing period t2 are all less than the threshold, that is, if the score of the second record is all less than the threshold, the action estimation unit 116 , causes the RNN 240 to perform the recursive processing again at the processing cycle t1. At this time, the action estimation unit 116 does not leave the element with the largest value among the plurality of elements e of the output vector h _t1 derived in the processing period t1, but rather, for example, the element with the second largest value, or the element with the third largest value. The RNN 240 derives the output vector _ht2 based on the output vector _ht1 in which the elements with the largest values are left.

このように、最初のレコードよりも後に閲覧されたウェブページに係るレコードの分類結果が曖昧である場合、最初のレコードの分類が間違っている蓋然性が高いと判断して、最初のレコードのスコアのうち、二番目以降に確からしいレコードのスコアを基に、以降のレコードのスコアを再計算させる。これによって、最初のレコードの分類を間違ったことにより、それ以降の全てのレコードの分類を間違ってしまうという再帰処理特有の事象が発生するのを抑制することができる。 In this way, when the classification results of records related to web pages viewed after the first record are ambiguous, it is determined that there is a high probability that the classification of the first record is wrong, and the score of the first record is reduced. Based on the scores of the second and subsequent most likely records, the scores of subsequent records are recalculated. As a result, it is possible to suppress the occurrence of an event peculiar to recursive processing, in which all subsequent records are misclassified due to the misclassification of the first record.

［学習時の処理フロー］
以下、第１実施形態における情報処理装置１００の学習時の一連の流れをフローチャートに即して説明する。図７は、第１実施形態における情報処理装置１００による学習時の一連の処理の流れを示すフローチャートである。 [Processing flow during learning]
A series of flow during learning of the information processing apparatus 100 according to the first embodiment will be described below with reference to a flowchart. FIG. 7 is a flow chart showing a series of processing during learning by the information processing apparatus 100 according to the first embodiment.

まず、学習処理部１２０は、記憶部１３０に格納された教師データ１３４から、あるアクセスログを選択する（Ｓ２００）。教師データ１３４は、予め、「ユーザが商品を購入したこと」、「ユーザが商品を購入しなかったこと」、「ユーザがサービスに入会したこと」、「ユーザがサービスに入会しなかったこと」、「ユーザによって不正決済が行われたこと」、「ユーザによって不正決済が行われなかったこと」等の各種タグが教師ラベルとして対応付けられたアクセスログである。 First, the learning processing unit 120 selects an access log from the teacher data 134 stored in the storage unit 130 (S200). The teacher data 134 is prepared in advance with information such as "the user purchased the product", "the user did not purchase the product", "the user joined the service", and "the user did not join the service". , ``unauthorized payment was made by the user'', and ``no unauthorized payment was made by the user''.

次に、学習処理部１２０は、教師データ１３４からアクセスログを選択すると、そのアクセスログを分類器２００に入力する（Ｓ２０２）。 Next, when the learning processing unit 120 selects an access log from the teacher data 134, the learning processing unit 120 inputs the access log to the classifier 200 (S202).

次に、学習処理部１２０は、アクセスログを入力した分類器２００から出力結果、すなわち最尤タグベクトルを取得する（Ｓ２０４）。 Next, the learning processing unit 120 acquires the output result, that is, the maximum likelihood tag vector, from the classifier 200 to which the access log is input (S204).

次に、学習処理部１２０は、最尤タグベクトルによって示されるタグと、分類器２００に入力したアクセスログに教師ラベルとした対応付けられたタグとを比較し、双方のタグの組み合わせが一致するか否かを判定する（Ｓ２０６）。 Next, the learning processing unit 120 compares the tag indicated by the maximum likelihood tag vector with the tag associated with the access log input to the classifier 200 as the teacher label, and the combination of both tags matches. (S206).

学習処理部１２０は、タグの組み合わせが一致しないと判定した場合、誤差逆伝番等の勾配法に基づいて、分類器２００のパラメータを学習する（Ｓ２０８）。例えば、学習処理部１２０は、教師ラベルとした対応付けられたタグの組み合わせの尤度を最大値とし、その最大値から、最尤タグベクトルによって示されるタグの組み合わせの尤度を減算した差分が小さくなるように、分類器２００のパラメータを学習する。これによって、本フローチャートの処理が終了する。 When the learning processing unit 120 determines that the combination of tags does not match, it learns the parameters of the classifier 200 based on a gradient method such as error reverse propagation (S208). For example, the learning processing unit 120 sets the likelihood of the tag combination associated with the teacher label to the maximum value, and the difference obtained by subtracting the likelihood of the tag combination indicated by the maximum likelihood tag vector from the maximum value is Learn the parameters of the classifier 200 to be small. This completes the processing of this flowchart.

以上説明した第１実施形態の情報処理装置１００によれば、アクセスログからユーザの所定の行動を推定し、推定した行動に応じた情報を出力することにより、ユーザに期待する行動（商品の購入やサービスの入会）を促したり、不正な決済による商品の購入を抑制したりすることができる。 According to the information processing apparatus 100 of the first embodiment described above, by estimating a predetermined behavior of the user from the access log and outputting information according to the estimated behavior, the behavior expected of the user (purchase of a product) can be obtained. and membership of services), and suppress the purchase of products by fraudulent payment.

また、上述した第１実施形態によれば、教師ラベルとして正解となるカテゴリと共に、含まれるレコードの数が異なるアクセスログを学習データとして分類器２００を学習させるため、全てのレコードについて再帰処理を繰り返すため、ＲＮＮ２４０の再帰処理の回数というハイパーパラメータを調整せずとも、各アクセスログのレコードの数に応じて再帰処理の回数を動的に変更しながらスコアを求めることができ、更に、事前にアクセスログのレコードの数がわからなくとも、各レコードを分類することができる。 Further, according to the above-described first embodiment, the classifier 200 learns the classifier 200 using access logs containing different numbers of records as learning data together with categories that are correct as teacher labels, so recursive processing is repeated for all records. Therefore, the score can be obtained while dynamically changing the number of recursive processes according to the number of records in each access log without adjusting the hyperparameter of the number of recursive processes of the RNN 240. Even without knowing the number of records in the log, each record can be classified.

＜第２実施形態＞
以下、第２実施形態について説明する。上述した第１実施形態では、ＲＮＮ２４０が、ネットワークの中間層がＬＳＴＭであるものについて説明した。これに対して、第２実施形態では、ＲＮＮ２４０が、コンボリューショナルニューラルネットワークと、ネットワークの中間層がＬＳＴＭのＲＮＮとを組み合わせたニューラルネットワーク（以下、ＣＯＮＶＬＳＴＭと称する）である点で上述した第１から第３実施形態と相違する。以下、第１から第３実施形態との相違点を中心に説明し、第１から第３実施形態と共通する点については説明を省略する。なお、第２実施形態の説明において、第１から第３実施形態と同じ部分については同一符号を付して説明する。 <Second embodiment>
A second embodiment will be described below. In the above-described first embodiment, the RNN 240 has explained that the middle layer of the network is LSTM. On the other hand, in the second embodiment, the RNN 240 is a neural network (hereinafter referred to as CONVLSTM) that combines a convolutional neural network and an RNN whose middle layer of the network is an LSTM. are different from the third embodiment. In the following, differences from the first to third embodiments will be mainly described, and descriptions of points common to the first to third embodiments will be omitted. In addition, in the description of the second embodiment, the same parts as those of the first to third embodiments are denoted by the same reference numerals.

第２実施形態では、例えば、第１ＣＮＮ２１０、及び第２ＣＮＮ２２０のそれぞれについてプーリング処理が省略されたり、ＦＣＮＮ２３０が省略されたりすることで、分類器２００の上段に設けられた各種ニューラルネットワーク等から、ＲＮＮ２４０に３階テンソルが入力される。 In the second embodiment, for example, by omitting the pooling process for each of the first CNN 210 and the second CNN 220, or by omitting the FCNN 230, from various neural networks provided in the upper stage of the classifier 200, the RNN 240 A third order tensor is input.

第２実施形態におけるＲＮＮ２４０は、例えば、コンボリューショナルニューラルネットワークと、ネットワークの中間層がＬＳＴＭであるＲＮＮとの組み合わせたニューラルネットワーク（以下、ＣＯＮＶＬＳＴＭと称する）であり、３階テンソルが入力されると、そのテンソルの元になったタイトルや概要、画像に対応する商品の分類先とするカテゴリの確からしさ（尤もらしさ）を表すスコアをレコード毎に出力するように学習されたニューラルネットワークである。ＣＯＮＶＬＳＴＭは、上述したＬＳＴＭの各ゲートの計算式において、重みと状態変数との内積計算が畳み込み計算に変更されたものである。数式（２）は、ＣＯＮＶＬＳＴＭのゲートおよびニューロンの計算式を表している。 The RNN 240 in the second embodiment is, for example, a neural network (hereinafter referred to as CONVLSTM) that combines a convolutional neural network and an RNN whose middle layer of the network is an LSTM. , is a neural network trained to output a score representing the probability (likelihood) of the category into which the product corresponding to the image is classified, for each record. CONVLSTM is obtained by changing the calculation of the inner product of the weight and the state variable to the convolution calculation in the calculation formula for each gate of the LSTM described above. Equation (2) expresses the calculation of the CONVLSTM gates and neurons.

第２実施形態におけるｘ_ｔは、処理周期ｔにおいて入力される３階テンソル（以下、入力テンソル）を表し、ｈ_ｔは、処理周期ｔにおいてＲＮＮ２４０により出力される３階テンソル（以下、出力テンソル）を表している。 x _t in the second embodiment represents a third-order tensor (hereinafter, input tensor) input in processing cycle t, and h _t represents a third-order tensor (hereinafter, output tensor) output by RNN 240 in processing cycle t. represents.

例えば、第２実施形態におけるＲＮＮ２４０は、重みＷ_ｉと入力テンソルｘ_ｔとの畳み込み値と、重みＲ_ｉと前回の出力テンソルｈ_ｔ－１との畳み込み値と、重みＱ_ｉと前回の内部演算テンソル（３階テンソル）ｃ_ｔ－１とのアダマール積（要素毎の積）と、バイアス成分ｂ_ｉとの和を変数としたシグモイド関数σを解くことで、処理周期ｔのインプットゲートテンソル（３階テンソル）ｉ_ｔを導出する。 For example, the RNN 240 in the second embodiment includes the convolution value of the weight W _i and the input tensor x _t , the convolution value of the weight R _i and the previous output tensor h _t−1 , the weight Q _i and the previous internal operation The _input gate tensor ₍ 3 Derive the rank tensor) i _t .

また、ＲＮＮ２４０は、重みＷ_ｆと入力テンソルｘ_ｔとの畳み込み値と、重みＲ_ｆと前回の出力ベクトルｈ_ｔ－１との畳み込み値と、重みＱ_ｆと前回の内部演算テンソルｃ_ｔ－１とのアダマール積と、バイアス成分ｂ_ｆとの和を変数としたシグモイド関数σを解くことで、処理周期ｔのフォーゲットゲートテンソルｆ_ｔを導出する。 In addition, the RNN 240 stores the convolution value of the weight W _f and the input tensor x _t , the convolution value of the weight R _f and the previous output vector h _t-1 , the weight Q _f and the previous internal operation tensor c _t-1 By solving the sigmoid function σ whose variable is the sum of the Hadamard product of and the bias component _bf , the forget gate tensor f _t of the processing period t is derived.

また、ＲＮＮ２４０は、処理周期ｔのフォーゲットゲートテンソルｆ_ｔおよびテンソルｚ_ｔのアダマール積と、処理周期ｔ－１の内部演算テンソルｃ_ｔ‐１および処理周期ｔのフォーゲットゲートテンソルｆ_ｔのアダマール積との和を、処理周期ｔの内部演算テンソルｃ_ｔとして導出する。 In addition, the RNN 240 generates the Hadamard product of the forget gate tensor f _t and the tensor _z _t in the processing period t, and the _Hadamard The sum with the product is derived as an internal computation tensor c _t with processing period t.

ｚ_ｔは、重みＷ_ｚと入力テンソルｘ_ｔとの畳み込み値と、重みＲ_ｚと前回の出力テンソルｈ_ｔ－１との畳み込み値と、バイアス成分ｂ_ｚとの和を変数としたハイパポリックタンジェント関数ｔａｎｈを解くことで導出される。 z _t is a hyperpolic whose variable is the sum of the convolution value of the weight W _z and the input tensor x _t , the convolution value of the weight R _z and the previous output tensor h _t−1 , and the bias component b _z It is derived by solving the tangent function tanh.

また、ＲＮＮ２４０は、重みＷ_ｏと入力テンソルｘ_ｔとの畳み込み値と、重みＲ_ｏと前回の出力テンソルｈ_ｔ－１との畳み込み値と、重みＱ_ｏと今回の内部演算テンソルｃ_ｔとのアダマール積と、バイアス成分ｂ_ｏとの和を変数としたシグモイド関数σを解くことで、処理周期ｔのアウトプットゲートテンソル（３階テンソル）ｏ_ｔを導出する。 In addition, the RNN 240 generates the convolution value of the weight W _o and the input tensor x _t , the convolution value of the weight R _o and the previous output tensor h _t−1 , and the weight Q _o and the current internal calculation tensor c _t . By solving the sigmoid function σ with the sum of the Hadamard product and the bias component _bo as variables, the output gate tensor (third-order tensor) o _t of the processing cycle t is derived.

ＲＮＮ２４０は、アウトプットゲートテンソルｏ_ｔを導出すると、アウトプットゲートテンソルｏ_ｔと、内部演算テンソルｃ_ｔを変数としたハイパポリックタンジェント関数ｔａｎｈの解とのアダマール積を、処理周期ｔの出力テンソルｈ_ｔとして導出する。 After deriving the output gate tensor o _t , the RNN 240 obtains the Hadamard product of the output gate tensor o _t and the solution of the hyperpolic tangent function tanh with the internal computation tensor c _t as a variable, to obtain the output tensor of the processing period t h _t .

出力テンソルｈ_ｔの各要素値は、上述した実施形態の出力ベクトルと同様に、各レコードのスコアを表している。ＲＮＮ２４０は、上述した数式に基づく計算を繰り返すことで、出力テンソルを繰り返し導出する。 Each element value of the output tensor _ht represents the score of each record, similar to the output vector of the embodiment described above. The RNN 240 iteratively derives the output tensor by repeating the calculations based on the formulas described above.

以上説明した第２実施形態によれば、分類器２００のＲＮＮ２４０を、中間層がＣＯＮＶＬＳＴＭであるＲＮＮとするため、上述した第１実施形態と比べて、更に適切なユーザの所定の行動に分類することができ、その結果、決定部１１８は、より適切な情報を提供すると決定することができる。 According to the second embodiment described above, since the RNN 240 of the classifier 200 is an RNN whose intermediate layer is CONVLSTM, it is possible to classify the user's predetermined behavior more appropriately than in the first embodiment described above. and, as a result, the decision unit 118 may decide to provide more relevant information.

＜ハードウェア構成＞
上述した実施形態の情報処理装置１００は、例えば、図８に示すようなハードウェア構成により実現される。図８は、実施形態の情報処理装置１００のハードウェア構成の一例を示す図である。 <Hardware configuration>
The information processing apparatus 100 of the embodiment described above is implemented by, for example, a hardware configuration as shown in FIG. FIG. 8 is a diagram illustrating an example of the hardware configuration of the information processing apparatus 100 according to the embodiment.

情報処理装置１００は、ＮＩＣ１００－１、ＣＰＵ１００－２、ＲＡＭ１００－３、ＲＯＭ１００－４、フラッシュメモリやＨＤＤ等の二次記憶装置１００－５、およびドライブ装置１００－６が、内部バスあるいは専用通信線によって相互に接続された構成となっている。ドライブ装置１００－６には、光ディスク等の可搬型記憶媒体が装着される。二次記憶装置１００－５、またはドライブ装置１００－６に装着された可搬型記憶媒体に格納されたプログラムがＤＭＡコントローラ（不図示）等によってＲＡＭ１００－３に展開され、ＣＰＵ１００－２によって実行されることで、制御部１１０が実現される。制御部１１０が参照するプログラムは、ネットワークＮＷを介して他の装置からダウンロードされてもよい。 The information processing apparatus 100 includes a NIC 100-1, a CPU 100-2, a RAM 100-3, a ROM 100-4, a secondary storage device 100-5 such as a flash memory or an HDD, and a drive device 100-6 connected via an internal bus or a dedicated communication line. are interconnected by A portable storage medium such as an optical disc is loaded in the drive device 100-6. A program stored in the secondary storage device 100-5 or a portable storage medium attached to the drive device 100-6 is expanded in the RAM 100-3 by a DMA controller (not shown) or the like, and executed by the CPU 100-2. Thus, the control unit 110 is realized. The program referred to by control unit 110 may be downloaded from another device via network NW.

以上、本発明を実施するための形態について実施形態を用いて説明したが、本発明はこうした実施形態に何ら限定されるものではなく、本発明の要旨を逸脱しない範囲内において種々の変形及び置換を加えることができる。 As described above, the mode for carrying out the present invention has been described using the embodiments, but the present invention is not limited to such embodiments at all, and various modifications and replacements can be made without departing from the scope of the present invention. can be added.

１…情報処理システム、１０…端末装置、２０…サービス提供装置、３０…決済処理装置、１００…情報処理装置、１００－５…二次記憶装置、１００－６…ドライブ装置、１０２…通信部、１１０…制御部、１１２…取得部、１１４…形態素解析部、１１６…行動推定部、１１８…決定部、１２０…学習処理部、１３０…記憶部、１３２…分類器情報、１３４…教師データ、２００…分類器 Reference Signs List 1 information processing system 10 terminal device 20 service providing device 30 payment processing device 100 information processing device 100-5 secondary storage device 100-6 drive device 102 communication unit DESCRIPTION OF SYMBOLS 110... Control part, 112... Acquisition part, 114... Morphological analysis part, 116... Action estimation part, 118... Decision part, 120... Learning process part, 130... Storage part, 132... Classifier information, 134... Teacher data, 200 …classifier

Claims

ネットワークを介したウェブページへのアクセスログを取得する取得部と、
アクセスログにおける所定の行動の有無を教師データとして、入力されたアクセスログにおいて前記所定の行動が行われるかを出力するように学習された分類器に、前記取得されたアクセスログを入力することで得られる出力結果に基づいて、前記取得部により取得された前記アクセスログに示されるウェブページを閲覧した閲覧者に提供すべき情報を決定する決定部と、
を備え、
前記アクセスログには、前記閲覧者が閲覧したウェブページの識別情報と、前記ウェブページをユーザが閲覧した時間の長さである閲覧時間とが、互いに対応付けられたレコードが少なくとも含まれるものであり、
前記分類器は、ＲＮＮ（Recurrent Neural Network：回帰型ニューラルネットワーク）により、前記ウェブページにアクセスするセッションの最後のレコードに対して所定のエンドラベルが付与されたデータを用いて学習されたものであり、１以上のアクセスログの入力に対してそれぞれ前記エンドラベルが付与されたレコードのスコアを出力するまで再帰処理を繰り返す、
情報処理装置。 an acquisition unit that acquires an access log to a web page via a network;
By inputting the obtained access log to a classifier trained to output whether or not the predetermined action is performed in the input access log, using the presence or absence of the predetermined action in the access log as teacher data. a determination unit that determines information to be provided to a viewer who viewed the web page indicated in the access log acquired by the acquisition unit, based on the obtained output result;
with
The access log includes at least a record in which identification information of the web page viewed by the viewer and viewing time, which is the length of time the user viewed the web page, are associated with each other. can be,
The classifier is trained by an RNN (Recurrent Neural Network) using data in which a predetermined end label is assigned to the last record of a session of accessing the web page. , repeating recursive processing until the score of the record to which the end label is assigned is output for each of the inputs of one or more access logs;
Information processing equipment.

前記所定の行動には、商品またはサービスを購入することが含まれ、
前記決定部は、前記分類器の出力結果に基づいて、前記閲覧者に対して商品またはサービスの購入に係る特典を前記提供するべき情報として決定する、
請求項１に記載の情報処理装置。 the predetermined action includes purchasing a product or service;
The decision unit decides, based on the output result of the classifier, a benefit related to the purchase of a product or service to the viewer as the information to be provided;
The information processing device according to claim 1 .

前記所定の行動には、サービスに入会することが含まれ、
前記決定部は、前記分類器の出力結果に基づいて、前記閲覧者に対してサービスへの入会を促す案内情報を前記提供すべき情報として決定する、
請求項１又は請求項２に記載の情報処理装置。 the predetermined action includes joining a service;
The decision unit decides, as the information to be provided, guide information that prompts the viewer to join the service, based on the output result of the classifier.
The information processing apparatus according to claim 1 or 2.

ネットワークを介したウェブページへのアクセスログを取得する取得部と、
アクセスログにおける商取引での不正決済の有無を教師データとして、入力されたアクセスログにおいて前記不正決済が行われたかを出力するように学習された分類器に、前記取得されたアクセスログを入力することで得られる出力結果に基づいて、前記取得部によって取得された前記アクセスログに示されるウェブページを閲覧した閲覧者によって前記不正決済が行われたか否かを推定し、前記不正決済が行われたと推定した場合、前記閲覧者による購入に係る処理を実行する外部装置に対して、処理の中止を指示することを決定する決定部と、
を備え、
前記アクセスログには、前記閲覧者が閲覧したウェブページの識別情報と、前記ウェブページをユーザが閲覧した時間の長さである閲覧時間とが、互いに対応付けられたレコードが少なくとも含まれるものであり、
前記分類器は、ＲＮＮ（Recurrent Neural Network：回帰型ニューラルネットワーク）により、前記ウェブページにアクセスするセッションの最後のレコードに対して所定のエンドラベルが付与されたデータを用いて学習されたものであり、１以上のアクセスログの入力に対してそれぞれ前記エンドラベルが付与されたレコードのスコアを出力するまで再帰処理を繰り返す、
情報処理装置。 an acquisition unit that acquires an access log to a web page via a network;
Inputting the acquired access log to a classifier trained to output whether or not the fraudulent payment has been made in the input access log, using the presence or absence of fraudulent payment in the commercial transaction in the access log as training data. Based on the output result obtained in , it is estimated whether or not the fraudulent payment has been made by the viewer who viewed the web page indicated in the access log acquired by the acquisition unit, and it is determined that the fraudulent payment has been made. a determination unit that determines, when estimated, to instruct an external device that executes a process related to purchase by the viewer to stop the process;
with
The access log includes at least a record in which identification information of the web page viewed by the viewer and viewing time, which is the length of time the user viewed the web page, are associated with each other. can be,
The classifier is trained by an RNN (Recurrent Neural Network) using data in which a predetermined end label is assigned to the last record of a session of accessing the web page. , repeating recursive processing until the score of the record to which the end label is assigned is output for each of the inputs of one or more access logs;
Information processing equipment.

コンピュータが、
ネットワークを介したウェブページへのアクセスログを取得し、
アクセスログにおける所定の行動の有無を教師データとして、入力されたアクセスログにおいて前記所定の行動が行われるかを出力するように学習された分類器に、前記取得されたアクセスログを入力することで得られる出力結果に基づいて、取得された前記アクセスログに示されるウェブページを閲覧した閲覧者に提供すべき情報を決定し、
前記アクセスログには、前記閲覧者が閲覧したウェブページの識別情報と、前記ウェブページをユーザが閲覧した時間の長さである閲覧時間とが、互いに対応付けられたレコードが少なくとも含まれるものであり、
前記分類器は、ＲＮＮ（Recurrent Neural Network：回帰型ニューラルネットワーク）により、前記ウェブページにアクセスするセッションの最後のレコードに対して所定のエンドラベルが付与されたデータを用いて学習されたものであり、１以上のアクセスログの入力に対してそれぞれ前記エンドラベルが付与されたレコードのスコアを出力するまで再帰処理を繰り返す、
情報処理方法。 the computer
Acquire access logs to web pages through the network,
By inputting the obtained access log to a classifier trained to output whether or not the predetermined action is performed in the input access log, using the presence or absence of the predetermined action in the access log as teacher data. Based on the obtained output result, determine information to be provided to the viewer who viewed the web page indicated in the acquired access log;
The access log includes at least a record in which identification information of the web page viewed by the viewer and viewing time, which is the length of time the user viewed the web page, are associated with each other. can be,
The classifier is trained by an RNN (Recurrent Neural Network) using data in which a predetermined end label is assigned to the last record of a session of accessing the web page. , repeating recursive processing until the score of the record to which the end label is assigned is output for each of the inputs of one or more access logs;
Information processing methods.

コンピュータが、
ネットワークを介したウェブページへのアクセスログを取得し、
アクセスログにおける商取引での不正決済の有無を教師データとして、入力されたアクセスログにおいて前記不正決済が行われたかを出力するように学習された分類器に、前記取得されたアクセスログを入力することで得られる出力結果に基づいて、取得された前記アクセスログに示されるウェブページを閲覧した閲覧者によって前記不正決済が行われたか否かを推定し、
前記不正決済が行われたと推定した場合、前記閲覧者による購入に係る処理を実行する外部装置に対して、処理の中止を指示することを決定し、
前記アクセスログには、前記閲覧者が閲覧したウェブページの識別情報と、前記ウェブページをユーザが閲覧した時間の長さである閲覧時間とが、互いに対応付けられたレコードが少なくとも含まれるものであり、
前記分類器は、ＲＮＮ（Recurrent Neural Network：回帰型ニューラルネットワーク）により、前記ウェブページにアクセスするセッションの最後のレコードに対して所定のエンドラベルが付与されたデータを用いて学習されたものであり、１以上のアクセスログの入力に対してそれぞれ前記エンドラベルが付与されたレコードのスコアを出力するまで再帰処理を繰り返す、
情報処理方法。 the computer
Acquire access logs to web pages through the network,
Inputting the acquired access log to a classifier trained to output whether or not the fraudulent payment has been made in the input access log, using the presence or absence of fraudulent payment in the commercial transaction in the access log as training data. based on the output result obtained in , estimating whether the unauthorized payment was made by the viewer who viewed the web page indicated in the acquired access log;
deciding to instruct the external device that executes the processing related to the purchase by the viewer to stop the processing when it is estimated that the fraudulent payment has been made;
The access log includes at least a record in which identification information of the web page viewed by the viewer and viewing time, which is the length of time the user viewed the web page, are associated with each other. can be,
The classifier is trained by an RNN (Recurrent Neural Network) using data in which a predetermined end label is assigned to the last record of a session of accessing the web page. , repeating recursive processing until the score of the record to which the end label is assigned is output for each of the inputs of one or more access logs;
Information processing methods.

コンピュータに、
ネットワークを介したウェブページへのアクセスログを取得させ、
アクセスログにおける所定の行動の有無を教師データとして、入力されたアクセスログにおいて前記所定の行動が行われるかを出力するように学習された分類器に、前記取得されたアクセスログを入力することで得られる出力結果に基づいて、取得された前記アクセスログに示されるウェブページを閲覧した閲覧者に提供すべき情報を決定させ、
前記アクセスログには、前記閲覧者が閲覧したウェブページの識別情報と、前記ウェブページをユーザが閲覧した時間の長さである閲覧時間とが、互いに対応付けられたレコードが少なくとも含まれるものであり、
前記分類器は、ＲＮＮ（Recurrent Neural Network：回帰型ニューラルネットワーク）により、前記ウェブページにアクセスするセッションの最後のレコードに対して所定のエンドラベルが付与されたデータを用いて学習されたものであり、１以上のアクセスログの入力に対してそれぞれ前記エンドラベルが付与されたレコードのスコアを出力するまで再帰処理を繰り返す、
プログラム。 to the computer,
Acquire the access log to the web page through the network,
By inputting the obtained access log to a classifier trained to output whether or not the predetermined action is performed in the input access log, using the presence or absence of the predetermined action in the access log as teacher data. determining information to be provided to a viewer who viewed the web page indicated in the obtained access log, based on the obtained output result;
The access log includes at least a record in which identification information of the web page viewed by the viewer and viewing time, which is the length of time the user viewed the web page, are associated with each other. can be,
The classifier is trained by an RNN (Recurrent Neural Network) using data in which a predetermined end label is assigned to the last record of a session of accessing the web page. , repeating recursive processing until the score of the record to which the end label is assigned is output for each of the inputs of one or more access logs;
program.

コンピュータに、
ネットワークを介したウェブページへのアクセスログを取得させ、
アクセスログにおける商取引での不正決済の有無を教師データとして、入力されたアクセスログにおいて前記不正決済が行われたかを出力するように学習された分類器に、前記取得されたアクセスログを入力することで得られる出力結果に基づいて、取得された前記アクセスログに示されるウェブページを閲覧した閲覧者によって前記不正決済が行われたか否かを推定させ、
前記不正決済が行われたと推定した場合、前記閲覧者による購入に係る処理を実行する外部装置に対して、処理の中止を指示することを決定させ、
前記アクセスログには、前記閲覧者が閲覧したウェブページの識別情報と、前記ウェブページをユーザが閲覧した時間の長さである閲覧時間とが、互いに対応付けられたレコードが少なくとも含まれるものであり、
前記分類器は、ＲＮＮ（Recurrent Neural Network：回帰型ニューラルネットワーク）により、前記ウェブページにアクセスするセッションの最後のレコードに対して所定のエンドラベルが付与されたデータを用いて学習されたものであり、１以上のアクセスログの入力に対してそれぞれ前記エンドラベルが付与されたレコードのスコアを出力するまで再帰処理を繰り返す、
プログラム。 to the computer,
Acquire the access log to the web page through the network,
Inputting the acquired access log to a classifier trained to output whether or not the fraudulent payment has been made in the input access log, using the presence or absence of fraudulent payment in the commercial transaction in the access log as training data. Based on the output result obtained in , to estimate whether the unauthorized payment was made by the viewer who browsed the web page indicated in the acquired access log,
If it is estimated that the fraudulent payment has been made, determine to instruct the external device that executes the process related to the purchase by the viewer to stop the process,
The access log includes at least a record in which identification information of the web page viewed by the viewer and viewing time, which is the length of time the user viewed the web page, are associated with each other. can be,
The classifier is trained by an RNN (Recurrent Neural Network) using data in which a predetermined end label is assigned to the last record of a session of accessing the web page. , repeating recursive processing until the score of the record to which the end label is assigned is output for each of the inputs of one or more access logs;
program.