CN114528490A

CN114528490A - Self-supervision sequence recommendation method based on long-term and short-term interests of user

Info

Publication number: CN114528490A
Application number: CN202210151706.2A
Authority: CN
Inventors: 王也; 阎震; 韩启龙; 宋洪涛; 李丽洁; 王宇华; 马志强
Original assignee: Harbin Engineering University
Current assignee: Harbin Engineering University
Priority date: 2022-02-18
Filing date: 2022-02-18
Publication date: 2022-05-24

Abstract

The invention discloses a self-supervision sequence recommendation method based on long-term and short-term interests of a user. Step 1: acquiring a sequence data set of user information, project information and user behaviors, preprocessing the data set, and dividing the data set into a training set and a test set; and 2, step: constructing a self-supervision sequence recommendation model based on long-term and short-term interests of a user; and step 3: training the self-supervision sequence recommendation model based on the long-term and short-term interests of the user in the step 2 by using a training set; and 4, step 4: inputting the personal information and the interaction sequence of the user to be recommended into the self-supervision sequence recommendation model based on the long-term and short-term interests of the user trained in the step 3, calculating the recommendation score of the item to be recommended relative to the user, and recommending the item to the user according to the recommendation score. The invention is used for solving the problem of mutual constraint relation between long-term interest and short-term interest of the user in the prior art, and realizing more accurate recommendation of the long-term interest and the short-term interest of the user.

Description

Self-supervision sequence recommendation method based on long-term and short-term interests of user

Technical Field

The invention belongs to the field of sequence recommendation, and particularly relates to a self-supervision sequence recommendation method based on long-term and short-term interests of a user.

Background

The behavior data of people in the internet can indicate important information, such as user preferences and behavior patterns, and a recommendation system can use the information to provide personalized services for users and improve the experience of the users.

The modeling of user interaction with an item by conventional sequence recommendation systems can be generalized into two main approaches. The first approach is Collaborative Filtering (CF) based on matrix factorization to obtain user preferences, focusing on mining the static associations of users from their interactions with items, which are represented by a traditional collaborative filtering model. However, these works only take into account the specific user-item relationships from the static view, neglecting the evolution of user preferences implicit in the serialized interactions, and do not take into account the impact of the evolution of user preferences on future purchases. The second method is to mine the relationship between the user and the item based on the sequence pattern to make personalized recommendation. Among them, the user's steady long-term interest is a preference caused by personal habits for a long time; short-term interest is a preference determined by items recently purchased by the user. This type of work includes: and modeling an interaction sequence of the user and the commodity according to the Markov chain model. The long-term interest and the short-term interest of the user play an important role in commodity selection of the user, so that the accuracy can be effectively improved by combining the long-term interest and the short-term interest into recommendation of the user.

Deep neural networks are increasingly used by many to construct sequence recommendation systems by taking advantage of the natural nature of the integrated relationships between different entities (e.g., users, items, interactions) in building and capturing sequences.

For sequence recommendation systems, recurrent neural networks were originally proposed and valued by researchers because of their structural advantages in modeling sequence data, but they also suffer from the drawback that higher order complex relationships cannot be modeled. Then, the convolutional neural network and the graph neural network are also applied to a sequence recommendation system for modeling complex interaction relation, so that the defects in the cyclic neural network are overcome.

In recent years, people pay more and more attention to improving the sequence recommendation performance by capturing the long-term and short-term interests of users, but the prior art only considers the complementary relation of the long-term and short-term interests of the users and does not consider the mutual constraint relation between the long-term and short-term interests of the users. In addition, the existing recommendation technology using the hypergraph can better learn high-order context information so as to better model the short-term interest of the user, but does not fully consider the sequence information of user interaction items, and particularly does not consider the utilization of the sequence information when constructing the hypergraph.

In summary, the current research work mainly provides a more accurate modeling method for the long-term interest and the short-term interest of the user, and can fully utilize the complementary and constrained relationships between the long-term interest and the short-term interest of the user to achieve a better recommendation effect.

Disclosure of Invention

The invention provides a self-supervision sequence recommendation method based on long-term and short-term interests of a user, which is used for solving the problem of mutual constraint relation between the long-term and short-term interests of the user in the prior art and realizing more accurate recommendation of the long-term and short-term interests of the user.

The invention is realized by the following technical scheme:

a self-supervision sequence recommendation method based on long-term and short-term interests of a user comprises the following steps:

step 1: acquiring a sequence data set of user information, project information and user behaviors, preprocessing the data set, and dividing the data set into a training set and a test set;

step 2: constructing a self-supervision sequence recommendation model based on long-term and short-term interests of a user;

and step 3: training the self-supervision sequence recommendation model based on the long-term and short-term interests of the user in the step 2 by using a training set;

and 4, step 4: inputting the personal information and the interaction sequence of the user to be recommended into the self-supervision sequence recommendation model based on the long-term and short-term interests of the user trained in the step 3, calculating the recommendation score of the item to be recommended relative to the user, and recommending the item to the user according to the recommendation score.

Further, the step 1 specifically includes the following steps:

step 1.1: dividing a user interaction sequence into a long-term sequence L and a short-term sequence S;

step 1.2: constructing a hypergraph G according to the short-term sequence S divided in the step 1.1;

let G ═ V, E denote a hypergraph, where the set of V contains N vertices, each vertex representing an item, E contains M hyperedges, each hyperedge epsilon E represents a session and contains five vertices; each super-edge is assigned a forward weight W_εεRepresenting the weight occupied by each item in the super edge, and forming all weights into a diagonal matrix W epsilon R^M×M(ii) a The hypergraph is represented by a correlation matrix H ∈ R^N×MIf the hyper-edge ε E contains a vertex v_iE is V then H _iε1, otherwise 0; for each vertex and hyper-edge, their degree is defined as

Definition D_hAnd B are both diagonal matrices; to utilize the sequence relation between the item interactions in the conversation, a degree matrix D in the sequence sense is defined_p∈R^N×NRegarding the former as influencing the latter through the context of the interactive items in each session;

standardizing for each item:

obtaining a final vertex degree matrix:

D＝2×(D_h+D_p)

further, the step 2 of constructing an automatic supervision sequence recommendation model based on long-term and short-term interests of the user specifically comprises the following steps:

step 2.1: obtaining an embedded representation of an item;

step 2.2: inputting the user long-term sequence L marked out in the step 1.1 and the project embedding expression into a GRU layer, and capturing the long-term interest theta of the user through a feedforward neural network layer_l；

Step 2.3: convolving the hypergraph G constructed in the step 1.2, and obtaining a short-term interest expression theta of the user by the convolution result and the short-term interaction sequence S of the user divided in the step 1.1 through a soft attention layer_s；

Step 2.4: the long-term interest theta of the user obtained in the step 2.2_lAnd short term interest θ from step 2.3_sObtaining an auto-supervised loss L by an auto-supervised learning layer_S；

Step 2.5: the long-term interest theta of the user obtained in the step 2.2_lAnd short term interest θ from step 2.3_sObtaining a final user representation theta through the fusion layer;

step 2.6: and calculating the scores of the candidate items according to the final representation of the user.

Further, the step 2.2 is specifically to model the long-term preference of the user according to the long-term sequence partitioned in the step 1.1, capture the evolution in the long-term sequence by using a GRU network, and represent θ by taking the last hidden unit state as the long-term preference of the user_l。

Further, the step 2.3 performs hypergraph convolution, and the hypergraph convolution update item represents:

the hypergraph convolution takes a formula from right to left as a convolution process from an item to a hyperedge to an item;

representing slave verticesInformation aggregation to the super edge, and then left multiplication H is carried out to aggregate the information from the super edge to the top point; after L-layer convolution, the average of all layer results is taken as the representation of the final item

For one session s ═ i_s,1,i_s,2,...,i_s,m]Aggregating the items in the conversation to obtain a representation θ of the end user's short-term interest_s：

α_t＝f^Tσ(W₁x_s+W₂x_t+b

Wherein x_sIndicating that the current session is averaged from items within the session, x_tRepresenting the tth interactive item in the conversation, wherein the short-term interest representation is obtained by the items in the conversation through a soft attention mechanism; f is an element of R^d，W₁∈R^d×dAnd W₂∈R^d×dAre the attention parameters used to learn the weights.

Further, the step 2.4 is specifically,

the method of comparative learning is used for fully playing the constraint relationship between the positive sample and the negative sample and calculating the loss by adopting a standard binary system cross entropy loss function between the positive sample and the negative sample

Wherein

Is to

Is rearranged in rows and columns to obtain negative samples, f_D(·):R^d×d→ R is a discriminator function that takes two vectors as input and scores the agreement of the two.

Further, said step 2.5 is specific in that, since step 2.4 effectively facilitates obtaining each other's information more accurately, the resulting representation of the user is obtained in an additive manner:

further, the step 2.6 obtains the scores of the items by multiplying the final user representation obtained in the step 2.5 by the item set points

Calculating the probability of each item appearing through softmax

Further, the step 3 specifically includes the following steps:

step 3.1: inputting data in the training set into the self-supervision sequence recommendation model based on the long-term and short-term interests of the user in the step 2 to obtain a final expression vector of the user;

step 3.2: inputting the final expression vector of the user into a prediction module to obtain a recommendation score of the user relative to the item;

step 3.3: updating the parameters of the model by calculating the error between the predicted score value and the true score value to optimize the recommendation loss function, combining the self-supervision loss in step 2.4 to obtain the final loss

Wherein β is a weight set to 0.01; and repeatedly training to obtain an optimal self-supervision sequence recommendation model based on long-term and short-term interests of the user.

Further, the step 4: inputting the personal information and the interaction sequence of the user to be recommended into the self-supervision sequence recommendation model based on the long-term and short-term interests of the user trained in the step 3, calculating the recommendation score of the item to be recommended relative to the user, and recommending the item to the user according to the recommendation score.

The invention has the beneficial effects that:

the invention fully utilizes the constraint relation between the long-term interest and the short-term interest of the user and considers the sequence relation in the short-term conversation during the construction of the hypergraph so as to achieve better recommendation effect.

According to the method and the device, the constraint relation between the long-term interest and the short-term interest of the user is fully exerted through self-supervision learning, and the recommendation accuracy rate is further improved.

Drawings

FIG. 1 is a schematic flow diagram of the process of the present invention.

FIG. 2 is a diagram of a user long-short term interest self-supervised recommendation model of the present invention.

Fig. 3 is a short-term session hypergraph diagram of the present invention.

Detailed Description

The technical solutions in the embodiments of the present invention will be described clearly and completely with reference to the accompanying drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.

A sequence recommendation system based on three deep neural networks.

1) A recurrent neural network-based sequence recommendation system. Given a historical sequence of user-commodity interactions, a recurrent neural network-based sequence recommendation system attempts to build a sequence dependency through a given interaction, thereby predicting the next possible interaction. The GRU4Rec model introduces a GRU network into sequence recommendation, models a user interaction sequence through the GRU network, learns the evolution of user interest, recommends the user according to the evolution of the user interest, and obtains a very good recommendation effect by utilizing the excellent performance of the recurrent neural network on the modeling of a sequence relation, but does not recommend the user interest by considering the long-term interest and the short-term interest together.

2) A convolutional neural network based sequence recommendation system. Convolutional Neural Networks (CNNs) can be used to extract features from text, audio, and pictures. Unlike the recurrent neural network, given the order of a user-commodity interaction, the convolutional neural network first embeds all the interactions into a matrix, and then treats this matrix as a picture in time and in potential space. Finally, the convolutional neural network learns this sequence pattern as a local feature of this picture, using convolutional filtering for subsequent recommendations. The Caser model utilizes a convolutional neural network to model the short-term interest of the user, takes the latest L items of the user interaction as the short-term sequences of the user interaction, respectively carries out horizontal convolution and vertical convolution on the embedding of the short-term sequences of the user, carries out splicing operation on the convolution results, takes the spliced embedding as the short-term interest of the user, also adopts a splicing mode when the long-term interest and the short-term interest of the user are fused, and takes the spliced embedding as the final interest representation of the user so as to recommend the user the articles meeting the interest of the user. Compared with a cyclic neural network, the convolutional neural network has the advantage that higher-dimensionality information can be captured by the convolutional neural network, and compared with a sequence recommendation model which is recommended by the cyclic neural network, the sequence recommendation model is relatively good at the moment, the experimental effect on a plurality of data sets is not weaker than that of other sequence recommendation models, but only the complementary relation among long-term and short-term interests of a user is considered, the constraint relation is not considered, and meanwhile, the short-term interest modeling of the user is too simple.

3) A sequence recommendation system based on a graph neural network. With the rapid development of graph neural networks, sequence recommendation systems based on graph neural networks have been designed that utilize graph neural networks to model and capture the transitions of more complex user-commodity interactions in sequences. When each sequence is mapped onto a path, a directed graph is first built over the sequence data, and each interaction is treated as a node in the graph. The embedding of the user or goods is then learned on the graph in order to embed more complex relationships throughout the graph. The SRGNN model constructs an interaction sequence of a user into a directed graph, and takes the sequence relation in the interaction sequence of the user into consideration as the directional relation in the directed graph during composition, the constructed directed graph can be used for clearly knowing which other items can influence the item representation in the graph, so that the item representation in the graph is updated through a graph neural network, the updated item representation is made to obtain an interest representation of the user through an attention mechanism, and recommendation is made according to the interest representation. The graph neural network is applied to the session recommendation to model the user interests, and compared with the sequence recommendation model which is represented in good terms at that time, the graph neural network has a better recommendation effect compared with other models, so that the graph neural network has strong capability of capturing the complex relationships among items in the user interaction sequence and has a large development space. As researchers have explored the use of graph neural networks in sequence recommendations, in recent years, hypergraph neural networks have been applied to sequence recommendations. The hypergraph neural network belongs to a graph neural network, but is different from a standard graph neural network in that the hypergraph neural network does not construct the relationship between the interaction sequences of the users into a standard graph structure any more, but into a hypergraph structure. The hypergraph differs from the normal graph in that there are two vertices on one edge of the normal graph, and there may be more than two vertices on one edge of the hypergraph, such an edge being called a hyperedge. By using the hypergraph, it is no longer possible to restrict to the paired relationship of two points on one side, and more complex relationships can be captured. The HyperRec model divides a complete interaction sequence of a user into a plurality of short-term sequences, constructs each short-term sequence into a hypergraph, utilizes hypergraph convolution to capture the correlation of articles in the short-term sequences, models the user interest of each short-term sequence by obtaining the dynamic representation of the articles, finally obtains the dynamic representation of the user interest of all the short-term sequences at the current moment by a self-attention mechanism, and performs next recommendation for the user by using the dynamic representation to obtain good recommendation effect, but sequence information in the user interaction sequence is not considered when constructing the hypergraph.

step 1: acquiring a sequence data set of user information, project information and user behaviors, preprocessing the data set, and dividing the data set into a training set and a test set; dividing the data set into training set and testing set according to 8:2 ratio

Step 1.1, the complete interactive sequence of the user is divided into a sequence capable of representing long-term preference and a session capable of representing current short-term interest, and because the long-term preference of the user is modeled, the complete interactive sequence can more accurately represent the long-term preference of the user, all the interactive sequences are used as sequences for capturing the long-term preference of the user, and aiming at the short-term interest of the modeled user, nearly five item interactions are used as short-term session sequences for constructing the hypergraph in order to prevent the influence effect of the data sparsity problem of the hypergraph session.

Further, the step 1 specifically includes the following steps:

constructing the hypergraph in the manner shown in fig. 3, and constructing the hypergraph by using the short-term session part of the whole sequence, wherein the step considers the mutual influence of similar sessions on the items in the respective sessions and also considers the sequence relation among the interactive items in the same session, namely the last interactive item has influence on the current interactive item.

Let G ═ V, E denote a hypergraph, where the set of V contains N vertices, each vertex representing an item, E contains M hyperedges, each hyperedge epsilon E represents a session and contains five vertices; each super-edge is assigned a forward weightW_εεRepresenting the weight occupied by each item in the super edge, and forming all weights into a diagonal matrix W epsilon R^M×M(ii) a The hypergraph is represented by a correlation matrix H ∈ R^N×MIf the hyper-edge ε E contains a vertex v_iE is then H _iε1, otherwise 0; for each vertex and hyper-edge, their degree is defined as

Definition D_hAnd B are both diagonal matrices; to utilize the sequence relation between the item interactions in the conversation, a degree matrix D in the sequence sense is defined_p∈R^N×NRegarding the former as influencing the latter through the context of the interactive items in each session; such as i₁、i₂Of (2), consider i₁Will influence i₂Then will be

Setting as 1;

standardizing for each item:

obtaining a final vertex degree matrix:

D＝2×(D_h+D_p)

step 2.1: obtaining an embedded representation of an item; generating an embedded representation of all items through an embedding matrix;

step 2.2: inputting the user long-term sequence L marked out in the step 1.1 and the project embedded representation into a GRU layer, and capturing the long-term interest theta of the user through a feedforward neural network layer_l；

Step 2.3: convolving the hypergraph G constructed in the step 1.2, and obtaining the short-term interest expression of the user by the convolution result and the short-term interaction sequence S of the user divided in the step 1.1 through a soft attention layerθ_s；

Further, step 2.3 performs hypergraph convolution, and the hypergraph convolution update item represents:

representing information aggregation from the vertex to the super edge, and then left multiplying H to aggregate information from the super edge to the vertex; after L-layer convolution, the average of all layer results is taken as the representation of the final item

α_t＝f^Tσ(W₁x_s+W₂x_t+b)

Further, the step 2.4 is specifically,

Wherein

Is to

Is rearranged in rows and columns to obtain negative samples, f_D(·):R^d×d→ R is a discriminator function that takes two vectors as input and scores the agreement of the two. The consistency is scored in a point-and-multiply mode; the learning goal of this part may be to maximize the mutual information between the user's long-term preferences modeled with different angles and the embedding of short-term interests.

Further, the step 2.5 is specific, since the step 2.4 effectively facilitates obtaining each other's information more accurately, but the complementary relationship between the two still needs to be fully developed; the way of obtaining the addition yields the final representation of the user:

Calculating the probability of each item appearing through softmax

Further, the step 3 specifically includes the following steps:

Preferably, the recommended loss function described in step 3.3 is specifically calculated as follows:

further, step 4 inputs the personal information and the interaction sequence of the user to be recommended into the self-supervision sequence recommendation model based on the long-term and short-term interests of the user trained in step 3, calculates the recommendation score of the item to be recommended relative to the user, and recommends the item to the user according to the recommendation score.

And 4, sequencing the item recommendation scores, and recommending the top k items with the highest scores to the user.

Claims

1. A self-supervision sequence recommendation method based on long-term and short-term interests of a user is characterized by comprising the following steps:

and 2, step: constructing a self-supervision sequence recommendation model based on long-term and short-term interests of a user;

and 3, step 3: training the self-supervision sequence recommendation model based on the long-term and short-term interests of the user in the step 2 by using a training set;

2. The self-supervision sequence recommendation method based on user long-term and short-term interests according to claim 1, wherein the step 1 specifically comprises the following steps:

let G ═ V, E denote a hypergraph, where the set of V contains N vertices, each vertex representing an item, E contains M hyperedges, each hyperedge epsilon E represents a session and contains five vertices; each super-edge is assigned a forward weight W_εεRepresenting the weight occupied by each item in the super edge, and forming all weights into a diagonal matrix W epsilon R^M×M(ii) a The hypergraph is represented by a correlation matrix H ∈ R^N×MIf the hyper-edge ε E contains a vertex v_iE is then H_iε1, otherwise 0;for each vertex and hyper-edge, their degree is defined as

standardizing for each item:

obtaining a final vertex degree matrix:

D＝2×(D_h+D_p)。

3. the method for recommending an unsupervised sequence based on long-term and short-term interests of a user according to claim 2, wherein the step 2 of constructing an unsupervised sequence recommendation model based on long-term and short-term interests of a user specifically comprises the following steps:

step 2.1: obtaining an embedded representation of an item;

Step 2.5: the long-term interest theta of the user obtained in the step 2.2_lAnd short term interest θ from step 2.3_sThrough the fusion layerTo the final user representation θ;

4. The method according to claim 1, wherein the step 2.2 is specifically to model the long-term preference of the user according to the long-term sequence divided in the step 1.1, capture the evolution in the long-term sequence by using a GRU network, and represent the last hidden unit state as the long-term preference of the user by θ_l。

5. The method for recommending an unsupervised sequence based on long-term and short-term interests of a user according to claim 2, wherein said step 2.3 is a hypergraph convolution, which updates the item representation:

representing the information aggregation from the vertex to the super edge, and then left multiplying H to aggregate the information from the super edge to the vertex; after L-layer convolution, the average of all layer results is taken as the representation of the final item

α_t＝f^Tσ(W₁x_s+W₂x_t+b)

6. The method for recommending an unsupervised sequence based on long-term and short-term interests of a user according to claim 2, wherein said step 2.4 is specifically,

Wherein

Is to

7. The method as claimed in claim 2, wherein the step 2.5 is to obtain the final list of the user in an additive manner, since the step 2.4 effectively facilitates obtaining each other's information more accuratelyThe following steps:

8. the method as claimed in claim 2, wherein the step 2.6 is to determine the score of each item by multiplying the final user representation obtained in the step 2.5 by the item set point

Calculating the probability of each item appearing through softmax

9. The method as claimed in claim 2, wherein the step 3 specifically comprises the following steps:

10. The method as claimed in claim 2, wherein the step 4 inputs the personal information and the interaction sequence of the user to be recommended into the self-supervised sequence recommendation model based on the long-term and short-term interests of the user trained in the step 3, calculates the recommendation score of the item to be recommended relative to the user, and recommends the item to the user according to the recommendation score.