CN112733027B

CN112733027B - Hybrid recommendation method based on local and global representation model joint learning

Info

Publication number: CN112733027B
Application number: CN202110026008.5A
Authority: CN
Inventors: 刘晓明; 吴少聪; 张占伟; 张兆晗; 沈超
Original assignee: Xian Jiaotong University
Current assignee: Xian Jiaotong University
Priority date: 2021-01-08
Filing date: 2021-01-08
Publication date: 2022-10-25
Anticipated expiration: 2041-01-08
Also published as: CN112733027A

Abstract

The invention discloses a hybrid recommendation algorithm based on local and global representation model joint learning, which comprises the following steps: adopting entity link to link the project with an entity in an external knowledge graph and obtain text description content and heterogeneous relation in the project as local information and global information respectively; constructing a knowledge complement embedded network as a local model; constructing a knowledge perception graph convolutional neural network as a global model; and integrating the representation information of the user and the project in the local embedding space and the global embedding space respectively, inputting the integrated result as the characteristic into the multilayer perceptron for prediction, outputting the required result, and finishing the joint learning process. The algorithm can capture the preference information of the user to the project at different angles, further effectively predict the click probability of the user, has high accuracy of the prediction result and small error, can be applied to various recommendation scenes, and has high practical value.

Description

Hybrid recommendation method based on local and global representation model joint learning

Technical Field

The invention belongs to the field of recommendation systems, and particularly relates to a hybrid recommendation method based on local and global representation model joint learning.

Background

With the development of internet technology, the total amount of data of online information rises sharply, and excessive information makes users unable to obtain the information really useful for themselves when facing a large amount of information, i.e. information overload.

Traditional collaborative filtering methods typically employ user-project-based historical interaction data to solve the problem, but these methods typically face data sparsity and cold start issues. In recent years, many methods solve the problems by combining social network information of users and attribute information of items as auxiliary information, but most of the methods face the problems of low data quality, poor modeling method effect, failure to effectively mine effective information in the social network information and the item information, and the like.

With the rapid development of semantic networks, knowledge graph data containing rich heterogeneous relationships provides an opportunity to know the relevant relations among users, projects and user-projects, and rules and mutual influences among nodes in a graph can be revealed. How to effectively model the preference information of the user through comprehensive research on utilizing rich heterogeneous data in the knowledge graph is a research focus for realizing personalized recommendation.

Disclosure of Invention

The invention aims to model user preference information in a recommendation environment and further predict the problem of the click probability of a user on an item, and provides a mixed recommendation method based on local and global representation model joint learning, which can comprehensively consider local information and global information related to the item and aggregate historical interaction information of the user to model the preference information of the user.

In order to achieve the purpose, the invention is realized by the following technical scheme:

a hybrid recommendation method based on joint learning of local and global representation models comprises the following steps:

step 1: by adopting entity link, the project is linked with an entity in an external knowledge graph, and text description content and heterogeneous relation in the project are acquired and respectively used as local information and global information;

and 2, step: constructing a knowledge complement embedded network as a local model according to the local information;

and step 3: constructing a knowledge perception graph convolutional neural network as a global model according to global information;

and 4, step 4: and (3) considering the historical interactive behavior of the user, simulating the local preference and the global preference of the user to the item under different models by combining an attention mechanism, integrating the representation information of the user and the item in local and global embedding spaces respectively, inputting the integrated result as a feature into a multi-layer perception machine for prediction, outputting a required result, and finishing the joint learning process.

Preferably, in step 1, the names of the items are used as queries and the type information of the items is used as constraint conditions, the items are linked with the entities in the external knowledge graph, and a heterogeneous relationship information graph between the text description information related to the items and the entities is obtained.

Preferably, obtaining a heterogeneous relationship information graph between the text description information related to the project and the entity includes:

step 1.1: the method comprises the steps that interactive information between a user and a project and basic information of the project are obtained in a centralized mode in project recommendation data;

step 1.2: respectively representing users in the data set, items in the data set, the number of the items and type information of the items by using a list;

step 1.3: taking the title of the project as query content, searching API by using an offline knowledge graph, and retrieving KB entity from the knowledge graph Freebase;

step 1.4: based on the linked entity, text description information is extracted from the KB, description information corresponding to the item is represented by a list, structure information is extracted, and representation is represented by a heterogeneous graph.

Preferably, the data comprises historical interaction data between the user and the item; the project information comprises a title and a category; the local information and the global information respectively comprise entity description information and heterogeneous relation information in the knowledge graph.

Preferably, in the step 2, a knowledge enhanced embedding algorithm is used for the text description content of the item to obtain a local embedded representation of the item; and calculating the weight value to represent the correlation degree between the two projects, constructing a historical interactive binary graph structure according to the interactive behavior information between the user and the projects, and calculating by combining the two to obtain the local embedded representation of the user.

Preferably, the obtaining of the locally embedded representation of the item using a knowledge enhanced embedding algorithm on the textual description of the item comprises:

step 2.1: aligning the title of the item with the text description information obtained in the step 1.4; obtaining each word and the initial embedded representation thereof;

step 2.2: calculating a title-embedded representation of the obtained title and text description using a knowledge enhanced embedding algorithm

And textual description sentence embedding representation

Finally, the embedded representations of the items are merged to obtain the local embedded representation of the items.

Preferably, a history interaction binary graph structure is constructed, and the user local embedded representation is obtained by combining the history interaction binary graph structure and the user local embedded representation through calculation, wherein the method comprises the following steps:

step 2.3: representing user u using lists based on locally embedded representations of items _i Setting a specific weight calculation mode, and respectively calculating weights between the historical interactive items and the candidate items;

step 2.4: and corresponding the weight to the historical interactive item to represent the contribution degree to modeling user preference information, and combining the weight with the local embedded representation to obtain a new embedded representation which indicates the local embedded representation of the user.

Preferably, the step 3, obtaining a global embedded representation of the item and the user by using a knowledge-graph convolutional neural network for the structural information in the knowledge-graph, includes:

step 3.1: combining the user, the project and the interaction information Y between the user and the project to add the user, the project and the interaction information Y into the abnormal graph H obtained in the step 1.4 to obtain a unified relational graph G;

step 3.2: using an embedded learning algorithm based on transfer to the relationship graph G constructed in the step 3.1 to obtain embedded representation of nodes and relationships in the relationship graph G;

step 3.3: taking a user node in the relational graph G as a central node, and taking a directly connected node as a neighbor node; aggregating the neighbor node characteristics in the relational graph G by adopting a knowledge-aware graph convolution neural network to obtain neighbor embedding representation and obtain new embedding by combining with the embedding information of the user as the user u _i Is embedded in the information.

Preferably, step 4 specifically comprises the following steps:

step 4.1: user u _i And candidate item v _i The local embedding expresses that the cosine similarity is calculated to obtain the local click probability p ^l ；

Step 4.2, user u _i And candidate item v _i The local embedding expresses that the cosine similarity is calculated to obtain the local click probability p ^g ；

Step 4.3: the local click probability p calculated in the steps 4.1 and 4.2 ^l And global click probability p ^g And aggregating through the weight calculated by the door mechanism to obtain the user u _i For candidate item v _i Establishing a corresponding target function, and training by using an optimizer to minimize a loss function;

step 4.4: and after one training is finished, updating the behavior and site node potential characteristics, and circulating the calculation processes in the steps 3, 4 and 5 to carry out the next training and result output.

The method solves the problem of cold start of the project in the recommendation environment by introducing the related auxiliary information of the project; according to the degree of closeness of contact with the project, the information related to the project is innovatively divided into local information and global information; in the aspect of information acquisition, the project is linked with the entity in the external knowledge graph by adopting an entity linking technology, and the text description content and the heterogeneous relation in the project are acquired as local information and global information respectively. According to the characteristics of different types of information, a knowledge complement embedded network is respectively designed as a local model and a knowledge perception graph convolutional neural network is respectively designed as a global model for mining different characteristics in the local model and the global model. In the joint learning process, the historical interaction behaviors of the user are considered firstly, and the local preference and the global preference of the user on the items are simulated under different models by combining an attention mechanism. And finally, aggregating different preference information as characteristics to be input into the multilayer perceptron for prediction, aggregating different information through a door mechanism, and calculating to obtain the click probability of the user to the item. And effective modeling of the actual preference information of the user is completed, and the personalized recommendation process of the user is realized.

Due to the adoption of the technical scheme, the invention has the following beneficial effects:

(1) According to the method, the information is classified, the project related information is divided into local information and global information based on the interactive object, and the preference information of the user can be mined from different angles by modeling different information;

(2) According to the method, projects are linked with entities in an external knowledge graph through an entity linking technology, then rich heterogeneous relationships among the entities are extracted, and further modeling is performed on preference information of a user from multiple angles by combining description information of the projects and historical interaction information between the user and the projects;

(3) According to the method, the local and global information of the user/project is mined by designing different deep learning architectures to obtain the local preference and the global preference of the user to the project, and the local preference and the global preference are aggregated to be input into the multi-layer perceptron as features to carry out co-learning and prediction of the user click probability, so that the prediction result has higher accuracy, smaller error and higher practical value.

Drawings

The accompanying drawings, which are included to provide a further understanding of the invention and are incorporated in and constitute a part of this application, illustrate embodiment(s) of the invention and together with the description serve to explain the principles of the invention:

FIG. 1 is a flow diagram of a hybrid recommendation algorithm based on joint learning of local and global representation models;

FIG. 2 is a diagram illustrating the acquisition of local information and global information related to a project by an entity linking method;

FIG. 3 is a knowledge enhancement embedding method in conjunction with local information;

FIG. 4 is a method of aggregating historical interactive movie computations to obtain a user local embedded representation;

FIGS. 5 (a), 5 (b) show a unified relationship diagram and a knowledge-aware convolutional neural network, respectively;

fig. 6 (a) and 6 (b) are respectively effectiveness visualizations of the hybrid recommendation algorithm based on the local and global representation model joint learning under different parameters.

Detailed Description

The present invention will now be described in detail with reference to the drawings and specific embodiments, wherein the exemplary embodiments and descriptions of the present invention are provided to explain the present invention without limiting the invention thereto.

As shown in FIG. 1, the hybrid recommendation method based on the joint learning of the local and global representation models of the present invention includes the following steps:

step 1: by adopting entity link, the project is linked with the entity in the external knowledge map, and the text description content and the heterogeneous relation in the project are acquired as local information and global information respectively.

Through an entity linking process, the name of a project is used as query and information such as the type of the project is used as constraint conditions, the project is linked with entities in an external knowledge graph (KB), and text description information related to the project is obtained and used as local information and a heterogeneous relation information graph between the entities and used as global information.

The method specifically comprises the following steps:

step 1.1: and intensively acquiring interaction information between the user and the item and basic information of the item in the item recommendation data.

Step 1.2: using list U = { U = ₁ ,u ₂ ,…,u _M Denotes a user in the data set, where u _i E, representing the ith user in the list U by the U, wherein M is the number of the users; list V = { V = ₁ ,v ₂ ,…,v _N Denotes an item in the data set, where v _j The element V represents the jth commodity in the list V, and N is the quantity of the commodities; lists

A title representing item i, where n represents the title length; character string t _i Indicating the type information of the article i. Matrix Y ∈ R whose elements are 0,1 ^M×N Representing interaction data between a user and a good, where y _uv =1 indicates that there is an interaction between user u and item i, otherwise y _uv ＝0；

Step 1.3: title W of commodity i _i As query content, KB entities are retrieved from the knowledge-graph Freebase using the offline knowledge-graph search API.

For items that return more than one search entity as a result during the entity linking process, an item title W is further employed _i And type T _i Screening as constraint conditions to obtain accurate link entity e _i 。

Step 1.4: for linking entity e _i Extracting text description information of the knowledge graph as a project v _i Using lists of local information

Wherein m represents a description information length; extracting the heterogeneous relation information thereof as a project v _i Using an abnormal picture

Representation of a triplet (e) _h ,e _r ,e _t ) Indicating the presence of the de novo node e _h To tail node e _t Relation e of _r ξ and

representing a list of nodes and relationships, respectively.

The data in step 1.1 to step 1.4 comprises historical interaction data between the user and the project; the item information includes title, category, etc.; the local information and the global information respectively comprise entity description information, heterogeneous relation information in a knowledge graph and the like.

Step 2: and constructing a knowledge complementing embedded network as a local model according to the local information, and obtaining a local embedded representation of the item by using a knowledge enhancing embedding method for the text description content of the item. The method specifically comprises the following steps;

step 2.1: will item v _i Title W of _i And the text description information D obtained in step 1.4 _i Align, get each word therein using word2vec

And

initial embedded representation of

And

step 2.2: calculating a title-embedded representation of the obtained title and text description using a knowledge-enhanced embedding algorithm

And textual description sentence embedding representation

Finally, the embedded representations of the items are merged to obtain an item v _i Is locally embedded to represent

Wherein for candidate item v _j Its local embedding is expressed as

In step 2.2, the knowledge enhancement embedding algorithm is specifically realized as follows:

knowledge-enhanced embedding algorithm uses a convolutional neural network model to process a sentence s = { w = { a word sequence constitutes ₁ ,w ₂ …, using filters to construct the word embedded representation, then using max pooling to extract the maximum value in each dimension of the representation, resulting in new representation information as the embedded representation of the sentence

The process is defined as follows:

wherein W ₁ And h represent weight and size information of a filter in the convolutional neural network, respectively.

Title embedded representation

And textual description sentence embedding representation

The specific implementation of the post-merger computation local embedding representation of (1) is:

the merging process is represented by embedding the title into the representation

And textual description sentence embedding representation

Stitching as a deep neural network

Is calculated to obtain a knowledge-enhanced embedded representation

As item v _i Is locally embedded to represent

The process is defined as follows:

wherein

Indicating a splicing operation.

And calculating the weight value to represent the correlation degree between the two projects, constructing a historical interactive binary graph structure according to the interactive behavior information between the user and the projects, and calculating by combining the two to obtain the local embedded representation of the user.

Step 2.3: using the list R (u) based on the locally embedded representation of the item _i )＝{v ₁ ,v ₂ ,…,v _n Denotes user u _i And v _j Representing candidate items, wherein n is the number of historical interaction items. And setting a specific weight calculation mode, and calculating the weights between the historical interactive items and the candidate items respectively.

The weight calculation between the historical interactive item and the candidate item is specifically realized as follows:

the weight calculation method is that two layers of deep neural networks are used

The attention mechanism is formed by embedding historical interaction items into a representation

And candidate item embedding

As input, the weight values between them are calculated

Then, the weight value is normalized by using a softmax function to obtain

Where R (u) represents the historical set of interactive merchandise for user u, W ₁ And

b ₁ and b ₂ Respectively represent

Weight and bias in (1).

Step 2.4: corresponding the weight value to the historical interactive item to represent the contribution degree to the modeling user preference information, and combining the weight value with the local embedded representation to aggregate to obtain a new embedded representation

Indicating user u _i Is shown partially embedded.

The aggregation process of the historical items is specifically realized as follows:

the weight a obtained by combining the calculation of the step 3.1 in the polymerization process _kj And corresponding item-embedded representations

Computing a locally embedded representation of a user

Where n represents the historical interactive commodity quantity for user u.

And step 3: a knowledge graph convolutional neural network is used for obtaining global embedded representation of the items and the users for the structural information in the knowledge graph. The method specifically comprises the following steps;

step 3.1: and (4) combining the user, the item and the interaction information Y between the user and the item, and adding the user, the item and the interaction information Y into the abnormal graph H obtained in the step 1.4 to obtain a unified relation graph G.

The process of constructing the unified relationship graph G is specifically realized as follows:

the construction process adds the users, commodities and the mutual information among the users and the commodities into the heterogeneous knowledge graph

Obtaining a unified relationship diagram

Where ξ' = { ξ utou } indicates that the user is added to the graph as a new node,

showing the interaction between the user and the goods as a new relationship to the graph.

Step 3.2: using embedded learning algorithm based on translation to obtain embedded representation of node and relation in graph data G constructed in step 3.1

And

for item v _i By using

Representing its global embedded information. For user node u _i The panel is embeddedThe in representation is its initial embedded representation

The embedded learning algorithm based on the transfer is specifically realized as follows:

the translation-based embedded learning algorithm performs embedded representation learning by optimizing translation rules:

wherein

And

respectively representing the embedded representation of nodes h, t and the relation r in the graph, M _r A mapping matrix is represented that is,

to represent

A mapped representation in the space of the relation r. For a triplet (h, r, t), its confidence score is defined as:

g _r a lower score of (h, t) indicates that (h, r, t) is more likely to be a valid triplet. Then, the boundary-based ordering loss function is used for training, and the representation information of the nodes and the relations in the graph is obtained through learning:

wherein gamma represents the boundary of the image,

step 3.3: with user node u in graph G _i As a central node, taking a directly connected node as its neighbor node set N (u) _i ). Neighbor node characteristics in the graph are aggregated by adopting a knowledge-aware graph convolution neural network, and a user u is combined _i Initial embedded representation

Calculating to obtain new embedding as its global embedding information

And (4) showing.

The knowledge-aware graph convolution neural network and the process of acquiring the global embedded representation are specifically realized as follows:

knowledge-aware graph-convolutional neural networks are used to aggregate embedded representations in a non-linear fashion, using localized convolution operations to merge ego-the embedding of neighbor nodes in the network that are directly connected to the target node h, as defined below:

where σ denotes the nonlinear activation function, N (h) denotes a set π consisting of neighboring nodes directly connected to node h _ht The normalized aggregation coefficient between the node h and the node t is represented, different importance of different neighbor nodes to the target node is reflected, and the normalized aggregation coefficient is defined as follows:

simultaneous user node u _i By incorporating neighbor embedding

And the initial embedded representation of itself

Calculated, defined as follows:

wherein W ₆ And LeakyRelu represent weights and activation functions in the neural network, respectively.

And 4, step 4: in the joint learning process, the historical interactive behaviors of the user are considered, the local preference and the global preference of the user to the item are simulated under different models by combining an attention mechanism, the representation information of the user and the item in local and global embedding spaces is integrated respectively, the integrated result is used as a feature and input into a multi-layer perception machine for prediction, and a required result is output. The method specifically comprises the following steps:

step 4.1: user u _i And candidate item v _j The local embedding expresses that the cosine similarity is calculated to obtain the local click probability p ^l ；

Step 4.2: user u _i And candidate item v _j The global embedding expresses that the cosine similarity is calculated to obtain the local click probability p ^g ；

Step 4.3: for the local click probabilities p calculated in steps 4.1 and 4.2 ^l And global click probability p ^g Designing a door mechanism to calculate to obtain a weight coefficient theta between the door mechanism and the door mechanism, and finally performing polymerization calculation to obtain a user u _i For candidate item v _j The click probability Pr of. Establishing a corresponding objective function, and training by using a proper optimizer to minimize a loss function;

the design of the prediction process and the loss function of the click probability is specifically realized as follows:

local click probability in combination with user u _i And candidate item v _j Is calculated as follows:

global click probability in conjunction with user u _i And candidate item v _j Is calculated as follows:

the predicted click probability Pr is calculated through a door mechanism to obtain a weight coefficient lambda between the predicted click probability Pr and the predicted click probability is finally calculated in a polymerization mode, and the predicted click probability Pr is defined as follows:

Pr(u _i ,v _j )＝λp ^l (u _i ,v _j )+(1-λ)p ^g (u _i ,v _j )

where σ is sigmoid function, W _gated A weight matrix of a gate mechanism, representing a vector dot product.

The training process utilizes a cross entropy function as a loss function and adopts mini-batch Adam for optimization, and the definition is as follows:

wherein R is _ij For elements in the interaction matrix R, user u is represented _i And candidate goods v _j Whether there is an interaction between them.

The invention is further illustrated by the following specific examples.

In the embodiment, the recommendation system data set is MovieLens-20M, and a source of the movie evaluation related data sethttps://grouplens.org/datasets/movielensAfter data preprocessing, 9861984 pieces of evaluation data are obtained, which comprise 61859 different users and 17488 different movies, and 5-star evaluation is adopted. Setting the rating threshold to 3, and setting the rating to be greater than or equal to 3 is regarded as that the preference of the user for the movie is positive, and negative. The used knowledge graph data is the Freebase data of the latest published version, and the triples are used<The head, relative, tail forms store the relevant facts. Referring to fig. 2, a movie item "a movie" in a data set is linked with an entity in an external knowledge graph through an entity linking technology, so as to obtain a related text description and heterogeneous relationship graph information thereof, which are respectively used as local and global information thereof, and the process is as follows:

step 1.2: use list U = { U = { (U) } ₁ ,u ₂ ,…,u _M Denotes a user in the data set, where u _i E, representing the ith user in the list U by the U, wherein M is the number of the users; list V = { V = ₁ ,v ₂ ,…,v _N Denotes an item in the dataset, where v _i And e.g. V represents the ith item in the list V, and N is the number of items. Taking FIG. 2 as an example, item v _i For a movie ", the titles are mapped and presented as a list

Wherein n represents the title length with fixed size, fixed as 10 in this example, and filled with less than 0, wherein each element represents the index value corresponding to each word after the word segmentation of the movie title; character string t _i Representing a movie v _i The type information of (1) is love photos. Matrix Y ∈ R whose elements are 0,1 ^M×N Representing interaction data between a user and a good, where y _ij =1 denotes user u _i And commodity v _j There is an interactive action between them, otherwise y _ij ＝0；

Step 1.3: using the movie title as a query, and using the entity linking technology to link the movie v _i Linking with entities in the knowledge-graph to obtain an item-entity pair < a movie,/m/02 v7x2 >, wherein/m/02 v7x2 represents the id of the entity in the knowledge-graph. Then, the local text description information "a certain movie" is … in 2007 "as shown in fig. 2 is acquired, and the local information is mapped and represented as a list:

wherein m represents the description information length, fixed as 20, and filled with less than 0; and extracting the triple information of the entity [ < http:// rdf. Freebase. Com/ns/m.02v7x2 >, < http:// rdf. Freebase. Com/ns/film.film.cinematography >, < http:// rdf. Freebase. Com/ns/m.0jsqw2 > ]]Form a movie v _i And the heterogeneous relationship graph which is the central node is used as global information.

And 2, step: as shown in fig. 3, a knowledge-enhanced embedding algorithm is used on the movie text description content to obtain a locally embedded representation of the item.

Step 2.1: will movie v _i Title W of _i And the local information D obtained in step 1 _i Align, get each word w therein using word2vec _ij ∈W _i And d _ij ∈D _i Initial embedded representation of

And

a dimension size of 50;

And textual description sentence embedding representation

Finally, the embedded representations are combined to obtain the local embedded representation of the film

The dimension size is 50.

According to the graph shown in FIG. 4, a historical interactive binary graph structure is constructed according to the interactive behavior information between the user and the movie, an attention network is designed, weight information is calculated according to the correlation degree of the historical interactive movie and the candidate movie, and finally the historical interactive movie is subjected to aggregation calculation to obtain the local embedded representation of the user;

step 2.3: using the list R (u) based on the locally embedded representation of the movie _i )＝{v ₁ ,v ₂ ,…,v _n Denotes user u _i And v _j Representing candidate items, wherein n is the fixed number of history interactive items 10. Calculating the weight between the historical interactive item and the candidate item

The dimension size is 10.

Step 2.4: corresponding the weight value to the historical interactive item to represent the contribution degree to the modeling user preference information, and combining the weight value and the historical interactive item to represent and aggregate the contribution degree to the modeling user preference information to obtain a user u _i Is locally embedded to represent

The dimension size is 50.

And 3, step 3: a knowledge graph convolutional neural network is used for obtaining global embedded representation of the items and the users for the structural information in the knowledge graph. The method specifically comprises the following steps;

step 3.1: as shown in FIG. 5 (a), the user and the item, and the interaction information Y between them are combined and added to the abnormal image H obtained in step 1.4 to obtain the unified relationship diagram

Where ξ' = { ξ ≡ U } indicates that the user is added to the graph as a new node,

Step 3.2: using an embedded learning algorithm based on translation for the graph data G constructed in the step 4.1 to obtain embedded representation of nodes and relations in the graph data G

Where t ∈ ξ' and

for movie v _i ，

Indicating its global embedded information.

Step 3.3: as shown in FIG. 5 (b), with user node u in graph G _i And taking the directly connected node as a neighbor node of the node as a central node. Aggregating neighbor node characteristics in the graph G by adopting a knowledge-aware graph convolution neural network to obtain neighbor embedded representation

Embedding information in connection with a user

Get a new embedding as user u _i Global embedded information of

And 4, step 4: in the process of joint learning, the representation information of the user and the item in the local embedding space and the global embedding space is integrated respectively, the integrated result is used as the characteristic to be input into the multilayer perceptron to be predicted, and the required result is output. The method specifically comprises the following steps:

step 4.1: user u _i And candidate item v _i Is obtained by splicing

The dimension size is 100. Inputting the local click probability p as a characteristic into a multilayer perceptron ^l (u _i ,v _j )＝0.6157。

Step 4.2 user u _i And candidate item v _i Is used for splicing to obtain

The dimension size is 100. Inputting the global click probability p as a feature into a multi-layer perceptron ^g (u _i ,v _j )＝0.8345。

Step 4.3: the click probability p calculated in the steps 4.1 and 4.2 ^l (u _i ,v _j ) And p ^g (u _i ,v _j ) And the weight lambda is calculated through a door mechanism to carry out aggregation to obtain a user u _i For candidate item v _i Click probability Pr (u) _i ,v _j )＝λp ^l (u _i ,v _j )+(1-λ)p ^g (u _i ,v _j ) =0.8793. Establishing a corresponding target function, training by using an Adam optimizer to minimize a loss function, and setting the learning rate to be 0.001;

As shown in fig. 6 (a) and 6 (b), the experimental results of this example are as follows:

the Accuracy (AUC) of the test set stabilized at 0.9218. In addition, mean absolute error value (MAE) stabilized at 0.1965, root Mean Square Error (RMSE) stabilized at 0.3361, and F1-score stabilized at 0.8836.

The experimental result shows that the mixed recommendation algorithm based on the joint learning of the local and global representation models can effectively mine useful characteristics in local information and global information related to the project, further can accurately model preference information of the user, and has the advantages of high accuracy of the prediction result, small error and high practical value.

The present invention is not limited to the above-mentioned embodiments, and based on the technical solutions disclosed in the present invention, those skilled in the art can make some substitutions and modifications to some technical features without creative efforts according to the disclosed technical contents, and these substitutions and modifications are all within the protection scope of the present invention.

Claims

1. A hybrid recommendation method based on joint learning of local and global representation models is characterized by comprising the following steps:

step 1: by adopting entity link, the project is linked with the entity in the external knowledge map, and text description information related to the project is obtained and used as local information and an information map of heterogeneous relationship between the entities and used as global information;

obtaining a local embedded representation of the item by using a knowledge enhanced embedding algorithm on the textual description content of the item; calculating the weight value to represent the correlation degree between two projects, constructing a historical interactive binary graph structure according to the interactive behavior information between the user and the projects, and calculating by combining the two to obtain the local embedded representation of the user;

and 3, step 3: constructing a knowledge perception graph convolutional neural network as a global model according to global information;

and 4, step 4: considering the historical interactive behavior of the user, simulating the local preference and the global preference of the user to the item under different models by combining an attention mechanism, respectively integrating the representation information of the user and the item in local and global embedding spaces, inputting the integrated result as a characteristic into a multi-layer perceptron for prediction, outputting a required result, and finishing a joint learning process;

the step 4 specifically comprises the following steps:

Step 4.2, user u _i And candidate item v _i The global embedding expresses that the cosine similarity is calculated to obtain the global click probability p ^g ；

Step 4.3: the local click probability p calculated in the steps 4.1 and 4.2 ^l And global click probability p ^g And the weight is calculated through a door mechanism to carry out aggregation to obtain a user u _i For candidate item v _i Establishing a corresponding target function according to the click probability, and training by using an optimizer to minimize a loss function;

step 4.4: after one-time training is finished, the potential characteristics of the behavior and place nodes are updated;

and (5) circulating the calculation processes in the steps 4.3 and 4.4 to carry out next training and result output.

2. The local and global representation model joint learning-based hybrid recommendation method according to claim 1, wherein in the step 1, the names of the items are used as queries and the type information of the items is used as constraints to link the items with the entities in the external knowledge graph, so as to obtain a textual description information related to the items and an information graph of heterogeneous relationships between the entities.

3. The hybrid recommendation method based on the local and global representation model joint learning of claim 2, wherein obtaining the textual description information related to the item and the heterogeneous relationship information graph between the entities comprises:

4. The hybrid recommendation method based on joint learning of local and global representation models according to claim 3, characterized in that said data comprises historical interaction data between users and items; the project information comprises a title and a category; the local information and the global information respectively comprise entity description information and heterogeneous relation information in the knowledge graph.

5. The hybrid recommendation method based on joint learning of local and global representation models according to claim 1, wherein obtaining the locally embedded representation of the item using knowledge enhanced embedding algorithm for the item text description content comprises:

step 2.1: aligning the title of the item with the text description information obtained in step 1.4; obtaining each word and the initial embedded representation thereof;

And textual description sentence embedding representation

6. The hybrid recommendation method based on the local and global representation model joint learning of claim 1, wherein a historical interactive binary graph structure is constructed, and a user local embedded representation is obtained by combining the historical interactive binary graph structure and the global representation model joint learning, and the method comprises the following steps:

step 2.3: representing user u using lists based on locally embedded representations of items _i Setting a specific weight calculation mode for the historical interactive items and the representing candidate items, and respectively calculating the weights between the historical interactive items and the candidate items;

step 2.4: and corresponding the weight to the historical interactive item to represent the contribution degree to modeling user preference information, and combining the weight with the local embedded representation to aggregate to obtain a new embedded representation which indicates the local embedded representation of the user.

7. The hybrid recommendation method based on joint learning of local and global representation models according to claim 1, wherein the step 3, obtaining global embedded representations of items and users by using knowledge-graph convolutional neural network for structural information in knowledge-graph, comprises:

step 3.3: taking a user node in the relational graph G as a central node, and taking a directly connected node as a neighbor node; neighbor node characteristics in the relational graph G are aggregated by adopting a knowledge-aware graph convolution neural network to obtain neighbor embedding expression, and new embedding is obtained by combining with embedding information of a user and is used as the user u _i Is embedded in the information.