CN112069399B

CN112069399B - Personalized search system based on interaction matching

Info

Publication number: CN112069399B
Application number: CN202010861245.9A
Authority: CN
Inventors: 窦志成; 邴庆禹
Original assignee: Renmin University of China
Current assignee: Renmin University of China
Priority date: 2020-08-25
Filing date: 2020-08-25
Publication date: 2023-06-02
Anticipated expiration: 2040-08-25
Also published as: CN112069399A

Abstract

The invention realizes a personalized search system based on interactive matching by a method in the artificial intelligence field, a system input module, a personalized search module based on interactive matching and an output module, wherein the operation process of the personalized search module based on interactive matching is realized by four steps of bottom matching modeling of a user search history, calculation of attention weight, generation of user interest matching vectors and personalized reordering, a model idea of matching a user history query with candidate documents in an interactive way is based on a word level, an attention mechanism reduces the influence of irrelevant information in the search history, and a convolution neural network is used for fusing the weighted matching methods, so that a final interest matching vector of the documents is generated, more accurate interest matching score is obtained, and the technical problems that the quality dependence vector of a sequencing result is good or bad for constructing a model and the vector constructing process possibly ignores some useful information under the existing vector representation method are solved.

Description

Personalized search system based on interaction matching

Technical Field

The invention relates to the field of artificial intelligence, in particular to a personalized search system based on interactive matching.

Background

Personalizing user searches with user history information has proven to be effective in improving the quality of search rankings. The personalized search algorithm firstly models the interests of the user according to the historical behaviors and other information of the user, and when the matching score is calculated, the correlation degree between the query statement and the document is considered, and the matching degree between the document and the interests of the user is introduced, so that a search result list meeting the requirements of different users is personalized and customized. The user interest model can be built based on various information sources, such as the position information of the user, the retrieval mode, the browsing history and the searching history of the user, and the like, and most personalized searching algorithms are currently built based on the historical browsing and the historical searching behaviors of the user. In recent years, researchers introduce a deep learning method into a personalized ranking model, so that the semantic understanding capability of the model on texts is enhanced, and a good effect is achieved on personalized rearrangement of search results. Ranking algorithms that utilize deep learning can be categorized into representation-based matching and interaction-based matching. Based on the representation matching, in the sorting algorithm, semantic vector representations of the query and the document are obtained through learning respectively, and then the two vectors are subjected to matching calculation. The interactive matching algorithm is to make the query and the document interact in advance on the word level with finer granularity, grasp more complete matching signals, and then combine the matching signals into a matching score by utilizing the matching signals. The existing personalized search algorithm almost all calculates interest expression vectors of users, and then interacts with the expression vectors of candidate documents to obtain personalized matching scores, and an algorithm idea based on expression matching is used.

Most of the existing personalized ranking algorithms directly calculate interest expression vectors of users in various modes according to historical behaviors of the users, and then interact with the expression vectors of candidate documents to obtain personalized matching scores. The method is to acquire a matching signal of the document and the user interest by taking the whole document as a unit, and mainly converts the document to be matched and the user interest into a representation vector, then carries out vector matching, and focuses on the construction of a representation layer. Under the vector representation-based method, the quality of the sequencing result depends on the quality of a vector construction model to a great extent, and the process of constructing the vector may ignore some useful information, such as text information and interaction information of the query and the document at the word level, so as to influence the personalized sequencing result.

Disclosure of Invention

Therefore, the invention provides an individualized searching system based on interactive matching, which comprises an input module, an individualized searching module based on interactive matching and an output module;

the input module is used for reading the user inquiry history and the candidate documents, inputting the standardized formats into the personalized search module based on interactive matching,

the operation process of the personalized search module based on the interaction matching is divided into four steps:

step one: a bottom layer matching modeling step of the user search history, wherein a bottom layer matching model is established by utilizing the history search information of the user, and the history inquiry of the user and the candidate document are interacted according to words to obtain a fine matching signal of the bottom layer;

step two: a step of calculating attention weight, namely introducing an attention mechanism, and weighting corresponding matching signals according to the contribution degree of different query records in the search history of the user to the current query;

step three: generating a user interest matching vector, namely performing feature extraction on the weighted matching signals by using a convolutional neural network to generate a document and a final user interest matching vector;

step four: a personalized reordering step, namely calculating personalized scores of candidate documents through the user interest matching vectors obtained in the user interest matching vector generation step, calculating relevance scores of the candidate documents through clicking feature vectors, and personalized reordering by taking the sum of the personalized scores as a final document matching score;

and the output module outputs the document matching score and the personalized rearrangement result.

The implementation mode of the bottom layer matching modeling step of the user search history is as follows: defining a user's historical query list as { q } ₁ ,q ₂ ,q ₃ ,…,q _n (wherein n.gtoreq.3, an integer), the current candidate document is d, for each historical query-candidate document pair<q _i ,d>Firstly, mapping the two words into word vectors, using word2vec model to express the word vectors, q _i Is processed and expressed as a group of word vectors { qw } ₁ ,qw ₂ ,qw ₃ ,…,qw _x Processed d is denoted as { dw }, d ₁ ，dw ₂ ，dw ₃ ，…，dw _y }. To be in two sets of word vectorsEach vector is interacted pairwise to obtain<q _i ,d>The word matching matrix T of (a), each element in the matching matrix T is:

T _i,j ＝cos(qw _i ,dw _j )

wherein T is _i,j Representing the elements of the ith row and jth column in the matrix T, qw _i Representing a word vector, dw, corresponding to the i-th word in the history query _j The matching value of the word vector corresponding to the jth word in the representative candidate document (wherein, i is more than or equal to 1 and less than or equal to x, j is more than or equal to 1 and less than or equal to y and i, j, x, y are integers) is calculated by a cosine function. In the K-NRM model, K RBF kernels are applied to each row in the matching matrix to obtain a K-dimensional feature vector

The formula corresponding to the RBF kernel is:

wherein K is _k (T _i ) Representing the value of the kth RBF kernel after processing the ith row of the matching matrix T, wherein the value range is between 0 and y; mu (mu) _k Sum sigma _k Are all super parameters, mu is uniformly valued from-1 to 1, then the logarithm of the feature vector quantity corresponding to each row in the matching matrix is summed again to be used as a historical query q _i Final bottom matching results with candidate documents:

{ v for underlying match vector calculated based on user's historical search information ₁ ,v ₂ ，v ₃ ,…，v _n And the element in the expression is a fine granularity matching vector v of the candidate document.

The specific implementation manner of the attention weight calculation step is as follows: the fine granularity matching vector v of the current query q and the candidate document d calculates the attention weight for the bottom matching vector corresponding to each historical query record:

e _i ＝g(v，v _i )

wherein g is a multi-layer perceptron using tanh as an activation function, alpha _i Is the bottom layer matching vector v calculated by the attention layer _i The corresponding weight, the weighted bottom matching vector is:

the weighted fine-grained matching vector corresponding to each historical query of the user is { V } ₁ ，V ₂ ，V ₃ ，…，V _n }。

The specific implementation mode of the step of generating the user interest matching vector is as follows: the weighted fine granularity matching vector { V } ₁ ，V ₂ ，V ₃ ,…,V _n Splicing the two rows to form a matching characteristic matrix M, M= [ V ] ₁ ，V ₂ ，V ₃ ，…，V _n ]∈R ^K×n Convolving the matching feature matrix M by using 100 convolution checks to obtain a three-dimensional tensor A epsilon R ^{100×(K-2)×(n-2)} Each element in tensor a is:

wherein t is an integer of 1-100, b _t For the bias vector b.epsilon.R ¹⁰⁰ The value of the t element, f _t Is the t 3 x 3 convolution kernel, M _{i-1:i+1,j-1:j+1} Representing the submatrices of the matched feature matrix M from row i-1 to row i+1 and from column j-1 to column j+1,

representation ofThe elements of the corresponding positions of the two matrixes are multiplied and all the products are added and summed, a convolution layer adopts a Relu function as an activation function, and after the convolution layer is processed, maximum pooling is applied to the second dimension and the third dimension of the three-dimensional tensor A in a pooling layer to obtain a vector I, I with 100 dimensions _t For the t-th element in vector I:

the output vector I is the final user interest matching vector.

The size of the convolution kernel is 3 x 3 and there are at least 3 search histories per user.

The specific implementation mode of the personalized reordering step is as follows: the candidate documents and the matching score (d|I) of the user interests are obtained by training an interest matching vector I through a multi-layer perceptron; the relevance score (d|q) of the candidate document and the current query is calculated by a multi-layer perceptron according to three click characteristics of the click times, the original click positions and the click entropy; the final score of the candidate document is obtained by adding the interest matching score (d|I) and the relevance score (d|q), and the final personalized sequencing result is obtained by reordering the original document list according to the score.

Training the candidate documents and the relevance score of the current query through a LambdaRank algorithm in the calculation of the relevance score of the candidate documents and the current query, taking the clicked document as a relevant document sample, taking the rest documents as irrelevant samples, and selecting one relevant document d _i And an uncorrelated document d _j Document pairs are constructed to calculate losses. The calculation of the loss function also introduces the degree of influence of the sequence of exchanging document pairs on the evaluation index MAP as a corresponding weight, namely, the document pairs with larger difference (larger MAP change value after exchanging sequence) are given larger weight. The loss function is obtained by multiplying the cross entropy between the actual probability and the predicted probability by the variation value of the MAP evaluation index:

where Δ is document d _i And document d _j The changing value of the MAP evaluation index after the exchange of the position,

representing document d _i Document d _j Actual probability of high correlation, p _ij Representing the prediction probability, the prediction probability p _ij The calculation method comprises the following steps:

the invention has the technical effects that:

(1) The method introduces a model idea based on interactive matching, does not convert the text into a unique integral expression vector, and interacts the historical query of the user with the candidate document at the word level to obtain a more accurate and complete matching signal.

(2) The attention mechanism is introduced, and the corresponding matching signals are weighted according to the contribution degree of different historical queries to the current matching, so that the influence of irrelevant information in the search history is reduced.

(3) The weighted matching signals are subjected to feature extraction by using a convolutional neural network to generate final interest matching vectors of the document, so that more accurate interest matching scores are obtained.

Drawings

FIG. 1 is a framework of an interactive matching based personalized search module;

Detailed Description

The following is a preferred embodiment of the present invention and a technical solution of the present invention is further described with reference to the accompanying drawings, but the present invention is not limited to this embodiment.

In order to achieve the above object, the present invention provides a personalized search system based on interactive matching.

The system comprises an input module, an individualized searching module based on interaction matching and an output module; the input module is used for reading the user inquiry history and the candidate documents, inputting the standardized formats of the user inquiry history and the candidate documents into the personalized search module based on the interactive matching, and outputting the document matching score and the personalized rearrangement result by the output module.

And processing the bottom matching signal by using a convolutional neural network based on the personalized search module of the interactive matching to obtain a final interest matching result of the candidate document.

The personalized search module based on interaction matching takes inter-word matching signals of the historical queries and candidate documents in the historical behavior information of the user into consideration, and a historical query list { q } of the user ₁ ,q ₂ ,q ₃ ,…,q _n The current candidate document is d, firstly, a search log of a user is processed through a K-NRM model based on interaction matching to obtain each historical query q _i Fine granularity matching vector v with candidate document d _i (where 1.ltoreq.i.ltoreq.n), and a fine-grained matching vector v of the current query q with the candidate document d. Then, given that the user interests are dynamically changing and that the user queries sometimes have some chance, the contribution of different queries in the user search history to the current query is different. According to the contribution degree of each historical query to the current query, a multi-layer perceptron is utilized to generate a matching vector { v ] for the K-NRM model ₁ ,v ₂ ,v ₃ ,…,v _n Weighting to obtain a weighted matching vector list { V }, and ₁ ,V ₂ ,V ₃ ,…,V _n }. And then, processing the vectors by using a convolutional neural network to obtain matching vectors between the candidate documents and the interests of the user. And finally, respectively calculating an interest matching score and a relevance score of the current candidate document according to the interest matching vector and the click feature vector, and summing to obtain a final document matching score, wherein the formula is as follows:

score(d)＝score(d|I)+score(d|q)

where score (d|I) represents the matching score of the current candidate document and the user's search interests, and score (d|q) represents the relevance score of the current candidate document to the current query.

The framework of the personalized search module based on interaction matching is shown in fig. 1, and is divided into the following four parts according to the processing flow:

step one: the underlying matching modeling of the user search history. And establishing a bottom layer matching model by utilizing the historical search information of the user, and interacting the historical query of the user with the candidate document according to words to obtain a fine matching signal of the bottom layer.

Step two: and (5) calculating the attention weight. And introducing an attention mechanism, and weighting corresponding matching signals according to the contribution degree of different query records in the search history of the user to the current query.

Step three: and generating a user interest matching vector. And performing feature extraction on the weighted matching signals by using a convolutional neural network to generate final matching vectors of the document and the user interests.

Step four: personalized reordering. And calculating the personalized score of the candidate document by the obtained interest matching vector, calculating the relevance score by clicking the feature vector, and personalized rearrangement is carried out by taking the sum of the score and the relevance score as the final document matching score.

The bottom layer matching modeling step of the user search history:

the user's search history can provide rich information for the acquisition of the user's search interests. In the past, most algorithms model the interests of a user based on the historical behavior information of the user to obtain an interest vector representing the search preference of the user, and then the interest vector is interacted with the document vector. The method comprises the steps of adopting a K-NRM framework, establishing a bottom layer matching model by utilizing historical search information of each user U, and carrying out interactive matching on each historical query in the historical search of the user with candidate documents at the bottom layer.

The user's historical query list is { q ₁ ,q ₂ ,q ₃ ,…,q _n Current candidate document is d. For each historical query-candidate document pair<q _i ,d>Firstly, mapping the two words into word vectors, and using a word2vec model to represent the word vectors. q _i Is processed and expressed as a group of word vectors { qw } ₁ ,qw ₂ ,qw ₃ ,…,qw _x Processed d is denoted as { dw }, d ₁ ,dw ₂ ,dw ₃ ,…,dw _y }. Each vector in the two groups of word vectors is interacted pairwise to obtain<q _i ,d>Is a word matching matrix T of (a). Each element in the matching matrix T is given by the following formula:

T _i,j ＝cos(qw _i ,dw _j )

wherein T is _i,j Representing the elements of the ith row and jth column in the matrix T, qw _i Representing a word vector, dw, corresponding to the i-th word in the history query _j The matching value of the word vector (wherein, 1.ltoreq.i.ltoreq.x, 1.ltoreq.j.ltoreq.y) corresponding to the jth word in the candidate document is calculated by a cosine function.

From the above description, the ith row in the matching matrix represents the matching signal of the ith word in the history query and the candidate document. In the K-NRM model, K RBF kernels are applied to each row in the matching matrix to obtain a K-dimensional feature vector

The corresponding formula of the RBF kernel is as follows:

wherein K is _k (T _i ) Representing the value of the kth RBF kernel after processing the ith row of the matching matrix T, wherein the value range is between 0 and y; mu (mu) _k Sum sigma _k Are super parameters. In the K-NRM model used by us, mu is uniformly valued from-1 to 1 because the cosine similarity of the vector is valued between-1 and 1. Then, the logarithm of the feature vector quantity corresponding to each line in the matching matrix is summed again to be used as a historical query q _i The final bottom matching result with the candidate document is as follows:

for each historical query q _i It has a K-dimensional matching vector with the current candidate document, the matching vector is the historical query q _i Fine granularity matching vector v with candidate document d _i . The fine-grained matching vector v of the current query q and the candidate document d is also calculated by the above procedure. To this end, we have derived the bottom level matching vector calculated based on the user's historical search information using { v } ₁ ,v ₂ ,v ₃ ,…,v _n And } represents.

The calculation step of the attention weight:

because the search interests and search modes of the user are dynamically changed and the user queries have a certain contingency, the influence degree of different query records in the user search history on the current query is different. Based on the consideration, the method introduces an attention mechanism, and further optimizes each bottom matching vector according to the contribution degree of different historical queries to current matching.

In the last step, we get the underlying matching vector { v } calculated using the user's historical search information ₁ ,v ₂ ,v ₃ ,…,v _n }. The method comprises the steps of calculating attention weights for bottom matching vectors corresponding to each historical query record based on fine-granularity matching vectors v of a current query q and a candidate document d. The input of the attention layer is the bottom layer matching vector { v ] calculated in the last step ₁ ,v ₂ ,v ₃ ,…,v _n And v, the calculation formula is as follows:

e _i ＝g(v,v _i )

/>

wherein g (·) is a multi-layer perceptron with tanh as activation function, α _i Is the bottom layer matching vector v calculated by the attention layer _i The corresponding weight. The weighted bottom layer matching vector is given by the following formula:

the attention layer gives more attention to the bottom matching vector corresponding to the historical query with larger contribution according to the information quantity of the current matching contribution of different historical queries in the user search history, and obtains optimized bottom matching information weighted according to the contribution degree. So far, we obtain weighted fine-grained matching vector { V } corresponding to each historical query of the user ₁ ，V ₂ ，V ₃ ，…,V _n }。

Generating a user interest matching vector:

the weighted fine granularity matching vector { V } ₁ ，V ₂ ，V ₃ ，…，V _n Splicing the two rows to form a matching characteristic matrix M, M= [ V ] ₁ ，V ₂ ，V ₃ ,…，V _n ]∈R ^K×n . The traditional method is to directly apply maximum pooling or average pooling on the matching feature matrix to obtain the user interest matching vector. However, given that there may be a large number of history search records in the user's search history, applying pooling directly on the matching feature matrix may ignore some useful information, such as relationship information between the underlying matching vectors corresponding to neighboring history queries.

To compensate for this deficiency, this step uses 100 3×3 convolution kernels f ₁ ,f ₂ ,…,f ₁₀₀ Convolving the matching feature matrix M to obtain a three-dimensional tensor A epsilon R ^{100×(K-2)×(n-2)} . Each element in tensor a is given by the following formula:

wherein t is more than or equal to 1 and less than or equal to 100, b _t For the bias vector b.epsilon.R ¹⁰⁰ The value of the t element, f _t Is the t 3 x 3 convolution kernel, M _{i-1:i+1,j-1:j+1} Representing the submatrices of the matched feature matrix M from row i-1 to row i+1 and from column j-1 to column j+1,

representing the operation of multiplying the elements of the corresponding positions of the two matrices and summing all the products. The convolution layer of this step uses a 3 x 3 convolution kernel, which requires at least 3 history queries per user's search history. In other words, the present model does not support users with less than three history queries, because too few history queries may not provide enough information for the extraction of the user's search interests, in which case the personalized rearrangement of the documents may instead interfere with the accurate calculation of the document scores. In addition, the convolution layer adopts the Relu function as the activation function, compared with other activation functions such as sigmoid, the calculated amount of the Relu function is small, and the gradient disappearance problem can be avoided.

After the convolution layer processing, we apply max-pooling (max-pooling) to the second and third dimensions of the three-dimensional tensor a at the pooling layer to obtain a 100-dimensional vector I. I _t For the t-th element in the vector I, the calculation formula is as follows:

the purpose of the pooling layer is to further extract the characteristics of the matching characteristic tensor A, and the output vector I is the final user interest matching vector.

Personalized reordering step

Since the score of a candidate document consists of two parts: the matching score of the candidate document to the user's interests and the relevance score to the current query. The candidate documents and the matching score (d|I) of the user interests are obtained by training an interest matching vector I through a multi-layer perceptron; the relevance score (d|q) of the candidate document and the current query is calculated by a multi-layer perceptron according to three click characteristics of the click times, the original click positions and the click entropy. The final score of the candidate document is obtained by adding the interest matching score (d|I) and the relevance score (d|q), and the final personalized sequencing result is obtained by reordering the original document list according to the score.

The method comprises the steps of selecting a lambdaRank algorithm for training, taking a clicked document as a relevant document sample, taking the rest documents as irrelevant samples, and selecting a relevant document d _i And an uncorrelated document d _j Document pairs are constructed to calculate losses. The loss function is obtained by multiplying the cross entropy between the actual probability and the predicted probability by the variation value of the MAP evaluation index, and the calculation formula is as follows:

wherein, delta is the variation value of MAP evaluation index,

representing document d _i Document d _j Actual probability of high correlation, p _ij Representing its predictive probability; />

Representing document d _j Document d _i Actual probability of high correlation, p _ji Representing its predictive probability. Prediction probability p _ij The method is calculated by the following formula:

and outputting the finally obtained personalized sequencing result to an output module for outputting.

Claims

1. An interaction matching-based personalized search system is characterized in that: the system comprises an input module, an individualized searching module based on interactive matching and an output module;

the output module outputs the document matching score and the personalized rearrangement result;

the implementation mode of the bottom layer matching modeling step of the user search history is as follows: defining a user's historical query list as { q } ₁ ,q ₂ ,q ₃ ,…,q _n N is an integer n.gtoreq.3, the current candidate document is d, for each historical query-candidate document pair<q _i ,d>Firstly, mapping the two words into word vectors, using word2vec model to express the word vectors, q _i Is processed and expressed as a group of word vectors { qw } ₁ ,qw ₂ ,qw ₃ ,…,dw _x Processed d is denoted as { dw }, d ₁ ,dw ₂ ,dw ₃ ,…,dw _y Each vector in the two groups of word vectors is interacted pairwise to obtain<q _i ,d>The word matching matrix T of (a), each element in the matching matrix T is:

T _i，j ＝cos(qw _i ，dw _j )

wherein T is _i,j Representing the elements of the ith row and jth column in the matrix T, qw _i Representing a word vector, dw, corresponding to the i-th word in the history query _j Representing word vectors corresponding to the jth word in the candidate document, wherein i is more than or equal to 1 and less than or equal to x, j is more than or equal to 1 and less than or equal to y, i, j, x, y are integers, matching values of the two are calculated by a cosine function, and K RBF kernels are applied to each row in a matching matrix in a K-NRM model to obtain a K-dimensional feature vector

The formula corresponding to the RBF kernel is:

{ v for underlying match vector calculated based on user's historical search information ₁ ,v ₂ ,v ₃ ,…,v _n -representing, wherein the element is a fine-grained matching vector v of the candidate document;

the fine granularity matching vector v of the current query q and the candidate document d calculates the attention weight for the bottom matching vector corresponding to each historical query record:

e _i ＝g(v，v _i )

the weighted fine-grained matching vector corresponding to each historical query of the user is { V } ₁ ,V ₂ ,V ₃ ,…,V _n }；

The specific implementation mode of the step of generating the user interest matching vector is as follows: the weighted fine granularity matching vector { V } ₁ ,V ₂ ,V ₃ ,…,V _n Splicing the two rows to form a matching characteristic matrix M, M= [ V ] ₁ ,V ₂ ,V ₃ ,…,V _n ]∈R ^K×n Convolving the matching feature matrix M by using 100 convolution checks to obtain a three-dimensional tensor A epsilon R ^{100×(K-2)×(n-2)} Each element in tensor a is:

representing the operation of multiplying the elements of the corresponding positions of the two matrices and summing all the products, the convolution layer adopts the Relu function as an activation function, and after the convolution layer processing, the pooling layer applies the most to the second dimension and the third dimension of the three-dimensional tensor APooling to obtain a 100-dimensional vector I, I _t For the t-th element in vector I:

the output vector I is the final user interest matching vector.

2. The personalized search system based on interactive matching according to claim 1, wherein: the size of the convolution kernel is 3 x 3 and there are at least 3 search histories per user.

3. A personalized search system based on interactive matching as claimed in claim 2, wherein: the specific implementation mode of the personalized reordering step is as follows: the candidate documents and the matching score (d|I) of the user interests are obtained by training an interest matching vector I through a multi-layer perceptron; the relevance score (d|q) of the candidate document and the current query is calculated by a multi-layer perceptron according to three click characteristics of the click times, the original click positions and the click entropy; the final score of the candidate document is obtained by adding the interest matching score (d|I) and the relevance score (d|q), and the final personalized sequencing result is obtained by reordering the original document list according to the score.

4. A personalized search system based on interactive matching as recited in claim 3, wherein: training the candidate documents and the relevance score of the current query through a LambdaRank algorithm in the calculation of the relevance score of the candidate documents and the current query, taking the clicked document as a relevant document sample, taking the rest documents as irrelevant samples, and selecting one relevant document d _i And an uncorrelated document d _j The document pairs are formed to calculate the loss, the calculation of the loss function also introduces the influence degree of the sequence of exchanging the document pairs on the evaluation index MAP as the corresponding weight, namely, the larger the MAP change value after the exchanging sequence is, the larger the document difference is, the larger the weight is given to the lossThe loss function is obtained by multiplying the cross entropy between the actual probability and the predicted probability by the variation value of the MAP evaluation index:

/>

representing document d _i Document d _j Actual probability of high correlation, p _ij Representing the prediction probability. />